Friday, November 28, 2014

Practical BigData Course with Live Project using Agile methodology


If you are thrilled by the "Big Data " buzz world and planning to jump on to it. Think Again. Although it may see promising Bigdata /Hadoop based carrier could be challenging. With so many different course available each claiming to be best , you are more likely to get confused then to get convenience .

It was these factors which forced to be join and give a more realistic and practical option for Hadoop skillset for genuine folks. Much of the content and inputs of the course is based on my  experience in BigData domain for last 5 years from a total of 16 + exp..

Please join linked in group for regular updates on my learning in Hadoop / Bigdata Real time work.

http://www.linkedin.com/groups/Online-Hadoop-Training-4838165

Sample Session:

Please refer below vids as sample from training session. i hope that should help you make up your mind. 

Training Course Details Session:



Hadoop HDFS Lab:


 Introduction to HADOOP
  • Distributed computing , cloud computing
  • Big data Basics and Need for Parallel Processing
  • How Hadoop works ?
  • Introduction to HDFS and Map Reduce
 Hadoop Architecture Details
  • Name Node
  • Data Node
  • Secondary Name Node
  • Job Tracker
  • Task Tracker
  • Safe Mode , FS Image and Edit log details
HDFS ( Hadoop - Distributed File System)
  • Hadoop Distributed file system , Background, GFS
  • Data Replication
  • Data Storage
  • Data Retrieval
  • Additional HDFS commands
  • HDFS Upgrade steps
  • NameNode Failure recovery steps and execution.
  • HDFS - Java API for Interaction with FS.
  • Node Decommissioning 
 MapReduce Programming
  • MapReduce, Background
  • Writing MapReduce Programs
  • Writable and WritableComparable
  • Input Format, Output Format
  • Input Split and Block size
  • Combiner
  • Partitioner
  • Number of Mappers and Reducers
  • Counters

Map Reduce Algorithms and Exercises
  • Sentiment Analysis using Facebook , Twitter and Youtube data
  • Frequency Count example with variations of MR .
  • Eclipse based development of end to end MR Job and Deployment.
  • Sorting Data using Key value Data Type Lab
  • Exercises- Martix , Partitioner, No Reduce, Distributed Cache Labs

Hadoop Streaming
  • Introduction to Hadoop Streaming
  • Streaming API details and use cases
  • Python Based Example for Streaming API
  • Exercise for Hadoop Streaming ( XML Files ) Based.
  • Exercises on Ruby
  • Exercise on C# using MS-Azure.
Apache Pig
  • Installation and configuration
  • Execution Types - Local , MR modes
  • Grunt Shell - Configuration 
  • Pig Latin - Examples and code
  • Data Processing using Pig Latin
  • Loading and Storing Pig Functions
  • Data Filtering
  • Grouping & Joining Operations
  • Hands on Exercises

Apache HBase Installation and Details
  • HBase and NOSQL Introduction
  • HBase Installation and Configuration.
  • HBase and Java Based integration
  • Hbase basic exercises

 Apache Hive  Instalaltion and Details
  •     Hive Installation on Single cluster Hadoop Node.
  •     Hive Services
  •     Hive Shell Description
  •     Meta store Details
  •     Hive QL Basics
  •     Working with Tables, Databases etc.
  •     Hive JDBC programming
  •     Hands on Exercises and Assignments

Introduction to Amazon Map Reduce (AWS-EMR)
  • Hadoop using Amaozon Web Service
  • AWS MapReduce and EC2 
  • AWS - S3 Service Model.
  • AWS-MR Architecture.
  • Streaming Exercise using EMR JobFlow.

Hadoop Infrastructure Planning
  • Basic Hadoop hardware and software req
  • Small , Medium and Large cluster 
  • Networking challenges in Hadoop Deployment
  • Disaster Recovery ( DR ) in Hadoop .
  • Performance Tuning a large cluster
 Hadoop Industry Solutions
  • EMC GreenPlum Introduction
  • IBM BigInsight Details
-----------------------------------------------------------------------

People planning to move from Mainframe, SAP , ERP ,.NET domain get specialized consultancy help to map there existing skillset and tune it as per Hadoop related job opportunities.
We go with a end to end , 3 phase approach to cater to your need for not only helping you with training but also with interview preparation and later job support.

  • Complete Hadoop and Eco system Training 
  • Live Interview Preparation with mock interviews
  •  Job support with coding help


Training Includes :
- Live and offline training on Hadoop , Ecosystem projects 40 Hours of recorded live training.
- 40+  Hadoop fully solved exercises (pdf form)
- Red Hat OS based Virtual machine for Labs.
- 2 set of Certification help questions
- Sample Resumes for Admin and Developer skillsets

Support includes :
- Helping and guiding student on day to day work w.rt to Hadoop.
- Coding and Helping with design suggestion for Hadoop , Hive , Pig and other tasks.
- Helping with documentation as needed - Design doc, Testing doc etc.
- Coding any Java related stuff as part of Hadoop Work.

Interview Preparation: 
- Walk through of Real time case study covering all Hadoop related tasks and questions.
- Writing and Discussing your resume and accordingly assessing your competency level
- 3 Mock Interviews on HDFS , Map Reduce and Misc Hadoop Topics .
- Sharing 2 certification Question Set .
- Guiding how to do Hardware estimation, Software configuration , Map Reduce code deployment etc.

You also has my open linked in forum to ask question with a Panel of experts .

Lastly about myself. I am 15+ exp Lead Hadoop Architect working for a Leading Financial Company . MY carrier spans across distributed programming , SOA and Cloud domain before moving to BigData space few years back.

Email: onlinetraining2011@gmail.com 
Skype:  onlinetraining2011

Course duration: 40 Hours
Medium:  GotoMetting