Certybox provides Instructor led sessions for Big Data & Hadoop Developer Certification. This course is an introduction to the Big Data Eco System, the need for Big Data and its applications. The course also covers Hadoop Architecture, Map-reduce framework, starting with installations, and explores other technologies like Pig, Hive, HBase, ZooKeeper, Oozie, and Flume. It focuses to provide participants hands on experience, so there would be multiple assignments, quizzes and a project.

About the Course

Certybox provides classes on Big Data and Hadoop Development. This course is an introduction to the Big Data Eco System, the need for Big Data and its applications. The course also covers Hadoop Architecture, Map-reduce framework, starting with installations, and explores other technologies like Pig, Hive, HBase, ZooKeeper, Oozie, and Flume. It focuses to provide participants hands on experience, so there would be multiple assignments, quizzes and a project.

Salient Features

  • High impact, proven training – Over 30,000 professionals globally have participated in our training programs.
  • Experienced, expert instructors – Our Instructors come with a rich, 10+ years’ industry experience & 3 Years experience in Big Data and Hadoop.
  • We have different training methodologies that help participants learn better.

Deliverables

  • Instructor-Led Training
  • Free 1 Year e-Learning Access
  • USB Drive with Virtual Machine With Built In Data Sets
  • 2 Simulated Projects
  • Receive Certification On Successful Submission Of Project
  • 45 PDU Certificate

Course Curriculum

INTRODUCTION TO BIG DATA AND HADOOP
What is Big Data? 00:00:00
Types of Data 00:00:00
Need for Big Data 00:00:00
Characteristics of Big Data 00:00:00
Traditional IT Analytics Approach 00:00:00
Big Data—Use Cases 00:00:00
Handling Limitations of Big Data 00:00:00
Introduction to Hadoop 00:00:00
History and Milestones of Hadoop 00:00:00
GETTING STARTED WITH HADOOP
VMware Player—Introduction 00:00:00
Installing VMware Player 00:00:00
Setting up the Virtual Environment 00:00:00
Oracle VirtualBox to Open a VM 00:00:00
HADOOP ARCHITECTURE
Hadoop Cluster in commodity hardware 00:00:00
Hadoop core services and components 00:00:00
Regular file system vs. Hadoop 00:00:00
HDFS layer 00:00:00
HADOOP DEPLOYMENT
Introduction to Ubuntu Server 00:00:00
Hadoop installation 00:00:00
Single node and multi node configuration 00:00:00
Hadoop Configuration in cluster environment 00:00:00
Installing Hadoop 2.0 00:00:00
MAPREDUCE
Introdution to MapReduce 00:00:00
Hadoop MapReduce example 00:00:00
Hadoop MapReduce Characteristics 00:00:00
Setting up your MapReduce Environment 00:00:00
Building a MapReduce Program 00:00:00
MapReduce Requirements and Features 00:00:00
MapReduce Java Programming in Eclipse 00:00:00
Checking Hadoop Environment for MapReduce 00:00:00
MapReduce 2.0 00:00:00
ADVANCED HDFS & MAPREDUCE
HDFS Benchmarking 00:00:00
Setting up HDFS Blocks 00:00:00
COMMERCIAL DISTRIBUTION OF HADOOP
Cloudera 00:00:00
Downloading Cloudera Quickstart VM 00:00:00
Starting the Cloudera VM 00:00:00
Exploring the Welcome Page 00:00:00
Understanding Hue 00:00:00
Understanding Cloudera Manager 00:00:00
Hortonworks Data Platform 00:00:00
MapR Data Platform 00:00:00
MapR Data Platform 00:00:00
Pivotal HD 00:00:00
IBM InfoSphere BigInsights 00:00:00
ZOOKEEPER SQOOP AND FLUME
Introduction to ZooKeeper 00:00:00
Features of ZooKeeper 00:00:00
Challenges faced in distributed applications 00:00:00
Coordination 00:00:00
ZooKeeper: Goals and Uses 00:00:00
ZooKeeper: Entities, Data Model, Services 00:00:00
Recipes of ZookeeperClient APIs 00:00:00
Introduction to Sqoop (Why, what, processing, under the hood) 00:00:00
Importing data into Hive 00:00:00
Hadoop Data Types 00:00:00
Input Formats in MapReduce 00:00:00
Output Formats in MapReduce 00:00:00
Distributed Cache 00:00:00
Joins in MapReduce Cartesian 00:00:00
PIG
Introduction to PIG 00:00:00
Components of Pig 00:00:00
Pig Data Model 00:00:00
Pig Modes 00:00:00
Pig Vs. SQL 00:00:00
Installing Pig Engine 00:00:00
Datasets for Pig Development 00:00:00
Pig Latin 00:00:00
Filtering and Transforming Data 00:00:00
Grouping and Sorting 00:00:00
Combining and Splitting 00:00:00
Pig Commmands 00:00:00
HIVE
Why another data warehousing system 00:00:00
What is HIVE 00:00:00
Characteristics of Hive 00:00:00
System Architecture and Components of Hive 00:00:00
Hive Data Models 00:00:00
Serialization/De-serialization 00:00:00
Hive file formats 00:00:00
Hive Query Language 00:00:00
HIVE: Installing, running, and programming 00:00:00
Hive Functions 00:00:00
Difference between Hive and PIG 00:00:00
HBASE
HBase introduction 00:00:00
Characteristics of HBase 00:00:00
HBase Architecture 00:00:00
Storage Model of HBase 00:00:00
When to use HBase 00:00:00
HBase Data Model 00:00:00
HBase Families 00:00:00
HBase Components 00:00:00
Row Distribution between region servers 00:00:00
Data Storage 00:00:00
Installation of HBase 00:00:00
Importing data into HBase 00:00:00
Exporting data from Hadoop using Sqoop 00:00:00
Sqoop Connectors 00:00:00
Introduction to Flume 00:00:00
Flume Use Cases 00:00:00
Configuring and Running Flume Agents 00:00:00
ECOSYSTEM AND ITS COMPONENTS
Hadoop Ecosystem 00:00:00
Components Overview 00:00:00
Overview of Apache Oozie 00:00:00
Overview of Mahout 00:00:00
Overview of Apache Cassandra 00:00:00
Apache Spark 00:00:00
HADOOP ADMINISTRATION AND TROUBLESHOOTING
Commands Used in Hadoop Programming 00:00:00
Different configurations of Hadoop cluster 00:00:00
Port Numbers for Individual Hadoop Services 00:00:00
Performance monitoring 00:00:00
Performance tuning 00:00:00
Troubleshooting and Log observation 00:00:00
Overview of Apache Ambari 00:00:00
Hadoop Security Using Kerberos 00:00:00

Course Reviews

5

5
8 ratings
  • 5 stars0
  • 4 stars0
  • 3 stars0
  • 2 stars0
  • 1 stars0

No Reviews found for this course.

TAKE THIS COURSE
  • $600
  • 180 Days
389 STUDENTS ENROLLED

    Get a Free Consultation

    Related Courses

    Copyright © 2019 Certybox All Rights Reserved