Big data Hadoop Administration - Course Introduction

About The Course




trainingbees, online HadoopAdministration course gives a holistic approach of learning of Big DataAdministration using Hadoop. The topics which is being covered are the case for apache Hadoop, Introduction to Hadoop and its Architecture, MapReduce and HDFSand MapReduce Abstraction.




Course Curriculum


The curriculum for Hadoop Admin Training program is being prepared by the real time trainers considering the current industry standards



Unit1: The Case for Apache Hadoop

Why Hadoop?

Core Hadoop Components

Fundamental Concepts



Unit2: HDFS

HDFS Features

Writing and Reading Files

NameNode Memory Considerations

Overview of HDFS Security

Using the Namenode Web UI

Using the Hadoop File Shell



Unit3: Getting Data into HDFS
Ingesting Data from External Sources with Flume
Ingesting Data from Relational Databases with Sqoop
REST Interfaces
Best Practices for Importing Data

Unit4: YARN and MapReduce
What Is MapReduce?
Basic MapReduce Concepts
YARN Cluster Architecture
Resource Allocation
Failure Recovery
Using the YARN Web UI
MapReduce Version 1

Unit5: Planning Your Hadoop Cluster
General Planning Considerations
Choosing the Right Hardware
Network Considerations
Configuring Nodes
Planning for Cluster Management

Unit6: Hadoop Installation and Initial Configuration
Deployment Types
Installing Hadoop
Specifying the Hadoop Configuration
Performing Initial HDFS Configuration
Performing Initial YARN and MapReduce

Unit7: Configuration
Hadoop Logging

Unit8: Installing and Configuring Hive, Impala, and Pig
Hive
Impala
Pig

Unit9: Hadoop Clients
What is a Hadoop Client?
Installing and Configuring Hadoop Clients
Installing and Configuring Hue
Hue Authentication and Authorization

Unit10: Cloudera Manager
The Motivation for Cloudera Manager
Cloudera Manager Features
Express and Enterprise Versions
Cloudera Manager Topology
Installing Cloudera Manager
Installing Hadoop Using Cloudera Manager
Performing Basic Administration Tasks

Unit11: Using Cloudera Manager Advanced Cluster Configuration
Advanced Configuration Parameters
Configuring Hadoop Ports
Explicitly Including and Excluding Hosts
Configuring HDFS for Rack Awareness
Configuring HDFS High Availability

Unit12: Hadoop Security
Why Hadoop Security Is Important
Hadoop’s Security System Concepts
What Kerberos Is and How it Works
Securing a Hadoop Cluster with Kerberos

Unit13: Managing and Scheduling Jobs
Managing Running jobs

Enroll now at herehttp://bit.ly/2puhvNA

Comments

Popular posts from this blog

Digital marketing Course overview

Sap Ariba Course Overview

Salesforce CRM Course Structure