Preloader

Hadoop Overview


Hadoop Training in Delhi

Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs.


  • Module: Introduction

  •  What is Hadoop?
  •  History of Hadoop
  •  Building Blocks – Hadoop Eco-System
  •  What Hadoop is Good for and why it is Good
  • Module: Mapreduce

  •  Map/Reduce Overview and Architecture
  •  Installation
  •  Developing Map/Red Jobs
  •  Input and Output Formats
  •  Job Configuration
  •  Job Submission
  •  Practicing Map Reduce Programs
  • Module: Hadoop Streaming

  • Module: Distributing Debug Scripts

  • Module: Pig

  •  Pig Overview
  •  Installation
  •  Pig Latin
  •  Pig with HDFS
  • Module: HBase

  •  HBase Overview and Architecture
  •  HBase Installation
  •  HBase Shell
  •  CRUD Operations
  •  Scanning and Batching
  •  Filters
  •  HBase Key Design
  • Module: Sqoop

  •  Sqoop Overview
  •  Installation
  •  Imports and Exports
  • Module: Integrations

  •  Distributed Installation
  •  Best Practices
  • Module: HDFS

  •  Configuring HDFS
  •  Interacting With HDFS
  •  HDFS Permissions and Security
  •  Additional HDFS Tasks
  •  HDFS Overview and Architecture
  •  HDFS Installation
  •  Hadoop File System Shell
  •  File System Java API
  • Module: Getting Started With Eclipse IDE

  •  Configuring Hadoop API on Eclipse IDE
  •  Connecting Eclipse IDE to HDFS
  • Module: Advanced Mapreduce Features

  •  Custom Data Types
  •  Input Formats
  •  Output Formats
  •  Partitioning Data
  •  Reporting Custom Metrics
  •  Distributing Auxiliary Job Data
  • Module: Using Yahoo Web Services

  • Module: Hive

  •  Hive Overview
  •  Installation
  •  Hive QL
  •  Hive Unstructured Data Analyzation
  •  Hive Semistructured Data Analyzation
  • Module: ZooKeeper

  •  Zoo Keeper Overview
  •  Installation
  •  Server Maintenance
  • Module: Configuration

  •  Basic Setup
  •  Important Directories
  •  Selecting Machines
  •  Cluster Configurations
  •  Small Clusters: 2-10 Nodes
  •  Medium Clusters: 10-40 Nodes
  •  Large Clusters: Multiple Racks