Hadoop

  1. Introduction to BigData and Hadoop

  2. HDFS (Hadoop Distributed File System)

  • HDFS Concepts

  • HDFS Federation

  • HDFS High Availability

  1. Working with HDFS

  • The command Line Interface

  • The Java Interface

  • Data Flow

  • Data Storage using Flume and Sqoop

  1. MapReduce

  • Introduction to MapReduce

  • Running a MapReduce application on HDFS

  • Anatomy of a MapReduce Job run

  • MapReduce features

  1. Introduction to NoSQL Database

  • HBase

    • Introduction to HBase

    • CRUD and Table Administration

    • Advanced features like Filters, Counters and CoProcessors

    1. HIVE

    • Introduction to Hive

    • Hive Architecture

    • Hive Work Flow

    • Hive Data Model

    • Hive Query Language and its Operations

    • Hive User Defined Functions(UDF)

    1. PIG

    • Introduction to Pig

    • Pig Latin

    • Pig Data Model

    • Pig User Defined Functions(UDF)

    • Data Flow

    • Pig Latin Commands

    1. Hadoop Eco System

    2. One End to End Project