Module 1Introduction to Bigdata and Hadoop Ecosystem |
|---|
|
✅ HDFS and Hadoop Architecture ✅ MapReduce and Sqoop ✅ Basics of Impala and Hive ✅ Working with Impala and Hive ✅ Type of Data Formats ✅ Advance HIVE concept and Data File Partitioning ✅ Apache Flume and HBase ✅ Apache Pig ✅ Basics of Apache Spark, RDDs in Spark and Applications |
Module 2Real Time Processing |
|---|
|
✅ Introduction to Spark ✅ Introduction to Programming in Scala ✅ Using RDD for Creating Applications in Spark ✅ Running SQL queries Using SparkSQL ✅ Spark Streaming ✅ Spark ML Programming ✅ Spark GraphX Programming
|
Module 3Store And Query Big Data |
|---|
|
✅ NoSQL Database Introduction ✅ MongoDB - A Database for the ✅ Modern Web ✅ CRUD Operations in MongoDB ✅ Indexing and Aggregation ✅ Replication and Sharding ✅ Developing Java and Node JS ✅ Application with MongoDB ✅ Administration of MongoDB ✅ Cluster Operations |
Module 4MongoDB Developer and Administrator |
|---|
|
✅ Amazon EC2 Overview ✅ Amazon Machine Images (AMI) ✅ Launch and connect to an EC2 Linux instance Demo ✅ Launch and connect to an EC2 Windows instance Demo ✅ Introduction to EC2 Instance Types ✅ Overview of Amazon EBS and EFS storage with EC2 ✅ EBS snapshot and how to play with EBS while upgrading the EC2 instance ✅ EC2 Pricing ✅ EC2 Best Practices and Costs
|
Module 5Apache Cassandra |
|---|
|
✅ Overview Big Data and NoSQL ✅ Databaases ✅ Introduction to Cassandra ✅ Cassandra Architecture ✅ Cassandra Installation and ✅ Configuration ✅ Cassandra Data Model ✅ Cassandra Interfaces ✅ Cassandra Advanced Architecture ✅ Apache Ecosystem around ✅ Cassandra
|
Module 6Big Data and Hadoop Administrator |
|---|
|
✅ Big Data and Hadoop - Introduction ✅ HDFS Hadoop Distributed File System ✅ Hadoop Cluster Setup and Working ✅ Hadoop Configurations and Daemon Logs ✅ Hadoop Cluster Maintenance and Administration ✅ Hadoop Computational Frameworks ✅ Scheduling: Managing Resources ✅ Hadoop Cluster Planning ✅ Hadoop Clients and Hue Interface ✅ Data Ingestion in Hadoop Cluster ✅ Hadoop Ecosystem ComponentsServices ✅ Hadoop Security ✅ Hadoop Cluster Monitoring |
Module 7Apache Storm |
|---|
|
✅ Introduction to Spark ✅ Introduction to Programming in Scala ✅ Using RDD for Creating Applications in Spark ✅ Running SQL Queries Using Spark SQL ✅ Spark Streaming ✅ Spark Structured Streaming ✅ Spark ML Programming ✅ Graph Processing using GraphX and GraphFrames |
Module 8Apache Kafka |
|---|
|
✅Course introduction ✅ Big Data Overview ✅ Introduction to Zookeeper ✅ Introduction to Kafka ✅ Installation and Configuration ✅ Kafka Interfaces
|
Module 9Apache Cassandra |
|---|
|
✅ Introduction to big data and No-SQL Databases ✅ Introduction to Cassandra ✅ Architecture of Cassandra ✅ Installation and Configuration of Cassandra ✅ Cassandra Data Model ✅ Cassandra Interfaces ✅ Advanced Architecture and Cluster Management ✅ Hadoop Ecosystem around Cassandra |