he Big Data Hadoop Architect is the perfect training program for an early entrant to the Big Data world.With a number of required skills required to be a big data specialist and a steep learning curve, this program ensures you get hands on training on the most in-demand big data technologies. The learning is complemented with projects on a cloud based environment, Clouslabs, for real world experience. The program covers hadoop, spark, NoSQL databases and other hadoop ecosystem components and makes sure you are ready for your next Big Data assignment.
Module 1Introduction to Bigdata and Hadoop Ecosystem |
|---|
|
✅ HDFS and Hadoop Architecture ✅ MapReduce and Sqoop ✅ Basics of Impala and Hive ✅ Working with Impala and Hive ✅ Type of Data Formats ✅ Advance HIVE concept and Data File Partitioning ✅ Apache Flume and HBase ✅ Apache Pig ✅ Basics of Apache Spark, RDDs in Spark and Applications |
Module 2Real Time Processing |
|---|
|
✅Introduction to Spark ✅Introduction to Programming in Scala ✅Using RDD for Creating Applications in Spark ✅Running SQL queries Using SparkSQL ✅Spark Streaming ✅Spark ML Programming ✅Spark GraphX Programming
|
Module 3Store And Query Big Data |
|---|
|
✅NoSQL Database Introduction ✅MongoDB - A Database for the ✅Modern Web ✅CRUD Operations in MongoDB ✅Indexing and Aggregation ✅Replication and Sharding ✅Developing Java and Node JS ✅Application with MongoDB ✅Administration of MongoDB ✅Cluster Operations
|
Module 4MongoDB Developer and Administrator |
|---|
|
✅Amazon EC2 Overview ✅Amazon Machine Images (AMI) ✅Launch and connect to an EC2 Linux instance Demo ✅Launch and connect to an EC2 Windows instance Demo ✅Introduction to EC2 Instance Types ✅Overview of Amazon EBS and EFS storage with EC2 ✅EBS snapshot and how to play with EBS while upgrading the EC2 instance ✅EC2 Pricing ✅EC2 Best Practices and Costs
|
Module 5Apache Cassandra |
|---|
|
✅Overview Big Data and NoSQL ✅Databaases ✅Introduction to Cassandra ✅Cassandra Architecture ✅Cassandra Installation and ✅Configuration ✅Cassandra Data Model ✅Cassandra Interfaces ✅Cassandra Advanced Architecture ✅Apache Ecosystem around ✅Cassandra
|
Module 6Big Data and Hadoop Administrator |
|---|
|
✅Big Data and Hadoop - Introduction ✅HDFS Hadoop Distributed File System ✅Hadoop Cluster Setup and Working ✅Hadoop Configurations and Daemon Logs ✅Hadoop Cluster Maintenance and Administration ✅Hadoop Computational Frameworks ✅Scheduling: Managing Resources ✅Hadoop Cluster Planning ✅Hadoop Clients and Hue Interface ✅Data Ingestion in Hadoop Cluster ✅Hadoop Ecosystem ComponentsServices ✅Hadoop Security ✅Hadoop Cluster Monitoring
|
Module 7Apache Storm |
|---|
|
✅Introduction to Spark ✅Introduction to Programming in Scala ✅Using RDD for Creating Applications in Spark ✅Running SQL Queries Using Spark SQL ✅Spark Streaming ✅Spark Structured Streaming ✅Spark ML Programming ✅Graph Processing using GraphX and GraphFrames
|
Module 8Apache Kafka |
|---|
|
✅Course introduction ✅Big Data Overview ✅Introduction to Zookeeper ✅Introduction to Kafka ✅Installation and Configuration ✅Kafka Interfaces
|
Module 9Apache Cassandra |
|---|
|
✅Introduction to big data and No-SQL Databases ✅Introduction to Cassandra ✅Architecture of Cassandra ✅Installation and Configuration of Cassandra ✅Cassandra Data Model ✅Cassandra Interfaces ✅Advanced Architecture and Cluster Management ✅Hadoop Ecosystem around Cassandra
|