Hexagon Program / Hadoop
Duration: 2-3 Months
ApacheTM Hadoop® is a highly scalable storage platform designed to process very large data sets across hundreds to thousands of computing nodes that operate in parallel. It provides a cost-effective storage solution for large data volumes with no format requirements.
Course Coverage :
- 1. Introduction to Big Data
- 2. HDFS and MapReduce Architecture
- 3. Hadoop Configuration
- 4. Understanding Hadoop MapReduce Framework
- 5. Advance MapReduce – Part 1
- 6. Advance MapReduce – Part 2
- 7. Apache Pig
- 8. Apache Hive and HiveQL
- 9. Advance HiveQL
- 10. Apache Flume, Sqoop, Oozie
- 11. NoSQL Databases
- 12. Apache HBase
- 13. Apache Zookeeper
- 14. Hadoop 2.0, YARN, MRv2
- 15. Project