- Big Data
- Performance Tuning
- Distributed Computing
- Data Processing
- Software Architecture
- Scalability
- Apache Hadoop
YARN MapReduce Architecture and Advanced Programming
Completed by Hala Anas Kahwajy
February 26, 2025
17 hours (approximately)
Hala Anas Kahwajy's account is verified. Coursera certifies their successful completion of YARN MapReduce Architecture and Advanced Programming
What you will learn
Learn the fundamentals of YARN and MapReduce architectures, including how they work together to process large-scale data efficiently.
Understand and implement Mapper and Reducer parallelism in MapReduce jobs to improve data processing efficiency and scalability.
Apply optimization techniques such as combiners, partitioners, and compression to enhance the performance and I/O operations of MapReduce jobs.
Explore advanced concepts like multithreading, speculative execution, input/output formats, and how to avoid common MapReduce anti-patterns.
Skills you will gain

