Introduction to Big Data with Spark and Hadoop
Completed by ANSHUMAN SINGH YADAV
January 22, 2023
19 hours (approximately)
ANSHUMAN SINGH YADAV's account is verified. Coursera certifies their successful completion of Introduction to Big Data with Spark and Hadoop
What you will learn
Explain the impact of big data, including use cases, tools, and processing methods.
Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.
Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.
Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.
Skills you will gain
- Category: Data Transformation
- Category: Development Environment
- Category: Big Data
- Category: Apache Hadoop
- Category: Open Source Technology
- Category: Data Processing
- Category: Debugging
- Category: PySpark
- Category: Performance Tuning
- Category: Docker (Software)
- Category: Apache Spark
- Category: Kubernetes

