
Skills you'll gain: Apache Spark, Apache Hadoop, PySpark, Extract, Transform, Load, Apache Hive, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Machine Learning Algorithms, IBM Cloud, Data Pipelines, Model Evaluation, Kubernetes, Supervised Learning, Docker (Software), Scalability, Graph Theory, Jupyter, MongoDB
Beginner · Specialization · 3 - 6 Months

Skills you'll gain: Data Flow Diagrams (DFDs), Apache Airflow, Data Pipelines, Data Modeling, Data Integration, Data Architecture, Data Warehousing, Apache Spark, Extract, Transform, Load, Database Development, Data Processing, Data Transformation, Data Quality, Data Validation, Configuration Management, Enterprise Security
Beginner · Course · 1 - 3 Months

Skills you'll gain: Apache Hadoop, Apache Hive, Extract, Transform, Load, Data Pipelines, Data Import/Export, Data Migration, Data Integration, MySQL, SQL, Relational Databases, Data Processing, Data Validation
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Snowflake Schema, Star Schema, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Data Modeling, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management
Intermediate · Specialization · 1 - 3 Months

University of Pittsburgh
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Data Pipelines, Distributed Computing, Big Data, Apache Hive, Data Processing, Data Storage Technologies, Data Storage, Scikit Learn (Machine Learning Library), Predictive Modeling, Scalability, Data Management, Data Science, Data Transformation, Information Technology, Data Analysis, Python Programming
Build toward a degree
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hive, Apache Hadoop, Data Warehousing, Performance Tuning, Data Architecture, Databases, Query Languages, Extensible Markup Language (XML), Data Processing, Data Transformation, Data Manipulation
Mixed · Course · 1 - 3 Months

Coursera
Skills you'll gain: Apache Kafka, Real Time Data, Data Pipelines, Data Processing, Scalability, Performance Tuning
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hadoop, Apache Hive, Big Data, Database Design, Extensible Markup Language (XML), Databases, JSON, Data Processing, Data Warehousing, Distributed Computing, Data Analysis, Scalability, Case Studies, Analytics, Data Pipelines, Query Languages, Social Media, Data Cleansing, Data Integration, Social Media Content
Intermediate · Specialization · 3 - 6 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Science, Databases, SQL, Query Languages, Data Manipulation, Java, Data Structures, Distributed Computing, Scripting Languages, Performance Tuning
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Apache Kafka, Apache Hadoop, Apache Spark, Real Time Data, Scala Programming, Data Integration, Command-Line Interface, Apache Hive, Big Data, Applied Machine Learning, Data Processing, Apache, System Design and Implementation, Apache Cassandra, Data Pipelines, Java, Distributed Computing, IntelliJ IDEA, Application Deployment, Enterprise Application Management
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Big Data, SQL, Test Case, Apache Hadoop, Analytics, Data Analysis, Query Languages, Databases, Relational Databases, Data Manipulation, Apache, Metadata Management
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hadoop, Apache, Database Systems, NoSQL, Big Data, Database Management Systems, Databases, Shell Script, Data Storage, Data Storage Technologies, Database Design, Software Installation, Data Access, Data Modeling, System Configuration, Command-Line Interface, Scalability
Beginner · Course · 1 - 4 Weeks