IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

IBM Skills Network Team
Muhammad Yahya
Abhishek Gagneja

Instructors: IBM Skills Network Team

Sponsored by University of Texas at Austin

106,242 already enrolled

Earn a career credential that demonstrates your expertise
4.6

(5,278 reviews)

Beginner level

Recommended experience

6 months
at 10 hours a week
Flexible schedule
Learn at your own pace
Earn a career credential that demonstrates your expertise
4.6

(5,278 reviews)

Beginner level

Recommended experience

6 months
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2 

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Placeholder

Advance your career with in-demand skills

  • Receive professional-level training from IBM
  • Demonstrate your technical proficiency
  • Earn an employer-recognized certificate from IBM
Placeholder
$132,000+
median U.S. salary for Data Engineering
¹
59,000+
U.S. job openings in Data Engineering
¹

Get exclusive access to career resources upon completion

  • Soft skills training

    Get free access to IBM’s People and Soft Skills Specialization

  • Resume review

    Improve your resume and LinkedIn with personalized feedback

  • Interview prep

    Practice your skills with interactive tools and mock interviews

  • Career support

    Plan your career move with Coursera’s job search guide

Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Professional Certificate - 16 course series

Introduction to Data Engineering

Course 113 hours4.7 (2,897 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Big Data
Category: Data Management
Category: Data Architecture
Category: Data Engineering
Category: Data Processing
Category: Data Infrastructure
Category: Data Integration
Category: Database Management Systems
Category: Databases
Category: Data Warehousing
Category: Extract, Transform, Load
Category: Data Pipelines
Category: Data Mapping
Category: Data Storage
Category: Data Store
Category: Data Mart
Category: Data Lakes
Category: IBM DB2
Category: Information Management
Category: Data Storage Technologies

Python for Data Science, AI & Development

Course 225 hours4.6 (39,287 ratings)

What you'll learn

  • Learn Python - the most popular programming language and for Data Science and Software Development.

  • Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.

  • Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Skills you'll gain

Category: Python Programming
Category: Computer Programming
Category: Computer Science
Category: NumPy
Category: Data Processing
Category: Data Science
Category: Data Analysis
Category: Data Manipulation
Category: Web Scraping
Category: Pandas (Python Package)
Category: Jupyter
Category: Application Programming Interface (API)
Category: Algorithms
Category: Data Structures
Category: Software Development
Category: Extract, Transform, Load
Category: Object Oriented Programming (OOP)
Category: Object Oriented Design
Category: Data Engineering
Category: Information Management

Python Project for Data Engineering

Course 39 hours4.6 (711 ratings)

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Category: Data Manipulation
Category: Data Analysis
Category: Data Science
Category: Python Programming
Category: Data Architecture
Category: Information Management
Category: Database Management
Category: Unit Testing
Category: Web Scraping
Category: Data Integration
Category: Software Engineering
Category: Big Data
Category: Data Access
Category: Data Processing
Category: Data Engineering
Category: Data Management
Category: Databases
Category: Database Architecture and Administration
Category: Database Administration
Category: Extract, Transform, Load

Introduction to Relational Databases (RDBMS)

Course 415 hours4.6 (611 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Category: Database Management
Category: Database Management Systems
Category: Database Systems
Category: Relational Databases
Category: Database Architecture and Administration
Category: Databases
Category: Database Theory
Category: Database Design
Category: Data Management
Category: Data Modeling
Category: Database Development
Category: Database Software
Category: PostgreSQL
Category: MySQL
Category: SQL
Category: Data Architecture
Category: Information Management
Category: Information Technology Architecture
Category: IBM DB2
Category: Query Languages

Databases and SQL for Data Science with Python

Course 520 hours4.7 (20,871 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Skills you'll gain

Category: SQL
Category: Database Development
Category: Database Management
Category: Database Systems
Category: Query Languages
Category: Relational Databases
Category: Database Management Systems
Category: Stored Procedure
Category: Data Storage
Category: Database Design
Category: Database Architecture and Administration
Category: Databases
Category: Data Access
Category: Data Management
Category: Database Administration
Category: Data Modeling
Category: Database Theory

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours4.7 (1,440 ratings)

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Category: Unix Shell
Category: Linux
Category: Linux Administration
Category: Unix
Category: Systems Administration
Category: Command-Line Interface
Category: Linux Commands
Category: Scripting Languages
Category: Operating Systems
Category: Bash (Scripting Language)
Category: Scripting
Category: Shell Script
Category: Computing Platforms
Category: IT Infrastructure
Category: Information Technology
Category: Computer Science
Category: IT Management

Relational Database Administration (DBA)

Course 720 hours4.4 (220 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Database Systems
Category: Database Management
Category: Database Architecture and Administration
Category: Database Management Systems
Category: Relational Databases
Category: Databases
Category: Database Administration
Category: Database Theory
Category: Data Management
Category: Database Development
Category: Database Software
Category: IBM DB2
Category: SQL
Category: PostgreSQL
Category: MySQL
Category: Systems Administration
Category: Performance Tuning
Category: Information Management
Category: Data Storage
Category: Query Languages

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 817 hours4.5 (371 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement ETL workflow through bash and Python functions.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Data Pipelines
Category: Data Engineering
Category: Data Integration
Category: Data Processing
Category: Extract, Transform, Load
Category: Data Management
Category: Real Time Data
Category: Information Management
Category: Dataflow
Category: Big Data
Category: Apache Airflow
Category: Apache Kafka
Category: Data Architecture
Category: Data Transformation
Category: Scripting
Category: Data Storage
Category: Data Wrangling
Category: Data Mapping
Category: Information Systems
Category: Databases

Data Warehouse Fundamentals

Course 915 hours4.4 (200 ratings)

What you'll learn

  • Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Data Architecture
Category: Data Mart
Category: Data Warehousing
Category: Database Systems
Category: Database Software
Category: IBM DB2
Category: Data Modeling
Category: Star Schema
Category: Snowflake Schema
Category: Database Management Systems
Category: Databases
Category: Database Management
Category: Data Management
Category: Extract, Transform, Load
Category: Database Architecture and Administration
Category: Data Engineering
Category: Data Integration
Category: Data Lakes
Category: Data Infrastructure
Category: Information Management

BI Dashboards with IBM Cognos Analytics and Google Looker

Course 1011 hours4.7 (19 ratings)

What you'll learn

  • Explore the purpose of analytics and Business Intelligence (BI) tools

  • Discover the capabilities of IBM Cognos Analytics and Google Looker Studio

  • Showcase your proficiency in analyzing DB2 data with IBM Cognos Analytics

  • Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio

Skills you'll gain

Category: Data Analysis Software
Category: Business Intelligence
Category: Business Intelligence Software
Category: Looker (Software)
Category: Analytics
Category: IBM Cognos Analytics
Category: Data Presentation
Category: Data Analysis
Category: Business Analytics
Category: Dashboard
Category: Data Storytelling
Category: Interactive Data Visualization
Category: Statistical Analysis
Category: Data Visualization
Category: Data Science

Introduction to NoSQL Databases

Course 1118 hours4.6 (324 ratings)

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

Category: NoSQL
Category: Database Management Systems
Category: Database Architecture and Administration
Category: Data Infrastructure
Category: Data Store
Category: Operational Databases
Category: Database Systems
Category: MongoDB
Category: Apache Cassandra
Category: Database Theory
Category: Data Architecture
Category: Relational Databases
Category: Cloud Services
Category: IBM Cloud
Category: Cloud Applications
Category: Databases
Category: Database Management
Category: Data Management
Category: Data Storage
Category: Cloud Infrastructure

Introduction to Big Data with Spark and Hadoop

Course 1219 hours4.4 (403 ratings)

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Systems Design
Category: Computing Platforms
Category: Application Performance Management
Category: Data Engineering
Category: Software Systems
Category: Extract, Transform, Load
Category: Data Processing
Category: Information Management
Category: Data Infrastructure
Category: Apache Spark
Category: Apache Hadoop
Category: IBM Cloud
Category: Data Warehousing
Category: Information Technology Operations
Category: Data Management
Category: Big Data
Category: Distributed Computing
Category: Scalability
Category: Apache Hive
Category: Data Architecture

Machine Learning with Apache Spark

Course 1315 hours4.5 (88 ratings)

What you'll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.

  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.

  • Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.

  • Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Skills you'll gain

Category: Apache Spark
Category: Analytics
Category: Artificial Intelligence
Category: Machine Learning Methods
Category: Machine Learning
Category: Artificial Intelligence and Machine Learning (AI/ML)
Category: Applied Machine Learning
Category: Statistical Machine Learning
Category: Machine Learning Software
Category: Feature Engineering
Category: Machine Learning Algorithms
Category: MLOps (Machine Learning Operations)
Category: Supervised Learning
Category: Statistical Modeling
Category: Data Processing
Category: Databases
Category: Big Data
Category: Generative AI
Category: Data Engineering
Category: Data Architecture

Data Engineering Capstone Project

Course 1416 hours4.7 (112 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Database Systems
Category: Database Architecture and Administration
Category: Data Engineering
Category: Data Warehousing
Category: Data Pipelines
Category: Data Integration
Category: Data Management
Category: IBM DB2
Category: Dashboard
Category: Databases
Category: Operational Databases
Category: Analytics
Category: IBM Cognos Analytics
Category: Data Store
Category: MongoDB
Category: Data Infrastructure
Category: Extract, Transform, Load
Category: Big Data
Category: Data Architecture
Category: Information Management

Generative AI: Elevate your Data Engineering Career

Course 1512 hours4.9 (19 ratings)

What you'll learn

  • Leverage various generative AI tools and techniques in data engineering processes across industries

  • Implement various data engineering processes such as data generation, augmentation, and anonymization using generative AI tools

  • Practice generative AI skills in hands-on labs and projects for data warehouse schema design and infrastructure setup

  • Evaluate real-world case studies showcasing the successful application of Generative AI for ETL and data repositories

Skills you'll gain

Category: Generative AI
Category: Artificial Intelligence
Category: Database Architecture and Administration
Category: Data Engineering
Category: Data Architecture
Category: Data Management
Category: Information Management
Category: Data Governance
Category: Data Pipelines
Category: Data Processing
Category: Data Integration
Category: Data Warehousing
Category: Database Management Systems
Category: Data Ethics
Category: Extract, Transform, Load
Category: Data Modeling
Category: Database Development
Category: Databases
Category: Database Design
Category: Database Management

Data Engineering Career Guide and Interview Preparation

Course 1611 hours4.7 (62 ratings)

What you'll learn

  • Describe the role of a data engineer and some career path options as well as the prospective opportunities in the field.

  • Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.

  • Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.

  • Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

Skills you'll gain

Category: Interpersonal Communications
Category: Interviewing Skills
Category: Recruitment
Category: Human Resources
Category: Human Resources Management and Planning
Category: Professional Networking
Category: Communication
Category: Communication Strategies

Instructors

IBM Skills Network Team
IBM
58 Courses1,056,739 learners
Muhammad Yahya
IBM
4 Courses69,383 learners
Abhishek Gagneja
IBM
5 Courses161,678 learners

Offered by

IBM

Why people choose Coursera for their career

Placeholder

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

¹Lightcast™ Job Postings Report, United States, 7/1/22-6/30/23. ²Based on program graduate survey responses, United States 2021.