IBM Data Engineering Professional Certificate

New year. Big goals. Bigger savings. Unlock a year of unlimited access to learning with Coursera Plus for $199. Save now.

IBM Data Engineering Professional Certificate

Name: IBM Data Engineering
Rating: 4.647374767072235 (5272 reviews)

Prepare for a career as a Data Engineer. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

Instructors: IBM Skills Network Team

105,710 already enrolled

Included with Coursera Plus

Learn more

16 course series

Earn a career credential that demonstrates your expertise

4.6

(5,272 reviews)

Beginner level

Recommended experience

Flexible schedule

6 months, 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

16 course series

Earn a career credential that demonstrates your expertise

4.6

(5,272 reviews)

Beginner level

Recommended experience

Flexible schedule

6 months, 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

What you'll learn

Master the most up-to-date practical skills and knowledge data engineers use in their daily roles
Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2
Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming
Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Skills you'll gain

Generative AI

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Professional Certificate - 16 course series

Prepare for a career in the high-growth field of data engineering. In this program, you’ll learn in-demand skills like Python, SQL, and Databases to get job-ready in less than 5 months.

Data engineering is building systems to gather data, process and organize raw data into usable information. Data engineers provide the foundational information that data scientists and business intelligence analysts use to make decisions.

This program will teach you the foundational data engineering skills employers are seeking for entry level data engineering roles, including Python, one of the most widely used programming languages. You’ll also master SQL, RDBMS, ETL, Data Warehousing, NoSQL, Big Data, and Spark with hands-on labs and projects.

You’ll learn to use Python programming language and Linux/UNIX shell scripts to extract, transform and load (ETL) data. You’ll work with Relational Databases (RDBMS) and query data using SQL statements and use NoSQL databases as well as unstructured data. You'll also learn how generative AI tools and techniques are used in data engineering.

Upon completion, you’ll have a portfolio of projects and a Professional Certificate from IBM to showcase your expertise. You’ll also earn an IBM Digital badge and will gain access to career resources to help you in your job search, including mock interviews and resume support.

This program is ACE® recommended—when you complete, you can earn up to 12 college credits.

Applied Learning Project

Throughout this Professional Certificate, you will complete hands-on labs and projects to help you gain practical experience with Python, SQL, relational databases, NoSQL databases, Apache Spark, building data pipelines, managing databases, and working with data warehouses.

Projects:

Design a relational database to help a coffee franchise improve operations.
Use SQL to query census, crime, and school demographic data sets.
Write a Bash shell script on Linux that backups changed files.
Set up, test, and optimize a data platform that contains MySQL, PostgreSQL, and IBM Db2 databases.
Analyze road traffic data to perform ETL and create a pipeline using Airflow and Kafka.
Design and implement a data warehouse for a solid-waste management company.
Move, query, and analyze data in MongoDB, Cassandra, and Cloudant NoSQL databases.
Train a machine learning model by creating an Apache Spark application.
Design, deploy, and manage an end-to-end data engineering platform.

Introduction to Data Engineering

Course 113 hours

What you'll learn

List basic skills required for an entry-level data engineering role.
Discuss various stages and concepts in the data engineering lifecycle.
Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.
Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Shell Script

Category: Bash (Unix Shell)

Category: Extract Transform and Load (ETL)

Category: Linux

Category: Linux Commands

Python for Data Science, AI & Development

Course 225 hours

What you'll learn

Learn Python - the most popular programming language and for Data Science and Software Development.
Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.
Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.
Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Skills you'll gain

Category: Cloud Database

Category: Mongodb

Category: Cassandra

Category: NoSQL

Category: Cloudant

Python Project for Data Engineering

Course 39 hours

What you'll learn

Demonstrate your skills in Python for working with and manipulating data
Implement webscraping and use APIs to extract data with Python
Play the role of a Data Engineer working on a real project to extract, transform, and load data
Use Jupyter notebooks and IDEs to complete your project

Introduction to Relational Databases (RDBMS)

Course 415 hours

What you'll learn

Describe data, databases, relational databases, and cloud databases.
Describe information and data models, relational databases, and relational model concepts (including schemas and tables).
Explain an Entity Relationship Diagram and design a relational database for a specific use case.
Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Category: Data Science

Category: Data Analysis

Category: Python Programming

Category: Numpy

Category: Pandas

Databases and SQL for Data Science with Python

Course 520 hours

What you'll learn

Analyze data within a database using SQL and Python.
Create a relational database and work with multiple tables using DDL commands.
Construct basic to intermediate level SQL queries using DML commands.
Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Skills you'll gain

Category: Machine Learning

Category: Machine Learning Pipelines

Category: Data Engineer

Category: SparkML

Category: Apache Spark

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours

What you'll learn

Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.
Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.
Develop shell scripts using Linux commands, environment variables, pipes, and filters.
Schedule cron jobs in Linux with crontab and explain the cron syntax.

Skills you'll gain

Category: Python Programming

Category: Relational Databases

Category: SQL

Category: NoSQL

Category: Data Pipelines

Relational Database Administration (DBA)

Course 720 hours

What you'll learn

Create, query, and configure databases and access and build system objects such as tables.
Perform basic database management including backing up and restoring databases as well as managing user roles and permissions.
Monitor and optimize important aspects of database performance.
Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts.

Skills you'll gain

Category: Python Programming

Category: Information Engineering

Category: Extract Transform and Load (ETL)

Category: Data Engineer

Category: Web Scraping

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 817 hours

What you'll learn

Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.
Explain batch vs concurrent modes of execution.
Implement ETL workflow through bash and Python functions.
Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Extract Transform and Load (ETL)

Category: Data Engineer

Category: Apache Kafka

Category: Apache Airflow

Category: Data Pipelines

Data Warehouse Fundamentals

Course 915 hours

What you'll learn

Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.
Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.
Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.
How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Big Data

Category: SparkSQL

Category: SparkML

Category: Apache Hadoop

Category: Apache Spark

BI Dashboards with IBM Cognos Analytics and Google Looker

Course 1011 hours

What you'll learn

Explore the purpose of analytics and Business Intelligence (BI) tools
Discover the capabilities of IBM Cognos Analytics and Google Looker Studio
Showcase your proficiency in analyzing DB2 data with IBM Cognos Analytics
Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio

Skills you'll gain

Category: Python Programming

Category: Cloud Databases

Category: Relational Database Management System (RDBMS)

Category: SQL

Category: Jupyter notebooks

Introduction to NoSQL Databases

Course 1118 hours

What you'll learn

Differentiate among the four main categories of NoSQL repositories.
Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.
Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.
Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

Category: Database Security

Category: Database (DBMS)

Category: Database Servers

Category: database administration

Category: Relational Database

Introduction to Big Data with Spark and Hadoop

Course 1219 hours

What you'll learn

Explain the impact of big data, including use cases, tools, and processing methods.
Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.
Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.
Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Data Science

Category: Database (DBMS)

Category: Information Engineering

Category: SQL

Category: NoSQL

Machine Learning with Apache Spark

Course 1315 hours

What you'll learn

Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.
Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.
Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.
Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Skills you'll gain

Category: Business Intelligence

Category: Data Visualization

Category: IBM Cognos Analytics

Category: Google Looker Studio

Category: Dashboards

Data Engineering Capstone Project

Course 1416 hours

What you'll learn

Demonstrate proficiency in skills required for an entry-level data engineering role.
Design and implement various concepts and components in the data engineering lifecycle such as data repositories.
Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.
Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Convolutional Neural Network

Category: Information Engineering

Category: Querying Databases

Category: Data Generation

Category: Generative AI

Generative AI: Elevate your Data Engineering Career

Course 1512 hours

What you'll learn

Leverage various generative AI tools and techniques in data engineering processes across industries
Implement various data engineering processes such as data generation, augmentation, and anonymization using generative AI tools
Practice generative AI skills in hands-on labs and projects for data warehouse schema design and infrastructure setup
Evaluate real-world case studies showcasing the successful application of Generative AI for ETL and data repositories

Skills you'll gain

Category: Database (DB) Design

Category: Postgresql

Category: Relational Database Management System (RDBMS)

Category: Database Architecture

Category: MySQL

Data Engineering Career Guide and Interview Preparation

Course 1611 hours

What you'll learn

Describe the role of a data engineer and some career path options as well as the prospective opportunities in the field.
Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.
Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.
Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

Skills you'll gain

Category: Cubes

Category: Data Warehousing

Category: Snowflake Schemas

Category: Data Lakes

Category: Rollups

Category: Data Marts

Category: Star Schemas

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

When you complete this Professional Certificate, you may be able to have your learning recognized for credit if you are admitted and enroll in one of the following online degree programs.¹

Instructors

IBM Skills Network Team

IBM

58 Courses1,051,963 learners

Muhammad Yahya

IBM

4 Courses69,089 learners

Abhishek Gagneja

IBM

5 Courses160,808 learners

Offered by

IBM

Why people choose Coursera for their career

New to Data Management? Start here.

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

This is a self-paced Professional Certificate that you can complete on your own schedule in less than 5 months.

This Professional Certificate is open for anyone with any job and academic background. It pre-reqs basic IT literacy and knowledge of IT infrastructure and familiarity working with Windows, Linux or MacOS. No prior computer programming experience is necessary, but is an asset, as is high school math.

Yes, it is highly recommended to take the courses in the order they are listed, as they progressively build on concepts taught in previous courses.

Most people without formal training or degree in the data engineering field start as a business analyst or software engineer, or if they have job role specific skills, they can directly start in a junior level data engineering role. From there you can move into more specialized roles such as Database Administrator, Data Warehouse Engineer, Data Architect, or Big Data Engineer. Some choose to combine their data engineering expertise with data science or artificial intelligence (AI) to become a Data Science Engineer, or Machine Learning (ML) Engineer. Others progress their careers by taking on software engineering management roles or even achieve the executive role of Chief Data Officer.

To share proof of completion with schools, certificate graduates will receive an email prompting them to claim their Credly badge, which contains the ACE®️credit recommendation. Once claimed, you will receive a competency-based transcript that signifies the credit recommendation, which can be shared directly with a school from the Credly platform. Please note that the decision to accept specific credit recommendations is up to each institution and is not guaranteed.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate, or you can audit it to view the course materials for free. When you subscribe to a course that is part of a Certificate, you’re automatically subscribed to the full Certificate. Visit your learner dashboard to track your progress.

IBM Data Engineering Professional Certificate

What you'll learn

Skills you'll gain

Details to know

See how employees at top companies are mastering in-demand skills

Professional Certificate - 16 course series

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

Earn a career certificate

Build toward a degree

Bachelor of Science in Cybersecurity Management and Policy

Bachelor of Information Technology

Bachelor of Science in Cybersecurity Technology

MSc Computer Science

Master of Data Science

Instructors

Offered by

Why people choose Coursera for their career

New to Data Management? Start here.

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

How long does it take to complete the professional certificate?

What background knowledge is necessary?

Do I need to take the courses in a specific order?

More questions