IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

IBM Skills Network Team
Muhammad Yahya
Abhishek Gagneja

Instructors: IBM Skills Network Team

Sponsored by ESCA

89,476 already enrolled

Earn a career credential that demonstrates your expertise
4.6

(4,679 reviews)

Beginner level

Recommended experience

6 months
at 10 hours a week
Flexible schedule
Learn at your own pace
Earn a career credential that demonstrates your expertise
4.6

(4,679 reviews)

Beginner level

Recommended experience

6 months
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2 

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Details to know

Earn a career certificate

Add to your LinkedIn profile

Taught in English
Recently updated!

June 2024

See how employees at top companies are mastering in-demand skills

Placeholder

Advance your career with in-demand skills

  • Receive professional-level training from IBM
  • Demonstrate your technical proficiency
  • Earn an employer-recognized certificate from IBM
Placeholder
$132,000+
median U.S. salary for Data Engineering
¹
59,000+
U.S. job openings in Data Engineering
¹

Get exclusive access to career resources upon completion

  • Soft skills training

    Get free access to IBM’s People and Soft Skills Specialization

  • Resume review

    Improve your resume and LinkedIn with personalized feedback

  • Interview prep

    Practice your skills with interactive tools and mock interviews

  • Career support

    Plan your career move with Coursera’s job search guide

¹Lightcast™ Job Postings Report, United States, 7/1/22-6/30/23. ²Based on program graduate survey responses, United States 2021.

Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Professional Certificate - 16 course series

Introduction to Data Engineering

Course 112 hours4.7 (2,623 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Data Management
Category: Databases
Category: Network Security
Category: Big Data
Category: Leadership and Management
Category: SQL

Python for Data Science, AI & Development

Course 225 hours4.6 (37,177 ratings)

What you'll learn

  • Learn Python - the most popular programming language and for Data Science and Software Development.

  • Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.

  • Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Python Project for Data Engineering

Course 39 hours4.6 (644 ratings)

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Category: Computer Programming
Category: Data Management
Category: Python Programming

Introduction to Relational Databases (RDBMS)

Course 415 hours4.6 (568 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Databases and SQL for Data Science with Python

Course 520 hours4.7 (20,098 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours4.6 (1,253 ratings)

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Relational Database Administration (DBA)

Course 721 hours4.4 (196 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Databases

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 817 hours4.5 (335 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement ETL workflow through bash and Python functions.

  • Describe data pipeline components, processes, tools, and technologies.

Data Warehouse Fundamentals

Course 915 hours4.4 (173 ratings)

What you'll learn

  • Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

What you'll learn

  • Explore the purpose of analytics and Business Intelligence (BI) tools

  • Discover the capabilities of IBM Cognos Analytics and Google Looker Studio

  • Showcase your proficiency in analyzing DB2 data with IBM Cognos Analytics

  • Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio

Introduction to NoSQL Databases

Course 1118 hours4.6 (278 ratings)

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Introduction to Big Data with Spark and Hadoop

Course 1218 hours4.4 (358 ratings)

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Big Data
Category: Distributed Computing Architecture
Category: Data Management
Category: Python Programming

Machine Learning with Apache Spark

Course 1315 hours4.6 (70 ratings)

What you'll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.

  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.

  • Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.

  • Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Data Engineering Capstone Project

Course 1416 hours4.7 (105 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Data Management
Category: Databases
Category: SQL
Category: Data Visualization Software
Category: Data Visualization

Generative AI: Elevate your Data Engineering Career

Course 1512 hours4.9 (10 ratings)

What you'll learn

  • Leverage various generative AI tools and techniques in data engineering processes across industries

  • Implement various data engineering processes such as data generation, augmentation, and anonymization using generative AI tools

  • Practice generative AI skills in hands-on labs and projects for data warehouse schema design and infrastructure setup

  • Evaluate real-world case studies showcasing the successful application of Generative AI for ETL and data repositories

Data Engineering Career Guide and Interview Preparation

Course 1611 hours4.8 (30 ratings)

What you'll learn

  • Describe the role of a data engineer and some career path options as well as the prospective opportunities in the field.

  • Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.

  • Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.

  • Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

Skills you'll gain

Category: Leadership and Management

Instructors

IBM Skills Network Team
IBM
57 Courses928,279 learners
Muhammad Yahya
IBM
4 Courses59,867 learners
Abhishek Gagneja
IBM
5 Courses133,228 learners

Offered by

IBM

Why people choose Coursera for their career

Placeholder

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy