IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Gain the in-demand skills and hands-on experience to get job-ready in less than 5 months.

IBM Skills Network Team
Muhammad Yahya
Abhishek Gagneja

Instructors: IBM Skills Network Team

Sponsored by Cisco Systems India Pvt Ltd.

1,743 already enrolled

Earn a career credential that demonstrates your expertise
4.6

(4,374 reviews)

Beginner level

Recommended experience

Flexible schedule
6 months, 10 hours a week
Learn at your own pace
Earn degree credit

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2 

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Details to know

Earn a career certificate

Add to your LinkedIn profile

Taught in English
Recently updated!

June 2024

See how employees at top companies are mastering in-demand skills

Placeholder

Advance your career with in-demand skills

  • Receive professional-level training from IBM
  • Demonstrate your technical proficiency
  • Earn an employer-recognized certificate from IBM

¹Lightcast™ Job Postings Report, United States, 7/1/22-6/30/23. ²Based on program graduate survey responses, United States 2021.

Placeholder
$132,000+
median U.S. salary for Data Engineering
¹
59,000+
U.S. job openings in Data Engineering
¹
Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Professional Certificate - 16 course series

Introduction to Data Engineering

Course 112 hours4.7 (2,483 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Data Management
Category: Databases
Category: Network Security
Category: Big Data
Category: Leadership and Management
Category: SQL

Python for Data Science, AI & Development

Course 225 hours4.6 (36,095 ratings)

What you'll learn

  • Learn Python - the most popular programming language and for Data Science and Software Development.

  • Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.

  • Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Python Project for Data Engineering

Course 39 hours4.6 (611 ratings)

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Category: Computer Programming
Category: Data Management
Category: Python Programming

Introduction to Relational Databases (RDBMS)

Course 415 hours4.6 (540 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Databases and SQL for Data Science with Python

Course 520 hours4.6 (19,692 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours4.7 (1,183 ratings)

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Relational Database Administration (DBA)

Course 721 hours4.4 (184 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Databases

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 817 hours4.5 (317 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement ETL workflow through bash and Python functions.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Data Management
Category: Big Data
Category: Distributed Computing Architecture
Category: Computer Programming
Category: Data Analysis
Category: Problem Solving

Data Warehouse Fundamentals

Course 915 hours4.4 (160 ratings)

What you'll learn

  • Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Data Management

What you'll learn

  • Explore the purpose of analytics and Business Intelligence (BI) tools

  • Discover the capabilities of IBM Cognos Analytics and Google Looker Studio

  • Showcase your proficiency in analyzing DB2 data with IBM Cognos Analytics

  • Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio

Introduction to NoSQL Databases

Course 1118 hours4.6 (265 ratings)

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Introduction to Big Data with Spark and Hadoop

Course 1218 hours4.4 (349 ratings)

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Big Data
Category: Distributed Computing Architecture
Category: Data Management
Category: Python Programming

Machine Learning with Apache Spark

Course 1315 hours4.6 (64 ratings)

What you'll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.

  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.

  • Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.

  • Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Data Engineering Capstone Project

Course 1416 hours4.7 (101 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Data Management
Category: Databases
Category: SQL
Category: Data Visualization Software
Category: Data Visualization

What you'll learn

  • Leverage various generative AI tools and techniques in data engineering processes across industries

  • Implement various data engineering processes such as data generation, augmentation, and anonymization using generative AI tools

  • Practice generative AI skills in hands-on labs and projects for data warehouse schema design and infrastructure setup

  • Evaluate real-world case studies showcasing the successful application of Generative AI for ETL and data repositories

Data Engineering Career Guide and Interview Preparation

Course 1611 hours4.5 (15 ratings)

What you'll learn

  • Describe the role of a data engineer and some career path options as well as the prospective opportunities in the field.

  • Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.

  • Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.

  • Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

Instructors

IBM Skills Network Team
IBM
58 Courses814,243 learners
Muhammad Yahya
IBM
4 Courses54,601 learners
Abhishek Gagneja
IBM
5 Courses116,602 learners

Offered by

IBM

Get a head start on your degree

When you complete this Professional Certificate, you can earn college credit if you are admitted and enroll in one of the following online degree programs.¹

Why people choose Coursera for their career

Placeholder

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy