Duke University
Applied Python Data Engineering Specialization
Duke University

Applied Python Data Engineering Specialization

Elevate your coding skills with data engineering. Use big data for decision-making, analysis, AI and machine learning

Kennedy Behrman
Matt Harrison
Noah Gift

Instructors: Kennedy Behrman

Sponsored by Abu Dhabi National Oil Company

3,944 already enrolled

Get in-depth knowledge of a subject
4.0

(39 reviews)

Intermediate level

Recommended experience

5 months
at 10 hours a week
Flexible schedule
Learn at your own pace
Get in-depth knowledge of a subject
4.0

(39 reviews)

Intermediate level

Recommended experience

5 months
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Create scalable big data pipelines (Hadoop, Spark, Snowflake, Databricks) for efficient data handling.

  • Build machine learning workflows (PySpark, MLFlow) on Databricks for seamless model development and deployment.

  • Implement DataOps/DevOps to streamline data engineering processes.

  • Formulate and communicate data-driven insights and narratives through impactful visualizations with Python and data storytelling

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Placeholder

Advance your subject-matter expertise

  • Learn in-demand skills from university and industry experts
  • Master a subject or tool with hands-on projects
  • Develop a deep understanding of key concepts
  • Earn a career certificate from Duke University
Placeholder
Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Specialization - 3 course series

Spark, Hadoop, and Snowflake for Data Engineering

Course 129 hours4.0 (43 ratings)

What you'll learn

  • Create scalable data pipelines (Hadoop, Spark, Snowflake, Databricks) for efficient data handling.

  • Optimize data engineering with clustering and scaling to boost performance and resource use.

  • Build ML solutions (PySpark, MLFlow) on Databricks for seamless model development and deployment.

  • Implement DataOps and DevOps practices for continuous integration and deployment (CI/CD) of data-driven applications, including automating processes.

Skills you'll gain

Category: PySpark
Category: Apache Spark
Category: Data Management
Category: Data Engineering
Category: Data Architecture
Category: Snowflake Schema
Category: MLOps (Machine Learning Operations)
Category: Big Data
Category: Data Pipelines
Category: Databricks
Category: Data Processing
Category: Data Integration
Category: Information Management
Category: Databases
Category: Database Management Systems
Category: Apache Hadoop
Category: Data Infrastructure
Category: Database Systems
Category: DevOps
Category: Extract, Transform, Load

What you'll learn

  • Master virtualization, containerization, and Docker, including Dockerfile creation and multi-container orchestration with Compose and Airflow.

  • Develop expertise in Kubernetes core concepts, cluster architecture, and deployment using cloud environments, GitHub Codespaces, and AI-driven tools.

  • Navigate data scenarios mastering containerization, deploying apps, and addressing production issues with cloud orchestration and SRE practices.

Skills you'll gain

Category: Containerization
Category: DevOps
Category: Cloud Engineering
Category: Cloud-Native Computing
Category: Kubernetes
Category: Devops Tools
Category: Virtualization and Virtual Machines
Category: Virtualization
Category: Data Engineering
Category: Docker (Software)
Category: Continuous Deployment
Category: Apache Airflow
Category: Virtual Machines
Category: Data Pipelines
Category: Microservices
Category: CI/CD
Category: Site Reliability Engineering
Category: Continuous Integration
Category: Amazon Web Services
Category: Development Environment

Data Visualization with Python

Course 39 hours4.1 (17 ratings)

What you'll learn

  • Apply Python, spreadsheets, and BI tooling proficiently to create visually compelling and interactive data visualizations.

  • Formulate and communicate data-driven insights and narratives through impactful visualizations and data storytelling.

  • Assess and select the most suitable visualization tools and techniques to address organizational data needs and objectives.

Skills you'll gain

Category: Statistical Visualization
Category: Data Visualization
Category: Data Presentation
Category: Data Visualization Software
Category: Data Analysis
Category: Interactive Data Visualization
Category: Plot (Graphics)
Category: Plotly
Category: Matplotlib
Category: Descriptive Statistics
Category: Tableau Software
Category: Data Storytelling
Category: Seaborn
Category: Dashboard
Category: Scatter Plots
Category: Histogram
Category: Data Analysis Software
Category: Pandas (Python Package)
Category: Business Intelligence
Category: Business Intelligence Software

Instructors

Kennedy Behrman
Duke University
7 Courses50,075 learners

Offered by

Duke University

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Placeholder

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy