It is becoming harder and harder to maintain a technology stack that can keep up with the growing demands of a data-driven business. Every Big Data practitioner is familiar with the three V’s of Big Data: volume, velocity, and variety. What if there was a scale-proof technology that was designed to meet these demands?
Enter Google Cloud Dataflow. Google Cloud Dataflow simplifies data processing by unifying batch & stream processing and providing a serverless experience that allows users to focus on analytics, not infrastructure. This specialization is intended for customers & partners that are looking to further their understanding of Dataflow to advance their data processing applications.
This specialization contains three courses:
Foundations, which explains how Apache Beam and Dataflow work together to meet your data processing needs without the risk of vendor lock-in
Develop Pipelines, which covers how you convert our business logic into data processing applications that can run on Dataflow
Operations, which reviews the most important lessons for operating a data application on Dataflow, including monitoring, troubleshooting, testing, and reliability.
Praktisches Lernprojekt
This specialization incorporates hands-on labs using Qwiklabs platform. The labs build on the concepts covered in the course modules. Where applicable, we have provided Java and Python versions of the labs. For labs that require adding/updating code, we have provided a recommended solution for your reference.