Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

Speaker(s):

Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

Sep-4 13:30-14:20 in Mariposa Grove

Add to Calendar 09/04/2024 1:30 PM 09/04/2024 2:20 PM America/New_York BS25: Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

This talk unveils our design journey to streamline the ingestion of CDC changes into a data warehouse, enabling rapid data availability for users. We leverage Qlik to stream CDC events to Kafka, harness Dataflow’s processing power, and store the transformed data in BigQuery for efficient analysis.

We’ll walk through our iterative design process, showcasing how Apache Beam’s flexibility allowed us to address business requirements. We’ll highlight key architectural decisions, performance optimizations, and lessons learned along the way.

This blueprint serves as a valuable resource for others seeking to simplify their CDC ingestion pipelines and accelerate time-to-insight for their data-driven initiatives.

Mariposa Grove

Download slides

This blueprint serves as a valuable resource for others seeking to simplify their CDC ingestion pipelines and accelerate time-to-insight for their data-driven initiatives.

Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

Bipin Upadhyaya

Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey