Speaker(s):

Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

Sep-4 13:30-14:20 in Mariposa Grove
Add to Calendar 09/04/2024 1:30 PM 09/04/2024 2:20 PM America/Los_Angeles AS24: Accelerating CDC Data Ingestion with Apache Beam: A Qlik-to-BigQuery Journey

This talk unveils our design journey to streamline the ingestion of CDC changes into a data warehouse, enabling rapid data availability for users. We leverage Qlik to stream CDC events to Kafka, harness Dataflow’s processing power, and store the transformed data in BigQuery for efficient analysis.

We’ll walk through our iterative design process, showcasing how Apache Beam’s flexibility allowed us to address business requirements. We’ll highlight key architectural decisions, performance optimizations, and lessons learned along the way.

This blueprint serves as a valuable resource for others seeking to simplify their CDC ingestion pipelines and accelerate time-to-insight for their data-driven initiatives.

Mariposa Grove

This talk unveils our design journey to streamline the ingestion of CDC changes into a data warehouse, enabling rapid data availability for users. We leverage Qlik to stream CDC events to Kafka, harness Dataflow’s processing power, and store the transformed data in BigQuery for efficient analysis.

We’ll walk through our iterative design process, showcasing how Apache Beam’s flexibility allowed us to address business requirements. We’ll highlight key architectural decisions, performance optimizations, and lessons learned along the way.

This blueprint serves as a valuable resource for others seeking to simplify their CDC ingestion pipelines and accelerate time-to-insight for their data-driven initiatives.