Beam Summit 2025 Sessions

Title	Speaker(s)	Recording	Slides
3.0 and Beyond: The Future of Beam	Danny McCormick & Kenneth Knowles
A Deep Dive into Beam Python Type Hinting	Jack McCluskey
A Tour of Apache Beam’s New Iceberg Connector	Ahmed Abualsaud
Architecting Real-Time Blockchain Intelligence with Apache Beam and Apache Kafka	Vijay Shekhawat
Become a Contributor: Making Changes, Running Patched Pipeline, and Contributing back to Beam	Yi Hu
Bridging BigQuery and ClickHouse with Apache Beam: A Google Dataflow Template for Batch Ingestion	Bentsi Leviav
Build Seamless Data Ecosystems: Real-World Integrations with Apache Beam, Kafka, and Iceberg	Rajesh Vayyala
Choosing The Right Boat For Your Stream	Kamal Aboul-Hosn
Data Quality in ML Pipelines	Pritam Dodeja
Dataflow Cost Calculator	Svetak Sundhar & Aditya Saraf
Dataflow for Beginners	Derrick Williams & Chamikara Jayalath
Dataflow Streaming Innovations	Tom Stepp & Ryan Wigglesworth
Enhancing Data Quality for AI Success	Aarohi Tripathi
Exabyte-scale Streaming Iceberg IO with Beam, Ray, and DeltaCAT	Patrick Ames
From taming energy market data to hyperparameter hunting at scale: leveraging Apache Beam & BigQuery	Nicholas Bonfanti & Matteo Pacciani
Growing the Apache Beam Community: Resources, Contributions, and Collaboration	Jana Polianskaja & Danny McCormick
How Beam serves models with vLLM	Danny McCormick
Integrating LLMs and Embedding models into Beam pipelines using langchain	Ganesh Sivakumar
Integration of Batch and Streaming data processing with Apache Beam	Yoichi Nagai
Introduction to the Apache Beam RAG package	Claude van der Merwe
Leveraging Apache Beam for Enhanced Financial Insights	Raj Katakam, Naresh Kumar Kotha & Venkatesh Poosarla
Leveraging LLMs for Agentic Workflow Orchestration in Apache Beam YAML Pipelines	Charles Adetiloye
Managed transforms - power of Beam without maintenance overheads	Chamikara Jayalath
Many Data Formats, One Data Lake	Peter Wagener
Optimize parallelism for reading from Apache Kafka to Dataflow	Supriya Koppa
Real-Time Medical Record Processing	Austin Bennett
Real-Time Predictive Modeling with MLServer, MLFlow, and Apache Beam	Devon Peticolas & Jeswanth Yadagani
Real-time Threat Detection at Box with Apache Beam	Abhishek Mishra, Mark Chin & Elango Prasad
Remote LLM Inference with Apache Beam: Practical Guide with Gemini and Gemma on Vertex AI	Taka Shinagawa
Revisiting Splittable DoFn in KafkaIO	Steven van Rossum
Scalable Drug Discovery with Apache Beam: From R-Groups to Crystal Structures	Joey Tran
Scalable Prompt Optimization in Apache Beam LLM Workflows	Tomi Ajakaiye
Scaling Real-Time Feature Generation Platform @Lyft	Rakesh Kumar
See the Full Picture: Integrating Beam/Dataflow into Your Distributed Traces	Steven van Rossum, Kenneth Knowles & Radek Stankiewicz
Simplified Streaming Anomaly Detection with Apache Beam's Latest Transform	Shunping Huang
Streaming Databases with Bigtable and Apache Beam	Christopher Crosbie
Talk to your pipeline: how to use AI to create dynamic transforms in streaming	Israel Herraiz
The ASF Data Ecosystem Bridging the Data Stream With Apache Beam	JB Onofre
Using Apache Beam to power and scale a data engineering transformation at a Financial Exchange	Conall Bennett