These sessions were presented in Beam Summit 2023 on June 13-15 in New York City.

TitleSpeaker(s)RecordingSlides

A Beginners Guide to Avro and Beam Schemas Without Smashing Your Keyboard

Devon Peticolas

Accelerating Machine Learning Predictions with NVIDIA TensorRT and Apache Beam

Shubham Krishna

Apache Beam and Ensemble Modeling: A Winning Combination for Machine Learning

Shubham Krishna

Beam at Talend - the long road from incubator project to cloud-based Pipeline Designer tool

Alexey Romanenko

Beam in Nokia NWDAF Distributed Architecture

Ifat Afek & Sigalit Aliazov

Beam IO: CDAP and SparkReceiver IO Connectors Overview

Alex Kosolapov & Elizaveta Lomteva

Beam Lightning Talks

Pablo Estrada

Beam loves Kotlin: full pipeline with Kotlin and Midgard library

Mazlum Tosun

Beam ML past, present and future

Kerry Donny-Clark & Reza Rokni

Benchmarking Beam pipelines on Dataflow

Pranav Bhandari

Building Fully Managed Service for Beam Jobs with Flink on Kubernetes

Talat Uyarer & Rishabh Kedia

Case study: Using statefulDofns to process late arriving data

Amruta Deshmukh

CI CD for Dataflow with Flex Templates and Cloud Build

Mazlum Tosun

Community Discussion: Future of Beam

Alex Van Boxel

Cross-language JdbcIO enabled by Beam portable schemas

Yi Hu

Dataflow Streaming - What's new and what's coming

inigo-san-jose-visiers & Tom Stepp

Dealing with order in streams using Apache Beam

Israel Herraiz

Deduplicating and analysing time-series data with Apache Beam and QuestDB

Javier Ramirez

Design considerations to operate a stateful streaming pipeline as a service

Israel Herraiz & Bhupinder Sindhwani

Developing (experimental) Rust SDKs and a Beam engine for IoT devices

Sho Nakatani

Easy cross-language with SchemaTransforms: use your favorite Java transform in Python SDK

Ahmed Abualsaud

Founders' Panel

Robert Bradshaw, Kenneth Knowles, Reuven Lax & Federico Patota

From Dataflow Templates to Beam: Chartboost’s Journey

Austin Bennett & Ferran Fernandez

Getting started with Apache Beam Quest

Svetak Sundhar

Hot Key Detection and Handling in Apache Beam Pipelines

Shafiqa Iqbal & Ikenna Okolo

How many ways can you skin a cat, if the cat is a problem that needs an ML model to solve?

Kerry Donny-Clark

How to balance power and control when using Dataflow with an OLTP SQL Database

Florian Bastin & Leo Babonnaud

How to Fail with Real-time Analytics

Matthew Housley

How to write an IO for Beam

John Casey

Introduction to Clustering in Apache Beam

Jasper Van den Bossche

Large scale data processing Using Apache Beam and TFX libraries

Olusayo Olumayode Akinlaja

Loading Geospatial data to Google BigQuery

Dong Sun & Sean Jensen-Grey

Machine Learning Platform Tooling with Apache Beam on Kubernetes

Charles Adetiloye

Managed Stream Processing through Apache Beam at LinkedIn

Bingfeng Xia, Prateek Maheshwari & Xinyu Liu

Managing dependencies of Python pipelines

Valentyn Tymofieiev

Mapping Data to FHIR with Apache Beam

Alex Fragotsis

Meeting Security Requirements for Apache Beam Pipelines on Google Cloud

Lorenzo Caggioni

ML model updates with side inputs in Dataflow streaming pipelines

Anand Inguva

Multi-language pipelines: a unique Beam feature that will make your team more efficient

Chamikara Jayalath

Oops I *actually* wrote a Portable Beam Runner in Go

Robert Burke

Optimizing Machine Learning Workloads on Dataflow

Alex Chan

Overview of a State Processing Toolkit for Apache Beam

Harish Nagu Sana, Antonio Si & Prema devi Kuppuswamy

Parallelizing Skewed Hbase Regions using Splittable Dofn

Prathap Reddy

Per Entity Training Pipelines in Apache Beam

Jasper Van den Bossche

Power Realtime Machine Learning Feature Engineering with Managed Beam at LinkedIn

David Shao & Yanan Hao

Resolving out of memory issues in Beam Pipelines

Zeeshan

Running Apache Beam on Kubernetes: A Case Study

Sascha Kerbler

Running Beam Multi Language Pipeline on Flink Cluster on Kubernetes

Lydian Lee

Scaling Public Internet Data Collection With Apache Beam

Lior Dadosh

Scaling up the OpenTelemetry Collector with Beam Go

Alex Van Boxel

Simplifying Speech-to-Text Processing with Apache Beam and Redis

Pramod Rao & Prateek Sheel

Streamlining Data Engineering and Visualization with Apache Beam and Power BI: A Real-World Case Stu

Deexith Reddy

Too big to fail - a Beam Pattern for enriching a Stream using State and Timers

Tobias Kaymak & Israel Herraiz

Troubleshooting Slow Running Beam Pipelines

Mehak Gupta

Unbreakable & Supercharged Beam Apps with Scala + ZIO

Aris Vlasakakis & Sahil Khandwala

Use Apache Beam to build Machine Learning Feature System at Affirm

Hao Xu

Using Large Language Models in Data Engineering Tasks

Sean Jensen-Grey & Vince Gonzalez

Workshop: Application Modernization with Kafka and Beam

Sami Ahmed

Workshop: Catch them if you can - Observability and monitoring

Wei Hsia

Workshop: Complex event processing with state & timers

Israel Herraiz & Miren Esnaola

Workshop: Step by step development of a streaming pipeline in Python

Israel Herraiz & Anthony Lazzaro

Workshop: Testing Apache Beam Pipelines

Bipin Upadhyaya

Write your own model handler for RunInference!

Ritesh Ghorse