Tech Week Meetup:


Data in Motion NYC: Messaging, Streaming & Processing

Experience an afternoon of high-level networking and technical insights over a refreshing summer lunch. Discover how to effectively scale your data pipelines with Apache Beam and optimize your messaging integration using Apache ActiveMQ alongside industry peers.


📝 Agenda


11:00 AM – 11:20 AM: Arrival & Networking

11:20 AM – 12:00 PM: Data in Motion with Apache ActiveMQ and Apache Beam | JB Onofré, Principal Software Engineer, Dremio + Director, Apache Foundation

12:00 PM – 1:00 PM: Lunch & Networking (on the rooftop if the weather permits)

1:00 PM – 1:40 PM: GraphFlow & Beam: Pythonic, Scalable GNN Pipelines | Yogesh Tewari, Senior Cloud Data Engineer, Google

1:40 PM – 2:00 PM: Final Networking


đź’­ Talks


Data in Motion with Apache ActiveMQ and Apache Beam By: JB Onofré, Principal Software Engineer, Dremio + Director, Apache Foundation.

Modern data architectures demand more than batch processing — they require reliable, scalable, and flexible pipelines that can handle data as it moves. This session explores the powerful combination of Apache ActiveMQ, a battle-tested message broker for enterprise messaging, and Apache Beam, a unified programming model for both batch and streaming data processing.

GraphFlow & Beam: Pythonic, Scalable GNN Pipelines By: Yogesh Tewari, Senior Cloud Data Engineer at Google.

Learn how GraphFlow, a modular Python toolkit, utilizes Apache Beam to create efficient and scalable data pipelines for Graph Neural Networks (GNNs). We’ll demonstrate how GraphFlow on Beam tackles large-scale graph data challenges, including distributed ingestion from cloud databases, scalable feature normalization, graph sampling, and online model inference.