Latest News

Generative AI Meetup

In addition to the Beam Summit program, AICamp will be hosting the meetup: “Generative AI”. Generative AI is revolutionizing the developer, content creator landscape. Come to discuss and learn the generative AI as we dive into the world of LLMs, explore the capabilities of ChatGPT, and discover the power of text-to-image technologies. Immerse yourself in captivating discussions, demos, and practical applications led by industry experts. Join us https://www.aicamp.ai/event/eventdetails/W2023061414

By Beam Summit Team

Machine learning design patterns: between Beam and a hard place

In a recent book entitled Machine Learning Design Patterns, we captured best practices and solutions to recurring problems in machine learning. Many of these design patterns are best implemented using Beam. The obvious example is the Transform design pattern, which allows you to replicate arbitrary operations from the training graph in the serving graph while keeping both training and serving code efficient and maintainable. Indeed, the tf.transform package makes this easy.

By Beam Summit Team

Recordings Beam Summit 2024

Watch recordings for Beam Summit 2024.

By Beam Summit Team

Implementing Cloud Agnostic Machine Learning Workflows with Apache Beam on Kubernetes

The need for a highly efficient data processing workflow is fast becoming a necessity in every organization implementing and deploying Machine Learning models at scale. In most cases, ML teams leverage the managed service solutions already in place by the cloud infrastructure provider they choose. While this approach is good enough for most teams to get going, the long-term cost of keeping the platform running may be prohibitively higher over time.

By Beam Summit Team

Unified Streaming and Batch Pipelines at LinkedIn using Beam

Many use cases at LinkedIn require real-time processing and periodic backfilling of data. Running a single codebase for both needs is an emerging requirement. In this talk, we will share how we leverage Apache Beam to unify Samza stream and Spark batch processing. We will present the first unified production use case Standardization. By leveraging Beam on Spark for its backfilling, we reduced the backfilling time by 93% while only using 50% of resources. We will also go through the challenges of running unified pipelines, lessons we have learned, and future roadmap at Linkedin.

By Beam Summit Team

Keynote: The adoption, current state and future of Apache Beam at Twitter

Lohit is part of Hadoop and Log Management team at Twitter. He has been concentrating on scaling Hadoop FileSystem, Hadoop Resource Manager, Log Ingestion and Processing pipelines at Twitter. During this talk he will explain the adoption, current state and future of Apache Beam at Twitter.

By Beam Summit Team

Keynote: Palo Alto Networks’ massive-scale deployment of Beam

Talat is a Principal Software Engineer at Palo Alto Networks Cortex Data Lake Team, working on building a streaming data platform using Apache Beam and Dataflow to secure their customers. He will share how Beam was adopted and how they deploy thousands of pipelines.

By Beam Summit Team

Beam Playground: discover, learn and prototype with Apache Beam

This talk will be presenting the Apache Beam discovery and learning through Beam Playground, proof of concept and contributing paths to those who are looking to learn, use and develop Apache Beam.