Optimizing Machine Learning Workloads on Dataflow

Speaker(s):

Optimizing Machine Learning Workloads on Dataflow

Jun-14 15:00-15:25 in Horizon

Add to Calendar 06/14/2023 3:00 PM 06/14/2023 3:25 PM America/New_York BS25: Optimizing Machine Learning Workloads on Dataflow

Trustpilot is is a community-driven platform that hosts reviews of businesses from across the world. It helps people by providing authentic reviews of products and services.

As a reviews platform, Trustpilot handles large volumes of data, particularly text from many languages, making it an ideal use case for machine learning. We use Apache Beam on GCP Dataflow for batch and streaming machine learning workloads.

In this talk, we would like to share our experiences optimizing the running of machine learning workloads on Dataflow including:

‣ Granular resource specification with Dataflow Prime to significantly lower inference costs

‣ Making use of Beam’s RunInference API for loading models

‣ Running multiple large models in a pipeline by GPU sharing with NVIDIA MPS

‣ Accelerating matrix operations in Beam pipelines with the JAX array computation library

Horizon

Download slides

Trustpilot is is a community-driven platform that hosts reviews of businesses from across the world. It helps people by providing authentic reviews of products and services.

In this talk, we would like to share our experiences optimizing the running of machine learning workloads on Dataflow including:

‣ Granular resource specification with Dataflow Prime to significantly lower inference costs

‣ Making use of Beam’s RunInference API for loading models

‣ Running multiple large models in a pipeline by GPU sharing with NVIDIA MPS

‣ Accelerating matrix operations in Beam pipelines with the JAX array computation library

Optimizing Machine Learning Workloads on Dataflow

Alex Chan

Optimizing Machine Learning Workloads on Dataflow