Splittable DoFns in Python: a hands-on workshop

Jul-20 12:30-15:00 UTC
Room: 202

In this workshop we will review the concept of Splittable DoFns and we will write two I/O connectors using this kind of DoFns: one in batch (for reading large files in a given format), and one for streaming (for reading from Kafka topics). We will run some examples on Google Cloud Dataflow after implementing these connectors from scratch during the workshop. All the code will be written in Python.