Speaker(s):

Count-distinct using HLL++ algorithm

Aug-28 17:40-18:00
Add to Calendar 08/28/2020 5:40 PM 08/28/2020 6:00 PM America/Los_Angeles AS24: Count-distinct using HLL++ algorithm

This talk goes through how to write a Beam pipeline to efficiently count the number of distinct elements in a massive data set using the HyperLogLog++ algorithm.

This talk goes through how to write a Beam pipeline to efficiently count the number of distinct elements in a massive data set using the HyperLogLog++ algorithm.