Add to Calendar
08/28/2020 5:40 PM08/28/2020 6:00 PMAmerica/Los_AngelesAS24: Count-distinct using HLL++ algorithm
This talk goes through how to write a Beam pipeline to efficiently count the number of distinct elements in a massive data set using the HyperLogLog++ algorithm.
This talk goes through how to write a Beam pipeline to efficiently count the number of distinct elements in a massive data set using the HyperLogLog++ algorithm.