Talking head
!!Con 2016

This presentation, by Adam Marcus, is licensed under a Creative Commons Attribution ShareAlike 3.0

Large datasets got you down? Have no fear! Make them small! Sketches are probabilistic data structures: they store a rough outline of a dataset in way less space than the dataset itself takes up. We'll sketch out three sketches to determine if an item is missing from your dataset (Bloom Filters!), count how many of an item are in your dataset (Count-min Sketches!), and count how many distinct items are in your dataset (HyperLogLogs!). In the spirit of the sketch, this talk will be hand-drawn (!!!) and leave some details to the imagination!

Rated: Everyone
Viewed 159 times
Tags: There are no tags for this video.