Practical Algorithms for Self Scaling Histograms, orBetter than Average Data Collection
Problem: Accurately Recovering Distribution
Solution: Minimize Available Error
How Histograms recover the distribution
Converting Interpolated CDF to buckets
Effect of bucket boundary on Available Error
What if we are willing to trade off some accuracy for performance?
Computing Multi-modal parameters
Accuracy as a function of Buckets