Distinct Element Estimation (F0)
My draf
Bitmap algorithm
- Whang, K.-Y., Zanden, B.
T. V., and Taylor, H. M. A
linear-time probabilistic counting algorithm
for database applications. TODS 15, 2 (1990), 208–229.
- Cristian Estan, George
Varghese, Mike Fisk. “Bitmap
Algorithms for Counting Active Flows on High Speed Links”, IEEE/ACM Transactions on
Networking, October 2006
Power-2
- Philippe Flajolet and G.
Nigel Martin. “Probabilistic
Counting Algorithms for Data Base
Applications”. Journal of Computer and Systems Sciences,
31:182–209,
1985
- Marianne Durand and
Philippe Flajolet, “Loglog Counting of Large
Cardinalities”, In
the "Engineering and Applications Track" of the 11th Annual European Symposium on
Algorithms (ESA03).
Proceedings published by Springer, Lecture Notes in Computer Science, vol 2832, pp.605-617.
Randomized uniformed
- Ziv Bar-Yossef, T.S.
Jayram, Ravi Kumar, D. Sivakumar, and Luca Trevisan. “Counting
distinct
elements in a data stream”. In Proceedings of the 6th International
Workshop on Randomization and Approximation Techniques in Computer
Science (RANDOM’02), Cambridge, Massachusetts, September 2002. My
analysis of the above algorithm [pdf]
- Noga Alon, Yossi Matias,
and Mario Szegedy. “The Space
Complexity of Approximating the Frequency
Moments”. In Proceedings of the 28th Annual ACM Symposium on the
Theory
of Computing, pages 20–29, Philadelphia, Pennsylvania, May 1996.
- Johnny Chen, naveen Sunkavally and David Woodruff. "Space
efficient algorithms for distinct elements in a data stream".