The near-duplicates problem and novelty detection by fruit flies

Learn from a fruit fly how to spot near-duplicates in your database. Our CTO, Cristiano, explains how.

If you are interested, have a look at the original fly Bloom filter paper by Dasgupta, Sheehan, Stevens and Navlakha.

Bar plot with errors on Bloom filters, LSBF and fly Bloom filters