Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

How can I quickly match one of 150 million names to a code?

code million names quickly
0
Posted

How can I quickly match one of 150 million names to a code?

0

Assuming that you want the results pretty quickly, siskin’s approach should work just fine. And the Bloom filter doesn’t have to be that big; a 200 mb table should give you a 1% false positive rate, which is a mere 1000 extra hits in your case. This gives you 800 mb for the (in-process) database cache, which is way more than you need. And yes, if you can spend the whole day on generating the results, you can skip the bloom filter even if you have a steam-powered 1.5 QPS database engine. 🙂 (There are fancy database engines that can do bloom filtering all on their own, but I’m not up to date on the Java universe so I’m not sure how common that is.

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123