In your blog post, https://eng.uber.com/hoodie/, you mentioned using bloom filter to store the recordKey->fileid mapping.
But bloom filter can only answer whether this key is definitely not in the set, does that you mean you would still need to linearly scan the files if the answer is possibly in?
In your blog post, https://eng.uber.com/hoodie/, you mentioned using bloom filter to store the recordKey->fileid mapping.
But bloom filter can only answer whether this key is definitely not in the set, does that you mean you would still need to linearly scan the files if the answer is possibly in?