This UDF would allow you to specify a bloom filter name, which it will read from s3, deserialize, and filter rows to those only contained within the filter. We need to consider using multiple bloom filters, i.e. selecting users from a range of dates, where each date has their own bloom filter. This could be an inner join on a "bloom_filter" table, where bloom filters have dates.
You need to log in before you can comment on or make changes to this bug.