Submitted by von-hust t3_11jyrfj in MachineLearning
alushamir t1_jbao8ez wrote
Reply to comment by TikiTDO in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
>BLIP VQA
Thanks for sharing! you can try fastdup. It's free and scales. It's also very easy to use.
https://github.com/visual-layer/fastdup
Would love to get your feedback. PM or join our Slack channel. Will be happy to talk more.
Viewing a single comment thread. View all comments