Where are all the multi-modal models? Submitted by ReadSeparate t3_10zcig2 on February 11, 2023 at 4:49 AM in singularity 7 comments 22
MysteryInc152 t1_j85rgjx wrote on February 11, 2023 at 9:02 PM Recently 2 papers were released that dealt with making frozen LLMs multimodal (with coffee and models released). Blip-2 - https://arxiv.org/abs/2301.12597 https://huggingface.co/spaces/Salesforce/BLIP2 And fromage - https://arxiv.org/abs/2301.13823 https://github.com/kohjingyu/fromage Permalink 1
Viewing a single comment thread. View all comments