Submitted by ReadSeparate t3_10zcig2 in singularity
MysteryInc152 t1_j83uty8 wrote
Reply to comment by adt in Where are all the multi-modal models? by ReadSeparate
Only the 17b and 30b models are multimodal. Still pretty good though for sure.
We also have some recent advances that ground frozen language models to images. Namely BLIP-2 and fromage.
Viewing a single comment thread. View all comments