Viewing a single comment thread. View all comments

adt t1_j831ml0 wrote

There is an entire world outside of California...

Germany: Luminous 200B multimodal.

China: All of the ERNIE 260B cross-modal stuff.

^(Yeh, you need) ^(The Memo)^(!)

17

ReadSeparate OP t1_j8442mf wrote

This is exactly the comment I was looking for when I made this thread, thanks so much

5

MysteryInc152 t1_j83uty8 wrote

Only the 17b and 30b models are multimodal. Still pretty good though for sure.

We also have some recent advances that ground frozen language models to images. Namely BLIP-2 and fromage.

3