1azytux OP t1_jdv913f wrote on March 27, 2023 at 1:30 PM

Reply to comment by aozorahime in Recent advances in multimodal models: What are your thoughts on chain of thoughts models? [D] by 1azytux

I am actually not using discord for time being, but maybe reddit messaging will work :) I can DM you.

1azytux OP t1_jdcvjki wrote on March 23, 2023 at 2:22 PM

Reply to comment by aozorahime in Recent advances in multimodal models: What are your thoughts on chain of thoughts models? [D] by 1azytux

Yes, I have worked with multimodal models before, but I'm still in nascent stage of discovering the field of NLP. What about you? Are you interested in multimodal models? What's your PhD on?

I was interested in CoT, and more in multimodal ones because of the recent advances of chatgpt as it's able to remember the previous conversations. I hope this is correct.

Yes, I saw the link and wasn't able to find much about CoT in particular, so asked about you.

I can talk about what I've worked on and what I was trying and want to do in future, maybe in DMs .. ?

1azytux OP t1_jd2ho88 wrote on March 21, 2023 at 11:17 AM

Reply to comment by aozorahime in Recent advances in multimodal models: What are your thoughts on chain of thoughts models? [D] by 1azytux

i'm looking for ideas based on the papers given :
- Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering

- Multimodal Chain-of-Thought Reasoning in Language Models

and such .. with general chain of thought idea for language can be looked at this paper.

I'm not sure if the link you provided will work, but as it's huge I might have missed (I've glanced on it) can you point out the parts which you think should be paid attention?

1azytux t1_jadp0aa wrote on February 28, 2023 at 6:17 PM

Reply to comment by [deleted] in [R] Microsoft introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot) by MysteryInc152

do you know which foundation models we can use though, or are open sourced? It seems like every other model is either not available or their weights aren't released yet. It's case with, CoCa, Florence, Flamingo, BEiT3, FILIP, ALIGN. I was able to find weights for ALBEF.

1azytux t1_jadmvbe wrote on February 28, 2023 at 6:03 PM

Reply to [R] Microsoft introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot) by MysteryInc152

can we download the model weights? is it open sourced? or maybe perform zero shot tasks by ourselves?

1azytux t1_irzlrs1 wrote on October 12, 2022 at 5:42 AM

Reply to comment by Tiny_Arugula_5648 in [D] Are there any open-source text summarization model? by CeFurkan

I guess the person meant BART, and you can get more information here.