KingsmanVince t1_irvmgnx wrote on October 11, 2022 at 11:44 AM

Reply to comment by MohamedRashad in [D] Reversing Image-to-text models to get the prompt by MohamedRashad

In Image Captioning, to train the model, you have to provide any text that describe the images. By this definition, "the prompt that makes the image" does FALL IN. One text can produce many images. One image can be described by many texts. Image and Text have many2many relationships.

For example, to capture a picture of a running dog, people can describe the whole process. That still a caption.

For example, I prompt "running dog". Dalle 2 draws a running dog me. Yes that's a freaking caption.

KingsmanVince t1_irvl5k5 wrote on October 11, 2022 at 11:30 AM

Reply to [D] Reversing Image-to-text models to get the prompt by MohamedRashad

That's called Image Captioning

KingsmanVince t1_ir5ommv wrote on October 5, 2022 at 3:01 PM

Reply to [R] Google Colab alternative by Zatania

Kaggle, Paperspace are enough.