elbiot t1_jdlgxnz wrote on March 25, 2023 at 7:19 AM

Reply to comment by light24bulbs in [R] Hello Dolly: Democratizing the magic of ChatGPT with open models by austintackaberry

In my understanding, if you have text, it's not a challenge to train on next word prediction. Just keep the learning rate low. The reason there's a focus on the instruction based fine tuning is because that data is harder to come by.

My only experience is I've done this with a sentence embedding model (using sbert) and I just trained on my new text and the original training data 50/50 and it both got better at embedding my text and didn't forget how to do what it was originally trained on

light24bulbs t1_jdlrnll wrote on March 25, 2023 at 9:59 AM

That's cool, that's exactly what I want to do. I'm hunting around for a ready-made pipeline to do that on top of a good open source model.

machineko t1_jdmdvst wrote on March 25, 2023 at 1:59 PM

We are working on adding that as well. Keep an eye out on our repo.

[deleted] t1_jdmjvww wrote on March 25, 2023 at 2:45 PM

[removed]