Viewing a single comment thread. View all comments

visarga t1_j6x8zna wrote on February 2, 2023 at 3:23 PM

I think open source implementations will eventually get there. They probably need much more multi-task and RLHF data, or they had too little code in the initial pre-training. Training GPT-3.5 like models is like a recipe, and the formula + ingredients are gradually becoming available.