resdaz t1_j80n64k wrote on February 10, 2023 at 7:25 PM

The architecture for these large language models are no secret. Everyone can see exactly how to implement them to the tiniest detail.

The value lies in how to train and fine tune the data. Which, tellingly, the big players are far less interested in sharing.

Setrict t1_j80ozpi wrote on February 10, 2023 at 7:37 PM

About the only I can see open source could compete is by leveraging large numbers of volunteers to create curated data sets that aren't licensed for use in closed systems. A kind of wikipedia for AI training. Quality over quantity. Filtering out stuff like "TheNitromeFan" data that confused Chatgpt.