Civil_Collection7267 t1_jdjce9w wrote on March 24, 2023 at 8:21 PM

#2,341,735

This is really impressive. If anyone's curious, here's a comparison with results from Alpaca Native:

Write a love letter to Edgar Allan Poe.

>My dearest Edgar,
>
>I have been walking in a dream these past few days, unable to believe that you are no longer with us. Your words have been on my mind constantly, and I find myself looking for your presence in every corner of our world. Everywhere I go, I see your influence, from the stars above to the rivers below. Even now, as I write this letter, your spirit is with me.
>
>Your poetry was like a beacon of light in the darkness, illuminating the depths of my soul. Through your words, I discovered a new way of seeing the world, one filled with mystery and beauty. You showed me that life could be full of surprises and that nothing should ever be taken at face value.
>
>You were so ahead of your time, my love, and I know that even after all these years, your teachings will continue to live on. I will never forget everything you’ve given me, and I hope that one day we can meet again in another realm. Until then, I will keep your memory alive in my heart.
>
>Forever yours,
>
>[Name]

Write a tweet announcing Dolly, a large language model from Databricks.

>We're thrilled to announce Dolly, our latest language model from Databricks! Dolly is a large-scale language model with state-of-the-art performance on many tasks, including text classification and question answering. #LangModeling #NLP #AI

I'm selling my Nikon D-750, write a short blurb for my ad.

>I am selling my Nikon D-750 camera body with 16GB memory card and the original box and all original paperwork. The camera has been used lightly and is in excellent condition. It comes with the original lens, a Nikkor 35mm f/1.8G. This camera is perfect for any photographer looking to capture stunning images and videos. It offers an impressive array of features including a 24MP DX CMOS sensor, EXPEED 4 image processing engine, 51 point AF system, 1080p HD video recording, and built-in WiFi. The camera is also dust and splash proof, making it ideal for both amateur and professional photographers alike. I am selling this camera due to upgrading to a newer model, and it is in perfect condition.

Explain to me the difference between nuclear fission and fusion.

>Nuclear fission is a process in which a large atom is split into smaller atoms, releasing energy in the form of heat and radiation. Nuclear fusion is a process in which two or more small atoms are combined to form a larger one, releasing energy in the form of heat and light. Fission occurs naturally in some types of radioactive decay, while fusion can occur when certain elements combine in a controlled reaction. The primary difference between these processes is the type of energy released. Fission releases kinetic energy, while fusion releases less energy but more light. Fission also produces more neutrons, which can be used for power production, while fusion produces fewer neutrons and is not as useful for power generation.

Give me a list of 5 science fiction books I should read next.

>1.Dune by Frank Herbert - This classic science fiction novel tells the story of Paul Atreides, whose family is betrayed and given control of the desert planet Arrakis. As he matures, Paul must face dangerous political intrigues and physical threats as he struggles to save his people.
>
>2.The Hitchhiker's Guide to the Galaxy by Douglas Adams - This comedic sci-fi novel follows Arthur Dent, who is told Earth will be destroyed to make way for an intergalactic bypass and escapes with his alien friend Ford Prefect. They travel together through space, encountering strange creatures and learning about the universe.
>
>3.The Foundation Trilogy by Isaac Asimov - This series follows the life of psychohistorian Hari Seldon and his efforts to create a foundation based on psychology and mathematics to preserve human knowledge after the fall of the Empire. It includes the novels Foundation, Foundation and Empire, and Second Foundation.
>
>4.The Martian Chronicles by Ray Bradbury - This collection of stories follows the colonization of Mars by humans, their interactions with the native Martians, and their eventual departure from the planet. It examines themes such as technology, religion, and the human spirit.
>
>5.Ender’s Game by Orson Scott Card - This novel follows Andrew “Ender” Wiggin, a gifted young boy recruited by the military to train as a leader of an army of genetically-engineered children. He must use his intelligence and empathy to fight a war against an alien race.

big_ol_tender t1_jdjcfc8 wrote on March 24, 2023 at 8:22 PM

#2,341,739

The alpaca dataset has a no commercial license so idk what they are doing.. I’ve asked Stanford to change it but heard nothing back

MasterEpictetus t1_jdjdobi wrote on March 24, 2023 at 8:30 PM

#2,341,865

This type of work needs to get a lot of attention. Great demonstration on getting instruction following behavior without the biggest LLM model that only companies with major resources can train.

machineko t1_jdjeh6y wrote on March 24, 2023 at 8:35 PM

#2,341,954

We have a similar open-source project focused on personalization of LLMs and efficient fine-tuning: https://github.com/stochasticai/xturing

We actually released code for GPT-J, LLaMA and GPT-2 before these guys but we are a small team. You can run it on any local machines too.

Colecoman1982 t1_jdjkgjy wrote on March 24, 2023 at 9:15 PM

#2,342,674

Replying to big_ol_tender (#2,341,739)

When you asked, did you clarify that you were asking about the training data versus the whole project? The final Alpaca project was built, in part, on top of Meta's LLaMa. Since LLaMa has a strictly non-commercial license, there is no way that Stanford can ever release their final project for commercial use (as they've already stated in their initial release of the project). On the other hand, any training data they've created on their own (without needing any code from LLaMa) should be within their power to re-license. If they think you are asking for the whole project to be re-licenced, they are likely to just ignore your request.

[deleted] t1_jdjkq8u wrote on March 24, 2023 at 9:17 PM

#2,342,711

[deleted]

big_ol_tender t1_jdjl1wx wrote on March 24, 2023 at 9:19 PM

#2,342,743

Replying to Colecoman1982 (#2,342,674)

I opened an issue on GitHub specifically about the data license and linked to the data bricks release :)

Colecoman1982 t1_jdjlw80 wrote on March 24, 2023 at 9:25 PM

#2,342,838

Replying to big_ol_tender (#2,342,743)

Very cool, hopefully you'll get through to them.

MjrK t1_jdjqz9h wrote on March 24, 2023 at 10:01 PM

#2,343,416

Replying to Colecoman1982 (#2,342,674)

> We emphasize that Alpaca is intended only for academic research and any commercial use is prohibited. There are three factors in this decision: First, Alpaca is based on LLaMA, which has a non-commercial license, so we necessarily inherit this decision. Second, the instruction data is based on OpenAI’s text-davinci-003, whose terms of use prohibit developing models that compete with OpenAI. Finally, we have not designed adequate safety measures, so Alpaca is not ready to be deployed for general use.

https://crfm.stanford.edu/2023/03/13/alpaca.html

ZetaReticullan t1_jdjrecp wrote on March 24, 2023 at 10:04 PM

#2,343,473

What a time to be alive! jointly terrifying and exciting!

Reeeeeeeeedit t1_jdjryxm wrote on March 24, 2023 at 10:08 PM

#2,343,539

Where is the instruction training data? Couldn’t find it in the GitHub repo

Esquyvren t1_jdjsw1j wrote on March 24, 2023 at 10:15 PM

#2,343,650

Replying to MjrK (#2,343,416)

They said it wasn’t ready but deployed it anyways… lol

danielbln t1_jdjt8zh wrote on March 24, 2023 at 10:17 PM

#2,343,703

Replying to big_ol_tender (#2,341,739)

Why has no one regenerated the training set? With gpt3.5 that's like 50 bucks. I can be the change I want to see in the world, but am I missing something?

big_ol_tender t1_jdjtwdk wrote on March 24, 2023 at 10:22 PM

#2,343,797

Replying to danielbln (#2,343,703)

Pls do! I believe in u

matterhayes t1_jdju696 wrote on March 24, 2023 at 10:24 PM

#2,343,827

Replying to Reeeeeeeeedit (#2,343,539)

It uses the Alpaca dataset https://huggingface.co/datasets/tatsu-lab/alpaca

Colecoman1982 t1_jdjuwpp wrote on March 24, 2023 at 10:29 PM

#2,343,921

Replying to MjrK (#2,343,416)

Ah, fair enough.

mxby7e t1_jdjzkzy wrote on March 24, 2023 at 11:04 PM

#2,344,474

Replying to danielbln (#2,343,703)

The use of OpenAI’s models for generating competing models violates the term of use, which is why the Stanford dataset is restricted.

MjrK t1_jdk4ig1 wrote on March 24, 2023 at 11:41 PM

#2,345,043

Replying to Esquyvren (#2,343,650)

For demonstration and research, not widely nor generally.

SWESWESWEh t1_jdk8rtn wrote on March 25, 2023 at 12:12 AM

#2,345,521

Replying to machineko (#2,341,954)

Doing the lords work my friend. Does it work with Apple Silicon Metal shaders? I've trained my own models as both TF and pytorch support it but I've noticed a lot of people use cuda only methods which makes it hard to use open source stuff

Maximum t1_jdkdtp2 wrote on March 25, 2023 at 12:51 AM

#2,346,044

ClosedAI is feeding off of our data. If we start using/supporting Open Assistant instead, it will beat chatgpt in a month or two.

Maximum t1_jdkepie wrote on March 25, 2023 at 12:57 AM

#2,346,140

Replying to mxby7e (#2,344,474)

Also, it's very shady for a company called OpenAI. They claimed they became for profit because they needed the money to grow, but these restrictions just show that they are filthy liars and only care about keeping the power and making profit. I'm sure they already have a strategy going around that 30B cap, just like they planned stealing money and talent by calling themselves non-profit first.

kromem t1_jdkfj5w wrote on March 25, 2023 at 1:04 AM

#2,346,233

> The model underlying Dolly only has 6 billion parameters, compared to 175 billion in GPT-3, and is two years old, making it particularly surprising that it works so well. This suggests that much of the qualitative gains in state-of-the-art models like ChatGPT may owe to focused corpuses of instruction-following training data, rather than larger or better-tuned base models.

The exciting thing here is the idea that progress in language models is partially contagious backwards to earlier ones by using newer models to generate the data to update older ones not in pre-training but in fine tuning (and I expect, based on recent research into in context learning, this would extend into additional few shot prompting).

I'm increasingly wondering if we'll see LLMs develop into rolling releases, particularly in the public sector. Possibly with emphasis on curating the data set for fine tuning with a platform agnostic stance towards the underlying pre-trained model powering it.

In any case, it looks more and more like the AI war between large firms will trickle down into open alternatives whether they'd like it to or not.

hangtime79 t1_jdkrpft wrote on March 25, 2023 at 2:43 AM

#2,347,412

The Alpaca dataset DB used to train this model absolutely cannot be used for commercial purposes. It uses the Creative Commons Attribution-NonCommercial 4.0 International Public License.

https://github.com/tatsu-lab/stanford_alpaca/blob/main/DATA_LICENSE

light24bulbs t1_jdks13d wrote on March 25, 2023 at 2:45 AM

#2,347,441

Replying to machineko (#2,341,954)

Question: i notice there's a focus here on fine tuning for instruction following, which is clearly different from the main training where the LLM just reads stuff and tries to predict the next word.

Is there any easy way to continue that bulk part of the training with some additional data? Everyone seems to be trying to get there with injecting embedding chunk text into prompts (my team included) but that approach just stinks for a lot of uses.

mxby7e t1_jdktvqr wrote on March 25, 2023 at 3:01 AM

#2,347,601

Replying to big_ol_tender (#2,341,739)

The license won’t change. The dataset was collected in a way that violates the term of service of OpenAI, which they used to generate the data. If they allowed commercial use it would open them up to lawsuit.

throwaway2676 t1_jdl0y80 wrote on March 25, 2023 at 4:05 AM

#2,348,223

Replying to mxby7e (#2,344,474)

Alpaca was only trained on 50k instructions, right? A large group of grad students or a forum like reddit could construct that many manually in a couple weeks. I'm surprised they even had to resort to using ClosedAI

mxby7e t1_jdl18t6 wrote on March 25, 2023 at 4:08 AM

#2,348,256

Replying to throwaway2676 (#2,348,223)

Maybe, open assistant by Stability.ai is doing this type of manual dataset collection. The training data and the model weights are supposed to be released once training is complete

dreamingleo12 t1_jdl3qgp wrote on March 25, 2023 at 4:33 AM

#2,348,490

It’s just a shameless copy of Stanford’s work. The innovative thing about Stanford Alpaca is it makes a ChatGPT style assistant with a language model, Meta LLaMA, and the cost is low. Databricks just followed Stanford’s approach and uses a different base model and claims it’s a big innovation. Alpaca actually can be fine-tuned with the same dataset in 3 hours and performs better than Databricks’ model.

Educational_Ice151 t1_jdl47lq wrote on March 25, 2023 at 4:38 AM

#2,348,553

Hello Dolly. This look pretty interesting. I have been playing with creating cross model feedback loops that iterate for several cycles using few shot prompts and chain of thought models. This would work really well for my concept. I’ll likely publish my code in a day or two.

Shared to r/aipromptprogramming

WarAndGeese t1_jdl5aq6 wrote on March 25, 2023 at 4:50 AM

#2,348,668

Replying to kromem (#2,346,233)

That would be pretty nuts and pretty cool. It's still a weird concept, but if it becomes like an operating system that you update, that would be a thing.

WarAndGeese t1_jdl5t0z wrote on March 25, 2023 at 4:55 AM

#2,348,724

Replying to mxby7e (#2,344,474)

Boo hoo to openai, people should do it anyway. Is the terms of service the only reason not to do it or are there actual material barriers? If it's a problem of money then as long as people know how much money it can be crowdfunded. If it's a matter of people power then there are already large volunteer networks. Or is it just something that isn't practical or feasible?

beautifoolstupid t1_jdld3fo wrote on March 25, 2023 at 6:25 AM

#2,349,380

Replying to machineko (#2,341,954)

This is what I love about this community.

impossiblefork t1_jdlddlt wrote on March 25, 2023 at 6:29 AM

#2,349,398

Replying to big_ol_tender (#2,341,739)

Model weights though, are, I assume, not copyrightable.

Is there actually a law giving Stanford any special rights to the weights?

elbiot t1_jdlgxnz wrote on March 25, 2023 at 7:19 AM

#2,349,658

Replying to light24bulbs (#2,347,441)

In my understanding, if you have text, it's not a challenge to train on next word prediction. Just keep the learning rate low. The reason there's a focus on the instruction based fine tuning is because that data is harder to come by.

My only experience is I've done this with a sentence embedding model (using sbert) and I just trained on my new text and the original training data 50/50 and it both got better at embedding my text and didn't forget how to do what it was originally trained on

master3243 t1_jdlhb8c wrote on March 25, 2023 at 7:24 AM

#2,349,680

I have a theory that the main reason OpenAI decided to start keeping it's training and architectural details private is because through minor modification in training data and data augmentation they were able to gain significant improvements in the qualitative output of GPT.

Thus any competitor could replicate the pipeline with ease and reproduce the improvements, so they decided to keep it as a trade secret.

Glad more research like this is being done and shared to the rest of the community.

master3243 t1_jdlhj77 wrote on March 25, 2023 at 7:28 AM

#2,349,697

Replying to __Maximum__ (#2,346,044)

Knowing how a lot of text data from Reddit comments ends up in these huge text datasets only for them to make it completely closed source rubs me the wrong way.

Disastrous_Elk_6375 t1_jdlix6j wrote on March 25, 2023 at 7:48 AM

#2,349,813

Replying to Esquyvren (#2,343,650)

The demo was up for a couple of days. The first hours of it being online were rough (80-200 people in queue). It got better the following day, and better still the 3'rd day. I believe they removed the demo ~1week later. IMO they've proven a point - the demo was extremely impressive for a 7b model.

Disastrous_Elk_6375 t1_jdlj4rn wrote on March 25, 2023 at 7:51 AM

#2,349,832

Replying to dreamingleo12 (#2,348,490)

> and uses a different base model and claims it’s a big innovation

Huh? My read of their blog was that they wanted to highlight the fact that you can fine-tune a ~2yo LLM and still get decent results. I don't think they've claimed this is innovative, or that the innovation is theirs to boast...

I've played with GPT-neo (non X) and GPT-J when they were released, and the results were rough. You had to do a ton of prompt engineering work and exploration to find useful cases. This shows that even smaller, older models can be fine-tuned with the method proposed in Alpaca.

plottwist1 t1_jdlj5r8 wrote on March 25, 2023 at 7:52 AM

#2,349,837

Replying to __Maximum__ (#2,346,044)

How open are they? I mean having open models is an improvment, but the training methods should be open too. And if we croud source data that should be accessible too.

lexcess t1_jdlj8tf wrote on March 25, 2023 at 7:53 AM

#2,349,845

Replying to mxby7e (#2,344,474)

Classy, especially when they are breezing past any copyright of the datasets they are training off of. I wonder if they can legally enforce that without creating a potentially bad precedent for themselves. Or if it could be worked around if the training was indirect through something like Alpaca.

dreamingleo12 t1_jdlkbxl wrote on March 25, 2023 at 8:09 AM

#2,349,929

Replying to Disastrous_Elk_6375 (#2,349,832)

WSJ:

“Databricks Launches ‘Dolly,’ Another ChatGPT Rival The data-management startup introduced an open-source language model for developers to build their own AI-powered chatbot apps” (Apparently DB paid them)

DB’s blog:

“Democratizing the magic of ChatGPT with open models”

Introduced? ChatGPT rival? Didn’t you just follow Stanford’s approach? You used Stanford’s dataset which was generated by GPT right? huh? This is Stanford’s achievement not DB’s. DB went too far on marketing.

SeymourBits t1_jdlkln7 wrote on March 25, 2023 at 8:14 AM

#2,349,943

Replying to Disastrous_Elk_6375 (#2,349,832)

I second this. I was able to extract fairly useful results from Neo but it took a huge amount of prompt trial and error, eventually getting decent/stable results but not in the same ballpark as GPT3+. The dolly training results here seem good, if not expected. I'm now ready to move to a superior model like LLaMA/Alpaca though. What are you running?

dreamingleo12 t1_jdll44j wrote on March 25, 2023 at 8:21 AM

#2,349,978

Replying to SeymourBits (#2,349,943)

I’ve been experimenting with Alpaca and able to fine-tune it using the dataset provided in 40 minutes with 8 A100s, spot instances. It actually works well.

[deleted] t1_jdllby0 wrote on March 25, 2023 at 8:25 AM

#2,349,998

[removed]

Disastrous_Elk_6375 t1_jdllii0 wrote on March 25, 2023 at 8:27 AM

#2,350,020

Replying to dreamingleo12 (#2,349,929)

> https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html

This is the blog post that I've read. I can't comment on the WSJ article, and your original message implied a bunch of things that, IMO, were not found in the blog post. If you don't like the WSJ angle, your grief should be with them, not databricks. shrug

From the actual blog:

> We show that anyone can take a dated off-the-shelf open source large language model (LLM) and give it magical ChatGPT-like instruction following ability by training it in 30 minutes on one machine, using high-quality training data.

> Acknowledgments > > This work owes much to the efforts and insights of many incredible organizations. This would have been impossible without EleutherAI open sourcing and training GPT-J. We are inspired by the incredible ideas and data from the Stanford Center for Research on Foundation Models and specifically the team behind Alpaca. The core idea behind the outsized power of small dataset is thanks to the original paper on Self-Instruct. We are also thankful to Hugging Face for hosting, open sourcing, and maintaining countless models and libraries; their contribution to the state of the art cannot be overstated.

More to the point of your original message, I searched for "innovative" "innovation" "inovate" and found 0 results in the blog post. I stand by my initial take, the blog post was fair, informative and pretty transparent in what they've done, how, and why.

CollectionLeather292 t1_jdlln8u wrote on March 25, 2023 at 8:29 AM

#2,350,031

Nice

dreamingleo12 t1_jdllxww wrote on March 25, 2023 at 8:34 AM

#2,350,058

Replying to Disastrous_Elk_6375 (#2,350,020)

Well if you ever worked with marketing or communication teams you would’ve known that DB co-authored the WSJ article. My point is that the democratization is an achievement of the Stanford Alpaca team, not DB. DB marketed it like they did the major work which is untrue.

Disastrous_Elk_6375 t1_jdlm6qd wrote on March 25, 2023 at 8:37 AM

#2,350,082

Replying to dreamingleo12 (#2,350,058)

That's fair. But you commented out of context, on a post that linked to the blog and not the WSJ article. That's on you.

dreamingleo12 t1_jdlmhcq wrote on March 25, 2023 at 8:42 AM

#2,350,108

Replying to Disastrous_Elk_6375 (#2,350,082)

Well if you have connections you would’ve seen they made a good amount of posts.

visarga t1_jdlo8hl wrote on March 25, 2023 at 9:09 AM

#2,350,256

Replying to master3243 (#2,349,697)

Closed source on the generation end, but even more open than open source on the usage end. LLMs lift the open source idea to the next level.

visarga t1_jdloh24 wrote on March 25, 2023 at 9:12 AM

#2,350,275

Replying to light24bulbs (#2,347,441)

Since RLHF finetuning is short, you can continue training your original model and RLHF again.

visarga t1_jdlonpq wrote on March 25, 2023 at 9:15 AM

#2,350,285

Replying to kromem (#2,346,233)

One way to speed this up is to make an extension for voluntary contributions of LLM interactions to open source. A user decides when a chat deserves to be donated to open source and pushes a button to share. I don't think OpenAI can object to users donating their data.

visarga t1_jdloqee wrote on March 25, 2023 at 9:16 AM

#2,350,291

Replying to ZetaReticullan (#2,343,473)

Most of our pre-2020 NLP skills are worthless now, what required bespoke models and datasets is just another emergent LLM ability. It's like a new starting line and we don't know what human skills will be valuable in the future.

visarga t1_jdlp21i wrote on March 25, 2023 at 9:21 AM

#2,350,328

Replying to master3243 (#2,349,680)

The combined effect of knowing what is possible and pressure to develop an alternative means replication effort will be huge.

visarga t1_jdlpae7 wrote on March 25, 2023 at 9:24 AM

#2,350,346

Replying to WarAndGeese (#2,348,724)

OpenAI has first hand RLHF data. Alpaca has second hand. Wondering if third hand is good enough and free of any restrictions.

visarga t1_jdlpf0h wrote on March 25, 2023 at 9:26 AM

#2,350,356

Replying to mxby7e (#2,347,601)

What about data generated from Alpaca, is that unrestricted?

Maximum t1_jdlqolz wrote on March 25, 2023 at 9:45 AM

#2,350,441

Replying to plottwist1 (#2,349,837)

It's community driven, so they are open open.

light24bulbs t1_jdlrnll wrote on March 25, 2023 at 9:59 AM

#2,350,507

Replying to elbiot (#2,349,658)

That's cool, that's exactly what I want to do. I'm hunting around for a ready-made pipeline to do that on top of a good open source model.

sdmat t1_jdm0pmi wrote on March 25, 2023 at 11:56 AM

#2,351,306

Replying to visarga (#2,350,291)

> It's like a new starting line and we don't know what human skills will be valuable in the future.

With each passing day, the creature stirs, growing hungrier and more restless. The ground trembles beneath our feet, but we dismiss the warning signs.

Text above naturally written by GPT4.

Maybe we should start flipping the assumption - why would you want a human if inexpensive and dependable AI competence is the default?

wywywywy t1_jdm0xwo wrote on March 25, 2023 at 11:58 AM

#2,351,331

Replying to __Maximum__ (#2,346,044)

/r/OpenAssistant

https://open-assistant.io

sneakpeekbot t1_jdm0yoj wrote on March 25, 2023 at 11:58 AM

#2,351,333

Replying to wywywywy (#2,351,331)

Here's a sneak peek of /r/OpenAssistant using the top posts of all time!

#1: the default UI on the pinned Google Colab is buggy so I made my own frontend - YAFFOA. | 27 comments
#2: Progress Update | 4 comments
#3: Paper reduces resource requirement of a 175B model down to 16GB GPU | 19 comments

[deleted] t1_jdm2vq5 wrote on March 25, 2023 at 12:19 PM

#2,351,522

Replying to kromem (#2,346,233)

[removed]

ephemeralentity t1_jdm6wkc wrote on March 25, 2023 at 12:58 PM

#2,351,973

Replying to machineko (#2,341,954)

Playing around with this. Running BaseModel.create("llama_lora") seems to return "Killed". I'm running it on WSL2 from Windows 11 so I'm not sure if that could be the issue. Running on my RTX 3070 with only 8GB VRAM so maybe that's the issue ...

EDIT - Side note, I first tried directly on Windows 11 but it seems deepspeed dependency is not fully supported: https://github.com/microsoft/DeepSpeed/issues/1769

ginger_beer_m t1_jdm6xfe wrote on March 25, 2023 at 12:58 PM

#2,351,976

Replying to visarga (#2,350,291)

This will kill so many smaller startups that do bespoke fine-tuned models as their core business.

Daveboi7 t1_jdm8aby wrote on March 25, 2023 at 1:11 PM

#2,352,089

Replying to dreamingleo12 (#2,349,978)

What platform are you using for training?

light24bulbs t1_jdmad5n wrote on March 25, 2023 at 1:29 PM

#2,352,313

Replying to machineko (#2,341,954)

Hey, I've been looking at this more and it's very cool. One thing I REALLY like is that I see see self-training using dataset generation on your roadmap. This is essentially the technique that Facebook used to train ToolFormer, if I'm reading their paper correctly.

I'd really love to use your library to try to reimplement toolformers approach someday.

LazyCheetah42 t1_jdmb2i0 wrote on March 25, 2023 at 1:35 PM

#2,352,391

is there already a dolly.cpp?

machineko t1_jdmdvst wrote on March 25, 2023 at 1:59 PM

#2,352,690

Replying to light24bulbs (#2,350,507)

We are working on adding that as well. Keep an eye out on our repo.

SDRealist t1_jdmdwkl wrote on March 25, 2023 at 1:59 PM

#2,352,694

Replying to visarga (#2,350,285)

Users could certainly donate their questions, but I believe the TOS for ChatGPT forbid using the generated output to train competing models (at least for commercial purposes).

Daveboi7 t1_jdme4qf wrote on March 25, 2023 at 2:01 PM

#2,352,713

Can we just download the model?

[deleted] t1_jdmjvww wrote on March 25, 2023 at 2:45 PM

#2,353,391

Replying to machineko (#2,352,690)

[removed]

machineko t1_jdmm43b wrote on March 25, 2023 at 3:02 PM

#2,353,647

Replying to SWESWESWEh (#2,345,521)

Thanks for the comment. Are you looking to run on M2 or smaller edge devices?

gamerx88 t1_jdmndip wrote on March 25, 2023 at 3:11 PM

#2,353,812

Food for thought. Is this really surprising considering that the InstructGPT paper in early 2022, already showed how even a 1.3B model after RLHF could beat a much larger 175B model?

I guess what this shows is that it's the data that matters rather than SFT vs RLHF. Wondering if any ablation studies have been done here.

gamerx88 t1_jdmpdtf wrote on March 25, 2023 at 3:25 PM

#2,354,075

Replying to ginger_beer_m (#2,351,976)

Not if they adopt the technology

dreamingleo12 t1_jdn511a wrote on March 25, 2023 at 5:17 PM

#2,355,849

Replying to Daveboi7 (#2,352,089)

By platform you mean?

RiyazRockz t1_jdnbroi wrote on March 25, 2023 at 6:04 PM

#2,356,529

Replying to machineko (#2,341,954)

Hey, I want to fine tune a model to solve a pharma related problem. I want to know if I can fine tune my model with this.. Could you please share your contact details so that I can learn about this more?

ebolathrowawayy t1_jdnc05i wrote on March 25, 2023 at 6:05 PM

#2,356,556

Replying to mxby7e (#2,344,474)

But what if you're training a model for a narrow use-case and don't intend for anyone to use it except for a niche set of users? Is that enough to be in the clear? Or is any use of OpenAI's model output to train a model for any purpose a no-no?

mxby7e t1_jdncs51 wrote on March 25, 2023 at 6:11 PM

#2,356,622

Replying to ebolathrowawayy (#2,356,556)

From my understanding its limited to no commercial use, so you can use it for what you need, but not commercially.

Daveboi7 t1_jdnczd9 wrote on March 25, 2023 at 6:12 PM

#2,356,644

Replying to dreamingleo12 (#2,355,849)

My bad. Did you train the model locally on your PC or using cloud?

dreamingleo12 t1_jdndszl wrote on March 25, 2023 at 6:18 PM

#2,356,743

Replying to Daveboi7 (#2,356,644)

I trained the model using cloud

Daveboi7 t1_jdndvq0 wrote on March 25, 2023 at 6:19 PM

#2,356,753

Replying to dreamingleo12 (#2,356,743)

With databricks?

dreamingleo12 t1_jdndzmt wrote on March 25, 2023 at 6:19 PM

#2,356,767

Replying to Daveboi7 (#2,356,753)

No I don’t use databricks. I only tried LLaMA and Alpaca.

Daveboi7 t1_jdnedrd wrote on March 25, 2023 at 6:22 PM

#2,356,806

Replying to dreamingleo12 (#2,356,767)

But which cloud service did you use to train them?

I tried using databricks to train a model but the setup was too complicated.

I’m wondering is there a more straightforward platform to train on?

dreamingleo12 t1_jdnel6b wrote on March 25, 2023 at 6:24 PM

#2,356,822

Replying to Daveboi7 (#2,356,806)

You can just follow Stanford Alpaca’s github instructions, as long as you have LLaMA weights. It’s straightforward.

Daveboi7 t1_jdneqdx wrote on March 25, 2023 at 6:25 PM

#2,356,845

Replying to dreamingleo12 (#2,356,822)

Ah. I’m trying to train the Dolly model created developed databricks.

dreamingleo12 t1_jdnewt2 wrote on March 25, 2023 at 6:26 PM

#2,356,869

Replying to Daveboi7 (#2,356,845)

It’s just Alpaca with a different base model. Databricks boasted too much.

Daveboi7 t1_jdnf18o wrote on March 25, 2023 at 6:27 PM

#2,356,887

Replying to dreamingleo12 (#2,356,869)

Yeah but the comparisons I have seen between Dolly and Alpaca look totally different.

Somehow the Dolly answers look much better imo

Edit: spelling

dreamingleo12 t1_jdnf4qn wrote on March 25, 2023 at 6:27 PM

#2,356,893

Replying to Daveboi7 (#2,356,887)

I don’t trust DB’s results tbh. LLaMA is a better model than GPT-J.

Daveboi7 t1_jdnf96e wrote on March 25, 2023 at 6:28 PM

#2,356,909

Replying to dreamingleo12 (#2,356,893)

Somebody posted results on Twitter, they looked pretty good. I don’t think he worked for DB either. But who knows really

No_Confusion_5493 t1_jdnksi8 wrote on March 25, 2023 at 7:08 PM

#2,357,448

Great great and great thanks for this post

machineko t1_jdnmg8l wrote on March 25, 2023 at 7:20 PM

#2,357,622

Replying to ephemeralentity (#2,351,973)

Right, 8GB won't be enough for LLaMA 7b. You should try GPT-2 model. That should work on 8GB VRAM.

baffo32 t1_jdnppmp wrote on March 25, 2023 at 7:43 PM

#2,358,004

Replying to light24bulbs (#2,347,441)

this is the same task as instruction tuning. instruction tuning just uses specific datasets where instructions are followed. it‘s called “finetuning” but nowadays people are using adapters and peft to do this on low end systems.

light24bulbs t1_jdntdbb wrote on March 25, 2023 at 8:10 PM

#2,358,405

Replying to baffo32 (#2,358,004)

I'm not hoping to do instruction tuning, i want to do additional pre-training.

baffo32 t1_jdo24su wrote on March 25, 2023 at 9:15 PM

#2,359,339

Replying to light24bulbs (#2,358,405)

It is the same thing. The alpaca data is just further pretraining data consisting of instructions and responses. Doing this is called finetuning.

biggieshiba t1_jdojnn6 wrote on March 25, 2023 at 11:28 PM

#2,361,088

Replying to hangtime79 (#2,347,412)

I don't understand why anyone would care, in a few years half the internet will be ai generated. If someone uses GPT-4 to generate a sentence posted on Wikipedia how will you know before using it ? Don't you think many models will use that sentence?

Plus, how will they know, training data is not easy to extract from a model. Except if you are a direct OpenAI competitor they won't ever care or even look at you (well maybe their superAI will).

Lastly the dataset is full of errors, better generate again or even pay people would be quite cheap for 50k examples. This is quite a bad dataset when you really look at it, empty inputs or outputs, unclear instructions, instructions not fit for model... The fact that it is bad and small is very encouraging BTW since it performs pretty well.

SWESWESWEh t1_jdolo78 wrote on March 25, 2023 at 11:43 PM

#2,361,275

Replying to machineko (#2,353,647)

M1 macbook pro

ephemeralentity t1_jdp2pu8 wrote on March 26, 2023 at 1:52 AM

#2,363,120

Replying to machineko (#2,357,622)

Thanks looks like gpt2 worked! Sorry, stupid question but how do I save/re-use the results of my model finetune? When I re-finetune for 0:2 epochs it gives a reasonable response but if I try to skip model.finetune, it responds with new lines only (\n\n\n\n\n\n\n\n ...).

SatoshiNotMe t1_jdpgj80 wrote on March 26, 2023 at 3:51 AM

#2,364,604

I hope this is not closely tied to the Databricks ecosystem (i.e. their notebooks, spark clusters etc). Running things in DB notebooks is not a pleasant experience.

SatoshiNotMe t1_jdpgrat wrote on March 26, 2023 at 3:53 AM

#2,364,624

Replying to SatoshiNotMe (#2,364,604)

Looking at the repo, well, it does looks like we need to run this in a DB notebook.

machineko t1_jdqzmyq wrote on March 26, 2023 at 2:40 PM

#2,370,881

Replying to ephemeralentity (#2,363,120)

model.save("path/to/your/weights") saves it to the directory
After that, you can load it with
model = BaseModel.create("gpt2", "path/to/your/weights")

Can you share the input text you have used? It is possible that GPT-2 is too small and needs custom generation parameters.

baffo32 t1_jdrhj77 wrote on March 26, 2023 at 4:49 PM

#2,373,078

Replying to light24bulbs (#2,358,405)

I was still confused as to your response, and I’m thinking that if you wanted a model to behave like you had given different pretraining data, you would probably first finetune on the different bulk data, and then after this finetune on the target task such as instruction following.

Instruction following is indeed of course just predicting the next word: on data where the next word is obedient to instructions preceding it.

light24bulbs t1_jdrm9kh wrote on March 26, 2023 at 5:23 PM

#2,373,588

Replying to baffo32 (#2,373,078)

That's the part I wasn't getting. I assumed the fine tuning involved a different process. I see now that it is fact just more training data, often templated into a document in such a way that it's framed clearly for the LLM.

The confusing thing is that most of the LLM-as-a-service companies, Open-AI included, will ONLY take data in the question answer format, as if that's the only data you'd want to use to fine tune.

What if i want to feed a book in so we can talk about the book? A set of legal documents? Documentation of my project? Transcriptions of TV shows?

There are so many use cases for training on top of an already pre-trained LLM that aren't just question answering.

I'm into training llama now. I simply took some training code i found, removed the JSON parsing question answer templating stuff, and done.

ephemeralentity t1_jdt1krp wrote on March 26, 2023 at 11:32 PM

#2,379,896

Replying to machineko (#2,370,881)

Thanks a lot! To be honest, I need to spend a bit more time familiarising myself with pytorch / this package. I'll see if I can figure it out from here.

SatoshiNotMe t1_jdtemml wrote on March 27, 2023 at 1:16 AM

#2,381,768

So if the notebook is tuning on a fixed dataset, anyone running it will arrive at the same weights after an expensive compute, which seems wasteful. Why not just share the weights, I.e the final trained + tuned model ? Or is that already available?

machineko t1_jdtv8jv wrote on March 27, 2023 at 3:40 AM

#2,384,156

Replying to ephemeralentity (#2,379,896)

If you need help, come find us on our discord channel.

nemorocksharder t1_jdz8kt5 wrote on March 28, 2023 at 7:40 AM

#2,411,502

Replying to light24bulbs (#2,373,588)

What you're describing is exactly what I have been looking to do too, and am really surprised I'm not hearing more about it. Have you found any useful approaches to essentially adding to the LLM's Corpus with target material/text? or anyone else trying to do this?

light24bulbs t1_jdzzeh4 wrote on March 28, 2023 at 1:01 PM

#2,415,160

Replying to nemorocksharder (#2,411,502)

Yes, I'm into it now. Code like this can be adapted to load bulk data instead of q&a.

I suspect some of the training parameters need to be adjusted a bit to prevent over fitting and obviously the data loading and templating needs to be removed.

https://github.com/lxe/llama-tune Or for a cooler approach where you make a Lora layer https://github.com/serp-ai/LLaMA-8bit-LoRA

matterhayes t1_jeackdz wrote on March 30, 2023 at 3:57 PM

#2,488,953

Replying to Daveboi7 (#2,352,713)

It’s been released here: https://huggingface.co/databricks/dolly-v1-6b

matterhayes t1_jeacmx0 wrote on March 30, 2023 at 3:58 PM

#2,488,980

Replying to SatoshiNotMe (#2,381,768)

It’s been released here: https://huggingface.co/databricks/dolly-v1-6b

SatoshiNotMe t1_jeakml0 wrote on March 30, 2023 at 4:49 PM

#2,491,058

Replying to matterhayes (#2,488,980)

thanks!

SatoshiNotMe t1_jealb7d wrote on March 30, 2023 at 4:54 PM

#2,491,226

Replying to matterhayes (#2,488,980)

Is there a "nice" way to use this model, (say, via the command-line like in the GPT4All or alpaca.cpp repos), rather than in a databricks notebook or in HG spaces? For example I'd like to chat with it on my M1 MacBook Pro. Any pointers appreciated!

Daveboi7 t1_jeapo15 wrote on March 30, 2023 at 5:21 PM

#2,492,247

Replying to matterhayes (#2,488,953)

Nice one

Comments