What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions]
Submitted by Destiny_Knight t3_118svv7 in singularity
My guy, correct me if I'm wrong, but doesn't it outperform humans, in everything but social sciences?...
Yes, and it does it with only 0.4% the size of GPT3, possibly enough to run on a single graphics card.
It uses language and pictures together instead of just language.
Fucking wow!
Yeah it's fucking nuts.
What is the "catch" here? It sounds too good to be true
The catch is that it only outperforms large models in a narrow domain of study. It's not a general purpose tool like the really large models. That's still impressive though.
Can It be fine tuned ?
You can tune it to another data set and probably get good results, but you have to have a nice, high quality data set to work with.
I’m working on one that’s trained on JFK speeches and Bachlorette data to help people with conversation skills.
I can't tell if this is a joke or real
It’s real. Gonna launch after GME moons
Sounds like a viable AI implementation to me. I'll be your angel investor and throw some Doge your way or something.
I don't think that's true, but I do believe it was finetuned on the specific dataset to achieve the SOTA result they did.
It chooses the correct answer from multiple choices. it isn't actually comparable to chatGtp.
Where can I get one? I'll take 20
Around 4GB vram, maybe 2GB to run it.
That’s so cool!! That’s how humans remember things, too
amazing.
does that prove that parameters aren't everything?
It was shown recently that for LLMs ~0.01% of parameters explain >95% of performance.
But higher parameters allow for broader knowledge right? You can't have a 6-20B model have broad knowledge as a 100B+ model, right?
At this point we don't really know what is bottlenecking. More params is an easyish way to capture more knowledge if you have the architecture and the $$... but there are a lot of other techniques available that increase the efficiency of the parameters.
Yes but how many parameters must you actually have to store all the knowledge you realistically need. Maybe a few billion parameters is enough to store the basics of every concept known to man and more specific details can be stored in an external file that the neural net can access with API calls.
You mean like a LoRA?
We already knew parameters aren't everything, or else we'd just be using really large feedforward networks for everything. Architecture, data, and other tricks matter too.
Its much small enough to run on a single graphics card
[deleted]
It's outperforming religion for some now.
https://www.businessinsider.com/rabbi-chat-gpt-ai-sermon-deathly-afraid-2023-2
Atleast AI can make accurate predictions for the next character in a line of text, which is better than any religion has predicted 🤣
Amen.
Religion is just anything you have faith ("belief") in without understanding the belief justification chains (or even that there is such a thing as different kinds of links in belief justification chains).
Thus, modern atheists are religious as well as they don't actually understand the Science (tm) and "logic" that shapes their beliefs, and they end up in culture war battles no different from early religious wars.
Modern science can no longer even predict what a man or woman is, which is just as simple as predicting what color the sky is. As an atheist myself, it's important to acknowledge the win religion has on this one.
Great post. I was about to counter that religion would require some kind of worship, but there's religions such Buddhism that requires no such worship.
There is a lot of worship is SEA tho.
Whst makes you think that atheists don’t understand the logic behind their beliefs? Religion is based off of myth and atheism is evidence based and logical.
Whst makes you think that atheists don’t understand the logic behind their beliefs? Religion is based off of myth and atheism is evidence based and logical.
"ChatGPT might be really great at sounding intelligent, but the question is, can it be empathetic? And that, not yet at least, it can't," added Franklin.
He admitted there's a chance.
It's good at faking empathy, just like humans.
Anyone who's a staunch opponent of the idea of philosophical zombies (to which I am more or less impartial) could very well be open to the idea that ChatGPT is empathetic. If prompted well enough, it can mimic an empathetic person with great realism. And as long as you don't let it forget the previous conversations it's had nor exceed its memory window, it will stay in character and remember past events.
yup and correcr me if I'm wrong, but those aren't average humans either, those are experts in their fields
You are wrong. It’s not experts. It’s randos on mechanical Turk.
rip, they should've included expert performance as well then
You are setting the bar as anything less than perfect is failure.
By that standard, most humans would fail. And most experts are only going to be an expert in one field, not every field, so they would also fail by your standards.
Wtf are you talking about. It's a benchmark, it's to compare performance. I'm not setting any bar, and I'm not expecting it to beat human experts immediately.
Agreed. Stage one was "cogent", stage two was "as good as a human", stage three is "better than all humans". We have already passed stage 2 which could be called AGI. We will soon hit stage 3 which is ASI.
we are a million miles away from AGI.
hey buddy, you might want to check this link -> Dunning-Kruger effect
Is this implying that I don't know anything about AI or that the average person is not knowledge enough to be useful?
But then they wouldn’t be able to say that the AI beats them and it wouldn’t be as flashy of a publication. Don’t you know how academia works?
No. I haven't seen anyone talking about it because it beat humans, it was always about it beating GPT-3 with less than 1B parameters. Beating humans was just the cherry on top. The paper is "flashy" enough, including experts wouldn't change that. Many papers do include expert performance as well, it's not a stretch to expect it.
The human performance number is not from this paper, it is from the original ScienceQA paper. They are they ones that did the benchmarking.
Are you joking or serious ?
Serious, read the paper.
My disappointment is unmeasurable and my day is ruined.
Really? So the time has come where a small-scale AI model being smarter than "ordinary" humans is not impressive.
Awe is so last December - impatience is the new mode. They teased us with the future, now we expect it ASAP!
It's not ordinary humans, it's people on mechanical turk who are paid to do them as fast as possible and for as little money as possible. They are not motivated to actually think that hard.
That's prejudice. You don't know that.
No it is economics, they make less money the longer they stop and think about it.
[deleted]
[deleted]
Only because ai isn't as morally flexible as most ppl are
Viewing a single comment thread. View all comments