pokeuser61 t1_jcj294w wrote on March 17, 2023 at 4:42 AM

Don't even need a gaming rig; https://github.com/ggerganov/llama.cpp

FoxlyKei t1_jcj30yc wrote on March 17, 2023 at 4:51 AM

How much vram do I need, then? I look forward to a larger model trained on gpt 4, I can only imagine the next month even. I'm excited and scared at the same time.

bemmu t1_jcj6zrc wrote on March 17, 2023 at 5:36 AM

You can try Alpaca out super easily. When I heard about it last night and just followed the instructions I had it running in 5 minutes on my GPU-less old mac mini:

Download the file ggml-alpaca-7b-q4.bin, then in terminal:

git clone https://github.com/antimatter15/alpaca.cpp  
cd alpaca.cpp  
make chat  
./chat

XagentVFX t1_jcl71ht wrote on March 17, 2023 at 5:08 PM

Dude, thank you so much. I was trying to download llama a different way but flopped. Then resorted to GPT-2. But this was super easy.

testfujcdujb t1_jcrtze8 wrote on March 19, 2023 at 1:50 AM

It is very bad though. A lot worse than chatgpt.

[deleted] t1_jcoc8qx wrote on March 18, 2023 at 8:48 AM

He is need it Proto agi

R1chterScale t1_jcj4i3i wrote on March 17, 2023 at 5:07 AM

Not GPU, CPU, so normal RAM not VRAM, takes about 8 or so gb to itself

FoxlyKei t1_jcj6xmh wrote on March 17, 2023 at 5:35 AM

Oh? So this only uses RAM? I just understood that Stable Diffusion requires VRAM but I guess that's just because it's processing images. Most people have plenty of RAM. Nice.

R1chterScale t1_jcjgd0x wrote on March 17, 2023 at 7:41 AM

Models can either use VRAM or RAM depending on whether they're accelerated with a GPU, has nothing to do with what they're actually processing, just different implementations.

iiioiia t1_jckjt70 wrote on March 17, 2023 at 2:38 PM

Any rough idea what the perforamnce difference is vs a GPU (of various powers)?

And does more ram help?

Straight-Comb-6956 t1_jcj7fn3 wrote on March 17, 2023 at 5:41 AM

0. llama.cpp runs on CPU and uses plain RAM.

I've managed to launch 7B Facebook LLAMA with 5GB memory consumption and 65B model with just 43GB.

GreenMirage t1_jcjnxyv wrote on March 17, 2023 at 9:32 AM

holy crap, thanks man.

KingdomCrown t1_jckiexy wrote on March 17, 2023 at 2:29 PM

Alpaca has similar quality to Gpt 3, not better. For more complex questions it’s closer to Gpt 2.

Those who know...

FoxlyKei t1_jciyxpz wrote on March 17, 2023 at 4:09 AM