Submitted by Destiny_Knight t3_11tab5h in singularity
Straight-Comb-6956 t1_jcj7fn3 wrote
Reply to comment by FoxlyKei in Those who know... by Destiny_Knight
0. llama.cpp runs on CPU and uses plain RAM.
I've managed to launch 7B Facebook LLAMA with 5GB memory consumption and 65B model with just 43GB.
Viewing a single comment thread. View all comments