dreaming_geometry
dreaming_geometry t1_je7vmov wrote
Reply to comment by Business-Lead2679 in [D] Training a 65b LLaMA model by Business-Lead2679
If you're having trouble with Vast.ai, you can ask for help on the discord. Sounds like your desired use case is a good fit.
dreaming_geometry t1_j3hnyvd wrote
Reply to comment by coumineol in [P] searchthearxiv.com: Semantic search across more than 250,000 ML papers on arXiv by universal_explainer
Data not yet collected. Why don't you try some side-by-side comparisons and report back?
dreaming_geometry t1_je7wweh wrote
Reply to [R] LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention by floppy_llama
I've thinking about trying something like this. Everything is moving so fast now in ml, I feel like nearly every new idea I have gets published before I even find the time to get started.