I'm getting >30 tokens/sec using it with ollama and an M2 Pro. That might be a l...

minzi · on Sept 27, 2023

Bit of a tangential question here, but any recommendations on how to get started fine tuning this model (or ones like it)? I feel like there are a million different tutorial and ways of doing it when I google.

brucethemoose2 · on Sept 28, 2023

https://github.com/OpenAccess-AI-Collective/axolotl

This is a wrapper around many training methods, and it has yielded many excellent community finetunes already.

Karrot_Kream · on Sept 28, 2023

Take a look at the QLoRA repo https://github.com/artidoro/qlora/ which has an example finetuning Llama. Made by the authors of the QLoRA paper.