Hacker News new | past | comments | ask | show | jobs | submit login

Oh, that means its a llama architecture model!

Is the tokenizer the same? It may "work" without actually working optimally until llama.cpp patches it in.

And the instruct model was just uploaded.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: