Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
vessenes
10 months ago
|
parent
|
context
|
favorite
| on:
The Llama 4 herd
well you don't. but the power of gradient descent if properly managed will split them up for you. But you might get more mileage out of like 200 specialist models.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: