Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
rvnx
11 months ago
|
parent
|
context
|
favorite
| on:
DeepScaleR: Surpassing O1-Preview with a 1.5B Mode...
It is great discovery, it could even open a next step in AI with MoM "Mixture of Models", where small fine-tuned models take each part of a task (instead of the current MoE)
mluo
11 months ago
[–]
Check out one of my prior work:
https://stylus-diffusion.github.io/
This work scales up selection/routing over many models/LoRAs
rvnx
11 months ago
|
parent
[–]
Love it, will check, thank you for showing / sharing all of that!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: