Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jandrese
on May 31, 2023
|
parent
|
context
|
favorite
| on:
Nvidia DGX GH200: 100 Terabyte GPU Memory System
I have to wonder how much improvement you would get with a 100 trillion parameter model. There seems to be diminishing returns in model size. That effort could almost certainly be better spent.
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: