Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The metric of run/not-run is too simplistic. You have to divide out the total throughout the system gives to all concurrent users (which we don't know). Like a golf-cart can get you from New York to LA same as a train, but the unit economics of the train are a lot more favorable, despite its increased cost. The minimum deployment scale is not irrelevant, it may make it infeasible to run an on-prem solution for most customers for eg, but if you are selling tokens via a big cloud API it doesn't really matter.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: