Even cached input is only half price with openAI (and they don't even offer it for o1-pro).
Further we also aren't counting input here which can get long since it includes the previous output, which for the last request will be 33,750 + reasoning + any prompts, which will increase your cost quite a bit there.
But yes that is more reasonable than I'd expect I must admit, but I still think it needs to be at least an order of magnitude cheaper to compete against the other models out there.
I'm not sure I know a lot of employees who would allocate that sort of constant funding to a consumables tool for an employee, given that's your usual monthly cost for a typical saas product.
Been a minute since I touched audio code but isn't PCM quite basic? Really hard to beat $18 though! Even the 2hrs it'd take a decent SWE would be easily 30x that.
- 80 chars per line, 30 occupied (avg'd across 300 KLOC in codebase)
- 500 lines of code
- 15000 characters
- 4 chars / token
- 3750 tokens output
- 10 full iterations, and don't apply cached token pricing that's 90% off
- 37,500 tokens req'd in output
- $600 / 1M tokens
- $0.60 / 1K tokens
- $18