This RouteLLM framework sounds really promising, especially for cost optimizatio...

Terretta · on July 11, 2024

Addendum to parent to make link clickable:

https://github.com/pulzeai-oss/knn-router

// HN doesn't handle squared circle as MD.

antupis · on July 11, 2024

Cost is a plus but at least what I see is that getting good response time is even bigger. Something like OpenAI Azure instances are inconsistent and it is far too normal to get a 40sec lag with responses with gpt4-o.