Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
anonzzzies
8 months ago
|
parent
|
context
|
favorite
| on:
Show HN: Open-source load balancer for llama.cpp
Does it do queuing ? Didn’t see it in the readme. I haven’t seen (but that says nothing at all) an open source solution that queues when all are busy and allow me to show a countdown of people in the queue. Like the closed ones do.
ritonlajoie
8 months ago
|
next
[–]
I stumbled on this yesterday, it seems to have a queue concept..
https://github.com/ParisNeo/ollama_proxy_server
mcharytoniuk
8 months ago
|
prev
|
next
[–]
In progress. I added that to the readme; I need the feature myself. :)
friendly_chap
8 months ago
|
prev
[–]
I have actually open sourced one recently which supports queues - its both a desktop app and a daemon:
https://github.com/singulatron/singulatron
3abiton
8 months ago
|
parent
[–]
So many interesting projects, I just wish AI hardware was readily available.
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: