Hacker News new | past | comments | ask | show | jobs | submit login
Milvus Lite: The Lightweight Version of Milvus (milvus.io)
56 points by stephen37 8 months ago | hide | past | favorite | 17 comments



This is awesome. Does Milvus lite also support binary embeddings?


Maybe I don’t have enough ‘AI’ experience to understand, but I’m not getting the future of vector databases. 90% of the use cases I’ve encountered also benefit from keyword search, faceting, etc. and therefore a more traditional search engine like Elastic, Meilisearch, or even Postgres makes more sense than something that is purely focused on the vector index. At this point every search engine has a solid vector and hybrid search implementation.


I have been playing with Milvus but as my use case evolves, I think PGVector may be a better fit . I currently store a lot of enriched data in PG and embeddings in Milvus. Consolidating them into one DB makes sense to me.


We're using PG vector alongside our other dat. It's has pros and cons. I've found checking to be really slow, so we don't index vectors. We just make sure the query filters down on a small enough subset where a direct comparison is good enough.

The other thing we've encountered is that vectors take up a lot of storage space compared to the normal columns (easily a couple kb per row). You can fill up a db really quickly, especially if you're embedding really small chunks


You should take a look at qdrant then. Might fit your use case


I dont think any (maybe Milvus cool-aid drinkers) would disagree. I recently used Milvus(for the first time) it made sense because it was quick to implement, purpose built, and is working exactly as I intended. Doesn't mean Ill go around blindly using Milvus everywhere. I also like Neo, Duck, Postgres, Parquet, etc etc. Just tools.


definitely overhyped, but not useless. Consider one of your examples, Elastic. It's often employed in situations where the db could handle what it's doing just fine, but it survives, Largely because of optimizations it is free to make knowing it is targeted at a narrow set of tasks.


Scaling, accuracy and search quickness is very important for vector dbs. Do other general purpose databases scale as well as specialized ones?

Because ideally they hold massive, well optimised indexes in their memory to be able to search quickly and not miss any vectors.


Vector databases are just the easiest way to make a search engine in a demo.


This is the right answer. When it's time to make a product, you either use a managed VectorDB or ditch it for a more traditional datastore with careful indexing.


Disagree on using a managed vector db. That's just the same thing except you're paying someone else money? "Traditional datastore" could mean anything. Info retrieval and search have very established players like the Lucene ecosystem.


It’ll be nice when the AI hype settles down a bit, so many of these “re-invent the wheel, with more AI sprinkles” projects popping up.

So many existing DBs can already do vector search, do we really need one dedicated to just that?


So like chromadb


Not really. This is more like SQLite or DuckDB for vector databases (on disk.) Chroma is more like redis for vector databases (in memory.)

We have seen similar products in the olap space, as well, ie. Clickhouse local.


Chroma used DuckDB at some point, might not be the case anymore though


Doesn't DuckDB already do vector search?


They do have an extension for vector similarity search (https://duckdb.org/docs/extensions/vss).

But Milvius might propose more features, as they have been in this specific space for longer.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: