Milvus Lite: The Lightweight Version of Milvus

nutanc · 2024-06-02T17:33:40 1717349620

This is awesome. Does Milvus lite also support binary embeddings?

ko_pivot · 2024-06-02T13:59:08 1717336748

Maybe I don’t have enough ‘AI’ experience to understand, but I’m not getting the future of vector databases. 90% of the use cases I’ve encountered also benefit from keyword search, faceting, etc. and therefore a more traditional search engine like Elastic, Meilisearch, or even Postgres makes more sense than something that is purely focused on the vector index. At this point every search engine has a solid vector and hybrid search implementation.

manishsharan · 2024-06-02T14:31:30 1717338690

I have been playing with Milvus but as my use case evolves, I think PGVector may be a better fit . I currently store a lot of enriched data in PG and embeddings in Milvus. Consolidating them into one DB makes sense to me.

mind-blight · 2024-06-03T03:18:22 1717384702

We're using PG vector alongside our other dat. It's has pros and cons. I've found checking to be really slow, so we don't index vectors. We just make sure the query filters down on a small enough subset where a direct comparison is good enough.

The other thing we've encountered is that vectors take up a lot of storage space compared to the normal columns (easily a couple kb per row). You can fill up a db really quickly, especially if you're embedding really small chunks

paul-tharun · 2024-06-02T16:01:50 1717344110

You should take a look at qdrant then. Might fit your use case

spacecadet · 2024-06-02T14:25:40 1717338340

I dont think any (maybe Milvus cool-aid drinkers) would disagree. I recently used Milvus(for the first time) it made sense because it was quick to implement, purpose built, and is working exactly as I intended. Doesn't mean Ill go around blindly using Milvus everywhere. I also like Neo, Duck, Postgres, Parquet, etc etc. Just tools.

menacingly · 2024-06-02T14:36:32 1717338992

definitely overhyped, but not useless. Consider one of your examples, Elastic. It's often employed in situations where the db could handle what it's doing just fine, but it survives, Largely because of optimizations it is free to make knowing it is targeted at a narrow set of tasks.

mewpmewp2 · 2024-06-02T15:34:50 1717342490

Scaling, accuracy and search quickness is very important for vector dbs. Do other general purpose databases scale as well as specialized ones?

Because ideally they hold massive, well optimised indexes in their memory to be able to search quickly and not miss any vectors.

sa-code · 2024-06-02T17:04:21 1717347861

Vector databases are just the easiest way to make a search engine in a demo.

ukuina · 2024-06-02T20:08:48 1717358928

This is the right answer. When it's time to make a product, you either use a managed VectorDB or ditch it for a more traditional datastore with careful indexing.

sa-code · 2024-06-13T14:47:56 1718290076

Disagree on using a managed vector db. That's just the same thing except you're paying someone else money? "Traditional datastore" could mean anything. Info retrieval and search have very established players like the Lucene ecosystem.

mikl · 2024-06-02T16:11:14 1717344674

It’ll be nice when the AI hype settles down a bit, so many of these “re-invent the wheel, with more AI sprinkles” projects popping up.

So many existing DBs can already do vector search, do we really need one dedicated to just that?

syntaxfree · 2024-06-02T15:25:30 1717341930

So like chromadb

gkapur · 2024-06-02T18:32:02 1717353122

Not really. This is more like SQLite or DuckDB for vector databases (on disk.) Chroma is more like redis for vector databases (in memory.)

We have seen similar products in the olap space, as well, ie. Clickhouse local.

valstu · 2024-06-03T08:06:36 1717401996

Chroma used DuckDB at some point, might not be the case anymore though

mhuffman · 2024-06-02T18:55:07 1717354507

Doesn't DuckDB already do vector search?

Kydlaw · 2024-06-02T19:09:20 1717355360

They do have an extension for vector similarity search (https://duckdb.org/docs/extensions/vss).

But Milvius might propose more features, as they have been in this specific space for longer.