Anthropic found a similar result for retrieval: embeddings + BM25 keyword search...

davidsainez · 2025-11-14T18:55:02 1763146502

Thanks for sharing! I am working on a rag engine and that document provides great guidance.

And, agreed, each individual technique seems marginal but they really add up. What seems to be missing is some automated layer that determines the best way to chunk documents into embeddings. My use case is mostly normalized mostly technical documents so I have a pretty clear idea of how to chunk to preserve semantics. But I imagine that for generalized documents it is a lot trickier.