2d ago

Embedding-Only Search Suffers Redundancy, Glitches, And Irrelevant Matches

718151.3K

——0——

Original post

Common failure modes I've seen with embedding-only search: 1. Many redundant results 2. Empty docs 3. Glitch text getting a high score 4. Matching on topic, style, form yet completely irrelevant But, we need embeddings. Here's why:

9:08 PM · May 14, 2026

Cluster Engagement

Engagement snapshots are unavailable for this cluster.no post metric buckets

ORIGINAL POST

#957Andrew Drozdov@MRDROZDOV

Common failure modes I've seen with embedding-only search: 1. Many redundant results 2. Empty docs 3. Glitch text getting a high score 4. Matching on topic, style, form yet completely irrelevant

But, we need embeddings. Here's why:

4:08 AM · May 15, 2026 · 676 Views

#957Andrew Drozdov@MRDROZDOV

b) We need to capture matches with documents that have little to no text overlap with the query. You can't do that with BM25 alone.

Andrew Drozdov@mrdrozdov

a) We need ranking models that can be supervised for our task, otherwise the chance of good results is wishful thinking. Embeddings are easily supervised.

4:08 AM · May 15, 2026 · 146 Views

4:08 AM · May 15, 2026 · 119 Views

#957Andrew Drozdov@MRDROZDOV

a) We need ranking models that can be supervised for our task, otherwise the chance of good results is wishful thinking. Embeddings are easily supervised.

Andrew Drozdov@mrdrozdov

4:08 AM · May 15, 2026 · 676 Views

4:08 AM · May 15, 2026 · 146 Views

#957Andrew Drozdov@MRDROZDOV

c) We need to design technology that's future proof. LLMs can be converted into embeddings, so embeddings will capture the benefits of future pre-trained models.

Andrew Drozdov@mrdrozdov

b) We need to capture matches with documents that have little to no text overlap with the query. You can't do that with BM25 alone.

4:08 AM · May 15, 2026 · 119 Views

4:08 AM · May 15, 2026 · 53 Views

#957Andrew Drozdov@MRDROZDOV

d) We need search to be fast. Agents are powerful but slow. It's hard to imagine a high quality single-step search that isn't based on embeddings.

Andrew Drozdov@mrdrozdov

c) We need to design technology that's future proof. LLMs can be converted into embeddings, so embeddings will capture the benefits of future pre-trained models.

4:08 AM · May 15, 2026 · 53 Views

4:08 AM · May 15, 2026 · 54 Views

#957Andrew Drozdov@MRDROZDOV

Whether you use single or multi-vector is a separate question. Having spent a lot of time modeling compositionality and ambiguity, multi-vector approaches make a lot of sense. If you want to go deep in this space check out Gaussian Embeddings and Multi-Sense/Facet Embeddings.

Andrew Drozdov@mrdrozdov

d) We need search to be fast. Agents are powerful but slow. It's hard to imagine a high quality single-step search that isn't based on embeddings.

4:08 AM · May 15, 2026 · 54 Views

4:08 AM · May 15, 2026 · 59 Views

#957Andrew Drozdov@MRDROZDOV

Of course, using late interaction models is a no brainer. The community is active, the toolchain is there, and the results speak for themselves.

Andrew Drozdov@mrdrozdov

4:08 AM · May 15, 2026 · 59 Views

4:08 AM · May 15, 2026 · 97 Views

#957Andrew Drozdov@MRDROZDOV

s/embeddings/dense embeddings/g

BM25 is an approach that leverages sparse embeddings!

Andrew Drozdov@mrdrozdov

Of course, using late interaction models is a no brainer. The community is active, the toolchain is there, and the results speak for themselves.

4:08 AM · May 15, 2026 · 97 Views

4:10 AM · May 15, 2026 · 99 Views