2d ago

Embedding-Only Search Suffers Redundancy, Glitches, And Irrelevant Matches

0
Original post

Common failure modes I've seen with embedding-only search: 1. Many redundant results 2. Empty docs 3. Glitch text getting a high score 4. Matching on topic, style, form yet completely irrelevant But, we need embeddings. Here's why:

9:08 PM · May 14, 2026 View on X

Common failure modes I've seen with embedding-only search: 1. Many redundant results 2. Empty docs 3. Glitch text getting a high score 4. Matching on topic, style, form yet completely irrelevant

But, we need embeddings. Here's why:

4:08 AM · May 15, 2026 · 676 Views

b) We need to capture matches with documents that have little to no text overlap with the query. You can't do that with BM25 alone.

Andrew DrozdovAndrew Drozdov@mrdrozdov

a) We need ranking models that can be supervised for our task, otherwise the chance of good results is wishful thinking. Embeddings are easily supervised.

4:08 AM · May 15, 2026 · 146 Views
4:08 AM · May 15, 2026 · 119 Views

a) We need ranking models that can be supervised for our task, otherwise the chance of good results is wishful thinking. Embeddings are easily supervised.

Andrew DrozdovAndrew Drozdov@mrdrozdov

Common failure modes I've seen with embedding-only search: 1. Many redundant results 2. Empty docs 3. Glitch text getting a high score 4. Matching on topic, style, form yet completely irrelevant But, we need embeddings. Here's why:

4:08 AM · May 15, 2026 · 676 Views
4:08 AM · May 15, 2026 · 146 Views

c) We need to design technology that's future proof. LLMs can be converted into embeddings, so embeddings will capture the benefits of future pre-trained models.

Andrew DrozdovAndrew Drozdov@mrdrozdov

b) We need to capture matches with documents that have little to no text overlap with the query. You can't do that with BM25 alone.

4:08 AM · May 15, 2026 · 119 Views
4:08 AM · May 15, 2026 · 53 Views

d) We need search to be fast. Agents are powerful but slow. It's hard to imagine a high quality single-step search that isn't based on embeddings.

Andrew DrozdovAndrew Drozdov@mrdrozdov

c) We need to design technology that's future proof. LLMs can be converted into embeddings, so embeddings will capture the benefits of future pre-trained models.

4:08 AM · May 15, 2026 · 53 Views
4:08 AM · May 15, 2026 · 54 Views

Whether you use single or multi-vector is a separate question. Having spent a lot of time modeling compositionality and ambiguity, multi-vector approaches make a lot of sense. If you want to go deep in this space check out Gaussian Embeddings and Multi-Sense/Facet Embeddings.

Andrew DrozdovAndrew Drozdov@mrdrozdov

d) We need search to be fast. Agents are powerful but slow. It's hard to imagine a high quality single-step search that isn't based on embeddings.

4:08 AM · May 15, 2026 · 54 Views
4:08 AM · May 15, 2026 · 59 Views

Of course, using late interaction models is a no brainer. The community is active, the toolchain is there, and the results speak for themselves.

Andrew DrozdovAndrew Drozdov@mrdrozdov

Whether you use single or multi-vector is a separate question. Having spent a lot of time modeling compositionality and ambiguity, multi-vector approaches make a lot of sense. If you want to go deep in this space check out Gaussian Embeddings and Multi-Sense/Facet Embeddings.

4:08 AM · May 15, 2026 · 59 Views
4:08 AM · May 15, 2026 · 97 Views

s/embeddings/dense embeddings/g

BM25 is an approach that leverages sparse embeddings!

Andrew DrozdovAndrew Drozdov@mrdrozdov

Of course, using late interaction models is a no brainer. The community is active, the toolchain is there, and the results speak for themselves.

4:08 AM · May 15, 2026 · 97 Views
4:10 AM · May 15, 2026 · 99 Views
Embedding-Only Search Suffers Redundancy, Glitches, And Irrelevant Matches · Digg