Adds explanation on when to use dense or sparse embeddings by kosabogi · Pull Request #6727 · elastic/docs-content

kosabogi · 2026-05-27T08:40:04Z

Summary

This PR adds explanation on when to use dense and sparse vectors to the Tutorial: Dense and sparse workflows using ingest pipelines page.

Related issue: https://github.com/elastic/docs-content-internal/issues/854

Generative AI disclosure

Did you use a generative AI (GenAI) tool to assist in creating this contribution?

Yes
No

Claude in Cursor

github-actions · 2026-05-27T08:40:38Z

Elastic Docs AI PR menu

Check the box to run an AI review for this pull request.

Review docs changes (docs-review). Status: not started.

Powered by GitHub Agentic Workflows and docs-actions. For more information, reach out to the docs team.

github-actions · 2026-05-27T08:41:41Z

🔍 Preview links for changed docs

solutions/search/vector/dense-versus-sparse-ingest-pipelines.md

github-actions · 2026-05-27T08:41:51Z

Vale Linting Results

Summary: 1 suggestion found

💡 Suggestions (1)

File	Line	Rule	Message
solutions/search/vector/dense-versus-sparse-ingest-pipelines.md	54	Elastic.FirstPerson	Use caution when using first-person pronouns such as 'my.'

The Vale linter checks documentation changes against the Elastic Docs style guide.

To use Vale locally or report issues, refer to Elastic style guide for Vale.

seanhandley

Looks almost ready @kosabogi - just a couple of thoughts.

seanhandley · 2026-05-27T08:51:08Z

+- [Natural language Q&A](/explore-analyze/machine-learning/nlp/ml-nlp-text-emb-vector-search-example.md): Match questions like "How do I reset my password?" to FAQ entries, product documentation, or policy pages.
+- [Recommendations and similarity](knn.md): Find related articles, products, or media. For example, you can surface articles like the current one or visually similar product images.
+
+Dense embeddings are a good choice when you need multilingual retrieval or a specific third-party embedding model you have already evaluated on your data.


I'd remove this sentence. Not all dense embeddings models are multilingual. We support Jina Embeddings v3, which is English only, for example.

Also, the need to use a model that's already been evaluated for a usecase could apply to a sparse embedding model too - it's a question of previous technical decisions and the need to accommodate them.

Could maybe say something like

Dense embeddings are ideal when you care more about the semantic meaning of search terms than exact keyword matches - they excel at retrieving relevant results based on synonyms and paraphrasing of the original query to return results that reflect the user's intensions.

seanhandley · 2026-05-27T08:55:18Z

+
+Common use cases include:
+
+- [Retrieval augmented generation (RAG)](../rag.md): Retrieve document passages that answer a user's question, even when the question and the source text use different words.


Could be an opportunity here to mention Context Engineering more prominently than RAG?

RAG is still a useful concept but I think we're positioning ourselves in the market as a broader solution for context engineering as a whole.

++ there's some good definitions in this Anthropic blog post: https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents

this could be a good spot to link to agent builder for OOTB toolkit

Per the glossary definition:

"Agent Builder combines LLM reasoning with skills, tools, and best practices for context engineering and retrieval, so responses are accurately and efficiently grounded in your data."

Adds explanation on when to use dense or sparse embeddings

e1cc154

kosabogi requested a review from seanhandley May 27, 2026 08:40

kosabogi requested a review from a team as a code owner May 27, 2026 08:40

github-actions Bot deployed to docs-preview May 27, 2026 08:43 View deployment

seanhandley reviewed May 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds explanation on when to use dense or sparse embeddings#6727

Adds explanation on when to use dense or sparse embeddings#6727
kosabogi wants to merge 1 commit into
mainfrom
sparse-dense

kosabogi commented May 27, 2026

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

github-actions Bot commented May 27, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

seanhandley left a comment

Uh oh!

seanhandley May 27, 2026

Uh oh!

seanhandley May 27, 2026

Uh oh!

leemthompo May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		Common use cases include:

		- [Retrieval augmented generation (RAG)](../rag.md): Retrieve document passages that answer a user's question, even when the question and the source text use different words.

Conversation

kosabogi commented May 27, 2026

Summary

Generative AI disclosure

Uh oh!

github-actions Bot commented May 27, 2026

Elastic Docs AI PR menu

Uh oh!

github-actions Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

github-actions Bot commented May 27, 2026

Vale Linting Results

Uh oh!

seanhandley left a comment

Choose a reason for hiding this comment

Uh oh!

seanhandley May 27, 2026

Choose a reason for hiding this comment

Uh oh!

seanhandley May 27, 2026

Choose a reason for hiding this comment

Uh oh!

leemthompo May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions Bot commented May 27, 2026 •

edited

Loading