[ML] Expose the Input Type option for the text_embedding task in the Inference API #117856

davidkyle · 2024-12-02T20:21:34Z

Description

Many of the integrated inference services have an input type option for text embeddings which modifies the resulting embedding for a specific use. The inference API has an InputType class for internal use and depending on where the text embedding call is made the Inference API will pick either the search or ingest input type.

When ingesting documents with an ingest pipeline or into a semantic text field the ingest type is used, when used in a query the search type is used automatically, this way users don't have to worry about selecting the right type and will get consistent results.

Exposing the input_type option in the POST _inference API is a natural extension that gives users more control over their embeddings. This change should not alter the search or ingest behaviours.

The text was updated successfully, but these errors were encountered:

elasticsearchmachine · 2024-12-02T20:21:57Z

Pinging @elastic/ml-core (Team:ML)

ymao1 · 2025-02-10T20:59:57Z

@davidkyle So this is updating the Perform Inference API to accept

POST _inference/text_embedding/my-endpoint
{
  "input": "The sky above the port was the color of television tuned to a dead channel.",
  "input_type": "ingest"
}

while still accepting the previous format of

POST _inference/text_embedding/my-endpoint
{
  "input": "The sky above the port was the color of television tuned to a dead channel.",
  "task_settings": {
    "input_type": "ingest"
  }
}

Is that correct? Should we return a validation error if both are specified? Or always use the input type in the task settings?

davidkyle added :ml Machine learning >enhancement labels Dec 2, 2024

elasticsearchmachine added the Team:ML Meta label for the ML team label Dec 2, 2024

dimkots assigned ymao1 Jan 31, 2025

ymao1 mentioned this issue Mar 6, 2025

Expose input_type option at root level for text_embedding task type in Perform Inference API #122638

Merged

ymao1 added the Feature:GenAI Features around GenAI label Mar 11, 2025

ymao1 closed this as completed in #122638 Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Expose the Input Type option for the text_embedding task in the Inference API #117856

[ML] Expose the Input Type option for the text_embedding task in the Inference API #117856

davidkyle commented Dec 2, 2024

elasticsearchmachine commented Dec 2, 2024

ymao1 commented Feb 10, 2025 •

edited

Loading

[ML] Expose the Input Type option for the text_embedding task in the Inference API #117856

[ML] Expose the Input Type option for the text_embedding task in the Inference API #117856

Comments

davidkyle commented Dec 2, 2024

Description

elasticsearchmachine commented Dec 2, 2024

ymao1 commented Feb 10, 2025 • edited Loading

ymao1 commented Feb 10, 2025 •

edited

Loading