Skip to content

[CI] OpenAiServiceUpgradeIT testOpenAiEmbeddings {upgradedNodes=1} failing #118156

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
elasticsearchmachine opened this issue Dec 6, 2024 · 5 comments
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Dec 6, 2024

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:inference:qa:rolling-upgrade:v8.12.2#bwcTest" -Dtests.class="org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT" -Dtests.method="testOpenAiEmbeddings {upgradedNodes=1}" -Dtests.seed=C96FD505C3401EF2 -Dtests.bwc=true -Dtests.locale=zgh-MA -Dtests.timezone=Africa/Freetown -Druntime.java=23

Applicable branches:
8.x

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.AssertionError: model_id not found in config: [{service_settings={similarity=dot_product, url=http://127.0.0.1:46075, dimensions=2}, service=openai, task_settings={model=text-embedding-ada-002}, model_id=old-cluster-embeddings, task_type=text_embedding}]

Issue Reasons:

  • [8.x] 3 consecutive failures in step 8.12.2_bwc
  • [8.x] 5 failures in test testOpenAiEmbeddings {upgradedNodes=1} (3.3% fail rate in 151 executions)
  • [8.x] 3 failures in step 8.12.2_bwc (100.0% fail rate in 3 executions)
  • [8.x] 3 failures in pipeline elasticsearch-periodic (100.0% fail rate in 3 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :ml Machine learning >test-failure Triaged test failures from CI Team:ML Meta label for the ML team needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 6, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/ml-core (Team:ML)

@maxhniebergall maxhniebergall added medium-risk An open issue or test failure that is a medium risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 9, 2024
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 13 consecutive failures in step 8.13.4_bwc
  • [8.x] 12 consecutive failures in step 8.12.2_bwc
  • [8.x] 31 failures in test testOpenAiEmbeddings {upgradedNodes=1} (5.1% fail rate in 611 executions)
  • [8.x] 13 failures in step 8.13.4_bwc (100.0% fail rate in 13 executions)
  • [8.x] 12 failures in step 8.12.2_bwc (100.0% fail rate in 12 executions)
  • [8.x] 6 failures in step 8.14.3_bwc (54.5% fail rate in 11 executions)
  • [8.x] 12 failures in pipeline elasticsearch-periodic (100.0% fail rate in 12 executions)

Build Scans:

elasticsearchmachine added a commit that referenced this issue Dec 9, 2024
@maxhniebergall
Copy link
Contributor

Fixed by #118624

elasticsearchmachine pushed a commit that referenced this issue Dec 13, 2024
#118664

```
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testRerank {upgradedNodes=1}
  issue: #116973
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=1}
  issue: #116974
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=2}
  issue: #116975

- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=1}
  issue: #118156
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testElser {upgradedNodes=1}
  issue: #118127
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=1}
  issue: #118162
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=2}
  issue: #118163
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=2}
  issue: #118204

- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testHFEmbeddings {upgradedNodes=1}
  issue: #118197
```
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 12 consecutive failures in step 8.12.2_bwc
  • [8.x] 11 consecutive failures in step 8.13.4_bwc
  • [8.x] 28 failures in test testOpenAiEmbeddings {upgradedNodes=1} (5.1% fail rate in 547 executions)
  • [8.x] 12 failures in step 8.12.2_bwc (100.0% fail rate in 12 executions)
  • [8.x] 5 failures in step 8.14.3_bwc (45.5% fail rate in 11 executions)
  • [8.x] 11 failures in step 8.13.4_bwc (100.0% fail rate in 11 executions)
  • [8.x] 11 failures in pipeline elasticsearch-periodic (100.0% fail rate in 11 executions)

Build Scans:

elasticsearchmachine added a commit that referenced this issue Dec 13, 2024
maxhniebergall added a commit to maxhniebergall/elasticsearch that referenced this issue Dec 16, 2024
elastic#118664

```
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testRerank {upgradedNodes=1}
  issue: elastic#116973
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=1}
  issue: elastic#116974
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=2}
  issue: elastic#116975

- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=1}
  issue: elastic#118156
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testElser {upgradedNodes=1}
  issue: elastic#118127
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=1}
  issue: elastic#118162
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=2}
  issue: elastic#118163
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=2}
  issue: elastic#118204

- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testHFEmbeddings {upgradedNodes=1}
  issue: elastic#118197
```
maxhniebergall pushed a commit to maxhniebergall/elasticsearch that referenced this issue Dec 16, 2024
elasticsearchmachine pushed a commit that referenced this issue Dec 17, 2024
```
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiEmbeddings {upgradedNodes=1}
  issue: #118156
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testHFEmbeddings {upgradedNodes=1}
  issue: #118197
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=2}
  issue: #118163
- class: org.elasticsearch.xpack.application.OpenAiServiceUpgradeIT
  method: testOpenAiCompletions {upgradedNodes=1}
  issue: #118162
- class: org.elasticsearch.xpack.application.HuggingFaceServiceUpgradeIT
  method: testElser {upgradedNodes=1}
  issue: #118127
- class: org.elasticsearch.xpack.application.CohereServiceUpgradeIT
  method: testCohereEmbeddings {upgradedNodes=1}
  issue: #116974
```
@maxhniebergall
Copy link
Contributor

fixed by backport of #118624

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

2 participants