Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range #125694

markjhoy · 2025-03-26T16:55:10Z

This PR allows LTR configuration model inference to get rid of negative score that can be issued by model trained using xgboost or other framework. The negative score can cause issues if used in Lucene where they are forbidden.

The implementation here checks the model's predictive bounds (when possible) to see if the minimum predicted value will be below zero, and if so adjust the scores upward so the minimum value is 0 by sliding all the predicted values upwards in the result (thus, keeping the rankings and the relativity of the scores to each other).

elasticsearchmachine · 2025-03-26T22:04:50Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

afoucret · 2025-03-27T13:34:26Z

...lasticsearch/xpack/core/ml/inference/trainedmodel/inference/BoundedWindowInferenceModel.java

+        this(model, model.getMinPredictedValue(), model.getMaxPredictedValue());
+    }
+
+    public BoundedWindowInferenceModel(BoundedInferenceModel model, double minPredictedValue, double maxPredictedValue) {


The absolute minimum of 0 is only for LTR models but we can imagine situations were we want to scale the result of a regression model in [-1; 1].

I would personally remove the adjusment from the constructor cause it does not make sense:

public BoundedWindowInferenceModel(BoundedInferenceModel model, double minPredictedValue, double maxPredictedValue) { this.model = model; this.minPredictedValue = minPredictedValue; this.maxPredictedValue = maxPredictedValue; }

Then you can create a static method (scaleLtrModel ?) that would do the following:

public scaleLtrModel(BoundedInferenceModel model) { int adjustment = LTR_MIN_PREDICTED_VALUE - model.getMinPredictedValue(); return new BoundedWindowInferenceModel(model, model.getMinPredictedValue() + adjustment, model.getMaxPredictedValue() + adjustment) }

Then you can use the formula of the POC to scale the prediction:

double predictedValue = ((Number) regressionInferenceResults.predictedValue()).doubleValue(); // First we scale the data to [0 ,1] predictedValue = (predictedValue - model.getMinPredictedValue()) / (model.getMaxPredictedValue() - model.getMinPredictedValue()); // Then we scale the data to the desired interval predictedValue = predictedValue * (getMaxPredictedValue() - getMinPredictedValue()) + getMinPredictedValue();

Also I would rename the class into something like MinMaxScaledInferenceModel

WDYT?

The absolute minimum of 0 is only for LTR models but we can imagine situations were we want to scale the result of a regression model in [-1; 1].

I thought we had decided with @jimczi that we would not perform scaling, but rather slide the scores up to ensure they are all positive (only if the minimum score was negative). Correct?

The absolute minimum of 0 is only for LTR models but we can imagine situations were we want to scale the result of a regression model in [-1; 1].

That makes sense. Is there a need for this now to fix this bug though? I was under the impression that the bug was only about the negative scores being returned and us having to deal with that if true.

I proposed "sliding" the score because we can apply it on the minimum and maximum value for a model entirely (instead of per query). This means that scores will still be comparable between queries.

Thanks @jimczi - what are you thoughts on the PR as-is then?

Sounds good. Let's use the new minPredictedValue and maxPredictedValue from the BoundedInferenceModel directly, no need to make it configurable for now.

@afoucret -

Then you can use the formula of the POC to scale the prediction:
double predictedValue = ((Number) regressionInferenceResults.predictedValue()).doubleValue(); // First we scale the data to [0 ,1] predictedValue = (predictedValue - model.getMinPredictedValue()) / (model.getMaxPredictedValue() - model.getMinPredictedValue());

Looking at this here -- would scaling in this method (normalizing it to 0 -> 1.0 first) fall victim to what we're trying to avoid here - that is, doing this may compress the space and lose precision for the scores and might cause closely scored items to have equal rank?

I think it's a good idea overall, and certainly more flexible for the developer though... so, I'm on the fence about it. cc: @jimczi

@afoucret - the more I think about this, for the time being, let's go with this solution, and if we decide to add in your suggestion, let's do that afterwords so we can unblock your work.

elasticsearchmachine · 2025-03-28T14:08:51Z

Pinging @elastic/ml-core (Team:ML)

jimczi

The approach looks good to me.
Pinging @elastic/ml-core for a validation here. We're enforcing all retrievers/query to return positive scores and this PR handles the tree inference models.

markjhoy · 2025-04-02T16:00:50Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

…ve Score Range (elastic#125694) * apply bounded window inference model * linting * add unit tests * [CI] Auto commit changes from spotless * add additional tests * remove unused constructor --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit e77bf80)

…ve Score Range (elastic#125694) * apply bounded window inference model * linting * add unit tests * [CI] Auto commit changes from spotless * add additional tests * remove unused constructor --------- Co-authored-by: elasticsearchmachine <[email protected]>

…ve Score Range (#125694) (#126149) * apply bounded window inference model * linting * add unit tests * [CI] Auto commit changes from spotless * add additional tests * remove unused constructor --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit e77bf80)

…ve Score Range (elastic#125694) * apply bounded window inference model * linting * add unit tests * [CI] Auto commit changes from spotless * add additional tests * remove unused constructor --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit e77bf80)

markjhoy · 2025-04-24T16:52:01Z

💚 All backports created successfully

Status	Branch	Result
✅	9.0

Questions ?

Please refer to the Backport tool documentation

…ve Score Range (#125694) (#127345) * apply bounded window inference model * linting * add unit tests * [CI] Auto commit changes from spotless * add additional tests * remove unused constructor --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit e77bf80)

markjhoy added 2 commits March 26, 2025 12:42

apply bounded window inference model

920b6b5

linting

bde0af8

elasticsearchmachine added the v9.1.0 label Mar 26, 2025

markjhoy and others added 4 commits March 26, 2025 17:38

add unit tests

6270943

[CI] Auto commit changes from spotless

661ca2b

add additional tests

5e6ab73

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

55a1989

markjhoy requested a review from afoucret March 26, 2025 22:00

markjhoy added >bug :Search Relevance/Ranking Scoring, rescoring, rank evaluation. v8.19.0 labels Mar 26, 2025

markjhoy marked this pull request as ready for review March 26, 2025 22:04

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Mar 26, 2025

afoucret reviewed Mar 27, 2025

View reviewed changes

afoucret mentioned this pull request Mar 27, 2025

LTR score normalization. #124255

Closed

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

ab02bce

markjhoy requested a review from afoucret March 27, 2025 13:47

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

aa1b7da

jimczi added the :ml Machine learning label Mar 28, 2025

elasticsearchmachine added the Team:ML Meta label for the ML team label Mar 28, 2025

jimczi reviewed Mar 28, 2025

View reviewed changes

markjhoy added 2 commits March 28, 2025 10:24

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

d82f4d2

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

ce2d87e

markjhoy requested a review from a team April 1, 2025 13:15

markjhoy added 2 commits April 1, 2025 09:16

Merge branch 'main' into markjhoy/fix_ltr_rescore_retriever_bug

1dd7230

remove unused constructor

3d97708

markjhoy requested a review from jimczi April 1, 2025 21:42

afoucret approved these changes Apr 2, 2025

View reviewed changes

markjhoy merged commit e77bf80 into elastic:main Apr 2, 2025
17 checks passed

markjhoy added the auto-backport Automatically create backport pull requests when merged label Apr 2, 2025

markjhoy mentioned this pull request Apr 2, 2025

[8.x] Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range (#125694) #126149

Merged

pquentin mentioned this pull request Apr 22, 2025

Add 9.1.0 snapshot to build matrix elastic/eland#776

Closed

markjhoy mentioned this pull request Apr 24, 2025

[9.0] Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range (#125694) #127345

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range #125694

Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range #125694

Uh oh!

markjhoy commented Mar 26, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 26, 2025

Uh oh!

afoucret Mar 27, 2025

Uh oh!

markjhoy Mar 27, 2025

Uh oh!

jimczi Mar 28, 2025

Uh oh!

markjhoy Mar 28, 2025

Uh oh!

jimczi Mar 28, 2025

Uh oh!

markjhoy Apr 1, 2025 •

edited

Loading

Uh oh!

markjhoy Apr 1, 2025

Uh oh!

elasticsearchmachine commented Mar 28, 2025

Uh oh!

jimczi left a comment

Uh oh!

Uh oh!

markjhoy commented Apr 2, 2025

Uh oh!

markjhoy commented Apr 24, 2025

Uh oh!

Uh oh!

Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range #125694

Add Bounded Window to Inference Models for Rescoring to Ensure Positive Score Range #125694

Uh oh!

Conversation

markjhoy commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 26, 2025

Uh oh!

afoucret Mar 27, 2025

Choose a reason for hiding this comment

Uh oh!

markjhoy Mar 27, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

markjhoy Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

markjhoy Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markjhoy Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Mar 28, 2025

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

markjhoy commented Apr 2, 2025

💚 All backports created successfully

Questions ?

Uh oh!

markjhoy commented Apr 24, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Uh oh!

markjhoy commented Mar 26, 2025 •

edited

Loading

markjhoy Apr 1, 2025 •

edited

Loading