Query sets

A query set is a collection of queries. These queries are used in experiments for search relevance evaluation. Search Relevance Workbench offers different sampling techniques for creating query sets from real user data that adheres to the User Behavior Insights (UBI) specification. Additionally, Search Relevance Workbench allows you to import a query set.

Creating query sets

If you’re tracking user behavior with the UBI specification, you can choose from different sampling methods that can create query sets based on real user queries stored in the ubi_queries index.

Search Relevance Workbench supports three sampling methods:

Random: Takes a random sample of all queries.
Probability-Proportional-to-Size Sampling: Takes a frequency-weighted sample of all queries to obtain a representative sample.
Top N: Takes the most frequent N queries.

Endpoint

POST _plugins/_search_relevance/query_sets

Request body fields

The following table lists the available input parameters.

Field	Data type	Description
`name`	String	The name of the query set. The maximum length is 50 characters.
`description`	String	A short description of the query set. The maximum length is 250 characters.
`sampling`	String	Defines which sampler to use. Valid values are `pptss` (Probability-Proportional-to-Size-Sampling), `random`, `topn` (most frequent queries), and `manual`.
`querySetSize`	Integer	The target number of queries in the query set. Depending on the number of unique queries in `ubi_queries`, the resulting query set may contain fewer queries. Must be a positive integer.

Example request: Sampling 20 queries with the Top N sampler

POST _plugins/_search_relevance/query_sets
{
  "name": "Top 20",
  "description": "Top 20 most frequent queries sourced from user searches.",
  "sampling": "topn",
  "querySetSize": 20
}

Example request: Uploading a query set manually

PUT _plugins/_search_relevance/query_sets
{
   	"name": "TVs",
   	"description": "TV queries",
   	"sampling": "manual",
   	"querySetQueries": [
      {
        "queryText": "tv"
      },
      {
        "queryText": "led tv"
      }
    ]
}

Query set formats

Search Relevance Workbench supports two formats for query sets, each designed for different use cases. Both formats are a collection of user queries, but they differ in whether they include an expected answer.

Basic query set: A list of user queries without any additional information. This is useful for general relevance testing where no specific answer is expected.
Query set with reference answers: A list of user queries, in which each query is paired with its expected answer. This format is particularly useful for evaluating applications designed to provide a specific answer, such as question-answering systems.

Fields

All query sets comprise one or more entries. Each entry is a JSON object containing the following fields.

Field	Data type	Description
`queryText`	String	The user query string. Required.
`referenceAnswer`	String	The expected or correct answer to the user query. This field is used for generating judgments, especially with large language models (LLMs). Optional.

Basic query set example

A basic query set contains only the queryText field for each entry. It is suitable for general relevance tests where no single “correct” answer exists.

Example query set without reference answers

{"queryText": "t towels kitchen"}
{"queryText": "table top bandsaw for metal"}
{"queryText": "tan strappy heels for women"}
{"queryText": "tank top plus size women"}
{"queryText": "tape and mudding tools"}

Query set with reference answers example

This format includes the referenceAnswer field alongside the queryText. It is ideal for evaluating applications designed to provide specific answers, such as chatbots or question-answering systems.

Example query set with reference answers

{"queryText": "What is the capital of France?", "referenceAnswer": "Paris"}
{"queryText": "Who wrote 'Romeo and Juliet'?", "referenceAnswer": "William Shakespeare"}
{"queryText": "What is the chemical symbol for water?", "referenceAnswer": "H2O"}
{"queryText": "What is the highest mountain in the world?", "referenceAnswer": "Mount Everest"}
{"queryText": "When was the first iPhone released?", "referenceAnswer": "June 29, 2007"}

The referenceAnswer field is particularly useful when using LLMs to generate judgments. The LLM can use the reference answer as a ground truth to compare against the retrieved search results, allowing it to accurately score the relevance of the response.

Managing query sets

You can retrieve or delete query sets using the following APIs.

Retrieve query sets

This API retrieves available query sets.

Endpoints

GET _plugins/_search_relevance/query_sets
GET _plugins/_search_relevance/query_sets/<query_set_id>

Example response

{
  "took": 2,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 1,
      "relation": "eq"
    },
    "max_score": null,
    "hits": [
      {
        "_index": "search-relevance-queryset",
        "_id": "bb45c4c4-48ce-461b-acbc-f154c0a17ec9",
        "_score": null,
        "_source": {
          "id": "bb45c4c4-48ce-461b-acbc-f154c0a17ec9",
          "name": "TVs",
          "description": "Some TVs that people might want",
          "sampling": "manual",
          "timestamp": "2025-06-11T13:43:26.676Z",
          "querySetQueries": [
            {
              "queryText": "tv"
            },
            {
              "queryText": "led tv"
            }
          ]
        },
        "sort": [
          1749649406676
        ]
      }
    ]
  }
}

Path parameters

The following table lists the available path parameters.

Parameter	Data type	Description
`query_set_id`	String	The ID of the query set to retrieve. Retrieves all query sets when empty.

Delete a query set

You can delete a query set using the query set ID.

Endpoint

DELETE _plugins/_search_relevance/query_sets/<query_set_id>

Example request

DELETE _plugins/_search_relevance/query_sets/bb45c4c4-48ce-461b-acbc-f154c0a17ec9

Example response

{
  "_index": "search-relevance-queryset",
  "_id": "bb45c4c4-48ce-461b-acbc-f154c0a17ec9",
  "_version": 2,
  "result": "deleted",
  "forced_refresh": true,
  "_shards": {
    "total": 2,
    "successful": 1,
    "failed": 0
  },
  "_seq_no": 17,
  "_primary_term": 1
}

Creating query sets
Query set formats
Managing query sets

WAS THIS PAGE HELPFUL?

✔ Yes ✖ No

Tell us why

350 characters left

Have a question? Ask us on the OpenSearch forum.

Want to contribute? Edit this page or create an issue.

Query sets

Creating query sets

Endpoint

Request body fields

Example request: Sampling 20 queries with the Top N sampler

Example request: Uploading a query set manually

Query set formats

Fields

Basic query set example

Example query set without reference answers

Query set with reference answers example

Example query set with reference answers

Managing query sets

Retrieve query sets

Endpoints

Example response

Path parameters

Delete a query set

Endpoint

Example request

Example response

OpenSearch Links

Get Involved

Resources

Contact Us