Add e2e tests for embedding raw flag #16940

SamMalayek · 2025-11-02T16:00:13Z

🧩 Summary

This PR adds a CI workflow for end-to-end embedding CLI tests (none exist today). It establishes a small, fast, reproducible baseline for validating embedding behavior (dimensions + determinism) using tiny GGUF models.

Discussion / design context: See the companion RFC in Discussions for the longer-term plan to add a native server endpoint: #16957

⚙️ What this PR includes

A GitHub Actions job (embeddings.yml) that runs E2E embedding CLI tests with cached tiny models (e.g., TinyLlama).
Checks output dimensions and deterministic behavior.
Keeps runs lightweight and fast; an optional large-model stress test can be added later.

CISC · 2025-11-02T19:54:54Z

While testing is great I have issues with this PR:

The CI only runs when the example code change, not when actual embedding code changes
The tests are not testing anything likely to break, nor indeed even anything useful

To sum up; overall as a CI job this will not run when it matters, and when it is run it will not catch any problems of consequence.

Lastly, why create a new PR instead of arguing your case in the original PR and maybe iron out concerns there?

SamMalayek · 2025-11-02T22:02:27Z

While testing is great I have issues with this PR:

The CI only runs when the example code change, not when actual embedding code changes

The tests are not testing anything likely to break, nor indeed even anything useful

To sum up; overall as a CI job this will not run when it matters, and when it is run it will not catch any problems of consequence.

I kept them in examples alongside the CLI code as a pragmatic, incremental, small first step. The intent was to expand scope and relocate to tests/e2e/embedding/ once the embedding path itself was integrated into the server runtime (or simply commonized as an intermediary step). That said, I can move them now and broaden coverage to better match expectations (done).

Lastly, why create a new PR instead of arguing your case in the original PR and maybe iron out concerns there?

That PR was closed with minimal comment. There appeared to be no room for discussion.

SamMalayek · 2025-11-02T23:35:32Z

Note: Empty force-pushes are to re-trigger the CI workflow following an intermittent failure.

SamMalayek · 2025-11-03T22:33:57Z

Putting this in draft while work on the RFC continues.

SamMalayek requested a review from CISC as a code owner November 2, 2025 16:00

github-actions bot added examples python python script changes devops improvements to build systems and github actions labels Nov 2, 2025

SamMalayek mentioned this pull request Nov 2, 2025

Add e2e tests for embedding raw flag #16923

Closed

SamMalayek force-pushed the feature/test-embedding-raw branch from 015351e to 075c324 Compare November 2, 2025 18:39

Add e2e tests for embedding raw flag

c1c3d99

SamMalayek force-pushed the feature/test-embedding-raw branch from 075c324 to c1c3d99 Compare November 2, 2025 18:55

SamMalayek requested a review from ggerganov as a code owner November 2, 2025 22:01

SamMalayek force-pushed the feature/test-embedding-raw branch from e5a5b26 to 82dbeee Compare November 2, 2025 22:12

github-actions bot added the testing Everything test related label Nov 2, 2025

Increase scope of embedding cli tests

2de1e68

SamMalayek force-pushed the feature/test-embedding-raw branch from 82dbeee to 2de1e68 Compare November 2, 2025 23:01

Update test and workflow to match new RFC

5ce810e

SamMalayek force-pushed the feature/test-embedding-raw branch from 6a65869 to 5ce810e Compare November 3, 2025 08:35

DajanaV mentioned this pull request Nov 3, 2025

UPSTREAM PR #16940: Add e2e tests for embedding raw flag auroralabs-loci/llama.cpp#50

Closed

SamMalayek marked this pull request as draft November 3, 2025 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add e2e tests for embedding raw flag #16940

Add e2e tests for embedding raw flag #16940

SamMalayek commented Nov 2, 2025 •

edited

Loading

Uh oh!

CISC commented Nov 2, 2025

Uh oh!

SamMalayek commented Nov 2, 2025 •

edited

Loading

Uh oh!

SamMalayek commented Nov 2, 2025

Uh oh!

SamMalayek commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add e2e tests for embedding raw flag #16940

Are you sure you want to change the base?

Add e2e tests for embedding raw flag #16940

Conversation

SamMalayek commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🧩 Summary

⚙️ What this PR includes

Uh oh!

CISC commented Nov 2, 2025

Uh oh!

SamMalayek commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SamMalayek commented Nov 2, 2025

Uh oh!

SamMalayek commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SamMalayek commented Nov 2, 2025 •

edited

Loading

SamMalayek commented Nov 2, 2025 •

edited

Loading