Name	Name	Last commit message	Last commit date
Latest commit History 236 Commits
.beads	.beads
.cargo	.cargo
.githooks	.githooks
.github/workflows	.github/workflows
.qlty	.qlty
docs	docs
openspec	openspec
skills	skills
tooltest-cli	tooltest-cli
tooltest-core	tooltest-core
tooltest-test-support	tooltest-test-support
.gitattributes	.gitattributes
.gitignore	.gitignore
AGENTS.md	AGENTS.md
Cargo.lock	Cargo.lock
Cargo.toml	Cargo.toml
README.md	README.md
install.sh	install.sh
skills.lock.json	skills.lock.json

tooltest

Conformance testing for MCP servers — fast enough for the CLI, solid enough for CI.

tooltest runs your MCP server like a real client/agent would (connect → list tools → call tools) and reports protocol / schema / runtime issues in a way that’s easy to act on.

Use it to:

sanity-check locally while you’re developing
gate releases in CI with repeatable integration tests
turn failures into fixes by handing the report to a coding agent

Quick start

Install

Latest prebuilt binaries (auto-detect OS/arch):

curl -fsSL https://raw.githubusercontent.com/lambdamechanic/tooltest/main/install.sh | bash

The installer verifies downloaded binaries against the *.sha256 checksums when sha256sum or shasum is available.

Optionally set an install directory (default /usr/local/bin, fallback to ~/.local/bin):

INSTALL_DIR="$HOME/.local/bin" \
  curl -fsSL https://raw.githubusercontent.com/lambdamechanic/tooltest/main/install.sh | bash

Direct downloads (stable URLs):

https://github.com/lambdamechanic/tooltest/releases/download/latest/tooltest-linux-x86_64
https://github.com/lambdamechanic/tooltest/releases/download/latest/tooltest-linux-aarch64
https://github.com/lambdamechanic/tooltest/releases/download/latest/tooltest-macos-arm64
https://github.com/lambdamechanic/tooltest/releases/download/latest/tooltest-windows-x86_64.exe

Build from source:

cargo install --path tooltest-cli

Test a stdio MCP server

tooltest stdio --command ./path/to/your-mcp-server
# optional: --arg ..., --env KEY=VALUE, --cwd /somewhere

Test a Streamable HTTP MCP endpoint

tooltest http --url http://127.0.0.1:8080/mcp
# optional: --auth-token "Bearer …"

Output

Human-readable output on stdout by default; pass --json for JSON output (including error messages).

JSON Schema patterns

Tooltest treats JSON Schema pattern values as ECMAScript regexes and relies on rslint_regex plus regex_syntax/proptest to keep generation aligned with ECMA-262 semantics (e.g., ASCII-only \d, \w).

Exit codes

0 = success
1 = run failure
2 = argument/validation error

Use it in CI / tests

Treat tooltest as an integration test: run it against your server build, and fail the job if it reports problems.

Example (shell):

set -euo pipefail
tooltest stdio --command ./target/release/my-mcp-server

CLI examples

Migration note (legacy generator removal)

Tooltest now runs only the state-machine generator. The --generator-mode flag and GeneratorMode API are removed. If you previously relied on legacy random generation, expect stricter input sourcing by default. To allow schema-based generation for required fields when the corpus is empty, set --lenient-sourcing or provide --state-machine-config '{"lenient_sourcing":true}'.

Simple run against a hosted MCP endpoint:

cargo run -p tooltest-cli --bin tooltest -- \
  --cases 100 \
  http --url https://pymcp.app.lambdamechanic.com/kev/mcp

Simple run against a local stdio MCP server:

cargo run -p tooltest-cli --bin tooltest -- \
  --cases 100 \
  stdio --command ./target/debug/my-mcp-server

State-machine sourcing

State-machine mode is strict by default: it only uses values mined from the corpus when satisfying required schema fields. If your server needs schema-based generation (for example, on the very first call), enable lenient sourcing.

State-machine runs always track a corpus and coverage counts; this adds overhead compared to the legacy generator and can grow with response size.

You can set this in the JSON config:

--state-machine-config '{"lenient_sourcing":true}'

Or override it on the CLI:

--lenient-sourcing
--no-lenient-sourcing

CLI flags take precedence over the JSON config.

Seed data

Seed the corpus with known values (strings or numbers) using inline JSON:

tooltest stdio --command ./target/debug/my-mcp-server \
  --state-machine-config '{"seed_strings":["alpha"],"seed_numbers":[42]}'

Text mining

If your MCP server only emits textual content, you can mine whitespace-delimited tokens into the corpus:

--state-machine-config '{"mine_text":true}'

Or override it on the CLI:

--mine-text

Corpus debugging

Dump the final corpus as JSON (stderr in human mode, inline in --json mode):

--dump-corpus

Log newly mined corpus values after each tool response (stderr):

--log-corpus-deltas

Hosted MCP integration tests

By default the hosted MCP integration test runs and exercises the three public MCP servers used for validation. To skip it:

SKIP_HOSTED_MCP_TESTS=1 cargo test -p tooltest-core --test hosted_mcp_tests

Verbose rmcp tracing

The tests install a tracing subscriber that emits to stderr. Use RUST_LOG plus --nocapture to see the full interaction.

RUST_LOG=rmcp=trace cargo test -p tooltest-core --test hosted_mcp_tests -- --nocapture

External Rust test example

This is an example integration test in another crate that uses tooltest-core to exercise a hosted MCP HTTP endpoint.

use rmcp::transport::streamable_http_client::StreamableHttpClientTransportConfig;
use rmcp::transport::StreamableHttpClientTransport;
use tooltest_core::{SessionDriver, ToolInvocation};

#[tokio::test]
async fn calls_hosted_tool() {
    let transport = StreamableHttpClientTransport::from_config(
        StreamableHttpClientTransportConfig::with_uri("https://pymcp.app.lambdamechanic.com/attack/mcp"),
    );
    let driver = SessionDriver::connect_with_transport(transport)
        .await
        .expect("connect");

    let invocation = ToolInvocation {
        name: "some_tool".into(),
        arguments: None,
    };

    let trace = driver.send_tool_call(invocation).await.expect("call tool");
    assert_eq!(trace.response.is_error, Some(false));
}

Agent-assisted “fix loop” prompt examples

Pick one of these, paste it into Codex/Claude (with repo access), and let it iterate until tooltest is clean.

Codex prompt

You are working in this repository.
Goal: make the repository’s MCP server(s) conform to the MCP spec as exercised by tooltest.

Figure out how to run the MCP server from this repo (stdio or HTTP).

Run tooltest against it (examples below).

When tooltest reports failures, fix the underlying issues in the smallest reasonable patch.

Re-run tooltest and repeat until it exits 0.

Don’t rename tools or change schemas unless required; prefer backward-compatible fixes.

Add/adjust tests if needed.

Commands (choose the right one):

stdio: tooltest stdio --command "<command that starts the repo’s MCP server>"

http: tooltest http --url "<server mcp url>"

Return a short summary of what you changed and why, plus the final passing tooltest output snippet.

Claude prompt

You have access to this repo and can run commands.
Please make the MCP server(s) in this repository pass tooltest with zero failures.

Process:

Identify how to start the MCP server from the repo (stdio or streamable HTTP).

Run tooltest against it.

Fix the issues reported (protocol violations, tool schema mismatches, error handling, etc.).

Re-run tooltest until it exits successfully.

Use minimal, targeted changes. Avoid breaking tool names/schemas unless necessary.

Tips

If you want deeper coverage, increase the number of generated cases / run modes (when available).
If a failure is intermittent, keep the smallest reproduction from the report and turn it into a regression test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tooltest

Quick start

Install

Test a stdio MCP server

Test a Streamable HTTP MCP endpoint

Output

JSON Schema patterns

Exit codes

Use it in CI / tests

CLI examples

Migration note (legacy generator removal)

State-machine sourcing

Seed data

Text mining

Corpus debugging

Hosted MCP integration tests

Verbose rmcp tracing

External Rust test example

Agent-assisted “fix loop” prompt examples

Codex prompt

Claude prompt

Tips

About

Uh oh!

Releases 1

Packages

Languages

lambdamechanic/tooltest

Folders and files

Latest commit

History

Repository files navigation

tooltest

Quick start

Install

Test a stdio MCP server

Test a Streamable HTTP MCP endpoint

Output

JSON Schema patterns

Exit codes

Use it in CI / tests

CLI examples

Migration note (legacy generator removal)

State-machine sourcing

Seed data

Text mining

Corpus debugging

Hosted MCP integration tests

Verbose rmcp tracing

External Rust test example

Agent-assisted “fix loop” prompt examples

Codex prompt

Claude prompt

Tips

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages