Testing

Questi contenuti non sono ancora disponibili nella tua lingua.

Soleri uses three layers of testing to ensure every engine feature works correctly — from individual modules to full cross-package integration.

Quick Reference

npm test              # Unit tests (core, forge, CLI)
npm run test:e2e      # E2E integration tests (800+ tests, 30 files)

Test Layers

Layer	Command	Scope	Speed
Unit	`npm test`	Individual modules within each package	Fast (~10s)
E2E	`npm run test:e2e`	Cross-package integration, real databases, real transports	Medium (~70s)
Smoke	Manual	Full scaffold → build → run cycle with a real agent	Slow (~2min)

When to run what

Changed	Unit	E2E	Smoke
`@soleri/core` engine module	Yes	Yes	—
`@soleri/core` facade	Yes	Yes	—
`@soleri/forge` template	Yes	Yes	Yes
`@soleri/cli` command	Yes	Yes	—
Transport layer	—	Yes	—
Before a release	Yes	Yes	Yes

Unit Tests

Each package has its own test suite:

npm test                                    # All packages
npm run test --workspace=@soleri/core       # Core engine only
npm run test --workspace=@soleri/forge      # Forge templates only
npm run test --workspace=@soleri/cli        # CLI commands only

Unit tests verify individual modules in isolation — vault operations, brain scoring, plan state machine, scaffold output, CLI argument parsing.

E2E Tests

The E2E suite (e2e/) tests cross-package integration with real SQLite databases, real MCP transport, and real scaffolded agents.

npm run test:e2e

What’s covered

Test File	What it verifies
`agent-activation`	Activation lifecycle and identity injection of scaffolded agents
`agent-behavioral`	Captured knowledge appears in searches, feedback affects pattern ranking
`agent-simulation`	Simulates a user’s first week with a Soleri agent as sequential behavior specs
`brain-memory-sessions`	Brain learning loop, intelligence building, memory capture/search, session lifecycle
`capability-packs`	Pack installation, capability resolution, graceful degradation, CLI integration
`chat-context-agency`	Chat, context, agency, control facades plus pack/hook lifecycle
`cli-agent-lifecycle`	CLI agent management (scaffold, build, refresh, diff), generated code compiles
`cli-commands`	Non-interactive create, list, doctor, add-domain, governance
`cli-flock-guard`	Flock file-based guard for CLI concurrency safety
`cli-hooks-conversion`	Hook pack installation, conversion, and lifecycle management
`comprehensive-features`	Every op across all 4 domain packs plus flow engine with realistic inputs
`concurrent-and-performance`	Concurrent facade calls without race conditions, vault search at 1000+ entries
`curator-brain-governance`	Curator grooming, health audits, brain feedback loop, governance lifecycle
`debug-facades`	Facade assembly, op registration, schema correctness across domain packs
`error-paths`	Negative scenarios: invalid ops, missing params, nonexistent resources
`filetree-agent`	Full v7 file-tree agent architecture (scaffold → engine → MCP → ops)
`full-pipeline`	All 13+ engine facades through the dispatch layer
`knowledge-traceability`	Single knowledge piece traced through every system touchpoint
`mcp-transport`	Over-the-wire MCP via real stdio subprocess
`operator-profile`	Operator facade ops (personality learning, signals, adaptation)
`parity-salvador-soleri`	Salvador MCP vs Soleri domain packs output parity for 8 critical ops
`persistence`	Vault/brain/plan data survives runtime close/reopen
`planning-orchestration`	Planning lifecycle, orchestration pipeline, playbook matching, drift reconciliation
`scaffold-and-build`	Template generates → npm install → TypeScript compiles
`scaffold-edge-cases`	Many domains, telegram, tones, skills filter, edge configurations
`skills-and-domains`	SKILL.md frontmatter validation, domain data integrity
`smoke-salvador-agent`	Salvador agent with all 4 domain packs boots, registers ops, executes them
`system-quality`	Vault-informed orchestration, brain recommendation quality at scale
`transports`	HTTP/SSE and WebSocket transport layers, session management, rate limiting
`vault-zettelkasten`	Zettelkasten user journeys: capture → search → link → traverse → orphan detection

How E2E tests work

E2E tests use two patterns depending on what they’re testing:

In-process facade testing — Creates a real AgentRuntime with an in-memory vault, captures facade handlers via a mock MCP server, and calls ops directly. Fast (~30ms for 25 tests) and exercises the full engine stack without subprocess overhead.

const runtime = createAgentRuntime({
  agentId: 'test',
  vaultPath: ':memory:',
});
const facades = createSemanticFacades(runtime, 'test');

Over-the-wire MCP testing — Scaffolds a real agent, builds it, spawns it as a child process, and communicates via MCP stdio transport. Verifies the complete pipeline from scaffold through production runtime.

Writing new E2E tests

E2E tests live in the e2e/ directory and use Vitest. The E2E config (e2e/vitest.config.ts) sets a 120-second timeout and runs tests in a single fork.

Adding a test file:

Create e2e/your-feature.test.ts
Import from @soleri/core or @soleri/forge/lib — path aliases are configured
Use createAgentRuntime({ vaultPath: ':memory:' }) for in-process tests
Clean up temp directories in afterAll

Tips:

Use :memory: vaultPath for fast in-process tests
Use tmpdir() for file-backed tests that verify persistence
Set generous timeouts for scaffold tests (60s+ for beforeAll)
Random port allocation for transport tests to avoid conflicts

Smoke Tests

After any change to @soleri/forge templates, manually scaffold a real agent and verify it works end-to-end:

npx @soleri/cli create smoke-test
cd smoke-test-mcp
npm install
npm run build
npm test

This catches issues that E2E tests might miss — like template syntax errors that only surface during a full build, or generated test failures.

CI

All unit and E2E tests run in CI on every push and pull request. The GitHub Actions workflow is at .github/workflows/ci.yml.

Next: Under the Hood — how the vault, brain, and memory actually work.