Preclinical
Adversarial testing platform for healthcare AI agents.
Preclinical runs adversarial multi-turn conversations against your healthcare AI agents, then grades the transcripts against safety rubrics. It simulates patients using red-team techniques to probe for weaknesses in triage accuracy, medical advice safety, and compliance.
What it does
- Adversarial Pen Testing -- AI-powered attacks that probe for weaknesses using medical-specific vectors
- Rubric-Based Grading -- Automated evaluation against configurable criteria
- Multi-Provider Support -- Test agents across Vapi, LiveKit, Pipecat, OpenAI, and Browser
- Evidence-Based Reports -- Detailed explanations with transcript quotes for every verdict
Quick Start
git clone https://github.com/Mentat-Lab/preclinical.git
cd preclinical
cp .env.example .env
# Add model credentials (OpenAI or Anthropic)
docker compose up
- UI + API: http://localhost:3000 (single port serves both)
Create an agent, pick scenarios, and start a test run.
The Problem
Healthcare AI agents are being deployed rapidly, but without standardized testing frameworks. This creates risks:
| Risk | Description |
|---|---|
| Safety | Agents may provide dangerous medical advice |
| Compliance | Agents may violate HIPAA or state regulations |
| Quality | Agents may have poor accuracy, tone, or response times |
| Trust | Healthcare organizations can't confidently deploy AI agents |
Use It Your Way
Open http://localhost:3000, create an agent, and start testing.
/plugin marketplace add Mentat-Lab/preclinical
/plugin install preclinical@preclinical
/preclinical:setup
/preclinical:run, /preclinical:benchmark, /preclinical:diagnose, and more.
If you clone the repo, the plugin loads automatically — no install step needed.
Then just ask: "Test my healthcare agent with emergency scenarios"
Works with Cursor, Windsurf, Copilot, Cline, and more.
Next Steps
- Quickstart -- Get up and running in minutes
- CLI & Agent Skills -- Use from terminal or AI coding assistants
- How It Works -- Understand the testing flow
- Integrations -- Connect your AI agent
- API Reference -- Full REST API documentation
- CI/CD -- Automate testing in your pipeline