AgentProbe by CarbonHelm

Test your AI agent in 5 minutes, not 5 days.

Paste your system prompt, define your tools, and get an automated reliability evaluation with adversarial testing built in.

NameDescriptionParameters

Generated Test Scenarios

20 scenarios covering happy-path, edge-case, adversarial, and multi-step patterns.

Evaluation Results

--
/ 100

Top Vulnerabilities

Failure Analysis

Recommendations

Export & Share

Pricing

Ship reliable agents. Stop shipping hope.

Free

$0
forever
  • 1 evaluation per day
  • 20 test scenarios per run
  • Basic reliability score
  • JSON export
  • Badge generator

Enterprise

Custom
contact us
  • Everything in Pro
  • 100+ test scenarios per run
  • Dedicated support
  • SSO / SAML
  • Custom adversarial tests
  • SLA guarantee
  • On-premise deployment

Evaluation History

Past evaluations stored locally in your browser.

No evaluations yet. Run your first test to see results here.