Documentation Index
Fetch the complete documentation index at: https://docs.xhipai.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
AccuracyEval uses an LLM judge to score agent responses against expected answers on a 0.0–1.0 scale.
Quick Start
Configuration
| Option | Type | Default | Description |
|---|---|---|---|
name | string | required | Name of the evaluation |
agent | Agent | required | Agent to evaluate |
judge | ModelProvider | required | Model used for scoring |
cases | EvalCase[] | required | Test cases with input/expected |
threshold | number | 0.7 | Minimum score to pass |
timeoutMs | number | 30000 | Timeout per case |