Prerequisites
Before you begin, make sure you have:- Completed the Installation
- An OpenAI API key (get one here)
Run Your First Evaluation
- Binary
- Docker
Set up the example agent
Navigate to the insurance customer service example and configure the agent:
What’s included:
- Pre-configured agent settings
- Sample knowledge base for scenario building and evaluation
- Ready-to-use runtime configuration
- A prompt-based agent using OpenAI’s Chat Completions API
Run the simulator
The simulation will take a few minutes to complete. You’ll see progress updates in your terminal.
View your results
Once complete, find your evaluation results in the
results/ folder:| File | Description |
|---|---|
results/evaluation/final_report.html | Visual summary with aggregated metrics and insights |
results/conversation/ | Individual conversation logs with message-by-message details |
results/evaluation/ | Granular per-turn and per-conversation scores |
Open
final_report.html in your browser to see your agent’s performance!Troubleshooting
Missing config file error
Missing config file error
The simulator looks for
agent_config.json in the agent setup directory. Make sure you’ve renamed the file:Docker can't find files
Docker can't find files
Use an absolute path for the bind mount to ensure the container sees your local files: