Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.arklex.ai/llms.txt

Use this file to discover all available pages before exploring further.

Prerequisites

Option A: Test your own agent

If you already have an agent running, use arksim init to scaffold a starter config and scenarios file, then point it at your endpoint.
1

Install and set your API key

pip install arksim
export OPENAI_API_KEY="your-api-key"
For other providers: pip install "arksim[anthropic]" or pip install "arksim[google]".
2

Scaffold a starter config

arksim init
This creates three files in the current directory:
  • config.yaml pointing at ./my_agent.py with sensible defaults
  • scenarios.json with four domain-agnostic starter scenarios (happy path, out of scope, ambiguous intent, multi-step)
  • my_agent.py with a BaseAgent subclass ready to fill in (no server needed)
Open my_agent.py and replace the execute() body with your agent logic. All files include inline comments explaining each field.
For HTTP or A2A agents, use arksim init --agent-type chat_completions or arksim init --agent-type a2a instead. Use --force to re-scaffold if files already exist.
3

Run simulation and evaluation

arksim simulate-evaluate config.yaml
4

View results

Open results/final_report.html in your browser for an interactive report with scores, failure categories, and full conversation transcripts.

Option B: Explore a pre-built example

If you want to see ArkSim in action before connecting your own agent, try one of the included examples.
1

Install and set your API key

pip install arksim
export OPENAI_API_KEY="your-api-key"
2

Download examples

arksim examples
This creates an examples/ folder with ready-to-run projects (bank-insurance, e-commerce, customer-service, openclaw).
3

Run simulation and evaluation

cd examples/bank-insurance
arksim simulate-evaluate config.yaml
4

View results

Open results/evaluation/final_report.html in your browser for scores, failure analysis, and full conversation transcripts.

Using other LLM providers

ArkSim uses OpenAI by default for both the simulated user and the evaluator. To use Anthropic or Google instead, set the provider in your config.yaml:
pip install "arksim[anthropic]"
export ANTHROPIC_API_KEY="your-api-key"
model: claude-opus-4-6
provider: anthropic

Next Steps

Now that you’ve run your first simulation and evaluation, here’s where to go next.