Overview
This example walks through running Arksim against an OpenClaw personal AI assistant. OpenClaw is an open-source assistant that runs on your own hardware and can manage tasks, smart home devices, messaging, calendar, and files. You use this setup to simulate and evaluate conversations against your own OpenClaw deployment. The agent is exposed via a Chat Completions-compatible endpoint; the exampleagent_config.json is already configured for the OpenClaw gateway.
Prerequisites
Before getting started:- OpenClaw installed and configured: openclaw.ai
- OpenClaw gateway running with the HTTP Chat Completions endpoint enabled
- OpenAI API key: used by Arksim to power the simulated user
- OpenClaw gateway token: for authenticating to your OpenClaw instance (see
~/.openclaw/openclaw.jsonundergateway.auth.token)
Scenarios
The example ships with pre-built scenarios inscenarios.json representing realistic interactions with a personal AI assistant. Each scenario defines a simulated user with a distinct persona, goal, and background knowledge (files, smart home setup, contacts, tasks, etc.).
The scenario goals are:
- Manage reminders and to-do lists for files and storage (cleanup Downloads, cloud folders, storage alerts, important documents).
- Control and automate smart home devices and scenes (lights, thermostat, blinds, locks, scenes like Good Morning, Movie Night, Leaving Home, Goodnight).
- Send and manage messages across WhatsApp, iMessage, Slack, email, and Telegram using contacts and recent conversations.
- Manage calendar and schedule (e.g. call dentist to confirm appointment, schedule car maintenance, recurring reminders, upcoming tasks).
- Organize files and documents across local folders and cloud storage (cleanup Downloads, duplicate photos, Trash, important items and shared folders).
scenarios.json in the example directory and can be edited or extended to reflect your assistant’s capabilities.
Setup
Enable Chat Completions in OpenClaw
In
~/.openclaw/openclaw.json, enable the HTTP Chat Completions endpoint:Set environment variables
~/.openclaw/openclaw.json under gateway.auth.token.Verify the OpenClaw connection
Before running simulation, confirm the gateway is responding:You should receive a valid JSON response from the gateway.
Configuration
The example uses a single config file for both simulation and evaluation. Key settings:simulated_user_prompt_template (Jinja) that uses scenario.agent_context, scenario.goal, scenario.knowledge, and simulation.profile to drive the simulated user.
The example directory contains
config.yaml, scenarios.json, and agent_config.json. There are no separate simulate or evaluate config files.Output
Results are written under the example directory:| Location | Contents |
|---|---|
./results/simulation/simulation.json | Simulated conversations from the simulation step |
./results/evaluation/evaluation.json | Evaluation results (per-turn and per-conversation scores, unique errors) |
./results/evaluation/final_report.html | Interactive HTML report for browsing and sharing results |
Example Files
| File | Description |
|---|---|
config.yaml | Simulate and evaluate configuration (single pipeline). |
scenarios.json | Pre-built scenarios for the personal assistant. |
agent_config.json | OpenClaw agent config: Chat Completions endpoint, port 18789, ${OPENCLAW_TOKEN}, and x-openclaw-agent-id: main. |
Adapting to Your Own Assistant
To use this example as a starting point for your own personal assistant:- Scenarios: Edit or add scenarios in
scenarios.jsonso goals and knowledge reflect your assistant’s capabilities. - Agent config: If your endpoint, port, or auth headers differ from the OpenClaw defaults, update
agent_config.jsonaccordingly.