OfflineTestResult
The outcome of an offline test run, returned by `client.offlineTests.run()`.
The outcome of an offline test run, returned by client.offlineTests.run().
Properties
testRunId
:string
The test run id.
status
:string
Final run status.
uiResultsUrl
:string | undefined
Link to the results page in the dashboard.
results
:OfflineExampleResult[]
One entry per dataset example, with per-judge scorer rows.
agentOfflineTraceIds
:Record<string, string>
Mapping of example id to the agent's offline trace (agent testing only).
passed
:boolean | null
Whether every row passed its pass condition; null when none was evaluated.
OfflineScorerData
One judge's result for one example.
OfflineTestRunner
Executes the offline-test lifecycle for a test config (the TypeScript port of the Python `OfflineTestRunner`): resolve the dataset version, optionally run the agent to produce offline traces, create the test run, wait for terminal status, fetch results, evaluate the pass condition, and report successes.
Last updated on