v0.11 Release Notes (Sep 16, 2025)

2025-09-16
v0.11.0

New Features

Select multiple scorers when creating tests

Test creation now supports selecting multiple scorers at once instead of one at a time. The dialog includes search filtering to quickly find the scorers you need, and the system validates compatibility between your dataset type and selected scorers.

Run tests directly from dataset tables

Dataset tables now include action buttons that let you run tests directly from a dataset. No more navigating to the tests page and hunting for the right dataset.

Broader OpenTelemetry compatibility

The trace ingestion endpoint now accepts both JSON and Protobuf formats, automatically detecting the content type and parsing accordingly. This expands compatibility with different OpenTelemetry clients and language SDKs beyond just Python.

Fixes

No bug fixes in this release.

Improvements

Faster, more efficient exports

Trace exports now stream directly to disk instead of buffering in memory, making it possible to download massive datasets without browser memory issues.

Better data consistency and validation

Dataset examples now return in consistent chronological order. The Dataset.add_examples() method includes type validation to catch incorrect usage of data types earlier. Project activity timestamps now accurately reflect the latest activity across test runs, traces, and datasets.

Updated Terms of Use

Replaced the concise Terms of Service with a comprehensive Terms of Use document covering Customer Obligations, Customer Data, Fees and Payment Terms, and AI Tools usage. Effective September 4, 2025.