v0.11 Release Notes (Sep 16, 2025)
New Features
Select multiple scorers when creating tests
Test creation now supports selecting multiple scorers at once instead of one at a time. The dialog includes search filtering to quickly find the scorers you need, and the system validates compatibility between your dataset type and selected scorers.
Run tests directly from dataset tables
Dataset tables now include action buttons that let you run tests directly from a dataset. No more navigating to the tests page and hunting for the right dataset.
Broader OpenTelemetry compatibility
The trace ingestion endpoint now accepts both JSON and Protobuf formats, automatically detecting the content type and parsing accordingly. This expands compatibility with different OpenTelemetry clients and language SDKs beyond just Python.
Fixes
No bug fixes in this release.
Improvements
Faster, more efficient exports
Trace exports now stream directly to disk instead of buffering in memory, making it possible to download massive datasets without browser memory issues.
Better data consistency and validation
Dataset examples now return in consistent chronological order. The Dataset.add_examples()
method includes type validation to catch incorrect usage of data types earlier. Project activity timestamps now accurately reflect the latest activity across test runs, traces, and datasets.
Updated Terms of Use
Replaced the concise Terms of Service with a comprehensive Terms of Use document covering Customer Obligations, Customer Data, Fees and Payment Terms, and AI Tools usage. Effective September 4, 2025.