v0.8 Release Notes (Aug 25, 2025)
New Features
Manage custom scorers in the UI
View and manage all your custom scorers directly in the platform. We've added a new tabbed interface that separates Prompt Scorers and Custom Scorers, making it easier to find what you need. Each custom scorer now has a dedicated page where you can view the code and dependencies in read-only format—perfect for team members who want to understand scoring logic without diving into codebases.
Track success rates and test history
The Tests dashboard now includes an interactive success rate chart alongside your existing scorer metrics. See how often your tests pass over time and quickly identify regressions. You can also customize the view to show the past 30, 50, or 100 tests, with smart time axis formatting that adjusts based on data density (month/day for sparse data, down to minute/second for high-frequency testing).
Better navigation throughout the platform
We've added back buttons to nested pages (Tests, Datasets, Annotation Queue, and Scorers) so you can navigate more intuitively. The sidebar now includes an enhanced support menu that consolidates links to documentation, GitHub, Discord, and support in one convenient dropdown.
Fixes
Registration error handling
Registration now shows clear error messages when you try to use an existing email.
Latency chart consistency
Latency charts display consistent units across the Y-axis and tooltips.
Improvements
Enhanced security
Migrated email templates to Jinja2 with autoescaping to prevent HTML injection.
Improved trace tables
You can now sort your traces by Name, Created At, Status, Tags, Latency, and LLM Cost.
Small platform enhancements
Click outside the trace view popout to dismiss it. Rules interface sections now expand and collapse smoothly, and Slack integration status is clearer with direct links to settings when not connected.