PromptScorer

Evaluate agent behavior based on a rubric you define and iterate on the platform.