Judgeval Python SDKResponse Types

ScoringResult

Complete evaluation results for a single example

Contains the output of one or more scorers applied to a single example. Represents the complete evaluation results for one input with its actual output, expected output, and all applied scorer results.

successrequired:bool
Whether the evaluation was successful. True when all scorers applied to this example returned a success.
scorers_data:List[ScorerData]

List of individual scorer results for this evaluation

data_object:Example

The original example object that was evaluated

name:str

Optional name identifier for this scoring result

trace_id:str

Unique identifier linking this result to trace data

run_duration:float

Time taken to complete the evaluation in seconds

evaluation_cost:float
Estimated cost of running the evaluation (e.g., API costs)

Usage Examples

from judgeval import JudgmentClient

client = JudgmentClient()
results = client.evaluate(examples=[...], scorers=[...])

for result in results:
    if result.success:
        print(f"Evaluation succeeded in {result.run_duration:.2f}s")
        for scorer_data in result.scorers_data:
            print(f"  {scorer_data.name}: {scorer_data.score}")
    else:
        print("Evaluation failed")

On this page