Python
ScoringResult
The combined result of running scorers against a single example.
The combined result of running scorers against a single example.
Returned by Evaluation.run(). Inspect scorers_data for per-scorer
values, errors, and metadata.
results = evaluation.run(
examples=examples,
scorers=["faithfulness", "answer_relevancy"],
eval_run_name="nightly",
)
for result in results:
for scorer in result.scorers_data:
print(f"{scorer.name}: {scorer.value}")Attributes
scorers_data
:List[ScorerData]
Per-scorer results (see ScorerData).
data_object
:Union[TraceSpan, Example]
The Example or TraceSpan that was scored.
name
:Optional[str]
The evaluation run name.
Default:
None
trace_id
:Optional[str]
Associated trace ID, if applicable.
Default:
None
run_duration
:Optional[float]
How long the evaluation took (seconds).
Default:
None
evaluation_cost
:Optional[float]
Total cost in USD.
Default:
None
to_dict()
def to_dict() -> APIScoringResult:Returns
APIScoringResult