EvalResults.scores_by_variant()
Aggregate mean scores per variant per dimension.
Usage
EvalResults.scores_by_variant()Returns
dict[str, dict[str, float]]- Mapping of variant name -> dimension name -> mean score.
Aggregate mean scores per variant per dimension.
Usage
dict[str, dict[str, float]]