Auto Evaluator
Overview
Example Use
from RAGchain.benchmark import AutoEvaluator
pipeline = <your pipeline>
questions: list[str] = <your list of questions>
evaluator = AutoEvaluator(pipeline, questions)
result = evaluator.evaluate()
# print result summary (mean values)
print(result.results)
# print result DataFrame
print(result.each_results)Last updated