Method of Model Evaluation which assesses AI system performance against a predefined task, such as mapping AI system outputs to a dataset of prompts and responses.
Last Updated:
Source:
Google DeepMind, Socio-technical Evaluations of Generative AI Systems
Additional source:
Related content:
Schedule a call with one of our experts