Back to Glossary

Benchmarking

Method of Model Evaluation which assesses AI system performance against a predefined task, such as mapping AI system outputs to a dataset of prompts and responses.

share this

See the industry-leading AI governance platform in action

Schedule a call with one of our experts

Get a demo