LLM evaluation

BSC Group: Computer Sciences Software

This technology integrates a list of benchmarks for LLMs and computes their performance. Includes medical benchmarks, general purpose as well as bias and toxicity ones.

Software Author: 

Pablo Martin, Daniel Hinjos, Anna Arias

License: 

Apache License (Version 2.0)

Primary tabs