BeClaude
Back to News
Release2024-11-19

Judge Arena: Benchmarking LLMs as Evaluators

Source: Hugging Face

open-sourcemodelsbenchmark