Research2026-04-30

A Scoping Review of LLM-as-a-Judge in Healthcare and the MedJUDGE Framework

arXiv:2604.25933v1 Announce Type: cross Abstract: As large language models (LLMs) increasingly generate and process clinical text, scalable evaluation has become critical. LLM-as-a-Judge (LaaJ), which uses LLMs to evaluate model outputs, offers a scalable alternative to costly expert review, but...

Read Original Article on Arxiv CS.AI

arxivpapers