BeClaude
Research2026-05-11

Evaluating Large Language Models in Scientific Discovery

Source: Arxiv CS.AI

arXiv:2512.15567v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly applied to scientific research, yet prevailing science benchmarks probe decontextualized knowledge and overlook the iterative reasoning, hypothesis generation, and observation interpretation that drive...

arxivpapers