BeClaude
Back to News
Release2022-10-19

Scaling laws for reward model overoptimization

Source: OpenAI

openaigpt