BeClaude
Back to News
Release2024-07-24

Improving Model Safety Behavior with Rule-Based Rewards

Source: OpenAI

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.

openaigptsafety