Back to News
Release2020-09-04
Learning to summarize with human feedback
Source: OpenAI
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
openaigpt
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.