Models

Reward Models

1 articles in archive

Scaling laws for reward model overoptimization

OpenAI Blog1249d ago