Bio
News
Papers
Blog

RLHF

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

May 1, 2025

© 2025 Me. This work is licensed under CC BY NC ND 4.0. SVG background art by Free SVG Backgrounds and Patterns by SVGBackgrounds.com

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.