DeepMind's Crome: A Breakthrough in AI Reward Models
Google DeepMind, McGill University, and MILA introduce Crome, a novel framework to enhance large language model alignment with human feedback. By leveraging causal understanding and synthetic data, Crome addresses 'reward hacking'—a key challenge in AI training—while improving safety and reasoning capabilities.
