Each January, while many students enjoy a well-earned winter break, a select group of Harvard undergraduates embarks on a journey that transcends traditional learning. Founded by Dominic Mao,…
In domains as diverse as mastering video games, controlling robotic limbs, and finetuning ChatGPT, a family of approaches known collectively as “reinforcement learning” (RL) has revolutionized the field…