In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from.
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!

The Retort AI Podcast
Distilling the major events and challenges in the world of artificial intelligence and machine learning, from Thomas Krendl Gilbert and Nathan Lambert.
Distilling the major events and challenges in the world of artificial intelligence and machine learning, from Thomas Krendl Gilbert and Nathan Lambert.Listen on
Substack App
Apple Podcasts
Spotify
YouTube
Overcast
Pocket Casts
RSS Feed
Recent Episodes
Share this post