Argmax
A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm
Argmax
1: Reward is Enough
•
Vahe Hagopian, Taka Hasegawa, Farrukh Rahman
•
Season 1
•
Episode 1
This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.
Todays paper: Reward is Enough
Summary of the paper
The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.
Highlights of discussion
- High level overview of Reinforcement Learning
- How evolution can be encoded as a reward maximization problem
- What is the one reward signal we are trying to optimize?