The reward hypothesis

Created time
Sep 25, 2022 11:50 AM
Main Box
Tags
AI Alignment
Human values
Machine Learning
Philosophy
The reward hypothesis states that:
That all of what we mean by goals and purposes can be well thought of as maximization of the expected value of the cumulative sum of a received scalar signal (reward).