Bellman Equations

We all learn through rewarding systems. Our body also adapted through ‘awarding’ systems.

I went through Reinforcement Learning: Theory and Algorithms, by Alekh Agarwal, Nan Jiang, Sham Kakade, and Wen Sun. This book was too abstract, and I’m drowning in the equation ocean. At the very begining, Bellman Equation is introduced.

Bellman Equation is fundamental to Reinforcement Learning algorithms. It tells you how to decompose the value function into immediate reward and future discounted returns. Thus, we could apply dynamic programming to get the value function. Also, from Bellman Equation, we could get the Bellman Optimality Equation.

Needless to say, I need read other references for a better understanding. After going through several on-line vedios and webpages (listed below), the beauty and logics of this equation were appreciated a little bit:-(

Other References

Wiki: https://en.wikipedia.org/wiki/Bellman_equation
DEEP REINFORCEMENT LEARNING EXPLAINED: https://towardsdatascience.com/the-bellman-equation-59258a0d3fa7
RL theory by Csaba Szepesvári at the University of Alberta

Deep reinforcement learning series

Other References

CATALOG

FEATURED TAGS

FRIENDS