Here is a nice informative PhD thesis on Reinforcement Learning. Really does a nice job in Chapter 2.2.3 explaining the use of Boltzmann distribution and Q-values.
Here is a nice informative PhD thesis on Reinforcement Learning. Really does a nice job in Chapter 2.2.3 explaining the use of Boltzmann distribution and Q-values.