Nov 26, 2020
Incorrect :) What you get using DQN is exactly the probabilities to select each action. These probabilities are based on the predicted Q-Values.
Incorrect :) What you get using DQN is exactly the probabilities to select each action. These probabilities are based on the predicted Q-Values.
Lives in Tel-Aviv, Israel 🇮🇱 See me on shakedzy.xyz