Why do we use np.random.choice? #265
Unanswered
pavolkacej
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I know that np.random.choice() takes probability distributions, so if some of the actions has bigger weight, it has bigger probability to be chosen. However, we want to pick the best action anyway or not? Shouldn't we just use the action with highest value here?
https://github.com/suragnair/alpha-zero-general/blob/master/Coach.py#L63
Beta Was this translation helpful? Give feedback.
All reactions