You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ifrandom.random() <self.epsilon:
minQ=min(q)
mag=max(abs(minQ), abs(maxQ))
# add random values to all the actions, recalculate maxQq= [q[i] +random.random() *mag-.5*magforiinrange(len(self.actions))]
maxQ=max(q)
why use this(versus qlearn.py)?
The text was updated successfully, but these errors were encountered:
I reconstructed your code in a more configurable way if your pardon. The link is mycode, and the question above is still bother me, I appreciate so much if you can give an interpretation.
In tutorial1,
qlearn_mod_random.py
line 32:why use this(versus
qlearn.py
)?The text was updated successfully, but these errors were encountered: