about the code #2

mazzzystar · 2017-04-15T03:38:25Z

In tutorial1, qlearn_mod_random.pyline 32:

if random.random() < self.epsilon:
            minQ = min(q)
            mag = max(abs(minQ), abs(maxQ))
            # add random values to all the actions, recalculate maxQ
            q = [q[i] + random.random() * mag - .5 * mag for i in range(len(self.actions))]
            maxQ = max(q)

why use this(versus qlearn.py)?

The text was updated successfully, but these errors were encountered:

mazzzystar · 2017-04-27T14:07:13Z

I reconstructed your code in a more configurable way if your pardon. The link is mycode, and the question above is still bother me, I appreciate so much if you can give an interpretation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the code #2

about the code #2

mazzzystar commented Apr 15, 2017 •

edited

Loading

mazzzystar commented Apr 27, 2017

about the code #2

about the code #2

Comments

mazzzystar commented Apr 15, 2017 • edited Loading

mazzzystar commented Apr 27, 2017

mazzzystar commented Apr 15, 2017 •

edited

Loading