Skip to content

Commit e50392c

Browse files
authored
Update reinforcement_q_learning.py (#2867)
1 parent caa78dd commit e50392c

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

intermediate_source/reinforcement_q_learning.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -227,7 +227,7 @@ def forward(self, x):
227227
# This cell instantiates our model and its optimizer, and defines some
228228
# utilities:
229229
#
230-
# - ``select_action`` - will select an action accordingly to an epsilon
230+
# - ``select_action`` - will select an action according to an epsilon
231231
# greedy policy. Simply put, we'll sometimes use our model for choosing
232232
# the action, and sometimes we'll just sample one uniformly. The
233233
# probability of choosing a random action will start at ``EPS_START``

0 commit comments

Comments
 (0)