Back to Freecodecamp

Reinforcement Learning With Q-Learning: Example

curriculum/challenges/english/blocks/tensorflow/5e8f2f13c4cdbe86b5c72da5.md

latest395 B
Original Source

--questions--

--text--

Fill in the blanks to complete the following Q-Learning equation:

py
Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])

--answers--

A: state

B: action

C: next_state


A: state

B: action

C: prev_state


A: state

B: reaction

C: next_state

--video-solution--

1