0 / 60 seg.

What is Q-learning reinforcement learning?