Popular "q-learning" questions

Although I know that SARSA is on-policy while Q-learning is off-policy, when looking at their formulas it's hard (to me) …

artificial-intelligence reinforcement-learning q-learning sarsa

I'm trying to get an agent to learn the mouse movements necessary to best perform some task in a reinforcement …

algorithm machine-learning reinforcement-learning q-learning

ϵ-greedy policy I know the Q-learning algorithm should try to balance between exploration and exploitation. Since I'm a beginner in …

machine-learning reinforcement-learning q-learning

How is Q-learning different from value iteration in reinforcement learning? I know Q-learning is model-free and training samples are transitions (…

machine-learning artificial-intelligence reinforcement-learning q-learning

I'm in a course called "Intelligent Machines" at the university. We were introduced with 3 methods of reinforced learning, and with …

machine-learning reinforcement-learning q-learning temporal-difference

I understand that epsilon marks the trade-off between exploration and exploitation. At the beginning, you want epsilon to be high …

machine-learning reinforcement-learning q-learning

What is the difference between deep reinforcement learning and reinforcement learning? I basically know what reinforcement learning is about, but …

machine-learning reinforcement-learning q-learning

I'm using the DQN algorithm to train an agent in my environment, that looks like this: Agent is controlling a …

tensorflow deep-learning reinforcement-learning q-learning

Is the classic Q-learning algorithm, using lookup table (instead of function approximation), equivalent to dynamic programming?

machine-learning dynamic-programming reinforcement-learning q-learning

The difference between Q-learning and SARSA is that Q-learning compares the current state and the best possible next state, whereas …

reinforcement-learning q-learning sarsa

Top "Q-learning" questions