Top "Reinforcement-learning" questions

Reinforcement learning is an area of machine learning and computer science concerned with how to select an action in a state that maximizes a numerical reward in a particular environment.

What is the difference between value iteration and policy iteration?

In reinforcement learning, what is the difference between policy iteration and value iteration? As much as I understand, in value …

machine-learning reinforcement-learning markov-models value-iteration
What is the difference between Q-learning and SARSA?

Although I know that SARSA is on-policy while Q-learning is off-policy, when looking at their formulas it's hard (to me) …

artificial-intelligence reinforcement-learning q-learning sarsa
Training a Neural Network with Reinforcement learning

I know the basics of feedforward neural networks, and how to train them using the backpropagation algorithm, but I'm looking …

algorithm language-agnostic machine-learning neural-network reinforcement-learning
What is a policy in reinforcement learning?

I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, …

machine-learning terminology reinforcement-learning markov-decision-process
How can I apply reinforcement learning to continuous action spaces?

I'm trying to get an agent to learn the mouse movements necessary to best perform some task in a reinforcement …

algorithm machine-learning reinforcement-learning q-learning
Tensorflow and Multiprocessing: Passing Sessions

I have recently been working on a project that uses a neural network for virtual robot control. I used tensorflow …

python parallel-processing multiprocessing tensorflow reinforcement-learning
Optimal epsilon (ϵ-greedy) value

ϵ-greedy policy I know the Q-learning algorithm should try to balance between exploration and exploitation. Since I'm a beginner in …

machine-learning reinforcement-learning q-learning
What is the way to understand Proximal Policy Optimization Algorithm in RL?

I know the basics of Reinforcement Learning, but what terms it's necessary to understand to be able read arxiv PPO …

machine-learning reinforcement-learning
What is the difference between Q-learning and Value Iteration?

How is Q-learning different from value iteration in reinforcement learning? I know Q-learning is model-free and training samples are transitions (…

machine-learning artificial-intelligence reinforcement-learning q-learning