Optimal Policy Archives - EITCA Academy

How does the Q-learning algorithm work?

Monday, 03 June 2024 by asadeghp

Q-learning is a type of reinforcement learning algorithm that was first introduced by Watkins in 1989. It is designed to find the optimal action-selection policy for any given finite Markov decision process (MDP). The goal of Q-learning is to learn the quality of actions, which is represented by the Q-values. These Q-values are used to

Published in Artificial Intelligence, EITC/AI/ARL Advanced Reinforcement Learning, Introduction, Introduction to reinforcement learning

Tagged under: Artificial Intelligence, Bellman Equation, Model-Free, Optimal Policy, Q-learning, Reinforcement Learning

EITCA Academy

SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

FORGOT YOUR DETAILS?

CREATE ACCOUNT

How does the Q-learning algorithm work?