Reinforcement Learning and Sequential Decision-making - AIDA - AI Doctoral Academy

Reinforcement Learning and Sequential Decision-making

Level

Intermediate, Broad, Algorithmic, Methodological.

This topic covers the study and design of machine learning algorithms for online learning, multi-armed bandits and reinforcement learning (RL).

Reinforcement Learning and Sequential Decision-making

Learning outcomes

Content /
Knowledge

<img data-src='https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png' class='lazyload' src='data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw=='><noscript><img src="https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png">

Students should be able to:

Understand the difference between online and batch learning.
Describe the main online learning algorithms and understand the analysis of their performance.
Understand the multi-armed bandit problem, describe the main algorithms, and understand the analysis of their performance.
Understand the goal of reinforcement learning and the mathematical MDP model.
Describe the basic evaluation criteria for RL: finite, infinite, and discounted horizon.
Describe the main algorithms for model-based RL and understand their performance guarantees.
Describe the main algorithms for model-free RL and understand their performance guarantees.
Understand value function approximation and deep RL.

Methodological
skills

<img data-src='https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png' class='lazyload' src='data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw=='><noscript><img src="https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png">

Students should be able to:

Design RL solutions for new problems using a correct MDP abstraction.
Implement RL algorithms taking advantage of available libraries and simulation environments.
Evaluate the accuracy of the derived solutions in a systematic way, using available benchmarks and considering different performance metrics.

Transferrable/
Application

<img data-src='https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png' class='lazyload' src='data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw=='><noscript><img src="https://www.i-aida.org/wp-content/themes/twentytwentyone-child/images/car-next.png">

Students should be able to:

Work effectively with others in an interdisciplinary and/or international team.
Design and manage individual projects.
Clearly and succinctly communicate their ideas to technical audiences.

AIDA courses and other online courses covering this subject

2024. 09. 10

Computational Intelligence – Deep Reinforcement Learning

PREVIOUS: Machine Learning Theory

NEXT: Distributed and Federated Learning

AIDA may use cookies to memorize your login data, collect statistics to optimize the website’s functionality and to carry out marketing actions based on your interests.You can customize used cookies in .