RL 2023 — различия между версиями

Версия 12:34, 13 ноября 2023

Содержание

1 Lecturers and Seminarists
2 About the course
3 Grading
4 Course materials
5 Recommended literature
6 Homeworks
7 Projects

Lecturers and Seminarists

Lecturer	Alexey Naumov	[anaumov@hse.ru]	T924
Seminarist	Sergey Samsonov	[svsamsonov@hse.ru]	T926

About the course

This page contains materials for Mathematical Foundations of Reinforcement learning course in 2022/2023 year, optional one for 2nd year Master students of the Math of Machine Learning program (HSE and Skoltech).

Grading

The final grade consists of 2 components (each is non-negative real number from 0 to 10, without any intermediate rounding) :

O_HW for the hometasks
O_Project for the course project

The formula for the final grade is

O_Final = 0.6*O_HW + 0.4*O_Project

with the usual (arithmetical) rounding rule.

Course materials

Recommended literature

Sebastien Bubek, Nicolo Cesa-Bianchi. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems. Chapter 2. http://sbubeck.com/SurveyBCB12.pdf
Richard S. Sutton, Andrew G. Barto. Reinforcement Learning: An Introduction. Chapter 2. http://incompleteideas.net/book/the-book-2nd.html;
Botao Hao et al. Bootstrapping Upper Confidence Bound. https://arxiv.org/abs/1906.05247
Aleksandrs Slivkins. Introduction to Multi-Armed Bandits. https://arxiv.org/abs/1904.07272 [Chapter 1]

@@ Строка 19: / Строка 19: @@
 * O<sub>Final</sub> = 0.6*O<sub>HW</sub> + 0.4*O<sub>Project</sub>
 with the usual (arithmetical) rounding rule.
-[https://docs.google.com/spreadsheets/d/1MPWVIkgxyotHU-P5cE7Gik4C6RTWxTnAVK8Btl7Fw3Y/edit?usp=sharing '''Table with grades''']
 == Course materials ==

RL 2023 — различия между версиями

Версия 12:34, 13 ноября 2023

Содержание

Lecturers and Seminarists

About the course

Grading

Course materials

Recommended literature

Homeworks

Projects

Навигация

Персональные инструменты

Пространства имён

Варианты

Просмотры

Действия

Поиск

Навигация

Инструменты