Reinforcement learning 2021 2022 — различия между версиями

Версия 23:12, 9 ноября 2021

Содержание

1 Lecturers and Seminarists
2 About the course
3 Grading
4 Lectures
5 Seminars
6 Homeworks
7 Projects
8 Recommended literature (1st term)

Lecturers and Seminarists

Lecturer	Naumov Alexey	[anaumov@hse.ru]	T924
Lecturer	Denis Belomestny	[dbelomestny@hse.ru]	T924
Seminarist	Samsonov Sergey	[svsamsonov@hse.ru]	T926
Seminarist	Maxim Kaledin	[mkaledin@hse.ru]	T926

About the course

This page contains materials for Mathematical Foundations of Reinforcement learning course in 2021/2022 year, optional one for 2nd year Master students of the Math of Machine Learning program (HSE and Skoltech).

Grading

The final grade consists of 2 components (each is non-negative real number from 0 to 10, without any intermediate rounding) :

O_HW for the hometasks
O_Project for the course project

The formula for the final grade is

O_Final = 0.5*O_HW + 0.5*O_Project

with the usual (arithmetical) rounding rule.

Table with grades

Lectures

Lecture 09.11

Seminars

Seminar 09.11, Seminar 09.11, Video, Seminar 09.11, Notebook

Homeworks

Projects

Recommended literature (1st term)

http://www.statslab.cam.ac.uk/~james/Markov/ - Cambridge lecture notes on discrete-time Markov Chains
https://link.springer.com/book/10.1007%2F978-3-319-97704-1 - book by E. Moulines et al, you are mostly interested in chapters 1,2,7 and 9 (book is accessible for download through HSE network)
https://link.springer.com/book/10.1007%2F978-3-319-62226-2 - Stochastic Calculus by P. Baldi, good overview of conditional probabilities and expectations (part 4, also accessible through HSE network)
https://link.springer.com/book/10.1007%2F978-1-4419-9634-3 - Probability for Statistics and Machine Learning by A. Dasgupta, chapter 19 (MCMC), also accessible through HSE network

@@ Строка 30: / Строка 30: @@
 == Seminars ==
-*[https://www.dropbox.com/s/i5g7a1pnbsnwclm/Seminar_18_09.pdf?dl=0 '''Seminar 09.11'''], [https://www.dropbox.com/s/i5g7a1pnbsnwclm/Seminar_18_09.pdf?dl=0 '''Seminar 09.11, Video''']
+*[https://www.dropbox.com/s/wc951vseud1q1p2/Seminar_09_11_RL.pdf?dl=0 '''Seminar 09.11'''], [https://www.dropbox.com/s/2h83vbjgew1inen/Seminar_1_RL.mp4?dl=0 '''Seminar 09.11, Video'''], [https://www.dropbox.com/s/bxa8h9vjrnegsql/Bandit_intro_strategies_09_11_2021.ipynb?dl=0 '''Seminar 09.11, Notebook''']
 ==Homeworks ==

Reinforcement learning 2021 2022 — различия между версиями

Версия 23:12, 9 ноября 2021

Содержание

Lecturers and Seminarists

About the course

Grading

Lectures

Seminars

Homeworks

Projects

Recommended literature (1st term)

Навигация

Персональные инструменты

Пространства имён

Варианты

Просмотры

Действия

Поиск

Навигация

Инструменты