Reinforcement learning 2021 2022

Содержание

1 Lecturers and Seminarists
2 About the course
3 Grading
4 Lectures
5 Seminars
6 Homeworks
7 Projects
8 Recommended literature (1st term)

Lecturers and Seminarists

Lecturer	Naumov Alexey	[anaumov@hse.ru]	T924
Lecturer	Denis Belomestny	[dbelomestny@hse.ru]	T924
Seminarist	Samsonov Sergey	[svsamsonov@hse.ru]	T926
Seminarist	Maxim Kaledin	[mkaledin@hse.ru]	T926

About the course

This page contains materials for Mathematical Foundations of Reinforcement learning course in 2021/2022 year, optional one for 2nd year Master students of the Math of Machine Learning program (HSE and Skoltech).

Grading

The final grade consists of 2 components (each is non-negative real number from 0 to 10, without any intermediate rounding) :

O_HW for the hometasks
O_Project for the course project

The formula for the final grade is

O_Final = 0.5*O_HW + 0.5*O_Project

with the usual (arithmetical) rounding rule.

Table with grades

Lectures

Seminars

Seminar 09.11, Seminar 09.11, Video, Seminar 09.11, Notebook

Homeworks

Projects

Recommended literature (1st term)

http://www.statslab.cam.ac.uk/~james/Markov/ - Cambridge lecture notes on discrete-time Markov Chains
https://link.springer.com/book/10.1007%2F978-3-319-97704-1 - book by E. Moulines et al, you are mostly interested in chapters 1,2,7 and 9 (book is accessible for download through HSE network)
https://link.springer.com/book/10.1007%2F978-3-319-62226-2 - Stochastic Calculus by P. Baldi, good overview of conditional probabilities and expectations (part 4, also accessible through HSE network)
https://link.springer.com/book/10.1007%2F978-1-4419-9634-3 - Probability for Statistics and Machine Learning by A. Dasgupta, chapter 19 (MCMC), also accessible through HSE network

Reinforcement learning 2021 2022

Содержание

Lecturers and Seminarists

About the course

Grading

Lectures

Seminars

Homeworks

Projects

Recommended literature (1st term)

Навигация

Персональные инструменты

Пространства имён

Варианты

Просмотры

Действия

Поиск

Навигация

Инструменты