Reinforcement learning 2022 2023 — различия между версиями

Материал из Wiki - Факультет компьютерных наук
Перейти к: навигация, поиск
Строка 22: Строка 22:
 
[https://docs.google.com/spreadsheets/d/1MPWVIkgxyotHU-P5cE7Gik4C6RTWxTnAVK8Btl7Fw3Y/edit?usp=sharing '''Table with grades''']
 
[https://docs.google.com/spreadsheets/d/1MPWVIkgxyotHU-P5cE7Gik4C6RTWxTnAVK8Btl7Fw3Y/edit?usp=sharing '''Table with grades''']
  
== Lectures ==
+
== Course materials ==
 
*[https://www.dropbox.com/s/a69ql9duo5jf5gt/Math%20of%20RL%20Lecture%201.pdf?dl=0 ''' Lecture 09.11''']
 
*[https://www.dropbox.com/s/a69ql9duo5jf5gt/Math%20of%20RL%20Lecture%201.pdf?dl=0 ''' Lecture 09.11''']
 
*[https://www.dropbox.com/s/7zkirk1xykua890/Math_of_RL_Le%20cture_2.pdf?dl=0 ''' Lecture 16.11''']
 
*[https://www.dropbox.com/s/7zkirk1xykua890/Math_of_RL_Le%20cture_2.pdf?dl=0 ''' Lecture 16.11''']

Версия 19:05, 11 ноября 2022

Lecturers and Seminarists

Lecturer Alexey Naumov [anaumov@hse.ru] T924
Seminarist Sergey Samsonov [svsamsonov@hse.ru] T926

About the course

This page contains materials for Mathematical Foundations of Reinforcement learning course in 2022/2023 year, optional one for 2nd year Master students of the Math of Machine Learning program (HSE and Skoltech).

Grading

The final grade consists of 2 components (each is non-negative real number from 0 to 10, without any intermediate rounding) :

  • OHW for the hometasks
  • OProject for the course project

The formula for the final grade is

  • OFinal = 0.6*OHW + 0.4*OProject

with the usual (arithmetical) rounding rule.

Table with grades

Course materials

Seminars

Recommended literature

Lecture and seminar 09.11

Lecture and seminar 16.11

Homeworks

Projects