Machine Learning 1 DSBA 2025/2026 — различия между версиями

Материал из Wiki - Факультет компьютерных наук
Перейти к: навигация, поиск
(Useful links)
(Useful links)
Строка 32: Строка 32:
 
== Useful links ==
 
== Useful links ==
  
# [https://edu.hse.ru/course/view.php?id=252964 '''Smart LMS''']: for posting weekly material, additional videos, calendar, and for posting/collecting/grading quizzes, tests, and HW, etc.
+
# [https://edu.hse.ru/course/view.php?id=252964 '''Smart LMS''']: for posting weekly material, grading quizzes, tests, and HW, etc.
 
# [https://drive.google.com/drive/folders/1IKcKL5JxnW_XPDFXA-DUFiuzCwUzphiB?usp=sharing '''Google Drive''']: for release of seminar Colab notebooks, lecture presentation slides, Starter Colab files for Kaggle competitions.
 
# [https://drive.google.com/drive/folders/1IKcKL5JxnW_XPDFXA-DUFiuzCwUzphiB?usp=sharing '''Google Drive''']: for release of seminar Colab notebooks, lecture presentation slides, Starter Colab files for Kaggle competitions.
 
# [https://colab.research.google.com/ '''Google Colaboratory''']: for individual manual-graded assignments, group Kaggle assignments and reproducible seminar’s notebooks.
 
# [https://colab.research.google.com/ '''Google Colaboratory''']: for individual manual-graded assignments, group Kaggle assignments and reproducible seminar’s notebooks.

Версия 13:12, 30 августа 2025

Course Syllabus

DSBA Machine Learning 1, ICEF Machine Learning 2025-2026. This syllabus is shared by 2 programs (differences specified where necessary):

Teachers and Assistants

Role DSBA 231 DSBA 232 DSBA 233 DSBA 234 ICEF
Lecturers Alexey Boldyrev, Maksim Karpov, Lecturers' TA Georgiy Solovev
Seminarists Kirill Bykov Sara Ali Aleksandr Khizhik Vsevolod Ovchinnikov Majid Sohrabi
Teaching Assistants Nikolay Dvoryanchikov Adamey Laipanov Stanislav Ryazanov Maria Sudakova Isa Gadaev

Useful links

  1. Smart LMS: for posting weekly material, grading quizzes, tests, and HW, etc.
  2. Google Drive: for release of seminar Colab notebooks, lecture presentation slides, Starter Colab files for Kaggle competitions.
  3. Google Colaboratory: for individual manual-graded assignments, group Kaggle assignments and reproducible seminar’s notebooks.
  • We require the use of LaTeX and Markdown syntax for all write ups.
  1. Kaggle.com: for data science competitions in teams of 1-3 students.
  • Ensure that your name & surname match exactly to those in Smart LMS or we can lose you in grade matching. Update the profile with your presentable photo to help us authenticate you and credit your effort.
  1. Telegram Сhat: for the course announcements (HW, timetable changes, etc.).

Course Description

This course introduces the students to the elements of machine learning, including supervised and unsupervised methods such as linear and logistic regressions, splines, decision trees, support vector machines, bootstrapping, random forests, boosting, regularized methods.

  • The first two modules (Sep-Dec`25) DSBA and ICEF students apply Python programming language and popular packages to investigate/visualize datasets and develop machine learning models that solve theoretical and data-driven problems.
  1. The course aims to help the students to develop an understanding of learning from data, to familiarize them with a wide variety of algorithmic and model based methods to extract information from data, teach to apply and evaluate suitable methods to various datasets by model selection and predictive performance evaluation.
  2. DSBA and ICEF students: the course is designed to prepare DSBA/ICEF students for the upcoming University of London (UoL) examination.

Grading System

Grades:

First Semester Grade = 0.15 * Final Test + 0.15 * Hackathon (Exam) + 0.25 * Kaggle competitions + 0.1 * Home assignments + 0.15 * Midterm Test + 0.2 * Quizzes

Relevant grading formulas are presented on the official course home page (DSBA Machine Learning 1, ICEF Machine Learning). Moodle’s gradebook shows your up to date performance, including your current constituent and aggregate grades. If you suspect an error in grade calculation, please let us know ASAP. We use natural grade aggregation in LMS. Rounding to the nearest integer is used to report 0-10 scale grades to HSE. There are no blocking grading components at the DSBA program, at the ICEF program the Exam in April is blocking.

Homework (HW) Assignment

  • HW is released via Moodle/Smart LMS and is also announced in the Telegram Channel.
  • Individual: These are individualized (not in groups!) assignments of two kinds
    • Auto-graded assignments (thanks to STACK plugin with Maxima CAS backend). Some written responses will be selectively hand-graded by TAs.
    • Quick Maxima syntax is provided below and in each assignment.
    • Manual-graded assignments include analysis of datasets, analytical and conceptual problems, and programming assignments.
    • Submit HW via Moodle/Smart LMS as both shared links to Google Colab and derived PDF.
      • All text explanations must be written in Markdown cells directly in Google Colab notebook.
      • Graders leave feedback in PDF and execute Google Colab to reproduce your results.
  • Group: Students are self-assigned into groups to compete on Kaggle.com. Teamwork is evaluated by instructors. Team’s grade is awarded to everyone on a team.

Midterm Test and Exam

  1. We will have a cumulative in-class midterm test/exam in the middle and at the end of the semester (during the HSE examination sessions). Do not book travel tickets that conflict with test dates.
  2. Moodle/Smart LMS based.
  3. Tests and exams are individual, i.e. no collaboration. Generative models, web searching, lecture and/or seminar materials, and textbooks are not allowed.
  4. Test questions are drawn from quiz banks, not HW. HW deepens your understanding, but a test measures it.
  5. There is a free navigation between questions, i.e. you can move back or forward the test questions.
  6. Results will be announced after the midterm test/exam within 5 working days.

University of London (UoL), Course ST3189 (ML)

1. Coursework Project in Python (or R) programming language is for DSBA/ICEF students only and is administered by LSE/UoL. It is released around January and is due around April 1\. Although students are given a 3-4 months window, this exercise is meant to be completed in a few days. Typically, students work on it in Feb/Mar. More details on the UoL site.

5-10 minute Quizzes during seminar in LMS \- biweekly

  1. Only students present in the classroom during the seminar are allowed to take the quiz. At the beginning of the seminar the seminar assistant will remove absent student(s) from the group list in moodle settings for the current quiz.
  2. Quizzes are based on lectures, seminars, textbooks, posted videos, and other material delivered via our course.
  3. Quizzes individualized (shuffled and sampled from question banks) for each student. Most questions are auto-generated.
  4. All choice questions are multiple-choice (regardless of singular/plural formulation). Incorrect answers lower your score to prevent guessing.
  5. Numeric answers are typically accepted with 0.01=1% of error, i.e. round to at least 4 decimal places, if needed. Please do not round any intermediate calculations. It’s best to use as many decimals as fits in the answer box.
  6. The quiz answers are released after all groups write the quiz .
  7. We always use natural logarithm (inverse of exp()) in this course.

Deadline Extensions and Makeup

  1. Only valid verifiable excuses are accepted for 1-2 day extensions.
    • DSBA and ICEF students: submit your doctor's note via student services.
    • Contact the instructors in discourse regarding your illness in order to verify it via study office
  2. If you miss a deadline (with a valid/verifiable excuse), contact instructors ASAP in a private post to arrange a new deadline.
  3. Submissions are not allowed after the solutions have been released.
  4. Any test/exam retakes will be rescheduled as per university policy (see also dedicated rubric below)
  5. Note: accommodating exceptions is difficult and time consuming. Typically, a verifiable medical emergency is a valid reason, but travel and conferences are not. It is the student's responsibility to start their work early, so as to hedge against any unforeseeable life event.