Modern Data Analysis 2021 2022 — различия между версиями

Материал из Wiki - Факультет компьютерных наук
Перейти к: навигация, поиск
(Новая страница: «== Course: Modern Data Analysis (2021–2022) == '''Lecturer:''' Dmitry Ignatov '''TA:''' TBA All the material are available via our telegram channel. '''Fin…»)
 
(Homeworks)
 
(не показаны 3 промежуточные версии этого же участника)
Строка 14: Строка 14:
 
* Homework 1: Classification
 
* Homework 1: Classification
  
 +
'''Announced''': 11.10.2021
  
=== Lecture 1===
+
'''Soft deadline''': changed to 08.11.2021 due to pandemic regulations (before it was 03.11.2021)
 +
 
 +
'''Hard deadline''': 10.11.2021
 +
 
 +
=== Lectures ===
 +
 
 +
==== Lecture 1====
  
 
Intro slides. Course plan. Assessment criteria. ML&DM libraries. What to read and watch?
 
Intro slides. Course plan. Assessment criteria. ML&DM libraries. What to read and watch?
Строка 21: Строка 28:
 
Practice: demonstration with Orange.
 
Practice: demonstration with Orange.
  
=== Lecture 2===
+
==== Lecture 2====
  
 
Classification. One-rule. Naïve Bayes. kNN. Logistic Regression. Train-test split and cross-validation. Quality Metrics (TP, FP, TN, FN, Precision, Recall, F-measure, Accuracy).
 
Classification. One-rule. Naïve Bayes. kNN. Logistic Regression. Train-test split and cross-validation. Quality Metrics (TP, FP, TN, FN, Precision, Recall, F-measure, Accuracy).
Строка 27: Строка 34:
 
Practice: demonstration with Orange.
 
Practice: demonstration with Orange.
  
=== Lecture 3===
+
==== Lecture 3====
  
 
Classification (continued). Quality metrics. ROC curves.  
 
Classification (continued). Quality metrics. ROC curves.  
Строка 33: Строка 40:
 
Practice: demonstration with Orange.
 
Practice: demonstration with Orange.
  
=== Seminar 1===
+
==== Seminar 1====
  
 
Classification  
 
Classification  
Строка 40: Строка 47:
  
  
=== Lecture 4===
+
==== Lecture 4====
  
 
Introduction to Clustering. Taxonomy of clustering methods. K-means. K-medoids. Fuzzy C-means. Types of distance metrics. Hierarchical clustering. DBScan
 
Introduction to Clustering. Taxonomy of clustering methods. K-means. K-medoids. Fuzzy C-means. Types of distance metrics. Hierarchical clustering. DBScan
  
 
Practice: DBScan Demo.
 
Practice: DBScan Demo.

Текущая версия на 14:34, 1 ноября 2021

Course: Modern Data Analysis (2021–2022)

Lecturer: Dmitry Ignatov

TA: TBA

All the material are available via our telegram channel.

Final mark formula: FM = 0.8 Homeworks + 0.2 Exam (under voting)


Homeworks

  • Homework 1: Classification

Announced: 11.10.2021

Soft deadline: changed to 08.11.2021 due to pandemic regulations (before it was 03.11.2021)

Hard deadline: 10.11.2021

Lectures

Lecture 1

Intro slides. Course plan. Assessment criteria. ML&DM libraries. What to read and watch?

Practice: demonstration with Orange.

Lecture 2

Classification. One-rule. Naïve Bayes. kNN. Logistic Regression. Train-test split and cross-validation. Quality Metrics (TP, FP, TN, FN, Precision, Recall, F-measure, Accuracy).

Practice: demonstration with Orange.

Lecture 3

Classification (continued). Quality metrics. ROC curves.

Practice: demonstration with Orange.

Seminar 1

Classification

Practice: scikit-learn.


Lecture 4

Introduction to Clustering. Taxonomy of clustering methods. K-means. K-medoids. Fuzzy C-means. Types of distance metrics. Hierarchical clustering. DBScan

Practice: DBScan Demo.