Into to DataMining and Machine Learning 2020 2021 — различия между версиями
Machine (обсуждение | вклад) (→Practice on 4 April 2021) |
Machine (обсуждение | вклад) (→Practice on 6 April 2021) |
||
Строка 58: | Строка 58: | ||
=== Practice on 6 April 2021 === | === Practice on 6 April 2021 === | ||
− | Practice. Scikit-learn tutorial on kNN, Decision Trees, Logistic Regression, SVM etc. | + | Practice. Scikit-learn tutorial on kNN, Decision Trees, NaÏveBayes, Logistic Regression, SVM etc. |
Версия 19:02, 11 июня 2021
Lecturer: Dmitry Ignatov
TA: Stefan Nikolić
Содержание
Homeworks
- Homework 1: Spectral Clustering
- Homework 2:
- Homework 3: Recommender Systems
Lecture on 16 January 2021
Intro slides. Course plan. Assessment criteria. ML&DM libraries. What to read and watch?
Practice: demonstration with Orange.
Lecture on 26 January 2021
Classification (continued). Quality metrics. ROC curves.
Practice: demonstration with Orange.
Lecture on 2 February 2021
Introduction to Clustering. Taxonomy of clustering methods. K-means. K-medoids. Fuzzy C-means. Types of distance metrics. Hierarchical clustering. DBScan
Practice: DBScan Demo.
Lecture on 09 February 2021
- Introduction to Clustering (continued). Density-based techniques. DBScan and Mean-shift.
- Graph and spectral clustering. Min-cuts and normalized cuts. Laplacian matrix. Fiedler vector. Applications.
Practice on 16 Feb 2021
Clustering with scikit-learn (k-means, hierarchical clustering, DBScan, MeanShift, Spectral Clustering).
Lecture on 2 March 2021
Practice: Spectral clustering.
Lecture: Decision tree learning. ID3. Information Entropy. Information gain. Gini coefficient and index. Overfitting and pruning. Decision trees for numeric data. Oblivious decision trees. Regression trees.
Lecture on 9 March 2021
Frequent Itemsets. Association Rules. Algorithms: Apriori, FP-growth. Interestingness measures. Closed and maximal itemsets.
Lecture + Practice on 16 March 2021
Frequent Itemset Mining (continued). Applications: 1) Taxonomies of Website Visitors and 2) Web advertising.
Exercises. Frequent Itemsets. FP-growth. Closed itemsets.
Practice. Orange, SPMF, Concept Explorer.
Practice on 6 April 2021
Practice. Scikit-learn tutorial on kNN, Decision Trees, NaÏveBayes, Logistic Regression, SVM etc.