Data Science for Business 2020
Содержание
About the Course
Data Science for Business. MAGoLEGO course.
Spring 2020. Module 4.
Department of Data Analysis and Artificial Intelligence, School of Computer Science.
Join our telegram channel Data science for business.
Instructors
Links
- Alternative Course website [weblink]
- Lectures link https://zoom.us/j/7723819319 Fridays, 6.10pm - 7.30pm
- Seminars link https://zoom.us/j/636910206 Fridays, 7.40pm - 9.00pm
Course outline
- Introduction to data science
- Data mining, statistics, machine learning, optimization
- Case studies
- Increasing business impact
Content
№ | Date | Title | Abstract |
---|---|---|---|
1 | 10.04.2020 | Introduction to data science. | Introduction to data science and its role in industry. Examples of real world use cases. |
2 | 17.04.2020 | Working with data. | Data cleaning and preparation. ETL process. Basic data analysis and visualization. |
3 | 24.04.2020 | Data mining, machine learning, statistics | Types of ML algorithms, applicability, training and testing, solution quality. |
4 | 15.05.2020 | Case study 1: Pricing | The goal of the case is to compute price elasticity.
Algorithms: Supervised learning: linear and non-linear regression, predicting continuous variable. Dimensionality reduction: PCA. |
5 | 22.05.2020 | Case study 2: Churn modeling | The goal of the case is to predict which customers are going to leave the service within a given time.
Algorithms: Supervised learning. Classification: Logistic regression, Decision trees, Random forest. |
6 | 29.05.2020 | Case study 3: Customer segmentation | The goal of the case is to group customers into clusters based on some customer similarity metrics.
Algorithms: clustering – k-means, agglomerative, dimensionality reduction - PCA. |
7 | 05.06.2020 | Case study 4: Personalizaton | The goal of the case is to build a recommender system.
Algorithms: association rules and collaborative filtering. |
8 | 12.06.2020 | Case study 5. Demand forecasting | The goal of the case is to develop demand forecasting model.
Algorithms: ARIMA, sliding window regression. |
9 | 19.06.2020 | Case study 6. Fraud detection | The goal of the case is to find abnormal customer transactions.
Algorithms: anomaly detection. |
10 | 26.06.2020 | Impacting the business | How to create a visible impact on business with analytics |
Seminar's materials
Seminar 2. video., RM processes COVID regression, COVID, Fisher's Iris, COVID days since 50 confirmed cases, Iris depivot example
Home assignments
1. Analyze COVID dataset
- Home assignment description.
- Starter process.
- Total Cases, Deaths, Recovered on April 19, 2020.
- Google form to submit your solution.
Textbooks
- Provost, Foster, Fawcett, Tom. Data Science for Business: What you need to know about data mining and data-analytic thinking. O'Reilly Media, Inc.", 2013.
- James, G. et al. An introduction to statistical learning. Springer, 2013.
- Siegel, E. Predictive analytics: The power to predict who will click, buy, lie, or die. John Wiley & Sons, 2016.
Software
- For online lectures and seminars. zoom
- Modelling package. RapidMiner
Apply for educational version https://rapidminer.com/get-started-educational/
- Email: Enter your university email (end with @edu.hse.ru)
- Job Function: Student
- University: Higher School of Economics
- Course Name: Data Science for Business
- Course Number: https://www.hse.ru/edu/courses/341840822
- Course Term: Summer Term
- Professor: Leonid Zhukov