SAS Technologies for Data Mining

Материал из Wiki - Факультет компьютерных наук
Перейти к: навигация, поиск

Schedule

Link for all lectures and seminars: https://teams.microsoft.com/l/meetup-join/19%3abt3YFk3CdUF4vwyWQsW_Jb6FdHUCzgLG_hAumdyaxfw1%40thread.tacv2/1641804643575?context=%7b%22Tid%22%3a%2221f26c24-0793-4b07-a73d-563cd2ec235f%22%2c%22Oid%22%3a%22afe1666c-56bf-4dfa-88bb-62652330bdca%22%7d


Schedule from 10.01.2022 to 28.02.2022 (incl.):

every Monday, 14:40 - 16:00 - lecture (online), Dmitriy Ilvovskiy.

every Monday, 16:20 - 17:40 - seminar (online), Dmitriy Ilvovskiy.


Schedule from 05.03.2022 to 02.04.2022 (incl.):

every Saturday, 14:40 - 16:00 - lecture (online), Nataliya Titova.

every Saturday, 16:20 - 17:40 - seminar (online), Nataliya Titova.


About this course

This course is aimed to study the basics of data analysis using SAS software. These knowledge and skills are necessary in the professional activities of specialists in mathematical modeling and computer science.

As a result of studying the course, students should:

  • Know the basics of the SAS Base language and be able to write and understand simple programs in this language;
  • Study the basics of macro programming in SAS Base;
  • Understand the principles of application of the main statistical methods of data analysis using SAS Software;
  • Be able to launch and analyze the results of the implementation of the main statistical methods of data analysis on the SAS platform;
  • Know the list of basic data analysis methods implemented on the SAS platform.


Course description (HSE) - https://www.hse.ru/edu/courses/470995302

Tutors Dmitriy Ilvovskiy, Nataliya Titova
Credits 5
Supervised study (h.) 60
Self-study (h.) 130
Year 3
Format full-time


additional links:

Curriculum

# Topic
1 SAS analytical platform. Technology review.
2 SAS BASE programming language
2.1 SAS BASE programming
2.2 Macros, SQL
3 Statistical analysis using SAS STAT
3.1 SAS STAT intro, ANOVA
3.2 Linear regression
3.3 Logistic regression
3.4 Generalized linear models
3.5 Data visualization
3.6 Clustering methods
3.7 Decision tree
3.8 Ensemble methods. Random forest, boosting.
3.9 Non-linear models. Neural networks.

Educational material

10.01.2022, lecture - https://drive.google.com/drive/folders/1GnyXdsWVD3-CH1ikGjx7Y_JG31kBy5Bf?usp=sharing

17.01.2022, lecture - https://drive.google.com/drive/folders/1enuEVq9WSnF_5OI84yPt8IzsgQVUK50M?usp=sharing

17.01.2022, seminar - https://drive.google.com/drive/folders/1H6ncIQ-5J6aOT9MOLbFSYRPaEvIVNUOp?usp=sharing

24.01.2022, lecture - https://drive.google.com/drive/folders/1haGmYEdCjEri1PJLySY8pg74Y_oJxu9w?usp=sharing

24.01.2022, seminar - https://drive.google.com/drive/folders/1PCOrUHDhRbPFqhsKOEr83UWoC0d-aZs8?usp=sharing

31.01.2022, lecture - https://drive.google.com/drive/folders/1pJ3A1KC96oad8H5Xds1crX0fa2Tevu7h?usp=sharing

31.01.2022, seminar - https://drive.google.com/drive/folders/1_nCn2tx9-WCKBysS8I9VwoPT-wdCUyA1?usp=sharing

07.02.2022, lecture - https://drive.google.com/drive/folders/11S4cbFa4YfKM9ofcDBIIfy3neIcIp58e?usp=sharing

07.02.2022, seminar - https://drive.google.com/drive/folders/1xQxOvC4aQYm3KGibsFLC8bdg6OoY6l0-?usp=sharing

14.02.2022, lecture - https://drive.google.com/drive/folders/1t-f2WGNl5nvO13lzxE35AdRMOVChveJF?usp=sharing

14.02.2022, seminar - https://drive.google.com/drive/folders/17jc61xRwWdoiMHwaEHIBzLfnvKNuIrhx?usp=sharing

21.02.2022, lecture - https://drive.google.com/drive/folders/1JXQRpAT9mj7xQPwz1CGn27i0-TNApSfU?usp=sharing

21.02.2022, seminar - https://drive.google.com/drive/folders/1RuMCB0VZrGnODQP7JsX9rnU1PAEByuyr?usp=sharing

SAS software

SAS environment: students are supposed to do tasks with SAS Viya. All software tools are cloud-based, it is enough to have an Internet browser.

SAS environment instruction and your personal access login and password will be sent to your student e-mails (in the "edu.hse.ru" domain).

Students who in-time and successfully do all tasks using SAS, may attend to free online courses on additional topics.

If you are interested in it, please, write to Nataliya Titova or send a request to the Telegram-chat.

If a student have successfully done all homeworks (with excellence), he or she may receive:

  • SAS academic program certificate
  • Acclaim e-badge, confirming successful passing of this course (with the list of SAS technologies used).

It is required to have a final grade at least 9 points.


The following online courses are freely available:

  • SAS Base programming link
  • Statistical analysis using SAS link

Moreover, students who are willing to spend extra time learning to program in SAS can try to pass a professional certification within the program SCYP (SAS® Software Certified Young Professionals) for free link.

Assessment criteria

The course provides several forms of knowledge control:

  • 4 homeworks;
  • Exam.

Criteria

  • All homework assignments are graded on a 10-point scale, minimum passing grade - 4.
  • Exam grade is also given on a 10-point scale.


Final grade

Final grade formula:

О_final = 0,5·О_exam + 0,5·О_HW

где

  • О_exam – grade for the exam on a ten-point scale.
  • О_HW – average grade for all homework assignments on a ten-point scale.

Rounding occurs only at the very end - in the final assessment. Rounding is arithmetic.

Each task and exam is evaluated on a 10-point scale (fractional marks are allowed for tasks).

The exam is held in the form of homework defense: students answer the questions on the tasks they have completed and, if necessary, perform an additional written task. If a student for some reason cannot or refuses to defend himself or herself, he or she must pass a written exam.

Homework

Homework №1

To be announced later


Homework №2

To be announced later.


Homework №3

To be announced later


Homework №4

To be announced later.


Contacts

Questions may be addressed in Telegram-chat, to Dmitry Ilvovsky @dilv_ru dilvovsky@hse.ru or SAS-HSE department manager Tatiana Lobok (@tatianalobok) tlobok@hse.ru. Be sure to add the tag [PMI FKN HSE / PAD FKN HSE / MIEF FKN HSE / Ek FEN HSE / Ext FEN HSE] to the title of the letter, and also indicate your last name and first name.


Telegram channel for announcements: https://t.me/+gI3IA1ucpok3MGUy

Telegram chat for discussions: https://t.me/+hSmHk-PTzXszOTk6


All announcements and materials will be available in Telegram chat!

All files provided are intended for use by students during their studies and are updated throughout the year. For found typos, inaccuracies, malfunctions of the page, please write to e-mail tlobok@hse.ru.

Additional materials

Course documents NB: the content may be updated!

Bibliography

Materials


Literature