Data Analysis in Journalism and Political Science 2020-2021
Содержание
About the course
The course is conducted for students of Bachelor’s Programme 'HSE and University of London Parallel Degree Programme in International Relations'.
Abstract: In this intermediate Python course, you will learn how to apply data science methods and techniques to political science and journalism. This course will provide you with knowledge and skills in data mining, exploratory data analysis, and visualization. The practical classes are project-oriented and cover the basic topics of data science applications used in the media: topic modeling, sentiment analysis, social network analysis, and classical machine learning approaches. By the end of the course, you will be able to perform your own projects in political science and journalism in Python.
Syllabus: open
Required Software
- Anaconda
or separately installed:
- Python version >= 3.6
- Jupyter Notebook
- pip3 for installing Python libraries during the course
Materials
Presentations and all materials will be available immediately after each practice class. Additional materials will be used in quizzes at each next seminar.
Github with the materials from our practical classes: https://github.com/anamarina/Data_Analysis_in_Journalism_and_Political_Science
Week | Topic | Slides | Tutorial | Additional Materials | Assignment | Deadline |
---|---|---|---|---|---|---|
1 | Introduction | Intro | Part1, Part 2 | |||
2 | Parsing | - | sem2_parsing.ipynb | |||
3 | Exploratory Data Analysis & Visualization | - | sem3_eda.ipynb | |||
4 | Social network analysis | - | sem4_graphs.ipynb | 1. Getting started with graph analysis in pandas and networks (Fraud detection example) 2. Coursera.Applied Social Network Analysis in Python |
||
5 | ||||||
6 | ||||||
7 |
Assignments
The course consists of two home assignments, each of them performed individually. Assignment 1 will be published after Week 3. It is based on Week 1 - Week 3. Assignment 2 - after Week 5. It is based on Week 2 - Week 5.
Each task is checked for plagiarism. Matching more than 25% of the code will be considered plagiarism and will result in 1 point out of 10 with the right to appeal. If the code matches more than 40%, the job will be canceled (0 points) without the right to appeal. After the deadline for each assignment, during the next week, each student will be offered a convenient time for her/him for participating in a conference in Zoom with a lecturer and TA to answer questions on code and explanations of solutions.
Assignment title standard: Please, name your files with solutions in this format: Assignment # _ # Number # _ # Group number # _ # Name # _ # Surname #. Example: Assignment_1_BMOL182_Morty_Smith
Github with assignments: https://github.com/anamarina/Data_Analysis_in_Journalism_and_Political_Science
Links for submitting your assignments (Dropbox links):
Group | Assignment 1 | Assignment 2 | Final Project |
---|---|---|---|
BMOL182 | Link | Link | Link |
BMOL183 |
Communication
All course materials, assignments, deadlines will be published on this page.
Important announcements will be sent in Telegram channel and sometimes duplicated to group emails: https://t.me/joinchat/AAAAAER0YXnyxzVHeH0Eaw
The group with 24/7 online support in Telegram for Q&A, discussions, technical issues, and moral support: https://t.me/joinchat/Eb8LXxFniDA5jaVbLRW5Cg
Group | Teacher | Teaching Assistant | Schedule |
---|---|---|---|
BMOL182 | Marina Ananyeva Email Telegram | Nikita Budilovskiy Email Telegram | Saturday 13:00 - 14:20 (September), Saturday 13:00 - 16:00 (October) |
BMOL183 | Marina Ananyeva | Nikita Budilovskiy Email Telegram | Saturday 14:40 - 16:00 (September), Saturday 16:20 - 19:30 (October) |
Feedback
We’ll much appreciate it if you help us to make this course better by sharing your ideas and feedback. Feel free to do it!
Anonymous feedback form: click_here
Grading
Final Grade = 0.2*(in-class quizzes + extra points) + 0.2*(home assignment 1) + 0.2* (home assignment 2) + 0.4*(group project)
In-class quizzes. At the beginning of each practical class, students are asked to pass a test to check their knowledge based on the additional reading materials and previous seminars. Each quiz task is evaluated out of 1 or 2 points for correct answers. All points for 6 quizzes will be summed up and normalized to 10 points grade. A student can get the extra points in case if he/she showed some good activity during the seminar (e.g., answered the teacher's questions, programmed a small task using a projector for demonstration instead of the teacher, submitted a bonus task).
Home assignments will be provided during the course with detailed instructions and assignment evaluation criteria. Deadlines cannot be violated. If so, the assignment will not be evaluated by your TA. All assignments are checked for plagiarism. Please, see Assignments above this page.
Cheating and honor
You must abide by the Honor Code.
Please don’t cheat - the rumor has it HSE has quite severe penalties.
To avoid being accused of plagiarism in “grey cases”, please disclose with whom and how you have collaborated on each assignment, except for the final group project. If you warn us, the worst thing that can happen to you after a good-faith mistake is to ask you to complete another version of the task, without disciplinary action and without notifying the HSE administration.