GESIS Training Courses
Search

Scientific Coordination

Verena Kunz

Administrative Coordination

Noemi Hartung

Introduction to Computational Social Science with Python

About
Location:
Online via Zoom
Course duration:
13:00-16:30 (CEST / UTC+2)
General Topics:
Course Level:
Format:
Software used:
 
Duration:
Language:
Fees:
Students: 550 €
Academics: 825 €
Commercial: 1650 €
Keywords
Additional links
Lecturer(s): John McLevey

About the lecturer - John McLevey

Course description

The digital revolution has produced unprecedented amounts of data that are relevant for researchers in the social sciences, from online surveys to social media user data, travel and access data, and digital or digitized text data. How can these masses of raw data be turned into understanding, insight, and knowledge? The goal of this course is to introduce you to topics, methods, and workflows in Computational Social Science (CSS) with Python, a powerful programming language that offers a wide variety of tools, used by journalists, data scientists, and researchers alike. Unlike many introductions to programming, e.g., in computer science, the focus of this course is on how to explore, obtain, wrangle, visualize, model, and communicate data to address challenges in the social sciences. The course emphasizes the theoretical and ethical aspects of CSS while covering topics such as web scraping (i.e. obtaining data from the internet), data cleaning (i.e. getting raw data into a rectangular or otherwise easy-to-analyze format), and visualization (i.e. drawing bar, line, scatter plots and more from data), computational text analysis (i.e. using the computer to find patterns in text or sort documents into categories), machine learning (i.e. training algorithms on annotated data and generalizing patterns to unseen data), network analysis (i.e. examining relationships among entities, such as persons, organizations, or documents), and the basics of probabilistic modelling. The course will be held in an online blended learning format with video lectures focused on theoretical background and demonstrations accompanied by live sessions where participants can ask questions and work through projects together.
 
For additional details on the course and a day-to-day schedule, please download the full-length syllabus.
 
Organizational Structure of the Course
The course will take place in a blended learning format. That means that you will need to (1.) read the literature listed under each session (if any); (2.) watch the video lecture; (3.) finish the exercises before each live group session. This means that participants will be on roughly the same level of knowledge during the live sessions, and we will be able to focus on open discussion, the answering of questions and small group exercises.


Target group

You will find the course useful if:
  • you have taken, e.g., a statistics course, know a little bit of Python, and now want to explore and get an overview of computational methods, data science, or one of the approaches listed above.


Learning objectives

By the end of the course you will:
  • be able to define what constitutes the field of computational social science
  • have a high-level overview of the approaches utilized in computational social science, including advantages and shortcomings
  • have basic knowledge and hands-on experience of how to apply the approaches and what tools are considered state-of-the-art
  • be equipped to deepen your knowledge on the theory and practice of computational social science.


Prerequisites

Working knowledge of Python is an asset but is not required. There is an optional “Introduction to Python” module that you should review before beginning this course. If you need more help, you should take the “Introduction to Python” course that takes place online from 25-28 August.
 
Software and Hardware Requirements
The computing in this course will be done on a remote server that you will access from your computer. You do not need to install any software on your personal machine! The only thing you need to do before class is sign up for a free GitHub account.


Schedule