Advanced Social Data Science I

Course content

The course introduces students to advanced quantitative social science methods, supervised machine learning and formal models of networks. The social sciences have developed a number of methods and approaches to inferring causal relations and testing theory based on observational data and ‘found’ data. At the same time, machine learning methods are becoming ever more prominent, both for measurement and analysis. The first part of the course introduces advanced regression models and key research designs for causal identification from observational data in the social sciences, including regression-discontinuity, difference-in-difference, event studies and instrumental variables. The second part of the course introduces the basic approaches to and methods of supervised machine learning in a social science context. This includes linear models, tree-based classification and (cross)validation. We also introduce the intersection of machine learning and social science empirical methods and outline challenges in (re)interpreting machine learning results through a social science lens, with a focus on machine learning model explainability and interpretability. Finally, the course introduces basic network concepts and measures.

Education

Mandatory course on MSc programme in Social Data Science at University of Copenhagen.

The course is only open for students enrolled in the MSc programme in Social Data Science.

Learning outcome

After completing the course, the student is expected to be able to:

 

Knowledge:

  • Show familiarity with advanced regression methods and different research designs for casual inference in the social sciences
  • Describe cores concepts and methods in supervised machine learning, including linear models, tree-based classification, overfitting, bias/variance trade-off and cross-validation.
  • Be able to define empirical issues at the intersection between machine learning and social science and describe challenges of interpretability of machine learning models.
  • Define key concepts in the analysis of complex networks.

 

Skills:

  • Implement common social science identification strategies to handle problems of endogeneity and selection.
  • Set up and execute simple supervised machine learning models for measurement and prediction in Python.
  • Identify challenges in applying and learning from machine learning in a social science context.
  • Structure network data in Python, as well as to construct and extract various network measures.

 

Competences:

  • Design and carry out basic analyses of complex social science networks.
  • Evaluate and implement appropriate modelling approaches given data set and objective, i.e. whether the goal is to evaluate a policy, make a model with best fit of the data or construct new measures.
  • Critically assess how carious research designs and identification strategies can or cannot be applied to questions of causal relationships in observational and “found” data and use vhis to develop data collection strategies.
  • Account for the possibilities and limitations in the use of machine learning within the social sciences and reflect upon contemporary (mis)use of application and machine learning in policy and research contexts.

Teaching combines lectures and classes, with a heavy emphasis on hands-on work with data in Python. Classes will present students with opportunities to apply their knowledge of programming and data handling and structuring as taught in Social Data Science Base Camp and Elementary Social Data Science to more advanced concepts and problems.

ECTS
7,5 ECTS
Type of assessment
Home assignment, 7 days
Type of assessment details
Written exam project. The exam is written individually. The project will involve conducting one or more empirical analyses using insights from the course.
Examination prerequisites

To be eligible for the exam in Advanced Social Data Science I, it is a requirement that students have completed a number of compulsory problem sets based on a social science question combining knowledge of social science research design with methods from the course. The problem sets must be approved by a member of the teaching team.

Aid
All aids allowed

ChatGPT and other large language model tools are permitted as a dedicated source, meaning text copied verbatim needs to be quoted, the tool cited, and generally the specific use made of them needs to be described in the submitted exam.

Marking scale
7-point grading scale
Censorship form
No external censorship
Exam period

Exam information:

The examination date can be found in the exam schedule     here

The exact time and place will be available in Digital Exam from the middle of the semester. 

Re-exam

Same as the ordinary exam.

Reexam info:

The reexamination date/period can be found in the reexam schedule     here

Criteria for exam assessment

Students are assessed on the extent to which they master the learning outcome for the course.

 

To obtain the top grade “12”, the student must with no or only a few minor weaknesses be able to demonstrate an excellent performance displaying a high level of command of all aspects of the relevant material and can make use of the knowledge, skills and competencies listed in the learning outcomes.

 

To obtain the passing grade “02”, the student must in a satisfactory way be able to demonstrate a minimal acceptable level of the knowledge, skills and competencies listed in the learning outcomes.

  • Category
  • Hours
  • Lectures
  • 28
  • Class Instruction
  • 42
  • Preparation
  • 112
  • Exam
  • 24
  • English
  • 206

Kursusinformation

Language
English
Course number
ASDK20004U
ECTS
7,5 ECTS
Programme level
Full Degree Master
Duration

1 block

Placement
Block 3
Studyboard
Social Data Science
Contracting department
  • Social Data Science
Contracting faculty
  • Faculty of Social Sciences
Course Coordinator
  • Nikolaj Arpe Harmon   (14-716c6e726f646d316b64757072714368667271316e7831676e)
Saved on the 12-05-2025

Are you BA- or KA-student?

Are you bachelor- or kandidat-student, then find the course in the course catalog for students:

Courseinformation of students