Topics in Statistical Genetics
Introduction to topics in Statistical Genetics, that is, the application of statistical methods for modelling and drawing inferences from genetic data, in particular DNA data. Genetics have been of statistical interest for more than 100 years.
The course discusses mathematical theory and statistical models to understand how genetic data vary in populations and how we can draw infererence from genetic data. Mathematical and statistical theory underlies vast progress and claims made in recent years about human's relationship to Neanderthals, the origin and spread of diseases (such a Covid19), and how the world was populated.
Random variables modelling genetic data from individuals in a population are highly correlated ("exchangeable random variables") and standard asymptotic theory does not apply. The theory and models are based on Markov chains/processes, in discrete and continuous time. Inference procedures are adhoc or advanced, and often based on models with latent variables.
Key mathematical/statistical concepts are ancestral processes, the coalescent process, the age and frequency of alleles (genetic types) in populations, and inference for genetic data based on such processes. Relatedness between indivduals is desribed in terms of a stochastic graph.
MSc Programme in Statistics
At the end of the course the student will have knowledge about the use of statistics in genetics, how genetic variation is modelled, ancestral processes, and how inference can be made from such processes.
The student will have the knowledge to explain
- population genetic models, like the simple Wright-Fisher model,
- the coalescent process and Ewens sampling formula
- the frequency distribution of alleles (types)
- statistical methods for inference on genetic data in different situations
- the use of Markov chains to model genetic variation
The student will acquire the skills to analysis simple genetic data sets, and to extract basic mathematical properties about ancestral processes.
At the end of the course the students will have the competence to
- carry out inference for (simple) genetic data sets
- extract relevant mathematical properties of genetic models
- extract biological insight from mathematical/statistical models
Four hours of lectures and three hours of exercises per week for 7 weeks.
Course literature to be decided, but will likely be a mix of research papers and extracts from books.
Basic mathematical statistics and probability based on measure
theory such as 2nd year courses or equivalent.
Academic qualifications equivalent to a BSc degree is recommended.
Students receive feedback at the exercise sessions.
- 7,5 ECTS
- Type of assessment
Written assignment, 27 hoursWritten take-home assignment
- All aids allowed
- Marking scale
- 7-point grading scale
- Censorship form
- No external censorship
One internal examiner
Criteria for exam assessment
The student must in a satisfactory way demonstrate that he/she has mastered the learning outcome of the course.
Single subject courses (day)
- Theory exercises
- Practical exercises
- Course number
- 7,5 ECTS
- Programme level
- Full Degree Master
- Block 3
- No limit
The number of seats may be reduced in the late registration period
- Study Board of Mathematics and Computer Science
- Department of Mathematical Sciences
- Faculty of Science
- Carsten Wiuf (4-7a6c7869437064776b316e7831676e)
Are you BA- or KA-student?
Courseinformation of students