Python Programming for Data Science

Course content

This course is an introduction to programming in Python, with focus on data processing and analysis. It includes basic programming concepts such as data types, conditionals, loops, functions, object oriented programming, pattern matching (regular expressions), and computational complexity. In addition, it also provides technical skills relevant to the data science pipeline such as the ability to log on to an external server, and to navigate a Unix shell.

Education

BSc Programme in Biochemistry
BSc Programme in Biology
BSc Programme in Molecular Biomedicine
MSc Programme in Bioinformatics

Learning outcome

Knowledge:
After completing the course, the student will master key programming concepts such as data-types, variables, conditionals, loops, and functions, and have an understanding of the central concepts in object oriented programming, pattern matching and computational complexity. Finally, the student will have acquired an understanding of a Unix/Linux environment, including basic navigation, pipes, networking, and process control.

Skills:
The student is capable of solving small to medium sized programming tasks in Python, in particular tasks related to data processing and analytics. The student can produce programs that are well-written, well-structured, and well-commented. Finally, the student knows how to execute scripts on a remote server, and navigate using a Unix command line interface on such a server.

Competences:
After completing the course, the student is capable of solving the many small to medium size programming tasks that arise in Data Science disciplines, and is able to write well-structured and maintainable programs in Python. The student is also capable of running programs both locally and on remote servers, and be able to navigate in a Unix environment.

The student

  • can explain the differences between various data-types in Python and can select the relevant type for a given programming task
  • can give a detailed description of conditionals and loops, and is able to explain how loops relate to the complexity of a program
  • can motivate the concepts of function and module, and give examples of how these tools should be used to structure code
  • can explain the basic concepts of Object Oriented Programming, and give examples of appropriate uses of classes and object
  • can identify problems for which regular expressions are well suited, and is able to construct an appropriate regular expression for a given pattern matching problem
  • can give examples of how to handle errors in a program
  • is capable of independently finding online information about external Python modules, and applying this information to solve a specific task

Lectures and exercises mixed (6-9 hours per week). Some of the exercises will involve simple Biological examples, but a Biological background is not required for taking the course.

See Absalon.

This is an introductory programming course: no prior programming experience is required. We do assume knowledge corresponding to a basic University-level mathematics introduction course (e.g. Introduction to the Mathematics for the Chemical Sciences).

Participants are expected to bring a laptop equipped with a network card to class (contact the teacher if not possible).

Written
Oral
Individual
Collective
Continuous feedback during the course of the semester
ECTS
7,5 ECTS
Type of assessment
Written assignment, 5 days/20 hours
Individual, written take-home exam.
Aid
All aids allowed
Marking scale
7-point grading scale
Censorship form
No external censorship
Several internal examiners.
Criteria for exam assessment

See learning outcome

Single subject courses (day)

  • Category
  • Hours
  • Lectures
  • 21
  • Preparation
  • 139
  • Practical exercises
  • 21
  • Exam
  • 25
  • English
  • 206