SCI 2000 – Introduction to Data Science (Winter 2021)

Professor: Maxime Turgeon

Learning objectives:

The main goal of the course is to give students the tools and skills to do data analysis as part of a research project. Specifically, we’ll learn about:

 

  • data visualization
  • how to extract data from relational databases
  • how to scrape data from the web
  • how to use regular expressions to clean text data
  • how to aggregate and summarise data.

In parallel to the above, an emphasis on statistical thinking will be the focus: how to think about variability, what biases can occur in the data, and how to perform simple statistical procedures.

The course aims to be “hands-on”. The course will be taught using R, but students will be allowed to submit their work in both R and Python.

Register