SCI 2000 – Introduction to Data Science (Winter 2021)
Professor: Maxime Turgeon
Learning objectives:
The main goal of the course is to give students the tools and skills to do data analysis as part of a research project. Specifically, we’ll learn about:
- data visualization
- how to extract data from relational databases
- how to scrape data from the web
- how to use regular expressions to clean text data
- how to aggregate and summarise data.
In parallel to the above, an emphasis on statistical thinking will be the focus: how to think about variability, what biases can occur in the data, and how to perform simple statistical procedures.
The course aims to be “hands-on”. The course will be taught using R, but students will be allowed to submit their work in both R and Python.