This module builds on knowledge that students have gained in GV900 and starts with a refresher on linear regression before delving into more advanced methods, both theoretically and practically. We will start with the assumptions that go into linear regressions, learn to analyse whether each assumption holds and consider how our results are impacted when they do not. We will focus on this both from a theoretical standpoint and learn what diagnostic tests can be used in R to test our regression assumptions. For linear regressions, we will also discuss interaction terms and fixed effects estimation.
The remainder of the module will focus on more advanced statistical methods with a focus on two broad categories. The first will be models for analysing non-continuous dependent variables, including binary, categorical and ordinal dependent variables. As before, we will also focus on how to analyse such data in R.
We will also spend some time introducing students to causal inference methods and analysis, using both observational and experimental data.
Along with the various new statistical techniques that students will learn, they will continue to build their coding skills in R throughout the module. In addition to implementing the statistical models, they will learn more advanced coding skills including writing their own (simple) functions, writing for-loops and nested for-loops, and they will learn to work with messy data so that they gain practical skills that are useful for both academic and non-academic spheres.