Postgraduate: Level 7
Thursday 08 October 2020
Friday 18 December 2020
14 July 2020
Requisites for this module
MA216, MA304, MA317, MA318, MA319
The module contains a brief early introduction to the basics of probability and random variables. Major topics are distribution theory, estimation and Maximum Likelihood estimators, hypothesis testing, basic linear regression and multiple linear regression implemented in R. The module concludes with a brief review of Generalised Linear Models with special reference to the logistic model, odeviance and Wald statistics using examples in R.
The module aims to cover part of the CS1 (Actuarial Statistics) IFOA syllabus. The module follows the Graduate (Level 6) standards int he Statistics of he Royal Statistical Society, see under http://www.rss.org.uk/Images/PDF/pro-dev/2016/rss-level6-standards.pdf
On completion of the module students should be able to:
1. Use the principles of probability as a function over sets of events and manipulate conditional probabilities via Bayes theorem.
2. Manipulate Random variables, PDFs, distribution functions and Expectation especially w.r.t. the moments of a PDF, Mean and variance.
3. Define and be able to apply appropriately the discrete distributions: binomial, Poisson and uniform and be capable of working with the continuous distributions: normal, exponential, chi-square, t, F and uniform.
4. Use the one-to-one correspondence between an mgf and a pdf for sums of RVs.
5. Perform operations with bivariate distributions, joint, marginal , conditional distributions and freely use independence arguments.
6. Apply the central limit theorem.
7. Determine maximum likelihood and least squares estimates of unknown
parameters. Be able to define the terms: bias and mean squared error. Determine efficiency w.r.t. the Cramer-Rao lower bound for unbiased estimators.
8. Determine confidence intervals for means, variances and differences
9. To work with the concepts of random sampling, statistical inference and sampling distribution, Hypothesis tests. Null and alternative hypotheses, type I and type II errors, test statistic, critical region, level of significance, probability-value and power of a test. Use tables of the t-, F-, and chi-squared distributions.
10. Investigate linear relationships between variables using regression analysis. Use the correlation coefficient for bivariate data and the coefficient of determination. Explain what is meant by response and explanatory variables. Derive and calculate the least squares estimates of the slope and intercept parameters in a simple linear regression model. Perform multiple linear regression using R and interpret output.
11. Justify and use a Generalised Linear Model including inference arguments using deviance and Wald statistics from R output. [RSS level 7 standards developing from level 6 standards]
12. Use R [R Core Team (2017), R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ ] for the data analysis examples of the module.
Sample space for an experiment, an Event and definition of probability as a set function on a collection of events. The basic properties satisfied by probability.Conditional probability and Bayes' Theorem for events. Definition of independence for two events, Random variables, means, variances and moments.
Standard distributions and their use in modelling, including Bernoulli, binomial, Poisson, discrete uniform, Normal, exponential, gamma, continuous, uniform and multivariate Normal. Expectation, variance and generating functions. Sums of IID random variables, weak law of large numbers, central limit theorem.
Joint, marginal and conditional distributions. Independence. Covariance and correlation.
Moment generating functions to find moments of the PDF and distributions of sums of random variables.
Sampling distributions. Bias in estimators and mean squared error.
Efficiency and the Cramer-Rao lower bound for unbiased estimators. Maximum likelihood estimation and finding estimators analytically. The mean and variance of a sample mean. The distribution of the t-statistic for random samples from a normal distribution. The F distribution for the ratio of two sample variances from independent samples taken from normal distributions. Chi Square distributions for the sum of squared standard normal variates
Hypothesis testing and Confidence intervals
Confidence intervals for means, variances and differences between means. Hypothesis tests concerning means and variances. Null and alternative hypotheses, type I and type II errors, test statistic, critical region, level of significance, probability-value and power of a test. Use tables of the t-, F-, and chi-squared distributions.
linear relationships between variables using regression analysis. The correlation coefficient for bivariate data and the coefficient of determination. Response and explanatory variables and the least squares estimates of the slope and intercept parameters in a simple linear regression model.. Multiple linear regression with IID normal errors implemented in R.
Generalised Linear Models
Definition of a Generalised Linear Model. Inference arguments using deviance and Wald statistics using data examples within R.
Use R to implement the methods discussed above.
No prerequisites, however the module requires basic mathematical knowledge of algebra, permutations, combinations and summation of series with notation. Competent knowledge of differential and integral calculus, including partial derivatives and double integrals, is certainly required.
Teaching will be delivered in a way that blends face-to-face classes, for those students that can be present on campus, with a range of online lectures, teaching, learning and collaborative support.
This module does not appear to have any essential texts. To see non-essential items, please refer to the module's reading list.
Assessment items, weightings and deadlines
|Coursework / exam
||180 minutes during Summer (Main Period) (Main)
Module supervisor and teaching staff
Dr Hongsheng Dai, email: email@example.com.
Dr Hongsheng Dai & Dr Yassir Rabhi
Dr Hongsheng Dai (firstname.lastname@example.org), Dr Yassir Rabhi (email@example.com)
Dr Dimitrina Dimitrova
Cass Business School, City, University of London
Available via Moodle
Of 39 hours, 39 (100%) hours available to students:
0 hours not recorded due to service coverage or fault;
0 hours not recorded due to opt-out by lecturer(s).
* Please note: due to differing publication schedules, items marked with an asterisk (*) base their information upon the previous academic year.
Disclaimer: The University makes every effort to ensure that this information on its Module Directory is accurate and up-to-date. Exceptionally it can
be necessary to make changes, for example to programmes, modules, facilities or fees. Examples of such reasons might include a change of law or regulatory requirements,
industrial action, lack of demand, departure of key personnel, change in government policy, or withdrawal/reduction of funding. Changes to modules may for example consist
of variations to the content and method of delivery or assessment of modules and other services, to discontinue modules and other services and to merge or combine modules.
The University will endeavour to keep such changes to a minimum, and will also keep students informed appropriately by updating our programme specifications and module directory.
The full Procedures, Rules and Regulations of the University governing how it operates are set out in the Charter, Statutes and Ordinances and in the University Regulations, Policy and Procedures.