MA334-7-SP-CO:
Data analysis and statistics with R
2020/21
Mathematical Sciences
Colchester Campus
Spring
Postgraduate: Level 7
Current
Monday 18 January 2021
Friday 26 March 2021
15
10 September 2020
Requisites for this module
(none)
(none)
(none)
(none)
(none)
MSC G305JS Applied Data Science
The module will introduce concepts from data analysis and statistics and show how they can be applied effectively via the R language. It will cover a wide introduction to statistics and provide practical experience of real-world examples of how statistics is used to gain insights.
Throughout these examples, and many more, we will teach programming techniques that will enable students to apply statistical approaches to real-world applications.
This module assumes no previous exposure to statistics.
The purpose of this module is to introduce:
Data analysis.
Statistics.
The use of R for data analysis and statistics.
A. A systematic, extensive and comparative knowledge and understanding of the use of R for carrying out statistical analysis
B. A systematic, extensive and comparative knowledge and understanding of data analysis methods
C. A systematic, extensive and comparative knowledge and understanding of statistical methods
Basic ideas of probability and statistical distributions
Random variables, means, covariance and variance
Variance of a sample mean and confidence intervals for means, variances and differences between means
Conditional probability and independence
Probability distribution theory
Standard distributions and their use in modelling: including Bernoulli, Binomial, Poisson
Estimation and Maximum Likelihood estimators
Hypothesis tests concerning means and variances
Null and alternative hypotheses
Type I and type II errors
Test statistic and critical region
Probability value and level of significance
Using tables of the t, F and chi-squared distributions
Introduction to linear regression
The least square estimates of the intercept and the slope of a simple linear regression
Confidence intervals for the slope parameter and prediction intervals for response
Coefficient of determination and the sample correlation coefficient
This module has 35 contact hours that will be structured as follows:
Lectures: 20 hours
Classes: 10 hours
Computer labs: 5 hours
Assessment items, weightings and deadlines
Coursework / exam |
Description |
Deadline |
Weighting |
Coursework |
Final Project & Presentation |
30/04/2021 |
|
Overall assessment
Reassessment
Module supervisor and teaching staff
Dr Andrew Harrison, email: harry@essex.ac.uk.
Dr Andrew Harrison, Dr Osama Mahmoud & Dr Mario Gutierrez-Roig
Dr Andrew Harrison (harry@essex.ac.uk), Dr Osama Mahmoud (o.mahmoud@essex.ac.uk), Dr Mario Gutierrez-Roig (mario.gutierrez-roig@essex.ac.uk)
Yes
Yes
Yes
No external examiner information available for this module.
Available via Moodle
Of 2377 hours, 0 (0%) hours available to students:
2377 hours not recorded due to service coverage or fault;
0 hours not recorded due to opt-out by lecturer(s).
Disclaimer: The University makes every effort to ensure that this information on its Module Directory is accurate and up-to-date. Exceptionally it can
be necessary to make changes, for example to programmes, modules, facilities or fees. Examples of such reasons might include a change of law or regulatory requirements,
industrial action, lack of demand, departure of key personnel, change in government policy, or withdrawal/reduction of funding. Changes to modules may for example consist
of variations to the content and method of delivery or assessment of modules and other services, to discontinue modules and other services and to merge or combine modules.
The University will endeavour to keep such changes to a minimum, and will also keep students informed appropriately by updating our programme specifications and module directory.
The full Procedures, Rules and Regulations of the University governing how it operates are set out in the Charter, Statutes and Ordinances and in the University Regulations, Policy and Procedures.