MA331-7-SP-CO:
Programming and Text Analytics with R

The details
2020/21
Colchester Campus
Spring
Postgraduate: Level 7
Current
Monday 18 January 2021
Friday 26 March 2021
15
10 September 2020

 

Requisites for this module
(none)
(none)
(none)
(none)

 

(none)

Key module for

MSC G305JS Applied Data Science,
MSC G306JS Data Science and its Applications

Module description

The module will introduce the underlying principles and basic concepts of programming with the R language. It will cover a wide range of analytics, provide practical experience of powerful R tools, and present real-world examples of how data and analytics are used to gain insights and to improve a business or industry. These examples include text analytics, Twitter, and IBM Watson.

Throughout these examples, and many more, we will teach programming techniques that will enable students to apply advanced data science approaches to real-world applications.

This module assumes no prior programming skills.

Module aims

The purpose of this module is to introduce:
Fundamental concepts of programming.
The key aspects of programming using the R language.
Powerful R tools for text analytics.

Module learning outcomes

At the end of this module a student will be able to:
A. A systematic, extensive and comparative knowledge and understanding of different objects and data types in R including character, numeric, factor and logical data.
B. A systematic, extensive and comparative knowledge and understanding of functions in R and create own functions.
C. A comprehensive knowledge and familiarity of R control structures, conditional expressions, and looping techniques.
D. A comprehensive knowledge and familiarity of sentiment using free form text, extract insights, and perform string processing methods.

Module information

Introduction to R
What is R? A brief overview of the concepts and features of the R statistical programming environment.
Help systems in R: A description of how to use different sources of R help.
Data types: A brief introduction to different data types in R including numeric, complex, character, factor, and logical data.
Data structure: A summary of data structure in R including vectors, matrices, arrays, data frames and lists.
Importing data: Describing how to import, edit, save, and export data of different formats from R including Excel, SPSS, STATA, and SAS data files.
Data manipulation: A description of how to use logical operators to manipulate data.
Missing values: Describing how R handles missing values.
Visualisation: Creating, editing, and saving graphics in various formats using R.
Programming using R
Functions: What is an R function? how are they structured and used? how can one understand function's parameters and how can we create our own functions?
Control Structures: Describing how we include control structures into R code.
Conditional expressions: Using "if" and "ifelse" structures in R.
Loops: Introducing looping techniques in R, with particular focus on "for", "repeat" and "while" statements.
"apply" family: using "apply", "lapply", "tapply", "mapply" and "sapply" in R.
Text analytics using R
Text as data: understand opinions and intelligence.
Case study: Analysis of tweets on Twitter to understand sentiment and public perception.
Sentiment analysis.

Learning and teaching methods

This module has 35 contact hours that will be structured as follows: Lectures: 15 hours Computer labs: 20 hours

Bibliography

(none)

Assessment items, weightings and deadlines

Coursework / exam Description Deadline Weighting
Coursework   Lab Test    20% 
Coursework   Final Project    80% 

Overall assessment

Coursework Exam
100% 0%

Reassessment

Coursework Exam
100% 0%
Module supervisor and teaching staff
Dr Osama Mahmoud, email: o.mahmoud@essex.ac.uk.
Dr Osama Mahmoud & Dr Joe Bailey
Dr Osama Mahmoud (o.mahmoud@essex.ac.uk), Dr Joe Bailey (jbailef@essex.ac.uk)

 

Availability
Yes
Yes
No

External examiner

No external examiner information available for this module.
Resources
Available via Moodle
Of 1973 hours, 2 (0.1%) hours available to students:
1971 hours not recorded due to service coverage or fault;
0 hours not recorded due to opt-out by lecturer(s).

 

Further information

Disclaimer: The University makes every effort to ensure that this information on its Module Directory is accurate and up-to-date. Exceptionally it can be necessary to make changes, for example to programmes, modules, facilities or fees. Examples of such reasons might include a change of law or regulatory requirements, industrial action, lack of demand, departure of key personnel, change in government policy, or withdrawal/reduction of funding. Changes to modules may for example consist of variations to the content and method of delivery or assessment of modules and other services, to discontinue modules and other services and to merge or combine modules. The University will endeavour to keep such changes to a minimum, and will also keep students informed appropriately by updating our programme specifications and module directory.

The full Procedures, Rules and Regulations of the University governing how it operates are set out in the Charter, Statutes and Ordinances and in the University Regulations, Policy and Procedures.