CE807-7-QS-:
Text Analytics

The details
2020/21
Computer Science and Electronic Engineering (School of)
Spring - Partner
Postgraduate: Level 7
Current
Sunday 17 January 2021
Friday 26 March 2021
15
20 November 2020

 

Requisites for this module
(none)
(none)
(none)
(none)

 

(none)

Key module for

(none)

Module description

This module will provide an understanding of text analytics and its applications. Students will study state of the art matters for supervised and unsupervised text mining. Methods include rule based traditional machine learning as well as deep neural networks.

Module aims

The aim of this module is to provide students with an understanding of basic and advanced methods of text analytics and its applications. Students will learn about state of the art methods for unsupervised and supervised text mining including text preprocessing, structured data extraction, clustering of documents and classification of documents using different techniques. The methods taught include rule-based approaches, traditional machine learning techniques as well as modern Deep Neural Networks.

Module learning outcomes

After completing this module, students will be expected to be able to:

1. Have knowledge about methods for text preprocessing.
2. Understand and use techniques for structured data extraction.
3. Understand and use various techniques for statistical text analysis.
4. Apply text analysis methods on data extracted from the web such as social media , websites and others.
5. Be empowered to independently develop systems for text analytics.

Module information

Outline Syllabus:

1. Text preprocessing techniques
2. Structured data extraction (such as entities, records)
3. Statistical methods for text clustering (unsupervised learning)
4. Statistical methods for text classification (supervised learning)
5. Deep Learning for text analysis (supervised and unsupervised)

Learning and teaching methods

2 hours of lectures per week, 2 hours of laboratory time per week.

Bibliography

  • Manning, Christopher D.; Raghavan, Prabhakar; Schütze, Hinrich. (2008) Introduction to information retrieval, New York: Cambridge University Press.
  • Deng, Li; Liu, Yang. (2018) Deep Learning in Natural Language Processing, Singapore: Springer Verlag, Singapore.
  • Coelho, Luis Pedro; Richert, Willi. (2015) Building machine learning systems with Python, Birmingham: Packt Publishing.
  • Goodfellow, Ian; Bengio, Yoshua; Courville, Aaron. (2016) Deep learning, Cambridge, Massachusetts: The MIT Press.
  • Witten, I. H.; Frank, Eibe; Hall, Mark A.; Pal, Christopher J. (2017) Data mining: practical machine learning tools and techniques, Boston: Elsevier.

The above list is indicative of the essential reading for the course. The library makes provision for all reading list items, with digital provision where possible, and these resources are shared between students. Further reading can be obtained from this module's reading list.

Assessment items, weightings and deadlines

Coursework / exam Description Deadline Coursework weighting

Additional coursework information

This assignment involves developing a text categorization system--e.g., for sentiment analysis of Twitter data. The assessment is going to be based in part on the code, in part on the report. In the new coursework-only version of the course, in the report students will also be asked to answer theoretical questions. Assignment 1 is to be handed out in week 20 and submitted to FASer in week 22. This assignment involves the development of a system for, e.g., named entity resolution, or disambiguation to Wikipedia of query logs. The assessment is based in part on the code produced, in part on report, which, in the new version of the module, will also require the students to answer some theoretical questions. Assignment 2 will be handed out in week 23, to be submitted to FASer in week 25.

Exam format definitions

  • Remote, open book: Your exam will take place remotely via an online learning platform. You may refer to any physical or electronic materials during the exam.
  • In-person, open book: Your exam will take place on campus under invigilation. You may refer to any physical materials such as paper study notes or a textbook during the exam. Electronic devices may not be used in the exam.
  • In-person, open book (restricted): The exam will take place on campus under invigilation. You may refer only to specific physical materials such as a named textbook during the exam. Permitted materials will be specified by your department. Electronic devices may not be used in the exam.
  • In-person, closed book: The exam will take place on campus under invigilation. You may not refer to any physical materials or electronic devices during the exam. There may be times when a paper dictionary, for example, may be permitted in an otherwise closed book exam. Any exceptions will be specified by your department.

Your department will provide further guidance before your exams.

Overall assessment

Coursework Exam
100% 0%

Reassessment

Coursework Exam
100% 0%
Module supervisor and teaching staff
Dr Mohammed Jameel, email: shoaib.jameel@essex.ac.uk.
Dr Shoaib Jameel, Dr Alba Garcia Seco De Herrera
School Office, e-mail csee-schooloffice (non-Essex users should add @essex.ac.uk to create full e-mail address), Telephone 01206 872770.

 

Availability
No
No
No

External examiner

No external examiner information available for this module.
Resources
Available via Moodle
No lecture recording information available for this module.

 

Further information

Disclaimer: The University makes every effort to ensure that this information on its Module Directory is accurate and up-to-date. Exceptionally it can be necessary to make changes, for example to programmes, modules, facilities or fees. Examples of such reasons might include a change of law or regulatory requirements, industrial action, lack of demand, departure of key personnel, change in government policy, or withdrawal/reduction of funding. Changes to modules may for example consist of variations to the content and method of delivery or assessment of modules and other services, to discontinue modules and other services and to merge or combine modules. The University will endeavour to keep such changes to a minimum, and will also keep students informed appropriately by updating our programme specifications and module directory.

The full Procedures, Rules and Regulations of the University governing how it operates are set out in the Charter, Statutes and Ordinances and in the University Regulations, Policy and Procedures.