RST Discourse Treebank

the `RST Discourse Treebank' from the Linguistic Data Consortium (LDC)

Availability

LDC/rst_discourse_treebank in the `usual place': /ufs/corpora/ (unix), \\corpora\corpora\ (Windows) (see the Introduction).

Documentation

see LDC/rst_discourse_treebank/index.html in the `usual place': /ufs/corpora/ (unix), \\corpora\corpora\ (Windows) (see the Introduction).

Comment

This is the Rhetorical Structure Theory Discourse Treebank Publication, produced by the Linguistic Data Consortium (LDC) catalog number LDC2002T07 and isbn 21-58563-223-6.

The RST Discourse Treebank contains a selection of 385 Wall Street Journal articles from the Penn Treebank: http://www.ldc.upenn.edu/Catalog/LDC95T7.html which have been annotated with discourse structure in the framework of Rhetorical Structure Theory (RST). In addition, the corpus includes a number of humanly-generated extracts and abstracts associated with the original documents.

Contact: poesio@essex.ac.uk (Massimo Poesio, Computing Science)