RST Discourse Treebank

the `RST Discourse Treebank' from the Linguistic Data Consortium (LDC)


LDC/rst_discourse_treebank in the `usual place': /ufs/corpora/ (unix), \\corpora\corpora\ (Windows) (see the Introduction).


see LDC/rst_discourse_treebank/index.html in the `usual place': /ufs/corpora/ (unix), \\corpora\corpora\ (Windows) (see the Introduction).


This is the Rhetorical Structure Theory Discourse Treebank Publication, produced by the Linguistic Data Consortium (LDC) catalog number LDC2002T07 and isbn 21-58563-223-6.

The RST Discourse Treebank contains a selection of 385 Wall Street Journal articles from the Penn Treebank: which have been annotated with discourse structure in the framework of Rhetorical Structure Theory (RST). In addition, the corpus includes a number of humanly-generated extracts and abstracts associated with the original documents.

Contact: (Massimo Poesio, Computing Science)