Corpora from the Association of Computational Linguistics Data Collection Initiative, CD-ROM 1, Sept 1991
CED --- Collins English Dictionary
WSJ --- Wall St. Journal (1987, 88, 89)
DOE --- Scientific Abstracts from (US) the Dept of Energy
TREEBANK -- an early version of the PENN Treebank
-- ACL-DCI in the `usual place': /ufs/corpora/ (unix), \\corpora\corpora\ (Windows) (see the Introduction)..
no special tools at present
This is mostly old stuff, we'll replace it with more up-to-date material in due course.