ICL Home >> Reading & Resources |
Introduction to Computational LinguisticsReading and ResourcesNLTKThe course will use the Natural Language Toolkit (NLTK-Lite), developed at Univ of Pennsylvania by Steven Bird and Edward Loper as an open source project at Sourceforge. This year, we will be using Version 0.6 of the toolkit. NLTK-Lite is provided as a Python package and modules from the package can therefore be imported into Python programs. For more details, including documentation, see the Linguistic Corpus Resources on DICEFor general information about language and speech data on DICE, have a look at the corpora web page. A variety of corpora are also included as part of the NLTK-Lite distribution, and can be found /usr/share/nltk-data. Recommended textbookDaniel Jurafsky and James H. Martin. Speech and Language Processing. Prentice-Hall, 2000. (Errata) New and revised chapters available online! Speech and Language Processing, 2nd Ed. Python ResourcesThe following books and other resources are not required, but may prove useful as references for the programming portions of the course.
Other reading
|
Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk Please contact our webadmin with any comments or corrections. Logging and Cookies Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh |