Computational morphosyntax (5r)
The objectives of this course are twofold. On the theoretical side, the objective is to learn the main techniques used in the computational treatment of morphology and syntax (i.e., in the treatment of words and strings of words that form sentences). On the practical side, the goal is to get acquainted with the basics of Python as a script programming language and to know how to manage text files to be processed. In the course the student will learn, write and use text tokenisers, morphosyntactic taggers and syntactic processors.
Requirement: if you are new to programming or to Python it is required that you acquire some knowledge of the Python programming language (Python 3) before the start of the course (contact your tutor or the course instructor for further details on tutorials for this training).
- Basic items:
- Jurafsky, Daniel & Martin, James H. (2009), Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. 2nd edition. Prentice Hall (a new edition is currently being prepared)
- Bird, Steven; Klein, Ewan & Loper, Edward (2014), Natural Language Processing with Python. Analyzing Text with the Natural Language Toolkit. http://www.nltk.org/book/. (There is a previous, printed version of this book published by O’Reilly in 2009, based on Python2) • Jurafsky, Daniel & Manning, Christopher D. (2014), Natural Language Processing lectures, You Tube https://www.youtube.com/playlist?list=PLQiyVNMpDLKnZYBTUOlSI9mi9wAErFtFm...
- Python Tutorials: There is plenty of Python tutorials on the web. Here are a couple of tutorials that may be useful: https://www.pythonprogramming.net/python-fundamental-tutorials/ (for beginners) http://www.diveintopython3.net/ (advanced content)
- Other recommended readings:
- Manning, Christopher D. & Schütze, Hinrich (1999), Foundations of Statistical Natural Language Processing. The MIT Press.