Natural Language Processing and Text Mining

Basic information

Workload: 

45 hours

Syllabus: 

Information extraction, morphosyntactic markup, syntax and semantics, statistical models and models based on rules, linguistic modeling, clustering, learning, machine translation, software for PLN.

Bibliography

Mandatory: 

•    Feldman, R., & Sanger, J. (2007). The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press.
•    Finegan, E. (2007). Language: Its Structure and Use (5th ed.). Wadsworth.
•    Manning, C. D., & Schütze, H. (1999). Foundations Of Statistical Natural Language Processing. MIT Press.
•    Pereira, F. C. N. (1994). Natural Language Processing. MIT Press.
•    Klavans, J. L., & Resnik, P. (Eds.). (1996). The Balancing Act: Combining Symbolic and Statistical Approaches to Language. MIT Press.
•    Young, S., & Bloothooft, G. (Eds.). (1997). Corpus-Based Methods in Language and Speech. Springer.