Skip to main content

Machine learning for natural language processing

Master’s level
7,5 credits (ECTS)


The course gives an introduction to machine learning models and architectures used in modern natural language processing (NLP) systems.

Rapid developments in machine learning have revolutionized the field of NLP, including for commerically important applications such as translation, summarization, and information extraction. However, natural language data exhibit a number of peculiarities that make them more challenging to work with than many other types of data commonly encountered in machine learning: natural language is discrete,structured, and highly ambiguous. It is extremely diverse: not only are there thousands of languages in the world, but in each language there is substantial variation in style and genre.
Furthermore, many of the phenomena encountered in language follow long-tail statistical distributions, which makes the production of training data more costly. For these reasons, machine learning architectures for NLP applications tend to be quite different from those used in other fields.

Prerequisites and selection


To be eligible to the course, the student should have a Bachelor's degree in any subject.
In addition, the course requires:
7.5 credits of courses in programming or equivalent.
A course including probability and statistics, such as DIT862 Statistical Methods for Data Science or MSG810 Mathematical Statistics and Discrete mathematics,
A first course in machine learning, such as DIT866 Applied Machine Learning,DIT381 Algorithms for Machine Learning and Inference, or MSA220 StatisticalLearning for Big Data,
English 6/English B or the equivalent level of an internationally recognized test, for example TOEFL, IELTS