Natural Language Processing (EN)

Kurzem Vás provede Jiří Materna

Je specialista na strojové učení se zkušenostmi s jeho aplikacemi v průmyslu od roku 2007. Mezi lety 2008 a 2017 pracoval ve společnosti Seznam.cz, z…

Základní info

Description

This course is focused on the analysis and processing of text data.

We are expecting knowledge of basic principles of machine learning in the same extent as the Introduction to Machine Learning course provides. A special attention will be aimed to text preprocessing and vectorization, which is crucial for NLP. We will further focus on text classification, language modeling and text synthesis.

Contents

  • Introduction to natural language processing
  • Chapters from computational linguistics (corpus, tokenization, morphological, syntactic and semantic analysis, entropy, perplexity)
  • Text document vectorization (bag of words, one-hot encoding, TF-IDF)
  • Practical taks on text classification
  • Word embedding (word2vec, GloVe)
  • Introduction to language modelling (n-gram models, smoothing, neural network based language models)
  • Practical task on language modelling (implementation of a language detection algorithm based on language models)
  • Neural network based text generator

Prerequisites

  • Basic knowledge of programing in Python
  • High school level of mathematics
  • Basics of machine learning on the level of our course Introduction to  machine Learning

Followups

Time Series

Natural Language Processing (EN)

Vybraný termín:

 Praha

Cena
4 990 Kč + 21% DPH

Kontaktovat dodavatele


Kontrola proti spamu. Kolik je dvě a dvě ? Součet zapište číslicemi.