Lecture 1. Introduction — различия между версиями
Материал из Wiki - Факультет компьютерных наук
Polidson (обсуждение | вклад) (Новая страница: «''' Lecture 1. Introduction ''' == Brief history of NLP == * January 7, 1954 — Georgetown experiment. Russian to English machine translation; * 1957 — Noam…») |
Polidson (обсуждение | вклад) (→NLP techniques) |
||
Строка 33: | Строка 33: | ||
== NLP techniques == | == NLP techniques == | ||
+ | |||
+ | * The level of characters: | ||
+ | ** Word segmentation | ||
+ | ** Sentence breaking | ||
+ | * The level of words — morphology: | ||
+ | ** Part of speech (POS) tagging | ||
+ | ** Word sense disambiguation | ||
+ | * The level of sentences — syntax: | ||
+ | ** Parsing | ||
+ | * The level of senses — semantics: | ||
+ | ** Coreference resolution | ||
+ | ** Discourse analysis | ||
+ | ** Semantic role labeling | ||
+ | ** Synonymy detection | ||
== Main problems == | == Main problems == | ||
== About this course == | == About this course == |
Версия 23:29, 22 августа 2015
Lecture 1. Introduction
Содержание
Brief history of NLP
- January 7, 1954 — Georgetown experiment. Russian to English machine translation;
- 1957 — Noam Chomsky introduced "universal grammar";
- since 1961 — Brown Corpus;
- the late 1960's — ELIZA, a simulation of a psychotherapist;
- 1975 — Vector Space Model by Salton;
- up to the early 1980's — rule based approaches;
- after the early 1980's — machine learning, corpus linguistics;
- 1998 — Language Model by Ponte and Croft;
- since 1999 — topic modeling (LSI, pLSI, LDA, etc);
- 1999 — "Foundations of Statistical Natural Language Processing" by Manning and Shuetze;
- 2009 — "Natural Language Processing with Python" by Bird, Klein, and Loper.
Major tasks of NLP
- Machine Translation
- Text classification
- Sentiment analysis
- Spam filtering
- Classification by topic or by genre
- Text clustering
- Named entity recognition
- Question answering
- Automatic summarization
- Natural language generation
- Speech recognition
- Spell checking
- User study design and evaluation
NLP techniques
- The level of characters:
- Word segmentation
- Sentence breaking
- The level of words — morphology:
- Part of speech (POS) tagging
- Word sense disambiguation
- The level of sentences — syntax:
- Parsing
- The level of senses — semantics:
- Coreference resolution
- Discourse analysis
- Semantic role labeling
- Synonymy detection