Chapter 6-NLP Basics
Chapter 6-NLP Basics
By Misganu T.
What is natural language processing?
Natural language processing (NLP) refers to the
branch of computer science and more specifically, the
branch of artificial intelligence concerned with giving
computers the ability to understand text and spoken
words in much the same way human beings can.
Natural language processing strives to build machines
that understand and respond to text or voice data and
respond with text or speech of their own.
What is natural language processing? Cont..
When the text has been provided, the computer will utilize
algorithms to extract meaning associated with every sentence and
collect the essential data from them.
For example, word sense disambiguation helps distinguish the meaning of the
verb 'make' in ‘make the grade’ (achieve) vs. ‘make a bet’ (place).
voice data into text data. Speech recognition is required for any application that
follows voice commands or answers spoken questions.
What makes speech recognition especially challenging is the way people talk quickly,
slurring words together, with varying emphasis and intonation, in different accents,
The study of how the context of text affects the meaning of an expression,
and what information is necessary to infer a hidden or presupposed
meaning.
Discourse analysis
The study of how narratives and other textual styles are constructed to
make larger textual compositions.
Ambiguity
• We say some input is ambiguous if there are multiple
alternative linguistic structures that can be built for it.
Example:
– I made her duck.
• Possible interpretations:
1. I cooked waterfowl for her
2. I cooked waterfowl belonging to her
3. I created (plaster?) duck she owns.
4. I caused her to quickly lower her head or body.
5. I waived my magic want and turned her into undifferentiated waterfowl.
Ambiguity Cont.
• These different meanings are caused by a
number of ambiguities.
1. First, the words duck and her are morphologically
or syntactically ambiguous in their part-of-speech.
• Duck can be a verb or a noun, while
• her can be a dative pronoun or a possessive
pronoun.
Duck (webster.com)
1duck noun, often attributive \ˈdək\, plural ducks
1 or plural duck
a : any of various swimming birds (family Anatidae, the duck family) in which the neck and legs
are short, the feet typically webbed, the bill often broad and flat, and the sexes usually
different from each other in plumage
b : the flesh of any of these birds used as food
2 : a female duck — compare DRAKE
3 chiefly British : DARLING —often used in plural but singular in construction
4 : PERSON, CREATURE
possessive pronoun
I gave her book back to her.
Ambiguity Cont.
2. Second, the word make is semantically
ambiguous; it can mean create or cook.
3. Finally, the verb make is syntactically ambiguous
in a different way.
Make can be transitive, that is, taking a single direct
object (2), or
it can be intransitive, that is, taking two objects (5),
meaning that the first object (her) got made into the
second object (duck).
– Finally, make can take a direct object and a verb,
meaning that the object (her) got caused to
perform the verbal action (duck).
Approaches for Disambiguation
• The models and algorithms as ways to
resolve or disambiguate these ambiguities.
• For example:
– deciding whether duck is a verb or a noun can be
solved by part-of-speech tagging.
– deciding whether make means “create” or “cook”
can be solved by word sense disambiguation.
• Resolution of part-of-speech and word sense
ambiguities are two important kinds of lexical
disambiguation.
– A wide variety of tasks can be framed as lexical
disambiguation problems. For example,
• A text-to-speech synthesis system reading the word lead
needs to decide whether it should be pronounced as in
lead pipe or as in lead me on.
Some NLP applications
• Spelling and grammar checking
• Optical character recognition (OCR)
• Screen readers
• Machine aided translation
• Information retrieval
• Document classification
• Document clustering
• Information extraction
• Augmentative and alternative communication
Some NLP applications Cont.
• Question answering
• Summarization
• Text segmentation
• Exam marking
• Report generation
• Machine translation
• Email understanding
• Dialogue systems
Thank You
27