0% found this document useful (0 votes)

2 views40 pages

AI_UNIT-5

The document discusses machine learning approaches, specifically passive and active learning, highlighting their differences in data acquisition methods. It also covers Natural Language Processing (NLP), detailing its components, challenges, and steps involved in processing natural language. Additionally, the document explains speech recognition, focusing on the roles of acoustic and language models in converting spoken language into text.

Uploaded by

irfan.official003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views40 pages

AI_UNIT-5

Uploaded by

irfan.official003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Machine learning is a subfield of artificial intelligence that deals with the

creation of algorithms that can learn and improve themselves without explicit
programming. One of the most critical factors that contribute to the success
of a machine learning model is the quality and quantity of data used to train
it. Passive learning and active learning are two approaches used in machine
learning to acquire data.
Passive Learning:
Passive learning, also known as batch learning, is a method of acquiring data
by processing a large set of pre-labeled data. In passive learning, the
algorithm uses all the available data to learn and improve its performance.
The algorithm does not interact with the user or request additional data to
improve its accuracy.
Example:- An example of passive learning is training a machine
learning model to classify emails as spam or not spam. The algorithm is fed a
large dataset of labeled emails and uses it to learn how to identify spam
emails. Once the training is complete, the algorithm can accurately classify
new emails without any further input from the user.
Active Learning:
Active learning is a method of acquiring data where the algorithm interacts
with the user to acquire additional data to improve its accuracy. In active
learning, the algorithm starts with a small set of labeled data and requests
the user to label additional data. The algorithm uses the newly labeled data to
improve its performance and may continue to request additional data until a
satisfactory level of accuracy is achieved.
Example:- An example of active learning is training a machine learning model
to recognize handwritten digits. The algorithm may start with a small set of
labeled data and ask the user to label additional data that the algorithm is
uncertain about. The algorithm uses the newly labeled data to improve its
accuracy, and the process repeats until the algorithm can accurately
recognize most handwritten digits.
Difference Between Passive Learning and Active Learning:
The following table summarizes the differences between passive learning and
active learning:

Passive Learning Active Learning

Uses a large set of pre-labeled Starts with a small set of labeled data and
data to train the algorithm requests additional data from the user

The algorithm does not The algorithm interacts with the user to
interact with the user acquire additional data

1
Passive Learning Active Learning

May continue to request additional data

It does not require user input
until a satisfactory level of accuracy is
after training is complete
achieved

Suitable for applications where Suitable for applications where labeled

a large dataset is available data is scarce or expensive to acquire

Conclusion:
In conclusion, passive learning and active learning are two approaches used
in machine learning to acquire data. Passive learning uses a large set of pre-
labeled data to train the algorithm, while active learning starts with a small
set of labeled data and requests additional data from the user to improve
accuracy. The choice between passive learning and active learning depends on
the availability of labeled data and the application’s requirements.

2
AI - NATURAL LANGUAGE PROCESSING

Natural Language Processing (NLP) refers to AI method of communicating with

an intelligent systems using a natural language such as English.

Processing of Natural Language is required when you want an intelligent

system like robot to perform as per your instructions, when you want to hear
decision from a dialogue based clinical expert system, etc.

The field of NLP involves making computers to perform useful tasks with the
natural languages humans use. The input and output of an NLP system can be
−

•Speech
•Written Text
Components of NLP

There are two components of NLP as given −

Natural Language Understanding (NLU)

Understanding involves the following tasks −

• Mapping the given input in natural language into useful representations.

• Analyzing different aspects of the language.

Natural Language Generation (NLG)

It is the process of producing meaningful phrases and sentences in the form of

natural language from some internal representation.

It involves −

• Text planning − It includes retrieving the relevant content from knowledge

base.
• Sentence planning − It includes choosing required words, forming
meaningful phrases, setting tone of the sentence.
• Text Realization − It is mapping sentence plan into sentence structure.

The NLU is harder than NLG.

1
Difficulties in NLU

NL has an extremely rich form and structure.

It is very ambiguous. There can be different levels of ambiguity −

• Lexical ambiguity − It is at very primitive level such as word-level.

• For example, treating the word “board” as noun or verb?
• Syntax Level ambiguity − A sentence can be parsed in different ways.
• For example, “He lifted the beetle with red cap.” − Did he use cap to lift the
beetle or he lifted a beetle that had red cap?
• Referential ambiguity − Referring to something using pronouns. For
example, Rima went to Gauri. She said, “I am tired.” − Exactly who is tired?
• One input can mean different meanings.
• Many inputs can mean the same thing.
NLP Terminology
• Phonology − It is study of organizing sound systematically.
• Morphology − It is a study of construction of words from primitive
meaningful units.
• Morpheme − It is primitive unit of meaning in a language.
• Syntax − It refers to arranging words to make a sentence. It also involves
determining the structural role of words in the sentence and in phrases.
• Semantics − It is concerned with the meaning of words and how to
combine words into meaningful phrases and sentences.
• Pragmatics − It deals with using and understanding sentences in different
situations and how the interpretation of the sentence is affected.
• Discourse − It deals with how the immediately preceding sentence can
affect the interpretation of the next sentence.
• World Knowledge − It includes the general knowledge about the world.
Steps in NLP

There are general five steps −

• Lexical Analysis − It involves identifying and analyzing the structure of

words. Lexicon of a language means the collection of words and phrases
in a language. Lexical analysis is dividing the whole chunk of txt into
paragraphs, sentences, and words.
• Syntactic Analysis (Parsing) − It involves analysis of words in the
sentence for grammar and arranging words in a manner that shows the
relationship among the words. The sentence such as “The school goes to
boy” is rejected by English syntactic analyzer.

2
• Semantic Analysis − It draws the exact meaning or the dictionary
meaning from the text. The text is checked for meaningfulness. It is done
by mapping syntactic structures and objects in the task domain. The
semantic analyzer disregards sentence such as “hot ice-cream”.
• Discourse Integration − The meaning of any sentence depends upon the
meaning of the sentence just before it. In addition, it also brings about the
meaning of immediately succeeding sentence.
• Pragmatic Analysis − During this, what was said is re-interpreted on
what it actually meant. It involves deriving those aspects of language
which require real world knowledge.

3
Describe Speech Recognition in terms of Language
Model and Acoustic Model.

What is speech recognition?

Speech recognition, or speech-to-text, is the ability of a machine or program to
identify words spoken aloud and convert them into readable text. Rudimentary
speech recognition software has a limited vocabulary and may only identify words
and phrases when spoken clearly. More sophisticated software can handle natural
speech, different accents and various languages.

Speech recognition uses a broad array of research in computer science, linguistics

and computer engineering. Many modern devices and text-focused programs have
speech recognition functions in them to allow for easier or hands-free use of a
device.

Speech recognition and voice recognition are two different technologies

and should not be confused:

• Speech recognition is used to identify words in spoken language.

• Voice recognition is a biometric technology for identifying an individual's

voice.
How does speech recognition work?
Speech recognition systems use computer algorithms to process and interpret
spoken words and convert them into text. A software program turns the sound a
microphone records into written language that computers and humans can
understand, following these four steps:

1. analyze the audio;

2. break it into parts;

3. digitize it into a computer-readable format; and

4. use an algorithm to match it to the most suitable text representation.

Speech recognition software must adapt to the highly variable and context-specific
nature of human speech. The software algorithms that process and organize audio
into text are trained on different speech patterns, speaking styles, languages,
dialects, accents and phrasings. The software also separates spoken audio from
background noise that often accompanies the signal.

To meet these requirements, speech recognition systems use two types of models:

• Acoustic models. These represent the relationship between linguistic units of

speech and audio signals.

• Language models. Here, sounds are matched with word sequences to

distinguish between words that sound similar.

Acoustic Model

The acoustic model is responsible for translating audio signals into phonetic units or
phonemes (the basic sounds of a language).

1. Function: It maps the audio signal, which consists of waveforms or spectral features,
to probabilities of phonetic units.
2. Training: It is trained using a large dataset of audio recordings and their
corresponding transcriptions. Techniques like Hidden Markov Models (HMMs) or
Deep Neural Networks (DNNs) are commonly used.
3. Output: The output of the acoustic model is a sequence of phonemes or probability
distributions over phonemes.

Language Model

The language model is responsible for using linguistic knowledge to construct probable word
sequences from the recognized phonetic units.

1. Function: It predicts the likelihood of a sequence of words. This helps in determining

the most probable word sequences given the phonetic inputs from the acoustic model.
2. Training: It is trained on a large corpus of text data to learn the statistical properties
of the language, including word frequencies and contextual word patterns.
3. Types: Common types of language models include N-gram models, Recurrent Neural
Networks (RNNs), and Transformers.

How They Work Together

1. Feature Extraction: The audio signal is first converted into a set of features that
capture the important characteristics of the speech signal.
2. Phoneme Prediction: The acoustic model processes these features to produce a
sequence of phoneme probabilities.
3. Word Hypothesis: The language model then takes these phoneme sequences and
predicts the most likely sequence of words by evaluating different possible word
sequences and selecting the one with the highest probability.
4. Decoding: A decoder integrates the acoustic and language models to produce the final
transcription. It combines the phoneme probabilities from the acoustic model and the
word probabilities from the language model to find the most likely transcription of the
spoken input.

Example

Suppose someone says, "I need a book."

1. Acoustic Model: Converts the audio features into phoneme probabilities like /ai/ /n/
/iː/ /d/ /ə/ /b/ /ʊk/.
2. Language Model: Evaluates sequences of words that could correspond to these
phonemes, determining that "I need a book" is a more likely sequence than
alternatives like "I knee dab hook."
3. Decoder: Integrates outputs from both models to produce the final recognized text: "I
need a book."

Nlp Meterial 5 Units
No ratings yet
Nlp Meterial 5 Units
151 pages
Nlput-unit1 Notes (1)
No ratings yet
Nlput-unit1 Notes (1)
29 pages
NLP Unit 1 to 5
No ratings yet
NLP Unit 1 to 5
91 pages
unit_-_1.pptx[1]
No ratings yet
unit_-_1.pptx[1]
55 pages
UNIT-I NLP
No ratings yet
UNIT-I NLP
15 pages
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet
AI_Unit_3
No ratings yet
AI_Unit_3
12 pages
Building AI - No-Code NLP Workflows
No ratings yet
Building AI - No-Code NLP Workflows
109 pages
Introduction NLP
No ratings yet
Introduction NLP
32 pages
CH1
No ratings yet
CH1
87 pages
UNIT - 03 (All Topics) (3)
No ratings yet
UNIT - 03 (All Topics) (3)
54 pages
UNIT 1
No ratings yet
UNIT 1
26 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Natural language processing (NLP)
No ratings yet
Natural language processing (NLP)
9 pages
NLP-1
No ratings yet
NLP-1
13 pages
Lect01
No ratings yet
Lect01
28 pages
Natural Language Processing – Part I - Primer
No ratings yet
Natural Language Processing – Part I - Primer
29 pages
CSC 528 Lecture 3
No ratings yet
CSC 528 Lecture 3
42 pages
Artificial Intelligence PYQ Theory
No ratings yet
Artificial Intelligence PYQ Theory
30 pages
Phases of NLP (8 Files Merged)
No ratings yet
Phases of NLP (8 Files Merged)
66 pages
1803.09103v1
No ratings yet
1803.09103v1
9 pages
Hadi Pres, 21-12-24-1
No ratings yet
Hadi Pres, 21-12-24-1
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
Notes Msc Nlp
No ratings yet
Notes Msc Nlp
36 pages
What Is NLP?: Natural Language Processing in AI
No ratings yet
What Is NLP?: Natural Language Processing in AI
5 pages
notes
No ratings yet
notes
9 pages
Unit 5 AI
No ratings yet
Unit 5 AI
9 pages
NLP IA1
No ratings yet
NLP IA1
7 pages
mlfmodule1
No ratings yet
mlfmodule1
14 pages
Harambe University
No ratings yet
Harambe University
8 pages
1 - Intro - To - NLP 2
No ratings yet
1 - Intro - To - NLP 2
55 pages
Minorproject Ishant
No ratings yet
Minorproject Ishant
18 pages
AI Notes Unit 3
No ratings yet
AI Notes Unit 3
10 pages
Course Code HUM1012 Logic and Language Structure BL202425040 0921 D21+D22
No ratings yet
Course Code HUM1012 Logic and Language Structure BL202425040 0921 D21+D22
55 pages
Introduction To NLP
No ratings yet
Introduction To NLP
50 pages
Abhi's Mini Project
No ratings yet
Abhi's Mini Project
63 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
How to Improve English-Speaking Fluency
No ratings yet
How to Improve English-Speaking Fluency
8 pages
NLp_lab1
No ratings yet
NLp_lab1
33 pages
Natural Language Processing
100% (1)
Natural Language Processing
6 pages
ai-unit4
No ratings yet
ai-unit4
36 pages
NLP Exam Notes
No ratings yet
NLP Exam Notes
15 pages
Lecture1
No ratings yet
Lecture1
16 pages
Unit 6 Endsem PYQs
No ratings yet
Unit 6 Endsem PYQs
15 pages
ورقة الذكاء
No ratings yet
ورقة الذكاء
7 pages
Final
No ratings yet
Final
22 pages
Assessment and Treatment of CAS
100% (3)
Assessment and Treatment of CAS
46 pages
Unit V
No ratings yet
Unit V
16 pages
Definition: Natural Language Processing Is A Theoretically Motivated Range of Computational
No ratings yet
Definition: Natural Language Processing Is A Theoretically Motivated Range of Computational
14 pages
NLP Introduction Overview
No ratings yet
NLP Introduction Overview
34 pages
Psycholinguistics: PBI A 2021
No ratings yet
Psycholinguistics: PBI A 2021
16 pages
News Classification Using Machine Learning
No ratings yet
News Classification Using Machine Learning
5 pages
GEC Purposive Communication Course Pack
100% (1)
GEC Purposive Communication Course Pack
81 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
Natural Language Processing Lec 1
No ratings yet
Natural Language Processing Lec 1
23 pages
01 - Intro NLP
No ratings yet
01 - Intro NLP
13 pages
1.introduction To Natural Language Processing (NLP)
100% (1)
1.introduction To Natural Language Processing (NLP)
37 pages
Inflection Morphology: Inflexion
No ratings yet
Inflection Morphology: Inflexion
4 pages
Zambia Primary School English Syllabus
75% (4)
Zambia Primary School English Syllabus
35 pages
Fall 2024, SED 5403, Chapter 11, Language and Students Sensory Disabilities (1)
No ratings yet
Fall 2024, SED 5403, Chapter 11, Language and Students Sensory Disabilities (1)
38 pages
Introduction To Psycholinguistics (PPT) - 2
100% (1)
Introduction To Psycholinguistics (PPT) - 2
10 pages
NLP
No ratings yet
NLP
11 pages
INDO-EUROPEAN LANGUAGES
No ratings yet
INDO-EUROPEAN LANGUAGES
40 pages
What Is NLP?
No ratings yet
What Is NLP?
5 pages
Nlp-Natural Language Process
No ratings yet
Nlp-Natural Language Process
3 pages
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
No ratings yet
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
7 pages
NLP Steps Basic
No ratings yet
NLP Steps Basic
26 pages
English Language Lab Presentation - 2022
100% (1)
English Language Lab Presentation - 2022
91 pages
Understanding Literacy Development PDF
100% (6)
Understanding Literacy Development PDF
248 pages
PD-Unit 2
No ratings yet
PD-Unit 2
36 pages
TP4 Miriam Heskin Speaking & Listening
No ratings yet
TP4 Miriam Heskin Speaking & Listening
12 pages
Scheme of Studies BS Education 25.08.2020
No ratings yet
Scheme of Studies BS Education 25.08.2020
139 pages
IJIREEICE.2024.12526
No ratings yet
IJIREEICE.2024.12526
4 pages
Avansa Naufal Hakim 08202244020
No ratings yet
Avansa Naufal Hakim 08202244020
161 pages
LESSON PLAN 5 delivering self composed speech - Copy
No ratings yet
LESSON PLAN 5 delivering self composed speech - Copy
9 pages
Powerpoint Presentation On Speech Styles
100% (3)
Powerpoint Presentation On Speech Styles
15 pages
SKRIPSI CHAPTER I-3 Revisi
No ratings yet
SKRIPSI CHAPTER I-3 Revisi
27 pages
ACADEMIC ENGLISH 2 - COURSE GUIDE - 2023 - 11 Weeks
No ratings yet
ACADEMIC ENGLISH 2 - COURSE GUIDE - 2023 - 11 Weeks
18 pages
Unit 4 NLP Notes
No ratings yet
Unit 4 NLP Notes
35 pages
id3
No ratings yet
id3
1 page
NLP Presentation
No ratings yet
NLP Presentation
19 pages
5.2 Natural Language Processing
No ratings yet
5.2 Natural Language Processing
43 pages
The Impact of Playdough Games On The Development o
No ratings yet
The Impact of Playdough Games On The Development o
10 pages
Language and Music The Distorted Power of Repetition
No ratings yet
Language and Music The Distorted Power of Repetition
5 pages
CCA Planner 2024-2025
No ratings yet
CCA Planner 2024-2025
8 pages
Subject Assignment: Teaching Pronunciation: General Information
No ratings yet
Subject Assignment: Teaching Pronunciation: General Information
11 pages
Лекция 1. Теор фонетика
No ratings yet
Лекция 1. Теор фонетика
6 pages
The Nature of Language
No ratings yet
The Nature of Language
22 pages
Language Production.7
No ratings yet
Language Production.7
20 pages
Eduardo Coutinho, Klaus R. Scherer, and Nicola Dibben. Singing and Emotion.
No ratings yet
Eduardo Coutinho, Klaus R. Scherer, and Nicola Dibben. Singing and Emotion.
24 pages
Theories About Language - Docx.f
No ratings yet
Theories About Language - Docx.f
47 pages
Cognitive Development of Infants and Toddlers
No ratings yet
Cognitive Development of Infants and Toddlers
58 pages
Module 9 Principles of Writing
100% (1)
Module 9 Principles of Writing
20 pages
PTE Panacea Score Guide PDF
No ratings yet
PTE Panacea Score Guide PDF
8 pages

AI_UNIT-5

Uploaded by

AI_UNIT-5

Uploaded by

Machine learning is a subfield of artificial intelligence that deals with the

Passive Learning Active Learning

May continue to request additional data

Suitable for applications where Suitable for applications where labeled

Natural Language Processing (NLP) refers to AI method of communicating with

Processing of Natural Language is required when you want an intelligent

There are two components of NLP as given −

Natural Language Understanding (NLU)

Understanding involves the following tasks −

• Mapping the given input in natural language into useful representations.

Natural Language Generation (NLG)

It is the process of producing meaningful phrases and sentences in the form of

• Text planning − It includes retrieving the relevant content from knowledge

The NLU is harder than NLG.

NL has an extremely rich form and structure.

It is very ambiguous. There can be different levels of ambiguity −

• Lexical ambiguity − It is at very primitive level such as word-level.

There are general five steps −

• Lexical Analysis − It involves identifying and analyzing the structure of

What is speech recognition?

Speech recognition uses a broad array of research in computer science, linguistics

Speech recognition and voice recognition are two different technologies

• Speech recognition is used to identify words in spoken language.

• Voice recognition is a biometric technology for identifying an individual's

1. analyze the audio;

2. break it into parts;

3. digitize it into a computer-readable format; and

4. use an algorithm to match it to the most suitable text representation.

• Acoustic models. These represent the relationship between linguistic units of

• Language models. Here, sounds are matched with word sequences to

1. Function: It predicts the likelihood of a sequence of words. This helps in determining

How They Work Together

Suppose someone says, "I need a book."

You might also like