Computational Linguistics

Computational linguistics is an interdisciplinary field concerned with computational modeling of natural language using computer programs and algorithms. It draws upon linguistics, computer science, and other fields. The goals are to develop systems that can understand, generate, and translate written and spoken language like question answering, machine translation, and dialogue agents. A computational linguist is an expert in machine learning, artificial intelligence, and cognitive science who works on natural language processing and creating interfaces that enable human-computer interaction through language. Modern computational linguistics relies on tools like artificial intelligence, machine learning, deep learning and neural networks to power applications such as chatbots, sentiment analysis, and language translation.

Uploaded by

Ragil Rizki Tri Andika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

311 views4 pages

Computational Linguistics

Uploaded by

Ragil Rizki Tri Andika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Introduction

to Linguistics
Instructor: Prof. Dr. Siusana Kweldju
Computational Linguistics
Computational linguistics is an interdisciplinary field of scientific and engineering
discipline concerned with the computational modelling of natural language, as well
as the study of appropriate computational approaches to linguistic questions. It is
concerned with understanding written and spoken language from a computational
perspective. And since language is our most natural and most versatile means of
communication, linguistically competent computers would greatly facilitate our
interaction with machines and software of all sorts, and put at our fingertips, in
ways that truly meet our needs, the vast textual and other resources of the internet.
In short, computational linguistics seeks to develop the computational machinery
needed for an agent to exhibit various forms of linguistic behavior. Agent can be
both human beings and artificial agents such as computer programs. Machinery is
computer programs as well as the linguistic knowledge that they contain.

In general, computational linguistics draws upon linguistics, computer

science, artificial intelligence, mathematics, logic, philosophy, cognitive
science, cognitive psychology, psycholinguistics, anthropology and neuroscience,
among others.
The theoretical goals of computational linguistics:
(a) formulation of grammatical and semantic frameworks to enable computationally
tractable implementations of syntactic and semantic analysis
(b) discovering processing techniques and learning principles that exploit both the
structural and statistic distributional properties of language
(c) developing cognitively and neuroscientifically plausible computational models of
how language processing and learning might occur in the brain.

The practical goals of the field are broad and varied; for examples:
(a) efficient text retrieval on some desired topic/search engines
(b) effective machine translation (MT)
(c) question answering (QA), ranging from simple factual questions to ones requiring
inference and descriptive or discursive answers (perhaps with justifications)
(d) text summarization
(e) analysis of texts or spoken language for topic, sentiment, or other psychological
attributes
(f) dialogue agents for accomplishing particular tasks (purchases, technical trouble
shooting, trip planning, schedule maintenance, medical advising, etc.)
(g) creation of computational systems with human-like competency in dialogue, in
acquiring language, and in gaining knowledge from text.
(h) Speech recognition systems
(i) Text-to-speech synthesizers
(j) Text-editors
(k) Language instruction materials

Computational Linguistics and knowledge representation are subdisciplines of

artificial intelligence (AI). AI, however, includes machine learning, or statistical
pattern recognition; historically referred to a variety of learning and inference
algorithms, many of them non-statistical, that were inspired as much by research in
psychology and cognitive science as by probability and information theory. That
was because the goals of early AI research centred around the development of
thinking machines. In the past the computer had to perform in a manner that
corresponded to human cognition. While there was a great deal of debate as to how
close and at what level that correspondence had to be. Therefore, a computational
linguist is an expert in machine learning, deep learning, AI, cognitive computing
and neuroscience.

Today a linguist needs a master’s or doctoral degree in a computer science-related

field or a bachelor's degree with work experience developing natural language
software. Tech software companies, such as Microsoft, typically hire
computational linguists to work on natural language processing (NLP), helping
programmers to create voice user interfaces that enable humans to communicate
with computing devices as if they were another person. Most of the time the terms
computational linguistics and NLP are used interchangeably.

Applications of CL typically include the following:

 Machine translation. This is the process of using AI to translate one human
language to another.
 Application clustering. This is the process of turning multiple computer servers
into a cluster.
 Sentiment analysis. This approach to NLP identifies the emotional tone behind
a body of text.
 Chatbots. These software or computer programs simulate human conversation
or chatter through text or voice interactions.
 Knowledge extraction. This is the creation of knowledge from structured and
unstructured text.
 Natural language interfaces. These are computer-human interfaces where
words, phrases or clauses act as user interface controls.
 Content filtering. This process blocks various language-based web content from
reaching end users.

History of computational linguistics

Although the concept of computational linguistics is often associated with AI, CL
predates AI's development, according to the Association for Computational
Linguistics. One of the first instances of CL came from an attempt to translate text
from Russian to English. The thought was that computers could make systematic
calculations faster and more accurately than a person, so it would not take long to
process a language. However, the complexities found in languages were
underestimated, taking much more time and effort to develop a working program.

Two programs were developed in the early 1970s that had more complicated
syntax and semantic mapping rules. SHRDLU was a primary language parser
developed in 1971 by computer scientist Terry Winograd at MIT. SHRDLU
combined human linguistic models with reasoning methods. This was a major
accomplishment for natural language processing research.

Also in 1971, NASA developed Lunar and demonstrated it at a space convention.

The Lunar system answered convention attendees' questions about the composition
of the rocks returned from the Apollo moon missions.

Translating languages was a difficult task before this, as the system had to
understand grammar and the syntax in which words were used. Since then,
strategies to implement CL began moving away from procedural approaches to
ones that were more linguistic, understandable and modular. In the late 1980s,
computing processing power increased, which led to a shift to statistical methods
when considering CL. This is also around the time when corpus-based statistical
approaches were developed.
Modern CL relies on many of the same tools and processes as NLP. These systems
may use a variety of tools, including AI, ML, deep learning and cognitive
computing. As an example, GPT-3, or the third-generation Generative Pre-trained
Transformer, is a neural network machine learning model that produces text based
on user input. It was released by OpenAI in 2020 and was trained using internet
data to generate any type of text. The program requires a small amount of input
text to generate large relevant volumes of text. GPT-3 is a model with over 175
billion machine learning parameters. Compared to the largest trained language
model before this, Microsoft's Turing-NLG model only had 17 billion parameters.

Computational Linguistics - Introduction
No ratings yet
Computational Linguistics - Introduction
50 pages
Intro to Computational Linguistics
No ratings yet
Intro to Computational Linguistics
12 pages
Global Englishes and ELT Paradigms
No ratings yet
Global Englishes and ELT Paradigms
24 pages
Understanding Corpus Linguistics Basics
No ratings yet
Understanding Corpus Linguistics Basics
3 pages
Understanding Human Language Universality
No ratings yet
Understanding Human Language Universality
10 pages
Wardhaugh CH 3
No ratings yet
Wardhaugh CH 3
10 pages
Shahinaz 2022
No ratings yet
Shahinaz 2022
10 pages
Language and Gender
100% (3)
Language and Gender
10 pages
Varieties of World Englishes
No ratings yet
Varieties of World Englishes
9 pages
English Language Complex 5
No ratings yet
English Language Complex 5
5 pages
Language, Culture, and Society Overview
No ratings yet
Language, Culture, and Society Overview
4 pages
What Is Applied Linguistics
No ratings yet
What Is Applied Linguistics
11 pages
Engineer English Vocabulary
No ratings yet
Engineer English Vocabulary
16 pages
Minimalist Grammar
No ratings yet
Minimalist Grammar
19 pages
The Sapir Whorf Hypothesis
No ratings yet
The Sapir Whorf Hypothesis
3 pages
حلول جميع اسئلة واسط
No ratings yet
حلول جميع اسئلة واسط
27 pages
Corpus Design and Types of Corpora
No ratings yet
Corpus Design and Types of Corpora
68 pages
Introduction to Discourse Analysis
No ratings yet
Introduction to Discourse Analysis
22 pages
4 Semantic Chapter IV
100% (1)
4 Semantic Chapter IV
33 pages
An Outline of The History of Linguistics
No ratings yet
An Outline of The History of Linguistics
5 pages
Dialects of English: A Comparative Study
No ratings yet
Dialects of English: A Comparative Study
11 pages
History and Overview of Applied Linguistics
No ratings yet
History and Overview of Applied Linguistics
17 pages
Syntactic Models Course Notes
No ratings yet
Syntactic Models Course Notes
15 pages
Applied Linguistics
No ratings yet
Applied Linguistics
23 pages
Syntax, Goals of Generative Grammar, 2021-22
No ratings yet
Syntax, Goals of Generative Grammar, 2021-22
6 pages
Causes of Language Change Explained
No ratings yet
Causes of Language Change Explained
4 pages
Causes of Language Death and Endangerment
No ratings yet
Causes of Language Death and Endangerment
10 pages
Alan Davies The Native Speaker Myth and
No ratings yet
Alan Davies The Native Speaker Myth and
4 pages
Introduction to Applied Linguistics
No ratings yet
Introduction to Applied Linguistics
2 pages
Evolution of English in Asia
No ratings yet
Evolution of English in Asia
6 pages
Iwan Sociolinguistic Powerpoint
No ratings yet
Iwan Sociolinguistic Powerpoint
22 pages
Lections On Lexicology
100% (2)
Lections On Lexicology
65 pages
Emi at Oran Graduate School of Economics
No ratings yet
Emi at Oran Graduate School of Economics
9 pages
Critical Analysis: Imran Khan's Speech
No ratings yet
Critical Analysis: Imran Khan's Speech
7 pages
Understanding Speech Acts
No ratings yet
Understanding Speech Acts
6 pages
Endangered Languages: Preservation and Challenges
100% (1)
Endangered Languages: Preservation and Challenges
2 pages
Understanding Language Death: Causes & Impact
No ratings yet
Understanding Language Death: Causes & Impact
18 pages
Cognitive Stylistics
No ratings yet
Cognitive Stylistics
4 pages
Linguistics: Understanding Feature Hierarchy
No ratings yet
Linguistics: Understanding Feature Hierarchy
8 pages
Introduction to Dialectology Course
No ratings yet
Introduction to Dialectology Course
3 pages
Understanding Psycholinguistics and Linguistics
100% (1)
Understanding Psycholinguistics and Linguistics
7 pages
Understanding Standard English Misconceptions
No ratings yet
Understanding Standard English Misconceptions
11 pages
Chapter 5 - PHONOLOGY
No ratings yet
Chapter 5 - PHONOLOGY
18 pages
Definitions of Grammar
No ratings yet
Definitions of Grammar
2 pages
Code Switching
No ratings yet
Code Switching
11 pages
Introduction to Sociolinguistics Concepts
50% (2)
Introduction to Sociolinguistics Concepts
37 pages
Understanding Phonology Basics
No ratings yet
Understanding Phonology Basics
31 pages
The Power and Spread of English
50% (2)
The Power and Spread of English
13 pages
Origins and Theories of Human Language
No ratings yet
Origins and Theories of Human Language
16 pages
Presuppositions in English and Vietnamese News
No ratings yet
Presuppositions in English and Vietnamese News
40 pages
Sociology of Membership Categorization
No ratings yet
Sociology of Membership Categorization
21 pages
History of Sociolinguistics Overview
100% (2)
History of Sociolinguistics Overview
42 pages
Introduction to Discourse Analysis
No ratings yet
Introduction to Discourse Analysis
27 pages
Arabic Determiner Phrase Syntax Analysis
No ratings yet
Arabic Determiner Phrase Syntax Analysis
104 pages
Predicate - Argument Strucutre
No ratings yet
Predicate - Argument Strucutre
10 pages
"Sapir-Whorf Hypothesis Explained"
100% (1)
"Sapir-Whorf Hypothesis Explained"
84 pages
Voiced Post-Alveolar Approximant R
No ratings yet
Voiced Post-Alveolar Approximant R
8 pages
Slides Computational Linguistics
No ratings yet
Slides Computational Linguistics
31 pages
Computational Linguistics
No ratings yet
Computational Linguistics
8 pages
Introtocompling
No ratings yet
Introtocompling
13 pages
Unlock Unit 1
25% (4)
Unlock Unit 1
10 pages
The Alphabet in American English Pronunciation and Examples
No ratings yet
The Alphabet in American English Pronunciation and Examples
6 pages
Yakov Malkiel - Etymology-Cambridge University Press (2012)
100% (2)
Yakov Malkiel - Etymology-Cambridge University Press (2012)
234 pages
Dakshina Kannada
No ratings yet
Dakshina Kannada
134 pages
Group Discussion PPT (3) Finale
No ratings yet
Group Discussion PPT (3) Finale
18 pages
Group Discussion Skills - Hardik Choudhary (24BBA10117)
No ratings yet
Group Discussion Skills - Hardik Choudhary (24BBA10117)
13 pages
2023 Week 8 Creative Thinking & Problem Solving
No ratings yet
2023 Week 8 Creative Thinking & Problem Solving
10 pages
2022 TDC Odd Semester Marketing Exam Guide
No ratings yet
2022 TDC Odd Semester Marketing Exam Guide
3 pages
Music Broadcasting & Journalism Syllabus
No ratings yet
Music Broadcasting & Journalism Syllabus
3 pages
Pragmatics Pedagogy in English As An International Language (Zia Tajeddin, Minoo Alemi) (Z-Library)
No ratings yet
Pragmatics Pedagogy in English As An International Language (Zia Tajeddin, Minoo Alemi) (Z-Library)
289 pages
COVID-19's Impact on Indian Print Media
No ratings yet
COVID-19's Impact on Indian Print Media
17 pages
Kindergarten Graduation Day Highlights
No ratings yet
Kindergarten Graduation Day Highlights
2 pages
Topic 3
No ratings yet
Topic 3
5 pages
QuACQ Paper
No ratings yet
QuACQ Paper
2 pages
Business Communication MCQs for MBA
No ratings yet
Business Communication MCQs for MBA
6 pages
Business-Focused Effective Communication
No ratings yet
Business-Focused Effective Communication
1 page
Additional Language Teaching Guide
100% (4)
Additional Language Teaching Guide
98 pages
11thed Wilcox PPT Chapter01
No ratings yet
11thed Wilcox PPT Chapter01
37 pages
Eight Essential Components of Communication
100% (1)
Eight Essential Components of Communication
4 pages
EN Vertintojui 206
No ratings yet
EN Vertintojui 206
2 pages
Hanen FUNdamentals Handout3
No ratings yet
Hanen FUNdamentals Handout3
2 pages
Form Meaning Use: Grammar Dimensions
100% (11)
Form Meaning Use: Grammar Dimensions
477 pages
Report 7 Lesson 9
No ratings yet
Report 7 Lesson 9
6 pages
Tip Course 3 Answers
No ratings yet
Tip Course 3 Answers
77 pages
Linguistic Theories Compared
No ratings yet
Linguistic Theories Compared
3 pages
University Teacher Education Challenges in 21st Century
No ratings yet
University Teacher Education Challenges in 21st Century
3 pages
LLPSI 2.1 - Guided Conversation
No ratings yet
LLPSI 2.1 - Guided Conversation
13 pages
English Test Paper for Grade 12
No ratings yet
English Test Paper for Grade 12
2 pages
Class Xi Mass Media Studies Content
No ratings yet
Class Xi Mass Media Studies Content
14 pages
TOGUPEN JEREMY R. - Learning Task #4 - Creating Instructional Materials
No ratings yet
TOGUPEN JEREMY R. - Learning Task #4 - Creating Instructional Materials
9 pages