0% found this document useful (0 votes)
21 views11 pages

Datamining 1

Uploaded by

Meenakshi Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views11 pages

Datamining 1

Uploaded by

Meenakshi Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Text Mining

PRESENTED BY
•Apu Baidya (11500222121)
•Meenakshi Sharma (11500321028)
•Srinjoy Debnath (11500321091)
•Disha Das (11500222120)
What is Text
Mining?
• It is a component of data mining that deals specifically
with unstructured text data.

• Combination of language science & computer science


wit help of statistical & ML techniques.

• It involves the use of NLP techniques.

• It’s a pre-processing step And also a standalone process.

Sources of Text Data

Website, books, emails, reviews, articles, logs and many


more
Why is Text Mining
Important?
• Most of the human recorded information in the world is in the form of
written text.

• No single person can read and interpret this tsunami of written


material by themself.

• Once again, we need to turn to computers to do the job for us.

• Sadly, however, the natural language doesn't come as "natural" to


computers as it does to humans.

Deriving meaning and filtering out the unimportant from the


important is still something a human is better at than any
machine
Text Mining Process
• Gathering unstructured information from various
sources.

• Pre-processing and data cleansing tasks are


performed unstructured information.

• Processing and controlling tasks are applied to review


and further clean the dataset.

• Pattern analysis is implemented in Management


Information System.
Common Methods
for Analyzing Text
Mining

• Text Summarization

• To extract its partial


content

• Text Categorization

• To assign a category to
the text among categories
predefined by users

• Text Clustering

• To segment texts into


several clusters
Text Mining
Techniques

• Information Retrieval

• Pattern recognition

• Analytical processes

Processes like Tokenization of the document or


the stemming process

• Information Extraction

• Feature Extraction : we try to develop some new


features.

• Feature Selection: we try to reduce the dimensionality of


the dataset.
Text Mining Applications

Digital Library: Academic and Life Science Social-Media Business


Research Field Intelligence

Text Mining Text mining Life science and for dissecting and different
processes perform utilization in the healthcare analyzing web- organization to
different activities research field is industries based media analyze their
like document help to discover and customers and
collection, arrange research competitors to
determination, papers and relevant make better
enhancement, material from decisions.
removing data, and various fields on
handling one platform.
substances, and
Producing
summarization.
Advantages
of Text
Mining
• Large Amounts of Data

to extract insights from large amounts


of unstructured text data.

• Variety of Applications

Text mining has a wide range of


applications

• Improved Decision Making

• Cost-eff ective

for manual data entry.


Disadvantag
es of Text
Mining

• Complexity
• Quality of Data
• High Computational
Cost
• Limited to Text Data
• Noise in text mining
results
• Lack of transparency
Conclusion & References

Text mining extracts valuable insights from • Contain-


unstructured text, aiding decision-making • https://www.geeksforgeeks.org/text-
across diverse fields. Despite challenges, mining-in-data-mining/
its applications in academia, healthcare, • https://youtu.be/99CSwf8xwaU?si=W
business, and more demonstrate its 0N7QBjy8ROC1gR3
significance in converting textual data into
• Images-
actionable knowledge.
• www.freepik.com

You might also like