0% found this document useful (0 votes)

29 views15 pages

Exploring NLP and LLMs

Uploaded by

pranavrmallia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views15 pages

Exploring NLP and LLMs

Uploaded by

pranavrmallia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

EXPLORING NLP AND

LLMS: A TECHNICAL
LEARNING JOURNEY
PR A NAV R MA L L I A
DEPARTMENT O F C O MPUTER SC I ENC E
UNDER THE G UI DA NC E O F DR . J I MSO N MATHEW
• Knowledge graphs
• MAMS for ABSA and TF-IDF vs
BERT-CapsNet accuracy
comparison
• Emotion classification using
SemEval dataset(multiple
AGENDA emotions)
• Twitter sentiment analysis
(TwitEval dataset)
• MiniLLMs for binary sentiment
classification (SST)
• Top-10 product recommendation
using amazon review sentiments.

2
Learning from GitHub: rasbt
/LLMs-from-scratch

Used to understand inner mechanics of LLMs from

tokenization to output.

Studied about:
KNOWLEDGE GRAPHS
•Token embedding lookup
•QKV attention → Multi-Head Self-Attention
•Positional encodings
•Feed-forward layers
•Training pipeline (loss, optimizer, batching)
•LLM architecture
•Finetuning

Treated the repo as a conceptual knowledge graph of

LLM design.

Helped in grounding later projects. Created internal

diagrams of token flow and attention behavior.

3
ABSA ON MAMS
DATASET
(Multi-Aspect Sentiment Analysis on
MAMS Dataset)

Aspect-Based Sentiment Analysis & Model

Comparison

4
•Multi-aspect product reviews labeled by
aspect (e.g. price, quality, service)
•BERT encoder for contextualized token
representations
•GLOVE embeddings to inject static semantic
priors
•Capsule Network to preserve aspect–
sentiment spatial relationships
•TF-IDF + traditional classifier as bag-of-
words baseline
•BERT + CAPSNET for fine-grained, context-
sensitive inference
Model Accuracy
PROJECT SUMMARY
TF-IDF ~72%

BERT + CapsNet ~90 %

5
EMOTION CLASSIFICATION (SEMEVAL TASK
3)
• Task: classify utterances into 7
emotions (joy, anger, etc.)

• Data: SemEval 2024 Task 3

(utterance IDs, dialog context)

• Model: BERT fine-tuned with

attention masks

• Performance: ~82% accuracy

• Lesson: CapsNet underperforms on

PROJECT SUMMARY fine-grained, sentence-level
emotions

7
TWITTER SENTIMENT ANALYSIS (TWEET-
EVAL DATASET)
•Used pretrained twitter-roberta-base-emotion
for classifying tweet emotions.
•Preprocessed input tweets and ran inference
using Hugging Face Transformers.
•Applied softmax to extract emotion
probabilities for a single sentence.
•Used mapping.txt to convert label IDs to one
of 7 emotions (e.g., surprised, anger) and
displayed top 4 predicted emotion with its
confidence score.
•Also explored masked word prediction (fill-
PROJECT SUMMARY
mask) to see how context affects word choice.

9
MINI LLMS FOR BINARY SENTIMENT
CLASSIFICATION (SST DATA)
•Dataset: Stanford Sentiment Treebank (SST)
containing movie reviews, positive/negative in a
tree like structure.
•Used lightweight DistilBERT to classify movie-
review sentiments on SST (positive vs.
negative), achieving strong results with faster
inference than full BERT.
•Outcome: competitive accuracy with PROJECT SUMMARY
drastically reduced model size and computation

11
TOP-10 PRODUCT RECOMMENDATION
USING AMAZON REVIEW SENTIMENT
Goal: surface top 10 products by average
positive sentiment
Pipeline:
1.Crawl Amazon reviews
2.Predict review sentiment with mircrosoft’s
MiniLM.
3.Aggregate scores per product
4.Rank and select top 10.

PROJECT SUMMARY •Application: dynamic “Top-10” widget on e-

commerce platforms
•Benefit: data-driven, customer-centric
product discovery

13
SUMMARY
• E X P L O R E D K N O W L E D G E G R A P H S T O G R O U N D L L M S W I T H R E A L - W O R L D FA C T S B E Y O N D
T E X T.
• B U I LT A M U LT I - A S P E C T S E N T I M E N T A N A LY Z E R ( M A M S ) U S I N G B E R T + G L O V E +
CAPSULENET — OUTPERFORMED TF-IDF BASELINES.
• F I N E - T U N E D B E R T F O R E M O T I O N D E T E C T I O N O N S E M E VA L — A C H I E V E D ~ 8 2 %
ACCURACY ACROSS 7 EMOTIONS.
• A P P L I E D R O B E R TA O N T W E E T E VA L — H A N D L E D N O I S Y T W I T T E R D ATA ( H A S H TA G S ,
S L A N G ) E F F E C T I V E LY A N D T O P 4 E M O T I O N O N T W E E T.
• U S E D D I S T I L B E R T O N S S T F O R S E N T I M E N T C L A S S I F I C AT I O N .
• DEVELOPED A TOP-10 PRODUCT RECOMMENDER FROM AMAZON REVIEW SENTIMENTS
USING MINILM
THANK YOU

NLPNEW
No ratings yet
NLPNEW
3 pages
Sentiment Analysis Using Machine Learning Classifiers
No ratings yet
Sentiment Analysis Using Machine Learning Classifiers
41 pages
Minor Project Presentation
No ratings yet
Minor Project Presentation
16 pages
Transformer Models in Opinion Mining
No ratings yet
Transformer Models in Opinion Mining
16 pages
BERT-Based Emotion Analysis in E-Commerce
No ratings yet
BERT-Based Emotion Analysis in E-Commerce
12 pages
Emotion Detection in Text Advances in Sentiment Analysis
No ratings yet
Emotion Detection in Text Advances in Sentiment Analysis
9 pages
BERT for Digital Shopping Emotion Analysis
No ratings yet
BERT for Digital Shopping Emotion Analysis
14 pages
Sentiment Analysis Using NLP
No ratings yet
Sentiment Analysis Using NLP
42 pages
2019 BERT Stock Market
No ratings yet
2019 BERT Stock Market
5 pages
Real-Time Twitter Sentiment Analysis
100% (1)
Real-Time Twitter Sentiment Analysis
19 pages
Group3 POC Assignment 3
No ratings yet
Group3 POC Assignment 3
9 pages
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For
No ratings yet
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For
8 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
BERT for Social Media Sentiment Analysis
No ratings yet
BERT for Social Media Sentiment Analysis
34 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
Proposal
No ratings yet
Proposal
2 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
Sentiment Analysis for Students
No ratings yet
Sentiment Analysis for Students
26 pages
Machine Learning With Advance Model
No ratings yet
Machine Learning With Advance Model
19 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
Miniproject NLP
No ratings yet
Miniproject NLP
22 pages
Transformer Models for Sentiment Analysis
No ratings yet
Transformer Models for Sentiment Analysis
45 pages
Poster Version Final Bis
No ratings yet
Poster Version Final Bis
1 page
Wa0002
No ratings yet
Wa0002
21 pages
Sentiment Analysis From H El Reviews: Data Mining For Business Intelligence
No ratings yet
Sentiment Analysis From H El Reviews: Data Mining For Business Intelligence
13 pages
L5 - L6 - Natural Language Processing
100% (1)
L5 - L6 - Natural Language Processing
94 pages
演讲稿
No ratings yet
演讲稿
3 pages
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
No ratings yet
A Review On Advances in Sentiment Analysis A Deep Learning Approach Using Transformer Based Models
5 pages
Sentiment Analysis: A NLP And: 2. Detailed Approach
No ratings yet
Sentiment Analysis: A NLP And: 2. Detailed Approach
6 pages
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
No ratings yet
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
7 pages
DLNLP CH-6 N
No ratings yet
DLNLP CH-6 N
12 pages
Bert Ayman
No ratings yet
Bert Ayman
5 pages
### Seminar Report
No ratings yet
### Seminar Report
12 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
1383-Article Text-6285-2-10-20240305
No ratings yet
1383-Article Text-6285-2-10-20240305
8 pages
Large Language Models For Information Management - 01 - Modulo Base (MB) - 4pdf
No ratings yet
Large Language Models For Information Management - 01 - Modulo Base (MB) - 4pdf
68 pages
Conference Template A4 1
No ratings yet
Conference Template A4 1
6 pages
Document
No ratings yet
Document
10 pages
Hybrid CNN-BERT for Sentiment Analysis
No ratings yet
Hybrid CNN-BERT for Sentiment Analysis
12 pages
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
98 pages
Sentiment Analysis & LLM Insights
No ratings yet
Sentiment Analysis & LLM Insights
24 pages
Final
No ratings yet
Final
21 pages
BERT for Movie Review Analysis
No ratings yet
BERT for Movie Review Analysis
11 pages
MP 1
No ratings yet
MP 1
14 pages
Stock Sentiment Analysis Using Ai
No ratings yet
Stock Sentiment Analysis Using Ai
17 pages
Lecture 2 Guide To Text Analytics Techniques
No ratings yet
Lecture 2 Guide To Text Analytics Techniques
14 pages
Sentiment Analysis Using Recurrent Neural Network
No ratings yet
Sentiment Analysis Using Recurrent Neural Network
7 pages
Deep Learning For Sentiment Analysis
No ratings yet
Deep Learning For Sentiment Analysis
5 pages
IMDb Sentiment Analysis Report Generation
No ratings yet
IMDb Sentiment Analysis Report Generation
20 pages
RES Presentation
No ratings yet
RES Presentation
21 pages
Sentiment Analysis Behind Text With Different Length and Formality
No ratings yet
Sentiment Analysis Behind Text With Different Length and Formality
6 pages
Transformers MUIA
No ratings yet
Transformers MUIA
34 pages
1 - Overview of NLP
No ratings yet
1 - Overview of NLP
39 pages
Target-Dependent Sentiment Classification With BERT: Zhengjie Gao, Ao Feng, Xinyu Song, and Xi Wu
No ratings yet
Target-Dependent Sentiment Classification With BERT: Zhengjie Gao, Ao Feng, Xinyu Song, and Xi Wu
19 pages
Aspect-Based Sentiment Analysis Using BERT
No ratings yet
Aspect-Based Sentiment Analysis Using BERT
10 pages
Manuscript Updated-1
No ratings yet
Manuscript Updated-1
10 pages
Natural Language Processing Tasks
No ratings yet
Natural Language Processing Tasks
5 pages
Module 3
No ratings yet
Module 3
110 pages
16 15 724 47409 ISE Assignment 2
No ratings yet
16 15 724 47409 ISE Assignment 2
1 page
LO2: Microservice API Anomaly Dataset of Logs and Metrics: Alexander Bakhtin Jesse Nyyssölä Yuqing Wang Noman Ahmad
No ratings yet
LO2: Microservice API Anomaly Dataset of Logs and Metrics: Alexander Bakhtin Jesse Nyyssölä Yuqing Wang Noman Ahmad
10 pages
Study
No ratings yet
Study
31 pages
Updated
No ratings yet
Updated
40 pages
Pranav R Mallia: Education
No ratings yet
Pranav R Mallia: Education
2 pages
Zhou Conditional Prompt Learning For Vision-Language Models CVPR 2022 Paper
No ratings yet
Zhou Conditional Prompt Learning For Vision-Language Models CVPR 2022 Paper
10 pages
Model Test Paper - 2019 Two Marks Questions (There Will Be 7 Questions, Answer Any 5 Questions)
No ratings yet
Model Test Paper - 2019 Two Marks Questions (There Will Be 7 Questions, Answer Any 5 Questions)
2 pages
2022 - Recurrent Neural Networks Concepts and Applications
No ratings yet
2022 - Recurrent Neural Networks Concepts and Applications
413 pages
Report 1
No ratings yet
Report 1
26 pages
Assignment 2.doc 1
No ratings yet
Assignment 2.doc 1
3 pages
Hidden Markov Models Overview
No ratings yet
Hidden Markov Models Overview
51 pages
Prediction of The Price of Used Cars Based On Mach
No ratings yet
Prediction of The Price of Used Cars Based On Mach
7 pages
Transformers v1.1
No ratings yet
Transformers v1.1
1 page
eBook-The Ultimate Guide To Using LLMs With Speech Recognition To Build Voice Apps
100% (1)
eBook-The Ultimate Guide To Using LLMs With Speech Recognition To Build Voice Apps
66 pages
Deep Learning - Intermediate Level
No ratings yet
Deep Learning - Intermediate Level
9 pages
Lec 04 Reinforcement Learning
No ratings yet
Lec 04 Reinforcement Learning
57 pages
Soft Computing MCQ
No ratings yet
Soft Computing MCQ
10 pages
Matlab Code for RBF and SOM
100% (2)
Matlab Code for RBF and SOM
13 pages
Intelligent Agents Overview
No ratings yet
Intelligent Agents Overview
5 pages
Deep CNN Model Based On VGG16 For Breast Cancer Classification
No ratings yet
Deep CNN Model Based On VGG16 For Breast Cancer Classification
6 pages
Thesis Presentation
No ratings yet
Thesis Presentation
22 pages
Neuro-Fuzzy Approach for PD Diagnosis
No ratings yet
Neuro-Fuzzy Approach for PD Diagnosis
14 pages
Fake News Detection PPT (AIB602)
No ratings yet
Fake News Detection PPT (AIB602)
11 pages
Seq-NMS For Video Object Detection
No ratings yet
Seq-NMS For Video Object Detection
9 pages
Beyond A Gaussian Denoiser: Residual Learning of Deep CNN For Image Denoising
No ratings yet
Beyond A Gaussian Denoiser: Residual Learning of Deep CNN For Image Denoising
13 pages
PyTorch Geometric Temporal Spatiotemporal Signal Processing
No ratings yet
PyTorch Geometric Temporal Spatiotemporal Signal Processing
10 pages
Supervised Learning: Linear Models
No ratings yet
Supervised Learning: Linear Models
34 pages
When Do You Need Chain-of-Thought Prompting For ChatGPT
No ratings yet
When Do You Need Chain-of-Thought Prompting For ChatGPT
8 pages
ML (15cs73) Question Bank
No ratings yet
ML (15cs73) Question Bank
6 pages
Big Data Analytics (2017 Regulation) : Unit - 2 Clustering and Classification
No ratings yet
Big Data Analytics (2017 Regulation) : Unit - 2 Clustering and Classification
7 pages
Efficient Deep Learning: Pruning Basics
No ratings yet
Efficient Deep Learning: Pruning Basics
74 pages
NASA Launch Control Speech Recognition
No ratings yet
NASA Launch Control Speech Recognition
7 pages
Machine Leaning 1 Unit
No ratings yet
Machine Leaning 1 Unit
10 pages
A Chaotic Gradient-Based Optimization With Support Vector Machine For Chinese Folk Music Classification
No ratings yet
A Chaotic Gradient-Based Optimization With Support Vector Machine For Chinese Folk Music Classification
4 pages
Chapter 7: Expert Systems and Artificial Intelligence
No ratings yet
Chapter 7: Expert Systems and Artificial Intelligence
18 pages

Exploring NLP and LLMs

Uploaded by

Exploring NLP and LLMs

Uploaded by

EXPLORING NLP AND

Used to understand inner mechanics of LLMs from

Treated the repo as a conceptual knowledge graph of

Helped in grounding later projects. Created internal

Aspect-Based Sentiment Analysis & Model

BERT + CapsNet ~90 %

• Data: SemEval 2024 Task 3

• Model: BERT fine-tuned with

• Performance: ~82% accuracy

• Lesson: CapsNet underperforms on

PROJECT SUMMARY •Application: dynamic “Top-10” widget on e-

You might also like