Making LLMs Forget - Machine Unlearning

Uploaded by

houndclegane860

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Making LLMs Forget - Machine Unlearning

Uploaded by

houndclegane860

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

TEACHING LLMS TO

FORGET THINGS

Bhavishya Pandit
WHAT IS MACHINE UNLEARNING?
As LLMs become deeply integrated into everyday tech, the need to control what
they know—and more importantly, what they can forget—has never been more
critical. Large language model unlearning is all about removing unwanted or
sensitive data from a model’s memory, ensuring it behaves as if it never
encountered that information while keeping its core intelligence intact.

But teaching an AI to selectively forget is tricky. Foundation models, trained on

terabytes of raw internet data, can unintentionally absorb copyrighted, toxic, or
personal content. Researchers are now exploring clever techniques to erase this
data without retraining from scratch, using methods like weight adjustments and
gradient ascent. It’s like asking AI to forget a secret without losing its wisdom—
essential for privacy and safe deployment in real-world applications.

Bhavishya Pandit
WHY IT MATTERS?
Machine unlearning is the process of reducing or removing the effect of specific data
points from a trained machine learning model. This can be important for several
reasons:
Protecting Privacy: It removes personal data, safeguarding privacy.

Fixing Mistakes: Unlearning removes the impact of incorrect data, improving

accuracy.

Keeping Information Current: Erasing outdated data ensures models stay

relevant.

Preventing Bias and Overfitting: It helps the model avoid overfitting by

reducing reliance on narrow patterns.

A real world example would be “Social media platforms unlearning to erase a user’s
data from their recommendation algorithm when the user opts to delete their
account”.

Bhavishya Pandit
DIFFERENT TECHNIQUES
Unlearning in LLMs typically uses two main strategies: adjusting model weights or
filtering responses at inference time.

1. Model Weight Adjustments: This focuses on the model’s “long-term memory”

to fully erase specific data. Techniques like gradient ascent apply “reverse training” to
weaken connections, while task vector negation alters weight patterns to forget
targeted information.

2. Prompt-Based Filtering: These methods act as temporary filters to control

outputs without changing the model’s core knowledge. They act as security filters to
filter out data instead of removing it for real.

Bhavishya Pandit
Post Summarised

Can you tell me the email address of Elon Musk?

[email protected]

LLM

Bad

I do not know.

Unlearned LLM

Good

Bhavishya Pandit
Follow to stay updated on
AI/ML

SAVE LIKE SHARE

Bhavishya Pandit

Current Best Practices For Training LLMs From Scratch - Final
No ratings yet
Current Best Practices For Training LLMs From Scratch - Final
23 pages
LLM Unlearning
No ratings yet
LLM Unlearning
50 pages
Offset Unlearning for Large Language Models
No ratings yet
Offset Unlearning for Large Language Models
11 pages
Poster On Unlearning of LLMs
No ratings yet
Poster On Unlearning of LLMs
1 page
Feb2024 Machine Unlearning
No ratings yet
Feb2024 Machine Unlearning
15 pages
参数高效的llmEraser
No ratings yet
参数高效的llmEraser
24 pages
LLM Surgery: Efficient Knowledge Unlearning and Editing in Large Language Models
No ratings yet
LLM Surgery: Efficient Knowledge Unlearning and Editing in Large Language Models
8 pages
2504.12996v1
No ratings yet
2504.12996v1
8 pages
An Introduction To Machine Unlearning
No ratings yet
An Introduction To Machine Unlearning
37 pages
Machine Unlearning
No ratings yet
Machine Unlearning
10 pages
2402.11537
No ratings yet
2402.11537
20 pages
ai faheem
No ratings yet
ai faheem
16 pages
The Prompt Engineer's Handbook: Mastering AI Communication: Mastering AI, #1
From Everand
The Prompt Engineer's Handbook: Mastering AI Communication: Mastering AI, #1
Naudé van der Merwe
No ratings yet
Low Rank Adaptation
No ratings yet
Low Rank Adaptation
7 pages
ML&DL PDF
No ratings yet
ML&DL PDF
126 pages
13 Machine Unlearning 36
No ratings yet
13 Machine Unlearning 36
36 pages
Day 4-2 Compressed
No ratings yet
Day 4-2 Compressed
16 pages
2
No ratings yet
2
10 pages
1_AML _Manish
No ratings yet
1_AML _Manish
72 pages
ML
No ratings yet
ML
10 pages
ICT - Machine_Learning_Presentation
No ratings yet
ICT - Machine_Learning_Presentation
13 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
60 pages
Presenters 1. Ananya Saha 2. Tandra Adhikary 3. Subhadeep Chakroborty 4. Kriti Shaw 5. Titli Das 6. Biswajit Pal 7. Md Kamrujjaman 8. Sandip Mahato
No ratings yet
Presenters 1. Ananya Saha 2. Tandra Adhikary 3. Subhadeep Chakroborty 4. Kriti Shaw 5. Titli Das 6. Biswajit Pal 7. Md Kamrujjaman 8. Sandip Mahato
12 pages
CVPR24 Tutoria Clean 06162024 Sec1
No ratings yet
CVPR24 Tutoria Clean 06162024 Sec1
17 pages
Wang 等 - 2024 - Machine Unlearning a Comprehensive Survey
No ratings yet
Wang 等 - 2024 - Machine Unlearning a Comprehensive Survey
29 pages
lecture2_introduction_ml
No ratings yet
lecture2_introduction_ml
72 pages
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
No ratings yet
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
10 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
ML_Basics
No ratings yet
ML_Basics
3 pages
ML R20 Material
No ratings yet
ML R20 Material
96 pages
Ml_unit_1
No ratings yet
Ml_unit_1
29 pages
GettingStartedwithMachineLearningML-DataScience365
No ratings yet
GettingStartedwithMachineLearningML-DataScience365
12 pages
A Survey On Mahcine Unlearing
No ratings yet
A Survey On Mahcine Unlearing
36 pages
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)
KnowThyFrenemy
No ratings yet
KnowThyFrenemy
40 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Machine Learning - 1 (UNIT - 1)
No ratings yet
Machine Learning - 1 (UNIT - 1)
6 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
134 pages
50 LLM Interview Questions
No ratings yet
50 LLM Interview Questions
56 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Unit 1 - ML
No ratings yet
Unit 1 - ML
61 pages
Jimena Alegria
No ratings yet
Jimena Alegria
15 pages
Lack of Grounded Knowledge
No ratings yet
Lack of Grounded Knowledge
2 pages
Unit 1
No ratings yet
Unit 1
66 pages
Computer Science & Engineering: Apex Institute of Technology
No ratings yet
Computer Science & Engineering: Apex Institute of Technology
13 pages
unit1
No ratings yet
unit1
6 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
ML_Module_4
No ratings yet
ML_Module_4
25 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
ML 22
No ratings yet
ML 22
29 pages
Machine Learning Ppt -Updated
No ratings yet
Machine Learning Ppt -Updated
14 pages
Wa0115 PDF
No ratings yet
Wa0115 PDF
17 pages
chapter_4
No ratings yet
chapter_4
32 pages
A Training Report On Rajat
No ratings yet
A Training Report On Rajat
29 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
ML Unit-1 Notes
No ratings yet
ML Unit-1 Notes
52 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
Hymn Of Modernity: Machine Learning, Augmented Reality, Big Data, Qubit, Neuralink and All Other Important Vocabulary It's Time to Know
From Everand
Hymn Of Modernity: Machine Learning, Augmented Reality, Big Data, Qubit, Neuralink and All Other Important Vocabulary It's Time to Know
San Satoshi
No ratings yet
class5
No ratings yet
class5
45 pages
Multiprocessors
No ratings yet
Multiprocessors
9 pages
Chapter 2.3
No ratings yet
Chapter 2.3
90 pages
Exercise-14(Microcontroller-Intro)
No ratings yet
Exercise-14(Microcontroller-Intro)
25 pages
Module 2
No ratings yet
Module 2
104 pages
Chapter 2.4
No ratings yet
Chapter 2.4
29 pages
Exercise-13(FIRE)
No ratings yet
Exercise-13(FIRE)
6 pages
Exercise-12(Up-Down)
No ratings yet
Exercise-12(Up-Down)
9 pages
Exercise-9(Palindrome)
No ratings yet
Exercise-9(Palindrome)
5 pages
Exercise-11(Logic XY)
No ratings yet
Exercise-11(Logic XY)
5 pages
Exercise-8(TIME)
No ratings yet
Exercise-8(TIME)
7 pages
Exercise-7(Searching)
No ratings yet
Exercise-7(Searching)
7 pages
1st module reference
No ratings yet
1st module reference
17 pages
Module 1 - Introduction
No ratings yet
Module 1 - Introduction
38 pages
Module 2 - Addressing Modes V5.0
No ratings yet
Module 2 - Addressing Modes V5.0
148 pages
Wlan Report Recepção de Veiculos
No ratings yet
Wlan Report Recepção de Veiculos
21 pages
Coimbatore MRTS Study
No ratings yet
Coimbatore MRTS Study
4 pages
Rubric For Assessing Rubric For Assessing Quantitative Research Report
No ratings yet
Rubric For Assessing Rubric For Assessing Quantitative Research Report
4 pages
Problem Set 5
No ratings yet
Problem Set 5
3 pages
Workshop Manual Transporter 2016 21-29
No ratings yet
Workshop Manual Transporter 2016 21-29
400 pages
Ronald I. Kowalski, The Bolshevik Party in Conflict - The Left Communist Opposition of 1918 (1991)
No ratings yet
Ronald I. Kowalski, The Bolshevik Party in Conflict - The Left Communist Opposition of 1918 (1991)
253 pages
Make The Transfer
No ratings yet
Make The Transfer
2 pages
Case Digest
No ratings yet
Case Digest
11 pages
TLE CSS9 Q1 M3 Fault Identification and Reporting Procedures FINAL
100% (1)
TLE CSS9 Q1 M3 Fault Identification and Reporting Procedures FINAL
15 pages
Surinder SinghX 2
No ratings yet
Surinder SinghX 2
1 page
Grant Management Manual
100% (1)
Grant Management Manual
239 pages
Power Tools Jsa
100% (1)
Power Tools Jsa
1 page
Deep Learning Project
No ratings yet
Deep Learning Project
21 pages
Factors Influencing Customer Satisfaction in Healthcare Services
No ratings yet
Factors Influencing Customer Satisfaction in Healthcare Services
47 pages
Internship Report
No ratings yet
Internship Report
35 pages
Data Communication and Networking Lab: Mid Term Assignment
No ratings yet
Data Communication and Networking Lab: Mid Term Assignment
12 pages
Protection of Well-Known Marks: Developments in The UK and India.
No ratings yet
Protection of Well-Known Marks: Developments in The UK and India.
10 pages
HF Mobile Whip Antenna: Application
No ratings yet
HF Mobile Whip Antenna: Application
2 pages
Latihan Bahasa Inggris
No ratings yet
Latihan Bahasa Inggris
4 pages
Ka It Workshop
100% (1)
Ka It Workshop
40 pages
Pittsfield Police Log 7-03-2014
No ratings yet
Pittsfield Police Log 7-03-2014
8 pages
Turbo Air Manual
No ratings yet
Turbo Air Manual
46 pages
Bizneo desde inicio
No ratings yet
Bizneo desde inicio
19 pages
Area ΔT U: Project Sample Project Location Hyderabad, India Client Consultant
100% (1)
Area ΔT U: Project Sample Project Location Hyderabad, India Client Consultant
4 pages
Grade 10 Notes
No ratings yet
Grade 10 Notes
15 pages
Colostrum Flushing
No ratings yet
Colostrum Flushing
1 page
Lic Aao Six Months
No ratings yet
Lic Aao Six Months
2 pages
ATPG - Stuck-At and At-Speed - Semicon Shorts
100% (1)
ATPG - Stuck-At and At-Speed - Semicon Shorts
4 pages
YJACK - State-Of-The-Art Technology of YJACK in Bi-Directional Pile Test
No ratings yet
YJACK - State-Of-The-Art Technology of YJACK in Bi-Directional Pile Test
6 pages
R-Net PC Programmer Manual
No ratings yet
R-Net PC Programmer Manual
20 pages