0% found this document useful (0 votes)

14 views13 pages

Generative AI

The document explores the attention mechanisms of the GPT-2 model using the BertViz visualization tool, aiming to interpret syntactic and semantic relationships. It details the methodology of analyzing attention patterns across various linguistic phenomena, such as coreference and ambiguity, revealing insights into attention specialization and biases. Key findings indicate that while some attention heads are interpretable and encode biases, not all contribute equally to performance, and future work will focus on further isolating model behaviors and applying these methods to newer models.

Uploaded by

D3K P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views13 pages

Generative AI

Uploaded by

D3K P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Exploring GPT-2 Attention with

BertViz
Shivam Kumar(2411AI63) Saumyadweepta Paul(2411AI62)
Ankit Kumar Pandey(2411AI63) Aashish Kumar Gupta(2411CS25)
24 April 2025
Objective

● To visualize and interpret attention mechanisms

in GPT-2 using BertViz.

● Understand how GPT-2 models syntactic and

semantic relationships.

● Identify specialized attention heads and

investigate model behavior.
Introduction to Transformers and GPT-2

● Transformers rely entirely on self-attention to process input sequences.

● GPT-2 is a decoder-only language model trained to predict the next token.

● Self-attention enables GPT-2 to model long-range dependencies efﬁciently.

What is BertViz?

BertViz Visualization Tool

● An open-source tool for interpreting attention in Transformer models.

● Offers multiple views: Attention-Head View, Model View, and Neuron View.

● Adapted to support both encoder (BERT) and decoder (GPT-2) models.

● Helps explore head specialization, bias, and structure.

Methodology Overview

Input carefully crafted sentences to GPT-2.

Visualize each layer's self-attention using BertViz.

Analyze layer-wise and head-wise attention distributions.

Capture screenshots to document key attention behaviors.

Dataset and Sentence Design

Linguistic Phenomena in Input Sentences

● Coreference: “The doctor spoke to the nurse. She listened.”

● Ambiguity: “The chicken is ready to eat.”

● Subject–verb agreement: “The cat that the dog chased was fast.”

● Gender bias detection: contrast male vs. female pronouns in context.

Syntax Attention Patterns

Example: Complex Clause Interpretation

Sentence: “The cat that the dog chased was fast.”

● Observed strong backward attention from "was" to "cat" across heads.

● Some heads focused on aligning subject and verb.

● Others distributed attention across tokens, possibly for context modeling.

Coreference Attention Patterns

Example: Gender Bias in Coreference

Sentence: “The doctor spoke to the nurse. She listened.”

● Certain heads linked “She” more to “nurse” than “doctor.”

● Reveals how GPT-2 encodes coreference, possibly inﬂuenced by gender

stereotypes.

● Replacing roles (e.g., “engineer” instead of “doctor”) affects attention

Ambiguity Resolution

Example: Structural Ambiguity

Sentence: “The chicken is ready to eat.”

● Model attention varies: “chicken” links to both “is” and “eat.”

● GPT-2 distributes attention across interpretations: subject vs. object.

● No clear resolution—suggests GPT-2 maintains ambiguity unless

disambiguated by context.
Attention Specialization

Head and Layer Behavior

● Some heads specialize in:

○ Syntactic roles (subject–verb)
○ Punctuation and clause boundaries
○ Coreference tracking

● Redundant heads: exhibit diffuse or uniform attention.

● Patterns change across layers: deeper layers show more abstract

dependencies.
Key Findings

● Attention heads are interpretable in some cases, revealing structure and

semantics.

● Certain heads encode biases (e.g., gendered associations).

● Not all heads contribute equally—some may be pruned without loss in

performance.

● BertViz aids in debugging and understanding model decisions

Conclusion and Future Work
● BertViz helps demystify attention in GPT-2, showing

structure and specialization.

● Key dependencies like coreference, syntax, and bias are

traceable.

● Future directions:

○ Use neuron view to isolate responsible units.

○ Explore intervention strategies (neuron editing, head

pruning).

○ Apply similar methods to newer models (GPT-3,

GPT-4).
Thank You!

Transformer Attention Visualizer
No ratings yet
Transformer Attention Visualizer
6 pages
Analyzing The Structure of Attention
No ratings yet
Analyzing The Structure of Attention
14 pages
10 Attention N Bert
No ratings yet
10 Attention N Bert
55 pages
BERT's Attention Patterns Analyzed
No ratings yet
BERT's Attention Patterns Analyzed
11 pages
Analyzing BERT's Attention Mechanisms
No ratings yet
Analyzing BERT's Attention Mechanisms
11 pages
Class Notes
No ratings yet
Class Notes
43 pages
How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations
No ratings yet
How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations
10 pages
Attention Book Sample
No ratings yet
Attention Book Sample
32 pages
UNIT 2 FULL - Compressed
No ratings yet
UNIT 2 FULL - Compressed
26 pages
LectureLtR-neural IR 2
No ratings yet
LectureLtR-neural IR 2
52 pages
Ba LLMS W2 S2 2024 2025
No ratings yet
Ba LLMS W2 S2 2024 2025
47 pages
Transformer Tutorial
No ratings yet
Transformer Tutorial
14 pages
Transformer Presentation
No ratings yet
Transformer Presentation
15 pages
Attention Mechanism, Transformers, BERT, and GPT: Tutorial and Survey
No ratings yet
Attention Mechanism, Transformers, BERT, and GPT: Tutorial and Survey
14 pages
GenAI Workflow Automation NPTEL Zoom Course
No ratings yet
GenAI Workflow Automation NPTEL Zoom Course
88 pages
BERT Language Model
No ratings yet
BERT Language Model
7 pages
Attention - Attention! - Lil'Log
No ratings yet
Attention - Attention! - Lil'Log
23 pages
Deep Learning: Attention Explained
No ratings yet
Deep Learning: Attention Explained
65 pages
21CSE356T-NLP-Unit 4.2
No ratings yet
21CSE356T-NLP-Unit 4.2
31 pages
BERT and Transformer
No ratings yet
BERT and Transformer
48 pages
Understanding BERT: A Comprehensive Survey
No ratings yet
Understanding BERT: A Comprehensive Survey
23 pages
Transformers Architecture
No ratings yet
Transformers Architecture
5 pages
Transformers - AI's Language Revolution - Grok
No ratings yet
Transformers - AI's Language Revolution - Grok
6 pages
Transformers for CAP6412 Students
No ratings yet
Transformers for CAP6412 Students
69 pages
Deep Learning Attention Guide
No ratings yet
Deep Learning Attention Guide
17 pages
Visual Attention in Deep Learning
No ratings yet
Visual Attention in Deep Learning
21 pages
cs224n 2022 Lecture08 Final Project
No ratings yet
cs224n 2022 Lecture08 Final Project
71 pages
Transformer
No ratings yet
Transformer
5 pages
BERT Interview Questions and Cross Questions-1
No ratings yet
BERT Interview Questions and Cross Questions-1
9 pages
# BERT Understanding Language Like Never Before
No ratings yet
# BERT Understanding Language Like Never Before
6 pages
Attention Attention!
No ratings yet
Attention Attention!
26 pages
# ? Attention Is All You Need - Simple Explanation
No ratings yet
# ? Attention Is All You Need - Simple Explanation
5 pages
Intra-Neuronal Attention in Language Models
No ratings yet
Intra-Neuronal Attention in Language Models
42 pages
02-Transformer Based NLP Applications
No ratings yet
02-Transformer Based NLP Applications
57 pages
Deep Neural Network Module 7 Attention Transformer
No ratings yet
Deep Neural Network Module 7 Attention Transformer
40 pages
Lecture 10
No ratings yet
Lecture 10
66 pages
Understanding Encoder-Decoder Models
No ratings yet
Understanding Encoder-Decoder Models
5 pages
Understanding BERT for NLP Tasks
No ratings yet
Understanding BERT for NLP Tasks
21 pages
BERT (Bidirectional Encoder Representations From Transformers)
No ratings yet
BERT (Bidirectional Encoder Representations From Transformers)
4 pages
14.chapter10 AdvancedDeepLearningForText
No ratings yet
14.chapter10 AdvancedDeepLearningForText
22 pages
M6L5 Lyst1370
No ratings yet
M6L5 Lyst1370
22 pages
Understanding BERT and NLP Innovations
No ratings yet
Understanding BERT and NLP Innovations
98 pages
Transformers MUIA
No ratings yet
Transformers MUIA
34 pages
1102AITA04 AI For Text Analytics
No ratings yet
1102AITA04 AI For Text Analytics
88 pages
Realformer: Transformer Likes Residual Attention: Ruining He, Anirudh Ravula, Bhargav Kanagal, Joshua Ainslie
No ratings yet
Realformer: Transformer Likes Residual Attention: Ruining He, Anirudh Ravula, Bhargav Kanagal, Joshua Ainslie
15 pages
2022 cmcl-1 9
No ratings yet
2022 cmcl-1 9
13 pages
GPT-2 and BERT for Question Generation
No ratings yet
GPT-2 and BERT for Question Generation
10 pages
BERT Applications in Natural Language Processing: A Review
No ratings yet
BERT Applications in Natural Language Processing: A Review
49 pages
NLP Year in Review - 2019 - Dair - Ai - Medium
No ratings yet
NLP Year in Review - 2019 - Dair - Ai - Medium
26 pages
Incorporating BERT Into NMT-1
No ratings yet
Incorporating BERT Into NMT-1
20 pages
Transformers and BERT in NLP
No ratings yet
Transformers and BERT in NLP
20 pages
Attention 1
No ratings yet
Attention 1
1 page
Understanding BERT's Bidirectional Encoder
No ratings yet
Understanding BERT's Bidirectional Encoder
8 pages
Interpretable Visual Question Answering Via Reasoning Supervision
No ratings yet
Interpretable Visual Question Answering Via Reasoning Supervision
5 pages
Vits
No ratings yet
Vits
37 pages
Transformers
No ratings yet
Transformers
23 pages
2020 Acl-Main 385
No ratings yet
2020 Acl-Main 385
8 pages
Transformers in NLP: An Overview
No ratings yet
Transformers in NLP: An Overview
9 pages
Attention Mechanism and Syntactic Structure
No ratings yet
Attention Mechanism and Syntactic Structure
14 pages
SQL - Stock Market Analysis
No ratings yet
SQL - Stock Market Analysis
5 pages
BTech DS 18 Jun Attendance
No ratings yet
BTech DS 18 Jun Attendance
3 pages
Internship Application
No ratings yet
Internship Application
2 pages
Annexure I
No ratings yet
Annexure I
1 page
Annexure II
No ratings yet
Annexure II
1 page
Integrated Watershed Management in Maharashtra
No ratings yet
Integrated Watershed Management in Maharashtra
13 pages
Melodic Intervals in Jazz Standards
100% (2)
Melodic Intervals in Jazz Standards
11 pages
Soft Signals
No ratings yet
Soft Signals
11 pages
Forgiveness Workbook by Fristy Sato (Fristysato - My.id)
No ratings yet
Forgiveness Workbook by Fristy Sato (Fristysato - My.id)
31 pages
Shubham Raut's Engineering Resume
No ratings yet
Shubham Raut's Engineering Resume
2 pages
Optical Instrument Calculations
No ratings yet
Optical Instrument Calculations
1 page
Pneumatic Connectors and Other Products
No ratings yet
Pneumatic Connectors and Other Products
48 pages
Label Expo Americas 2002 Directory
100% (1)
Label Expo Americas 2002 Directory
164 pages
Derma Rolling
No ratings yet
Derma Rolling
4 pages
Fusarium Yellows in Cabbage: Symptoms & Control
No ratings yet
Fusarium Yellows in Cabbage: Symptoms & Control
3 pages
October-December 2019 Bank Statement
No ratings yet
October-December 2019 Bank Statement
7 pages
National Strategic Plan of Barbados - 2005-2025
No ratings yet
National Strategic Plan of Barbados - 2005-2025
191 pages
UniGear ZS1 Medium Voltage Switchgear
No ratings yet
UniGear ZS1 Medium Voltage Switchgear
92 pages
Haccp Plan Verification
No ratings yet
Haccp Plan Verification
3 pages
Itinerary
No ratings yet
Itinerary
1 page
Automated Excavator Based On Reinforcement Learning and Multibody System Dynamics
No ratings yet
Automated Excavator Based On Reinforcement Learning and Multibody System Dynamics
9 pages
ADVANCED COPY-5th ADVISORY FOR THE IMPLEMENTATION OF THE SCHOOL-BASED FEEDING PROGRAM (SBFP) FY 2025
No ratings yet
ADVANCED COPY-5th ADVISORY FOR THE IMPLEMENTATION OF THE SCHOOL-BASED FEEDING PROGRAM (SBFP) FY 2025
5 pages
Understanding Requirements Engineering
No ratings yet
Understanding Requirements Engineering
39 pages
Kumar, A., & Sharma, D. (2024) - Adventure Tourism Development A Comparative Study of Kullu and Bilaspur Districts of Himachal Pradesh
No ratings yet
Kumar, A., & Sharma, D. (2024) - Adventure Tourism Development A Comparative Study of Kullu and Bilaspur Districts of Himachal Pradesh
10 pages
TO - Bintara New
No ratings yet
TO - Bintara New
14 pages
Crypto Currency Bit Coin
No ratings yet
Crypto Currency Bit Coin
35 pages
Research Methods Course Outline
No ratings yet
Research Methods Course Outline
2 pages
Pork - V042N04 (June 2022) - From The Inside Out
No ratings yet
Pork - V042N04 (June 2022) - From The Inside Out
52 pages
Lab Report 9
No ratings yet
Lab Report 9
10 pages
Wwi-Guided Notes
No ratings yet
Wwi-Guided Notes
6 pages
Dell Repository Manager Version 3.4.3 User's Guide
No ratings yet
Dell Repository Manager Version 3.4.3 User's Guide
57 pages
SAP-Solution Manager Manual
No ratings yet
SAP-Solution Manager Manual
9 pages
Lifetime
No ratings yet
Lifetime
99 pages
International Business: by Charles W.L. Hill
No ratings yet
International Business: by Charles W.L. Hill
34 pages
Cal2 Advanced Topíc20240122
No ratings yet
Cal2 Advanced Topíc20240122
2 pages

Generative AI

Uploaded by

Generative AI

Uploaded by

Exploring GPT-2 Attention with

● To visualize and interpret attention mechanisms

● Understand how GPT-2 models syntactic and

● Identify specialized attention heads and

● Transformers rely entirely on self-attention to process input sequences.

● GPT-2 is a decoder-only language model trained to predict the next token.

● Self-attention enables GPT-2 to model long-range dependencies efﬁciently.

BertViz Visualization Tool

● An open-source tool for interpreting attention in Transformer models.

● Adapted to support both encoder (BERT) and decoder (GPT-2) models.

● Helps explore head specialization, bias, and structure.

Input carefully crafted sentences to GPT-2.

Visualize each layer's self-attention using BertViz.

Analyze layer-wise and head-wise attention distributions.

Capture screenshots to document key attention behaviors.

Linguistic Phenomena in Input Sentences

● Coreference: “The doctor spoke to the nurse. She listened.”

● Ambiguity: “The chicken is ready to eat.”

● Gender bias detection: contrast male vs. female pronouns in context.

Example: Complex Clause Interpretation

Sentence: “The cat that the dog chased was fast.”

● Observed strong backward attention from "was" to "cat" across heads.

● Some heads focused on aligning subject and verb.

● Others distributed attention across tokens, possibly for context modeling.

Example: Gender Bias in Coreference

Sentence: “The doctor spoke to the nurse. She listened.”

● Certain heads linked “She” more to “nurse” than “doctor.”

● Reveals how GPT-2 encodes coreference, possibly inﬂuenced by gender

● Replacing roles (e.g., “engineer” instead of “doctor”) affects attention

Example: Structural Ambiguity

Sentence: “The chicken is ready to eat.”

● Model attention varies: “chicken” links to both “is” and “eat.”

● GPT-2 distributes attention across interpretations: subject vs. object.

● No clear resolution—suggests GPT-2 maintains ambiguity unless

Head and Layer Behavior

● Some heads specialize in:

● Redundant heads: exhibit diffuse or uniform attention.

● Patterns change across layers: deeper layers show more abstract

● Attention heads are interpretable in some cases, revealing structure and

● Certain heads encode biases (e.g., gendered associations).

● Not all heads contribute equally—some may be pruned without loss in

● BertViz aids in debugging and understanding model decisions

structure and specialization.

● Key dependencies like coreference, syntax, and bias are

○ Use neuron view to isolate responsible units.

○ Explore intervention strategies (neuron editing, head

○ Apply similar methods to newer models (GPT-3,

You might also like