Module1_L5_GPT_variants

The document provides an overview of the Generative Pre-trained Transformer (GPT) models developed by OpenAI, detailing their evolution from GPT-1 to GPT-4, including their capabilities and architectures. It explains the processes involved in training these models, such as pre-training, fine-tuning, tokenization, and generation. GPT models are designed to generate human-like responses and can handle multimodal inputs, including text, images, and audio.

Uploaded by

shushmareddyy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Module1_L5_GPT_variants

Uploaded by

shushmareddyy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 7

L5: GPT and its variants

1
Generative Pre-trained Transformer (GPT)
• GPT is a family of AI models built by OpenAI.
• It stands for Generative Pre-trained Transformer,
• Generative: Generative AI is a technology capable of producing content, such as text and imagery.
• Pre-trained: Pre-trained models are saved networks that have already been taught, using a large data set,
to resolve a problem or accomplish a specific task.
• Transformer: A transformer is a deep learning architecture that transforms an input into another type of
output.
• GPT is a generative AI technology that has been previously trained to transform its input into a different
type of output.
• Initially, GPT was made up of only LLMs (large language models). But OpenAI has expanded this to include
two new models:
• GPT-4o: a large multimodal model (LLM)
• GPT-4o mini: a small language model (SLM)
• Generate human-like responses to a prompt – initially text based.
• But GPT-4o and GPT-4o mini can also work with images and audio inputs because they're multimodal.

2
• GPT-1
• is the first version of OpenAI’s language model.
• It followed Google’s 2017 paper Attention is All You Need, in which researchers introduced
the first general transformer model.
• serves as the framework for Google Search, Google Translate, autocomplete, and all large
language models (LLMs), including Bard and Chat-GPT.
• GPT-2
• is the second transformer-based language model by OpenAI.
• It’s open-source, unsupervised, and trained on over 1.5 billion parameters.
• GPT-2 was designed specifically to predict and generate the next sequence of text to follow a
given sentence.

3
GPT-3
• The third iteration of OpenAI’s GPT model is trained on 175 billion parameters,.
• Trained on Wikipedia entries as well as the open-source data set Common Crawl.
• can generate computer code and improve performance in niche areas of
content creation such as storytelling.
GPT-4
• GPT-4 is the most recent model from OpenAI.
• It’s a large multimodal model (LMM), meaning it's capable of parsing image
inputs as well as text.
• exhibits human-level performance across a variety of benchmarks in the
professional and academic realm.

4
How GPT works
1. Pre-training https://www.youtube.com/watch?v=8Lqi-F8g_ps
• The model is trained on a large dataset consisting of text from the internet.
• During this phase, GPT learns grammar, facts about the world‌and some
reasoning abilities for predicting the next word in a sentence.
• It builds a general understanding of language and context from this extensive
data.
2. Fine Tuning
• After pre-training, the model undergoes fine-tuning on a smaller and more
focused dataset.
• This dataset usually contains examples directly related to the intended
application.

5
3. Tokenization
• The process of breaking down input text into smaller parts, known as tokens, can
include words, subwords‌or individual characters.
• These tokens are then converted into numerical representations that the model
can process.
4. Transformer architecture
• GPT uses the transformer architecture, which includes mechanisms like self-
attention.
• Self-attention enables the model to assess the significance of individual words
within a sentence, enhancing its comprehension of context and connections
among words.

6
5. Generation
• During the generation phase, the model receives an AI prompt and generates a
coherent and contextually relevant continuation based on its training data.
• It predicts one token at a time, using the previously generated tokens as context.
• This process continues until the desired output length is reached.

Whitepaper - Foundational Large Language Models & Text Generation
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Generative AI For Dummies
67% (3)
Generative AI For Dummies
6 pages
CWF Program v2 PDF
No ratings yet
CWF Program v2 PDF
10 pages
GEN-AI-unit 3
No ratings yet
GEN-AI-unit 3
30 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
BTech Advanced AI Unit03
No ratings yet
BTech Advanced AI Unit03
109 pages
Understanding-GPT-The-AI-Revolution-in-Language-Processing
No ratings yet
Understanding-GPT-The-AI-Revolution-in-Language-Processing
10 pages
LLM_Review
No ratings yet
LLM_Review
16 pages
An AIRevolutionfroman Open AIFull Paper 1
No ratings yet
An AIRevolutionfroman Open AIFull Paper 1
14 pages
large_language_model
No ratings yet
large_language_model
22 pages
Presentation 11 (1)
No ratings yet
Presentation 11 (1)
20 pages
GPT - Google Search
No ratings yet
GPT - Google Search
1 page
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
genaitoolboxltslides1736779963542
No ratings yet
genaitoolboxltslides1736779963542
38 pages
Week 12
100% (1)
Week 12
64 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
To create a LLM
No ratings yet
To create a LLM
53 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
Large Language Model
No ratings yet
Large Language Model
49 pages
GEN AI
No ratings yet
GEN AI
17 pages
Lecture 1
No ratings yet
Lecture 1
7 pages
LLM
No ratings yet
LLM
41 pages
Day 1
No ratings yet
Day 1
32 pages
Large Language Model Algorithms in Plain English
No ratings yet
Large Language Model Algorithms in Plain English
8 pages
00779778a72413121603 (1)
No ratings yet
00779778a72413121603 (1)
42 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Week 1 Day 4
No ratings yet
Week 1 Day 4
21 pages
cl13_gpt
No ratings yet
cl13_gpt
26 pages
cl13_gpt-2
No ratings yet
cl13_gpt-2
26 pages
AI API Course
No ratings yet
AI API Course
85 pages
GPT-3
No ratings yet
GPT-3
15 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
CS480 Lecture November 28th
No ratings yet
CS480 Lecture November 28th
96 pages
Generative AI and LLMS
No ratings yet
Generative AI and LLMS
34 pages
Pranay Report
No ratings yet
Pranay Report
26 pages
Intro Gen AI 6p
100% (1)
Intro Gen AI 6p
6 pages
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
No ratings yet
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
12 pages
Introduction_to_LLMs
No ratings yet
Introduction_to_LLMs
2 pages
Whitepaper_Foundational Large Language Models & Text Generation_v2
100% (1)
Whitepaper_Foundational Large Language Models & Text Generation_v2
86 pages
Introduction to Gen AI
No ratings yet
Introduction to Gen AI
7 pages
Pe 1
No ratings yet
Pe 1
5 pages
Path to the LLM & Generative AI
No ratings yet
Path to the LLM & Generative AI
12 pages
Introduction to Large Language Models
No ratings yet
Introduction to Large Language Models
3 pages
aa
No ratings yet
aa
11 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
Generative AI and ChatGPT 101
100% (1)
Generative AI and ChatGPT 101
27 pages
Module1_L4_LLMs_new
No ratings yet
Module1_L4_LLMs_new
37 pages
Lecture # 14-2 Generative Pre-trained Transformer
No ratings yet
Lecture # 14-2 Generative Pre-trained Transformer
46 pages
Slides
No ratings yet
Slides
137 pages
Comprehensive Analysis of GPT Architecture, Evolut
No ratings yet
Comprehensive Analysis of GPT Architecture, Evolut
6 pages
Generative AI 101 Introduction to the Fundamentals michael-callaghan
No ratings yet
Generative AI 101 Introduction to the Fundamentals michael-callaghan
145 pages
Generative AI in Modern Marketing Module 2 1
No ratings yet
Generative AI in Modern Marketing Module 2 1
14 pages
The Future by ChatGPT
No ratings yet
The Future by ChatGPT
41 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
The Atlas of 50 Common AI Models
No ratings yet
The Atlas of 50 Common AI Models
72 pages
unit3sem7 generative ai
No ratings yet
unit3sem7 generative ai
41 pages
Generative AI For Software Practitioners
No ratings yet
Generative AI For Software Practitioners
9 pages
Using ChatGPT
From Everand
Using ChatGPT
ALBERT MUTURI
No ratings yet
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
SL 190
No ratings yet
SL 190
2 pages
Centre of Assessments For Excellence-COAE: ISO 21001:2018 (EOMS-Management Systems For Educational Organizations)
100% (4)
Centre of Assessments For Excellence-COAE: ISO 21001:2018 (EOMS-Management Systems For Educational Organizations)
14 pages
IQ Backoffice - Synopsis
No ratings yet
IQ Backoffice - Synopsis
6 pages
KPI Call Center 2023
No ratings yet
KPI Call Center 2023
11 pages
E4-E5 - Text - Chapter 4. CONVERGED PACKET BASED AGGREGATION NETWORK (CPAN)
No ratings yet
E4-E5 - Text - Chapter 4. CONVERGED PACKET BASED AGGREGATION NETWORK (CPAN)
11 pages
SLR of Ontology Use in Elearning Recommender System
No ratings yet
SLR of Ontology Use in Elearning Recommender System
16 pages
Sdetail Div14 PDF
No ratings yet
Sdetail Div14 PDF
9 pages
1736183362
No ratings yet
1736183362
11 pages
p Series Post Railings Noa
No ratings yet
p Series Post Railings Noa
57 pages
1.12 Catalogo Eaton Surge Protection (SPD) Products-Integrated Applications
No ratings yet
1.12 Catalogo Eaton Surge Protection (SPD) Products-Integrated Applications
14 pages
Part 2 Prelim Ha Lec Transes
No ratings yet
Part 2 Prelim Ha Lec Transes
2 pages
Electronics-Tutorials - Ws-The Zener Diode
No ratings yet
Electronics-Tutorials - Ws-The Zener Diode
10 pages
AQY Volume 94 Issue 374 Cover and Back Matter
No ratings yet
AQY Volume 94 Issue 374 Cover and Back Matter
3 pages
Download Full (eBook PDF) Kirk's Fire Investigation (Brady Fire) 7th Edition PDF All Chapters
100% (7)
Download Full (eBook PDF) Kirk's Fire Investigation (Brady Fire) 7th Edition PDF All Chapters
55 pages
List of National Debt by Country
No ratings yet
List of National Debt by Country
7 pages
Aoxiang Golf Cart Brochure
No ratings yet
Aoxiang Golf Cart Brochure
28 pages
Oca, Jr. v. Trajano
No ratings yet
Oca, Jr. v. Trajano
8 pages
Chemrite NN
No ratings yet
Chemrite NN
3 pages
25 Service Flash DW071 Water Leaking From Bottom of Door
No ratings yet
25 Service Flash DW071 Water Leaking From Bottom of Door
2 pages
Spouses Afulugencia v. Metropolitan Bank and Trust Co., G.R. No. 185145, February 5, 2014
No ratings yet
Spouses Afulugencia v. Metropolitan Bank and Trust Co., G.R. No. 185145, February 5, 2014
7 pages
KL 002.12 en Labs v1.7.1
No ratings yet
KL 002.12 en Labs v1.7.1
178 pages
SURGICEL Scientific Lit Compendium
No ratings yet
SURGICEL Scientific Lit Compendium
69 pages
Mep Works Tender - Taj Sats Noida
No ratings yet
Mep Works Tender - Taj Sats Noida
606 pages
Table 6 - Mechanical Products PDF
No ratings yet
Table 6 - Mechanical Products PDF
18 pages
Connection Design Steel
No ratings yet
Connection Design Steel
19 pages
Chemical Process Safety Management
No ratings yet
Chemical Process Safety Management
27 pages
Africa Meal Project - Final
No ratings yet
Africa Meal Project - Final
32 pages
DRAFT Climate Bonds Standard v4 Public Consultation 060922 Final
No ratings yet
DRAFT Climate Bonds Standard v4 Public Consultation 060922 Final
46 pages
BOOKLET LEVEL 3 (March 2023)
No ratings yet
BOOKLET LEVEL 3 (March 2023)
144 pages

Module1_L5_GPT_variants

Uploaded by

Module1_L5_GPT_variants

Uploaded by

L5: GPT and its variants

You might also like