Deep Read

Uploaded by

Soumyashree Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views22 pages

Deep Read

Uploaded by

Soumyashree Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

DEEPSEEK

into the unknown……

AGENDA
Introduction
Comparison between ChatGPT and DeepSeek
Large Language Models(LLMs)
Neural Networks
U.S. and China
NVIDIA’s response to china
DeepSeek
WHAT IS DEEPSEEK?
DeepSeek Artificial Intelligence Co., Ltd. (referred to
as "DeepSeek" or "深度求索") , founded in 2023, is a
Chinese company dedicated to making AGI a reality.
AGI stands for Artificial General Intelligence, which
refers to a type of AI that possesses the ability to
understand, learn, and apply knowledge across a wide
range of tasks at a level comparable to human
intelligence.

Liang Wenfeng
WHY ARE CHATGPT AND DEEPSEEK EVEN
COMPARED?
DeepSeek and ChatGPT are both large language models (LLMs) designed for various natural
language processing tasks:
General-purpose: They Conversational abilities: Both
can be used for a wide Constantly evolving: can engage in human-like
range of tasks, including Both are under active conversations, answer
writing, translation, development, with questions, and generate text.
summarization, and code frequent updates and
generation. improvements.
5

WHAT IS A
LARGE
LANGUAGE
MODEL(LLM)??
6

LARGE LANUAGE
MODEL(LLM)
A Large Language Model (LLM) is a type of artificial intelligence (AI)
designed to understand and generate human-like text. Think of it as a
super-smart computer program that has been trained on a huge amount of
text from books, websites, and other sources
7
HERE’S A BREAKDOWN
It can read, understand, How it learns:
and generate text. For •It’s trained on massive amounts of text data, learning
example, it can answer patterns, grammar, facts, and even some reasoning skills.
questions, write essays, •It doesn’t "know" things like humans do but uses statistics
summarize articles, or and patterns to generate responses
even create stories.
Why it’s useful:
It works by predicting 1. It can help with tasks like writing,
what words or sentences learning, coding, or even just having a
conversation.
should come next based
2. It’s like having a very knowledgeable
on the input it receives.
assistant that can help with almost
any text-based task.

In short, an LLM is a powerful AI tool that uses its training on vast amounts of
text to understand and generate human-like language.
8

CoPilot ChatGPT

LLMs Google’s
Bard

Meta’s
Orca
Llama
9
Ma’am something named
neural network is ABSOLUTELY!!!!
Let me explain
a viral term nowadays,
how is it related…
are LLMs anything
related to this???

LLM
10

NEURAL NETWORKS
A neural network is a type of computer system
inspired by how the human brain works. It’s designed
to recognize patterns and solve problems by learning
from examples.

Frank Rosenblatt
What is it? How does it work?
•A neural network is like a Imagine you’re teaching a child to recognize cats:
virtual "brain" made up of •You show them pictures of cats and say, "This is a cat."
layers of tiny units •Over time, the child learns to identify cats by noticing patterns
called neurons (or nodes). like pointy ears, whiskers, and tails.
A neural network does something similar:
•These neurons work 1.Input Layer: It receives data (e.g., a picture of a cat).
together to process 2.Hidden Layers: It processes the data, looking for patterns (e.g.,
information, learn from edges, shapes, colors).
data, and make decisions 3.Output Layer: It makes a decision or prediction (e.g., "This is a
or predictions. cat").

How does it learn? Why is it useful?

•The network learns by adjusting the Neural networks are great at solving
connections between neurons. This complex problems, like:
is called training. •Recognizing faces in photos.
•During training, it compares its •Understanding spoken language
predictions to the correct answers (e.g., Siri or Alexa).
and improves over time (like •Predicting trends (e.g., stock prices
practicing to get better at a game). or weather).
12
13
DeepSeek, a Chinese AI firm, is shaking up the
industry with its low-cost, open-source large
language models, posing a challenge to U.S. tech
giants. The company recently unveiled its latest AI
models, developed for just $6 million — a fraction
of the typical cost for U.S. firms. This disruption has
unsettled majorBUT……….
tech players and impacted their
stock prices.
15

U.S. after imposing so much regulations on NVIDIA’s chips

How is this
even possible?
16

•Cooperating with regulators

•Nvidia has offered to cooperate with regulators in China and the US.
•Investigating how chips ended up in China
•Nvidia has asked its distributors to check their customers in Southeast Asia to see how its chips
ended up in China.
•Acknowledging the success of Chinese AI models
•Nvidia has praised the innovation of Chinese AI models like DeepSeek, and has said that they
showcase the potential of AI techniques.
•Explaining its commitment to providing quality products

•Nvidia has said that it works to provide the best products in every region.

Nvidia is a key player in the global AI industry and makes most of the processors used in AI. The US
and China have been involved in a trade war over technology, including chips, and both sides have
implemented measures to assert their influence.
17
THIS IS REVOLUTION!!!
But how did it manage to do so???
DeepSeek has emerged as a standout in the field of large language models (LLMs) due to
its innovative architecture, cost efficiency, and specialized capabilities. Here are the key
factors that make DeepSeek superior to many other LLMs:

Efficient Architecture and Cost-Effectiveness

•Mixture-of-Experts (MoE) System: DeepSeek uses a MoE architecture, which
activates only a subset of its 671 billion parameters (37 billion per task) rather
than the entire model. This reduces computational costs and energy
consumption while maintaining high performance .
•Low Training Costs: DeepSeek's models, such as DeepSeek-V3 and R1, were
trained for
approximately 6million,significantlylowerthanthe6million,significantlylowertha
nthe100 million+ budgets of competitors like OpenAI's GPT-4. This efficiency is
achieved through advanced data processing and training optimizations .
19
Superior Performance in Specialized Tasks
•Creative Writing and Emotional Intelligence: DeepSeek excels in creative writing,
producing emotionally resonant and coherent narratives. It outperforms models like
ChatGPT and Claude in emotional intelligence (EQ) benchmarks, making it ideal for
storytelling and character development .
•Reasoning and Problem-Solving: DeepSeek-R1, a reasoning-focused model, achieves
top scores in benchmarks like MATH-500 (97.3%) and GPQA Diamond (73.3%), rivaling
or surpassing OpenAI's o1 in logical reasoning and scientific problem-solving .

Open-Source Accessibility
•DeepSeek models are open-source under the MIT license, allowing researchers and
developers to study, modify, and build upon them. This fosters innovation and
collaboration, making advanced AI tools accessible to smaller organizations and
individuals
20

Long Context Handling

•DeepSeek supports up to 128K
tokens in context windows,
enabling it to process and maintain
coherence in lengthy texts, such as
large codebases or complex
datasets. This capability surpasses
many competitors, which typically Resource Efficiency and Environmental
handle 32K-64K tokens Impact
•By optimizing hardware usage and
reducing computational overhead,
DeepSeek minimizes energy
consumption and environmental
impact. This is a significant advantage
over resource-intensive models like
GPT-4, which have higher carbon
footprints
21

Versatility and Specialization

•While DeepSeek excels in
creative and reasoning tasks, it
also performs well in general- Community and Ecosystem Growth
purpose applications. Its •DeepSeek has a growing
specialized models, like community of users and
DeepSeek Coder, are tailored for developers, particularly in creative
coding and technical tasks, and technical fields. Its open-source
offering faster and more nature encourages contributions
accurate code generation and and customization, further
debugging enhancing its capabilities

Cost-Effective Deployment
•DeepSeek's API costs are significantly lower than competitors, with input tokens
priced at 0.14permillionandoutputtokensat0.14permillionandoutputtokensat2.19 per
million. This makes it an affordable option for businesses and researchers
THANK YOU From
• Asmi Maulik
• Amrita Debnath
• Ayushi Biswas
• Deep Das
• Manjari Aich
• Indresh Mukherjee
• Rigam Bhaduri
• Sabyasachi Mukhopadhyay
• Soumyashree Ghosh

DeepSeek-R1: AI Innovation & Impact
No ratings yet
DeepSeek-R1: AI Innovation & Impact
3 pages
Basics AI & ML
No ratings yet
Basics AI & ML
30 pages
Digital Empowerment
No ratings yet
Digital Empowerment
9 pages
AI Guide for Tech Enthusiasts
No ratings yet
AI Guide for Tech Enthusiasts
14 pages
Generative AI: Skills for Future Jobs
100% (1)
Generative AI: Skills for Future Jobs
31 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Week 6 Ai Llms Gpts
No ratings yet
Week 6 Ai Llms Gpts
17 pages
CH 5 Modern Artificial Intelligence
No ratings yet
CH 5 Modern Artificial Intelligence
5 pages
Basic AI & ML Concepts Explained - LinkedIn
No ratings yet
Basic AI & ML Concepts Explained - LinkedIn
10 pages
AI-Driven Search: A Business Leader's Guide
100% (1)
AI-Driven Search: A Business Leader's Guide
48 pages
AI Insights for Competition Lawyers
No ratings yet
AI Insights for Competition Lawyers
9 pages
DeepSeek Is A Game Changer For AI - Computerphile (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
No ratings yet
DeepSeek Is A Game Changer For AI - Computerphile (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
21 pages
AIML
No ratings yet
AIML
13 pages
Ai & LLM
No ratings yet
Ai & LLM
10 pages
The Essential Guide To Generative AI
No ratings yet
The Essential Guide To Generative AI
16 pages
Introduction To Generative AI
No ratings yet
Introduction To Generative AI
8 pages
Affan 1
No ratings yet
Affan 1
24 pages
AI: Definitions, Applications, and Ethics
No ratings yet
AI: Definitions, Applications, and Ethics
27 pages
Generative AI and ChatGPT Overview
100% (1)
Generative AI and ChatGPT Overview
27 pages
Day 3
No ratings yet
Day 3
10 pages
AI Made Easy For All
No ratings yet
AI Made Easy For All
54 pages
1 Introduction To AI 15-07-2024
No ratings yet
1 Introduction To AI 15-07-2024
63 pages
Pe 1
No ratings yet
Pe 1
5 pages
Class Notes - Removed
No ratings yet
Class Notes - Removed
29 pages
Note ss1
No ratings yet
Note ss1
22 pages
Generative AI On Amazon Web Services Ebook
100% (1)
Generative AI On Amazon Web Services Ebook
33 pages
Gen Ai - Ebook - Guvi
No ratings yet
Gen Ai - Ebook - Guvi
34 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
Introduction GenAI EoAI
No ratings yet
Introduction GenAI EoAI
69 pages
Trends in Information Technology Robotics: (Frankenfield, 2020)
No ratings yet
Trends in Information Technology Robotics: (Frankenfield, 2020)
6 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
Generative AI in Software Engineering
No ratings yet
Generative AI in Software Engineering
9 pages
Business Benefits of ChatGPT LLMs
No ratings yet
Business Benefits of ChatGPT LLMs
4 pages
AI Literacy in Hausa Course Presentation
No ratings yet
AI Literacy in Hausa Course Presentation
61 pages
Define AI and Explain in Detail BC
No ratings yet
Define AI and Explain in Detail BC
6 pages
Building AI Agents With LLMS, RAG, and Knowledge Graphs
100% (10)
Building AI Agents With LLMS, RAG, and Knowledge Graphs
560 pages
Presentationn
No ratings yet
Presentationn
6 pages
Artificial Intelligence & Machine Learning
No ratings yet
Artificial Intelligence & Machine Learning
14 pages
24 July, Class Notes - 01
No ratings yet
24 July, Class Notes - 01
10 pages
Generative AI
No ratings yet
Generative AI
6 pages
Unit3sem7 Generative Ai
No ratings yet
Unit3sem7 Generative Ai
41 pages
India in The Global Ai Race 1739599723668
No ratings yet
India in The Global Ai Race 1739599723668
54 pages
A Quick Guide To Artificial Intelligence
100% (3)
A Quick Guide To Artificial Intelligence
41 pages
DEEPSEEK
No ratings yet
DEEPSEEK
10 pages
29.01.2025 Editorial in Tamil.
No ratings yet
29.01.2025 Editorial in Tamil.
57 pages
AI - Shrey - Jain
No ratings yet
AI - Shrey - Jain
67 pages
AI and LLM Application Development - An Overview
No ratings yet
AI and LLM Application Development - An Overview
77 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
IAI Sp2025 Session 16 - Improving LLMs (Continued)
No ratings yet
IAI Sp2025 Session 16 - Improving LLMs (Continued)
28 pages
GenAI Concepts: A Comprehensive Guide
No ratings yet
GenAI Concepts: A Comprehensive Guide
14 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
A Beginner's Guide To Large Language Models
No ratings yet
A Beginner's Guide To Large Language Models
25 pages
Introduction of Generative AI Shoolini University
No ratings yet
Introduction of Generative AI Shoolini University
15 pages
The Future by ChatGPT
No ratings yet
The Future by ChatGPT
41 pages
OceanofPDF - Com Large Language Models Concepts - John AtkinsonAbutridy
No ratings yet
OceanofPDF - Com Large Language Models Concepts - John AtkinsonAbutridy
185 pages
Alpha-GPT Human-AI Interactive Alpha Mining For Quantitative Investment
No ratings yet
Alpha-GPT Human-AI Interactive Alpha Mining For Quantitative Investment
9 pages
1 ModelDrivenAcelerador
No ratings yet
1 ModelDrivenAcelerador
4 pages
Minicpmo
No ratings yet
Minicpmo
14 pages
1 Xy HQ TO5 Q SMP 2 IQNBT0 Ei JX RK
No ratings yet
1 Xy HQ TO5 Q SMP 2 IQNBT0 Ei JX RK
33 pages
Artificial Intelligence and Psychology: Balancing Innovation and Ethics
No ratings yet
Artificial Intelligence and Psychology: Balancing Innovation and Ethics
8 pages
Springer Nature LaTeX Template 4
No ratings yet
Springer Nature LaTeX Template 4
17 pages
Pre-Train SLR
No ratings yet
Pre-Train SLR
16 pages
Adversarial LLM Attribution Challenges
No ratings yet
Adversarial LLM Attribution Challenges
7 pages
Adversarial Attacks on LLMs
No ratings yet
Adversarial Attacks on LLMs
31 pages
Nerative AI Agents B0F9KK7N2H
100% (5)
Nerative AI Agents B0F9KK7N2H
254 pages
Qwen3 Technical Report
No ratings yet
Qwen3 Technical Report
70 pages
NSIC Summer Internship - 15 Days
No ratings yet
NSIC Summer Internship - 15 Days
2 pages
A Short History of AI
No ratings yet
A Short History of AI
12 pages
AI Meets AI - ChatGPT and Teachin
No ratings yet
AI Meets AI - ChatGPT and Teachin
29 pages
【低秩 - 自适应秩 - 理论】 From Low-Rank Gradient Subspace Stabilization to Low-Rank Weights - Observations, Theories, And Applications
No ratings yet
【低秩 - 自适应秩 - 理论】 From Low-Rank Gradient Subspace Stabilization to Low-Rank Weights - Observations, Theories, And Applications
17 pages
810 110 AITECH v1.0
No ratings yet
810 110 AITECH v1.0
2 pages
Resume Template For AI
No ratings yet
Resume Template For AI
4 pages
Quiz (Watsonx - Governance Level 2) v2 Attempt Review
No ratings yet
Quiz (Watsonx - Governance Level 2) v2 Attempt Review
18 pages
Deepak Singh Resume
No ratings yet
Deepak Singh Resume
2 pages
Sifted Rising 100 2024
No ratings yet
Sifted Rising 100 2024
41 pages
AI in Public Health: A PAHO Guide
No ratings yet
AI in Public Health: A PAHO Guide
49 pages
BIM and NLP for Construction Scheduling
No ratings yet
BIM and NLP for Construction Scheduling
9 pages
Lenovo NVIDIA GenAI Ebook FINALpdf
No ratings yet
Lenovo NVIDIA GenAI Ebook FINALpdf
18 pages
经济学人九月下
No ratings yet
经济学人九月下
128 pages
Slides Hackathon
No ratings yet
Slides Hackathon
16 pages
Dissertation Writing Services in Mumbai
100% (2)
Dissertation Writing Services in Mumbai
6 pages
CAI8
No ratings yet
CAI8
7 pages
Vector Embeddings Guide
No ratings yet
Vector Embeddings Guide
28 pages
3418499+ +Artigo+Cilamce+Modificado
No ratings yet
3418499+ +Artigo+Cilamce+Modificado
7 pages
Recent Advances in Large Langauge Model Benchmarks Against Data Contamination: From Static To Dynamic Evaluation
No ratings yet
Recent Advances in Large Langauge Model Benchmarks Against Data Contamination: From Static To Dynamic Evaluation
17 pages