0% found this document useful (0 votes)

219 views21 pages

Lesson 6: Practical Deep Learning For Coders (V2)

This document discusses recurrent neural networks (RNNs) and their ability to process variable length sequence data with long-term dependencies. It provides examples of using RNNs for tasks like character-level language modeling where each next character is predicted based on previous characters. The key aspects are RNNs' stateful memory and ability to represent sequences as they can retain information about previous inputs/outputs through their hidden state and hidden-to-hidden connections. Diagrams demonstrate how RNNs can be stacked and trained on full sequences using backpropagation through time.

Uploaded by

John Curtis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

219 views21 pages

Lesson 6: Practical Deep Learning For Coders (V2)

Uploaded by

John Curtis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Lesson 6

PRACTICAL DEEP LEARNING FOR CODERS (V2)

Why we need RNNs

“I went to Nepal in 2009” Variable length Long-term

“In 2009, I went to Nepal” sequence dependency

Stateful
Memory
Representation
\begin{proof} We may assume that $\mathcal{I}$ is an abelian sheaf on
$\mathcal{C}$. \item Given a morphism $\Delta : \mathcal{F} \to \mathcal{I}$
is an injective and let $\mathfrak q$ be an abelian sheaf on $X$. Let
$\mathcal{F}$ be a fibered complex. Let $\mathcal{F}$ be a category.
Basic NN with single hidden layer

Output: batch_size * #classes

Matrix product; softmax

Hidden: batch_size * # activations

Matrix product; relu

Input: batch_size * #inputs

Image CNN with single dense hidden layer NB: batch_size dimension and activation
function not shown here or in following slides

Output: #classes

Matrix product

FC1: # activations

(Flatten); matrix product

Output
Conv1: # filters * (h/2) * (w/2)
Hidden
Convolution stride 2
Input
Input: #channel * h * w
Predicting char 3 using chars 1 & 2 NB: layer operations not shown;
remember that arrows represent layer operations

char 3 output: vocab size

FC2: # activations

Output
char 2 input FC1: # activations
Hidden

Input
char 1 input: vocab size
Predicting char 4 using chars 1, 2 & 3

InputHidden
char 4 output: vocab size
HiddenOutput
HiddenHidden

FC3: # activations

char 3 input FC2: # activations

char 2 input FC1: # activations

char 1 input: vocab size

Predicting char n using chars 1 to n-1 NB: no hidden/output labels shown

InputHidden
HiddenOutput
HiddenHidden

char n input Repeat for 2n-1

Output

Hidden

Input
char 1 input
Predicting chars 2 to n using chars 1 to n-1

InputHidden
HiddenOutput
HiddenHidden

char n input Repeat for 1n-1

Output

Hidden
Initialize to zeros
Input
Predicting chars 2 to n using chars 1 to n-1 using stacked RNNs

Repeat for 1n-1

char n input Repeat for 1n-1

Initialize to zeros

Initialize to zeros
Unrolled stacked RNNs for sequences

char 3 input

char 2 input

char 1 input
Backprop

InputHidden
HiddenOutput
HiddenHidden

Loss

char 3 input

char 2 input

Sequence Models
No ratings yet
Sequence Models
73 pages
CS60010: Deep Learning: Recurrent Neural Network
No ratings yet
CS60010: Deep Learning: Recurrent Neural Network
44 pages
Build RNN with Numpy: Step-by-Step Guide
No ratings yet
Build RNN with Numpy: Step-by-Step Guide
36 pages
Ảnh Màn Hình 2025-04-10 Lúc 10.10.40
No ratings yet
Ảnh Màn Hình 2025-04-10 Lúc 10.10.40
63 pages
AI Perspective (Post-Web) : Robotics
No ratings yet
AI Perspective (Post-Web) : Robotics
84 pages
CS564 RNN Nov 20 2020
No ratings yet
CS564 RNN Nov 20 2020
93 pages
RNN Basics
No ratings yet
RNN Basics
17 pages
RNNs: A Comprehensive Guide
No ratings yet
RNNs: A Comprehensive Guide
338 pages
Chapter 3
No ratings yet
Chapter 3
14 pages
Advanced Machine Learning 1 - Without Solutions
No ratings yet
Advanced Machine Learning 1 - Without Solutions
32 pages
Recurrent Neural Networks: RNN: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Recurrent Neural Networks: RNN: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
47 pages
Advanced RNN Design & Applications
No ratings yet
Advanced RNN Design & Applications
41 pages
Outline
No ratings yet
Outline
50 pages
6.1 DeepFFNets M2
No ratings yet
6.1 DeepFFNets M2
48 pages
Advanced RNNs for ML Students
No ratings yet
Advanced RNNs for ML Students
57 pages
Lec14 RNN3 8 Feb 18
No ratings yet
Lec14 RNN3 8 Feb 18
16 pages
Lec 10 New
No ratings yet
Lec 10 New
57 pages
Lec 10
No ratings yet
Lec 10
37 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
DL 4
No ratings yet
DL 4
19 pages
Deep Learning Recurrent Neural Networks - Introduction
No ratings yet
Deep Learning Recurrent Neural Networks - Introduction
106 pages
RNN Overview: Types, Applications, and Code
No ratings yet
RNN Overview: Types, Applications, and Code
8 pages
RNNs: Power and Applications Explained
No ratings yet
RNNs: Power and Applications Explained
1 page
Recurrent Neural Networks - Hinton
No ratings yet
Recurrent Neural Networks - Hinton
57 pages
Week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
Week 03-04 - Deep Feedforward Networks - Intro
141 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
125 pages
Cours 3
No ratings yet
Cours 3
18 pages
DL Unit Iv
No ratings yet
DL Unit Iv
15 pages
DL Mod 3
No ratings yet
DL Mod 3
4 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
DL M5 Tech
No ratings yet
DL M5 Tech
21 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Architectures
3 pages
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
21 pages
RNN
No ratings yet
RNN
8 pages
1deep L Layer Neural Network
No ratings yet
1deep L Layer Neural Network
3 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
48 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
ML 5th and 6th
No ratings yet
ML 5th and 6th
37 pages
Overview of Recurrent Neural Networks
No ratings yet
Overview of Recurrent Neural Networks
53 pages
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
No ratings yet
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
16 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
Module 2
No ratings yet
Module 2
44 pages
Mod 4 - Full Notes
No ratings yet
Mod 4 - Full Notes
23 pages
Shallow Networks Versus Deep Networks
No ratings yet
Shallow Networks Versus Deep Networks
6 pages
A Quick Recap: Artificial Intelligence LAB
No ratings yet
A Quick Recap: Artificial Intelligence LAB
29 pages
Chapter 4 Data Sci
No ratings yet
Chapter 4 Data Sci
58 pages
11 RNN
No ratings yet
11 RNN
32 pages
OCI DL Fundations
No ratings yet
OCI DL Fundations
4 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Module 7 RNN
No ratings yet
Module 7 RNN
12 pages
Chap 10-1 - Sequence Modeling Recurrent and Recursive Nets - Eunjeong Yi
No ratings yet
Chap 10-1 - Sequence Modeling Recurrent and Recursive Nets - Eunjeong Yi
21 pages
RNN Tutorial
No ratings yet
RNN Tutorial
41 pages
Lec 5 - RNN
No ratings yet
Lec 5 - RNN
61 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Brooks Model 0254 Installation Manual
No ratings yet
Brooks Model 0254 Installation Manual
124 pages
Strike and Dips in Google Earth Pro Part 1 TLD08 May 2020
No ratings yet
Strike and Dips in Google Earth Pro Part 1 TLD08 May 2020
34 pages
DSB09 0035
No ratings yet
DSB09 0035
5 pages
A Glossary of Micro Economic Terms
No ratings yet
A Glossary of Micro Economic Terms
13 pages
FEATool Multiphysics v1
No ratings yet
FEATool Multiphysics v1
41 pages
Multicomponent Distillation Design Guide
No ratings yet
Multicomponent Distillation Design Guide
17 pages
T4 Ascii
No ratings yet
T4 Ascii
20 pages
CHE F314 Process Design Principles-I Class Quiz-1
No ratings yet
CHE F314 Process Design Principles-I Class Quiz-1
2 pages
Flight Manual
100% (7)
Flight Manual
55 pages
Manual VCDS - Audi Q3
50% (2)
Manual VCDS - Audi Q3
4 pages
Ortho Presentation
No ratings yet
Ortho Presentation
23 pages
Gaurav
No ratings yet
Gaurav
26 pages
Mobile App Development Approaches
No ratings yet
Mobile App Development Approaches
15 pages
W35T51NW RevA 012924
No ratings yet
W35T51NW RevA 012924
154 pages
Elastic and Bellows Transducers Overview
No ratings yet
Elastic and Bellows Transducers Overview
61 pages
Abaqus CAE Script Error Handling
No ratings yet
Abaqus CAE Script Error Handling
3 pages
Catia Cloud
No ratings yet
Catia Cloud
17 pages
Toa Presentation - Module 3 Functional Concepts and Interior Environment
No ratings yet
Toa Presentation - Module 3 Functional Concepts and Interior Environment
21 pages
P2 Section C @trial 2021
No ratings yet
P2 Section C @trial 2021
34 pages
Gamio M. Top Java Challenges. Cracking The Coding Interview... 2020
No ratings yet
Gamio M. Top Java Challenges. Cracking The Coding Interview... 2020
124 pages
STD 5 Ganit Pravinya 2014 Test Paper
67% (3)
STD 5 Ganit Pravinya 2014 Test Paper
4 pages
PCAP Analysis Project
No ratings yet
PCAP Analysis Project
9 pages
Ge1 Ge2-Gb
No ratings yet
Ge1 Ge2-Gb
3 pages
Q2 - LE - Mathematics 7 - Lesson 6 - Week 6
No ratings yet
Q2 - LE - Mathematics 7 - Lesson 6 - Week 6
16 pages
List of Outside Recognized Laboratories: SL No. Name of The Recognized Laboratory Lab Code Testing Charges Remarks
No ratings yet
List of Outside Recognized Laboratories: SL No. Name of The Recognized Laboratory Lab Code Testing Charges Remarks
18 pages
Browser's XSS Filter Bypass Cheat Sheet Masatokinugawa - Filterbypass Wiki GitHub
No ratings yet
Browser's XSS Filter Bypass Cheat Sheet Masatokinugawa - Filterbypass Wiki GitHub
20 pages
Convert Spool to PDF and Email
No ratings yet
Convert Spool to PDF and Email
7 pages
FortiManager CLI Command Guide
No ratings yet
FortiManager CLI Command Guide
7 pages
SET D Machine
No ratings yet
SET D Machine
8 pages
TTU Fittings Ttu Industrial Corp., LTD.: Manufacturer'S Certificate
No ratings yet
TTU Fittings Ttu Industrial Corp., LTD.: Manufacturer'S Certificate
1 page

Lesson 6: Practical Deep Learning For Coders (V2)

Uploaded by

Lesson 6: Practical Deep Learning For Coders (V2)

Uploaded by

Lesson 6

PRACTICAL DEEP LEARNING FOR CODERS (V2)

“I went to Nepal in 2009” Variable length Long-term

Output: batch_size * #classes

Matrix product; softmax

Hidden: batch_size * # activations

Matrix product; relu

Input: batch_size * #inputs

(Flatten); matrix product

char 3 output: vocab size

char 3 input FC2: # activations

char 2 input FC1: # activations

char 1 input: vocab size

char n input Repeat for 2n-1

char n input Repeat for 1n-1

Repeat for 1n-1

char n input Repeat for 1n-1

You might also like