0% found this document useful (0 votes)

103 views29 pages

Back Propagation Back Propagation Network Network Network Network

The document describes back propagation networks (BPNs), a type of neural network that uses backpropagation as a learning algorithm. BPNs have a multi-layer feedforward architecture consisting of an input layer, hidden layers, and an output layer. During training, the error is calculated at the output and propagated back through the network to update the weights between layers. Factors that affect training include the initial weights, learning rate, momentum, number of training examples, and number of hidden nodes. An example demonstrates calculating the output, error, and weight updates for a simple BPN classifying a single input pattern.

Uploaded by

alvinverghese

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views29 pages

Back Propagation Back Propagation Network Network Network Network

Uploaded by

alvinverghese

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

BACK PROPAGATION

NETWORK
Back propagation network (BPN)
Network associated with back propagation learning
algorithm (BPLA).
BPLA is one of the most important development in neural
network.
BPLA is applied to multilayer feed forward networks
consisting of processing elements with continuous
differentiable activation function.
BPN is used to classify the input patterns correctly.
Basics of gradient descent method is used in weight update
algorithm.
Back propagation network (BPN)
Error will be propagated back to the hidden unit.
Aims to achieve a balance between the network’s ability to
respond and its ability to give reasonable responses to the
input that is similar but not identical to the training input.
Training stages in BPN network:
Output generation of the network for the input pattern

Calculation and back propagation of the error

Updation of weights.
Architecture
BPN is a multi layer feed-forward neural network consisting
of
Input layer

Hidden layer and

Output layer

During back propagation of error, the signal are sent in the

reverse direction.
Input and output of BPN may be binary or bipolar.
Activation function could be any function which increases
monotonically and differentiable.
Architecture (Contd..,)
Notations
x – input training vector
t – target output vector
α - learning rate
voj – bias on jth hidden unit
wok – bias on kth output unit
zj – hidden unit j
zinj – net input to zj
yk – output unit k
Notations (Contd..,)
δ k - error correction weight adjustment for wjk
δ j - error correction weight adjustment for vij
Commonly used activation function:
Binary sigmoidal function

Bipolar sigmoidal function

Properties of activation function to be used in BPN

Continuity

Differentiability

Nondecreasing monotony
Training patterns
Incremental approach for updation of weights.
Weights are being changed immediately after a single training pattern
is presented.

Batch – mode training

Weights are being changed only after all the training patterns are
presented.

Requires additional storage for each connection to maintain the

immediate weight changes.

Effectiveness of the training pattern depends on problem.

BPN – Equivalent to optimal Bayesian discriminant function
BP learning algorithm
BP learning algorithm will converge and find proper weights
for network even after enough learning if and only if there
exist a relation between input and output training pattern is
deterministic and error surface is deterministic.
BPN is a special case of stochastic approximation.
Randomness of the algorithm helps it to get out of local
optima.
Factors affecting the BPN
Training of BPN and convergence of BPN is based on the
choice of various parameters like
Initial weights

Learning rate

Updation rule

Size and nature of training set

Architecture (i.e., number of layers and number of neurons per layer)

Factors affecting the BPN
Initial weight
Initialized at random values.
Choice of initial weight determines how fast the network converges.
Can not be very high since sigmoidal activation functions used here
may get saturated and system may be stuck to local optima.
Method 1: Range in which the initial weight can be initialized

where oi is the number of processing elements j that feed – forward to

processing element i
Factors affecting the BPN
Method 2: Using Nyugen – Widrow initialization
This method leads to faster convergence of network.

Concept is based on geometric analysis.

Factors affecting the BPN
Learning rate:
Affects the convergence.

Larger value
speed up the convergence but might result in overshooting

Leads to rapid learning but there is oscillation of weights

Smaller value – Has vice versa effect.

Range: 10-3 to 10
Factors affecting the BPN
Momentum
Very efficient and commonly used method that allows a larger
learning rate without oscillations is adding a momentum factor to the
normal weight updation method.

Denoted as

Common value assigned is 0.9

Can be used in pattern by pattern updating or batch – mode updating.

Momentum factor leaves some useful information for weight

updation if pattern by pattern method is used.

Helps in faster convergence

Factors affecting the BPN
Weight updation formula
Factors affecting the BPN
Generalization
A network is said to be generalized when it sensibly interpolates with
the new input networks.

Over-fitting or Over-training:

Network learns well but does not generalize well if there are many
trainable parameters for the given amount of training data is available,

Making small changes in the input space of a pattern without

changing the output components can improve the ability of the
network to generalize to a test data set.

Smaller networks are preferred since, a network with large number

of nodes is capable of memorizing the training set that generalizing it.
Factors affecting the BPN
Number of training data T
Should be sufficient and proper.

Training data should cover the entire expected input space, and while
training, training – vector pairs should be selected randomly from the
set.

Let us consider the input space can be linearly separable into L

disjoint regions , and T is the lower bound on the number of training
patterns .

If proper value of T is selected such that T/L >> 1, then the network
can able to discriminate pattern classes using fine piecewise
hyperplane partitioning.
Factors affecting the BPN
Number of hidden layer nodes
If there is more than one hidden layer in a BPN, then calculations
performed for a single layer are repeated for other layers and are
summed up at the end.

For a network of a reasonable size, the size of hidden nodes has to be

relatively small fraction of input layer.

Example:
If the network does not converge to a solution, it may need more hidden
nodes.

Also, if the network converges, the user may try a very few hidden nodes
and then settle finally on a size based on overall system performance.
Example
Input pattern [0, 1] . Target output: 1 Learning rate,
Example (Contd..,)
Initial weights:
[ v11 v21 v01] =[0.6 -0.1 0.3]

[v12 v22 v02 ] = [-0.3 0.4 0.5]

[w1 w2 w0] = [ 0.4 0.1 -0.2]

Activation function used:

Example (Contd..,)
Calculate the net input :
For z1 layer:

For z2 layer:
Example (Contd..,)
Applying activation function:

Calculate the net input to the output layer.

Example (Contd..,)
Applying activation function, we get

Compute the error using

Now,
Example (Contd..,)
Therefore,

Change in weight between the hidden and output layer.

Example (Contd..,)
Calculate the error between the input and hidden layer using
and

Here m = 1 and j = 1 to 2.
Therefore,
Example (Contd..,)
Now,
Example (Contd..,)
Now,
Example (Contd..,)
Calculate change un weights between the input and hidden
layer
Example (Contd..,)
The final weights are calculated as

PDF Time Series A First Course With Bootstrap Starter 1st Edition Tucker S. Mcelroy Download
67% (3)
PDF Time Series A First Course With Bootstrap Starter 1st Edition Tucker S. Mcelroy Download
84 pages
Study Guide For GitLab Certified Associate Certification
No ratings yet
Study Guide For GitLab Certified Associate Certification
12 pages
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
No ratings yet
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
23 pages
Expected Value Markov Chains
No ratings yet
Expected Value Markov Chains
10 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Solution 2
0% (1)
Solution 2
4 pages
Back Propagation
No ratings yet
Back Propagation
33 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Markov Chains
No ratings yet
Markov Chains
42 pages
Limitations of Mathematical Model PDF
No ratings yet
Limitations of Mathematical Model PDF
16 pages
Discrete Time Markov Chains
No ratings yet
Discrete Time Markov Chains
59 pages
Continuous Markov Chain
No ratings yet
Continuous Markov Chain
17 pages
Rubisov Anton 201511 MAS Thesis
No ratings yet
Rubisov Anton 201511 MAS Thesis
94 pages
Stanford - Discrete Time Markov Chains PDF
No ratings yet
Stanford - Discrete Time Markov Chains PDF
23 pages
The Backpropagation Algorithm
No ratings yet
The Backpropagation Algorithm
4 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
PPT_Btech CSE
No ratings yet
PPT_Btech CSE
17 pages
MATH858D Markov Chains: Maria Cameron
No ratings yet
MATH858D Markov Chains: Maria Cameron
44 pages
Ahmed Rebai, PHD in Nuclear Physics
No ratings yet
Ahmed Rebai, PHD in Nuclear Physics
34 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
4.1 Reinforcement Learning 2
No ratings yet
4.1 Reinforcement Learning 2
31 pages
Chapter 3 - Supervised Learning - Neural Network Final
No ratings yet
Chapter 3 - Supervised Learning - Neural Network Final
103 pages
A Gentle Introduction To Backpropagation
100% (1)
A Gentle Introduction To Backpropagation
15 pages
Recommender System
No ratings yet
Recommender System
45 pages
RL Unit 2
No ratings yet
RL Unit 2
11 pages
Ensemble Machine Learning With Python: 7-Day Mini-Course Jason Brownlee - The full ebook version is ready for instant download
100% (1)
Ensemble Machine Learning With Python: 7-Day Mini-Course Jason Brownlee - The full ebook version is ready for instant download
46 pages
Q-Learning and Deep Q Networks (DQN)
No ratings yet
Q-Learning and Deep Q Networks (DQN)
52 pages
An Introduction To Deep Reinforcement Learning PDF
No ratings yet
An Introduction To Deep Reinforcement Learning PDF
140 pages
Deep Reinforcement Learning PDF
No ratings yet
Deep Reinforcement Learning PDF
150 pages
Markov Chains
No ratings yet
Markov Chains
61 pages
Stochastic Gradient Descent - Term Paper
No ratings yet
Stochastic Gradient Descent - Term Paper
8 pages
01 Transformers For Time-Series Data - by BearingPoint Data, Analytics & AI - BearingPoint Data, Analytics & AI - Medium
No ratings yet
01 Transformers For Time-Series Data - by BearingPoint Data, Analytics & AI - BearingPoint Data, Analytics & AI - Medium
20 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Chap 12
No ratings yet
Chap 12
120 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
29 pages
Deep Learning Tutorial Release 0.1
No ratings yet
Deep Learning Tutorial Release 0.1
173 pages
Training Deep Neural Networks
No ratings yet
Training Deep Neural Networks
55 pages
Time Series Forecasting Fundamentals
No ratings yet
Time Series Forecasting Fundamentals
37 pages
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
100% (1)
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
48 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
No ratings yet
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
29 pages
(EBook PDF) Advances in Biomedical Engineering and Technology 1st edition by Albert Rizvanov, Bikesh Kumar Singh, Padma Ganasala 9811563292 9789811563294 full chapters pdf download
100% (1)
(EBook PDF) Advances in Biomedical Engineering and Technology 1st edition by Albert Rizvanov, Bikesh Kumar Singh, Padma Ganasala 9811563292 9789811563294 full chapters pdf download
87 pages
Machine Learning: Neural Networks
No ratings yet
Machine Learning: Neural Networks
22 pages
1. Deep Learning
No ratings yet
1. Deep Learning
127 pages
Markov Chains and Markov Chain Monte Carlo: Yee Whye Teh Department of Statistics Tas: Luke Kelly, Lloyd Elliott
No ratings yet
Markov Chains and Markov Chain Monte Carlo: Yee Whye Teh Department of Statistics Tas: Luke Kelly, Lloyd Elliott
93 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Lecture Notes SC
No ratings yet
Lecture Notes SC
21 pages
Dl All Units Materials
No ratings yet
Dl All Units Materials
138 pages
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
0% (1)
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
4 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
Colah Github Io Posts 2015 08 Understanding LSTMs
No ratings yet
Colah Github Io Posts 2015 08 Understanding LSTMs
16 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
28.1.2 Lab - Construct A Basic Python Script - ILM
No ratings yet
28.1.2 Lab - Construct A Basic Python Script - ILM
16 pages
Software Engineering - Module 3
No ratings yet
Software Engineering - Module 3
31 pages
Amitesh Sharma ML
No ratings yet
Amitesh Sharma ML
28 pages
VIRTUALIZATION SET A ANSWER KEY
No ratings yet
VIRTUALIZATION SET A ANSWER KEY
5 pages
Theme: Computer Science. Basic Computing Concepts."
No ratings yet
Theme: Computer Science. Basic Computing Concepts."
7 pages
Semester-1: Course Code Course Name L-T-P Credits Total Marks
No ratings yet
Semester-1: Course Code Course Name L-T-P Credits Total Marks
7 pages
Quantum Cryptography: IT (9626) Theory Notes
No ratings yet
Quantum Cryptography: IT (9626) Theory Notes
2 pages
CHAPTER - 1 Computer System
No ratings yet
CHAPTER - 1 Computer System
16 pages
Getting Started With Alliedware Plus: Feature Overview and Configuration Guide
No ratings yet
Getting Started With Alliedware Plus: Feature Overview and Configuration Guide
50 pages
Electronic Banking
No ratings yet
Electronic Banking
14 pages
Engine Log
No ratings yet
Engine Log
25 pages
Simplex Method
No ratings yet
Simplex Method
23 pages
G24AT (4GE+CATV+WiFi) Datasheet
No ratings yet
G24AT (4GE+CATV+WiFi) Datasheet
2 pages
PCA2.4.4
No ratings yet
PCA2.4.4
8 pages
Lab Manual
No ratings yet
Lab Manual
16 pages
06 Inst 4
No ratings yet
06 Inst 4
633 pages
Pranav Mailarpawar
No ratings yet
Pranav Mailarpawar
1 page
BCA-06 C Programming 2022
No ratings yet
BCA-06 C Programming 2022
3 pages
Cisco Ccna Certification Guide
No ratings yet
Cisco Ccna Certification Guide
14 pages
MS Security Baseline Windows 11 v24H2
No ratings yet
MS Security Baseline Windows 11 v24H2
2,318 pages
CCL Manual
No ratings yet
CCL Manual
23 pages
Mod Menu Log - Com - Roblox.client
No ratings yet
Mod Menu Log - Com - Roblox.client
18 pages
Synopsis FinalFINAL
No ratings yet
Synopsis FinalFINAL
4 pages
IPSEC Troubleshooting Palo Alto 1699387222
No ratings yet
IPSEC Troubleshooting Palo Alto 1699387222
4 pages
O-RAN.WG1.O-RAN-Architecture-Description-v06.00[78361]
No ratings yet
O-RAN.WG1.O-RAN-Architecture-Description-v06.00[78361]
38 pages
Lecture 12 8051 Timer Programing v2
No ratings yet
Lecture 12 8051 Timer Programing v2
22 pages
LAS - WEEK 4 2nd QTR CSS
100% (1)
LAS - WEEK 4 2nd QTR CSS
9 pages
Unity Install Gde
No ratings yet
Unity Install Gde
50 pages
Object-Oriented Programming (OOP) Lecture No. 1: Division of Science and Technology University of Education, Lahore
No ratings yet
Object-Oriented Programming (OOP) Lecture No. 1: Division of Science and Technology University of Education, Lahore
20 pages

Back Propagation Back Propagation Network Network Network Network

Uploaded by

Back Propagation Back Propagation Network Network Network Network

Uploaded by

BACK PROPAGATION

Calculation and back propagation of the error

Hidden layer and

During back propagation of error, the signal are sent in the

Bipolar sigmoidal function

Properties of activation function to be used in BPN

Batch – mode training

Requires additional storage for each connection to maintain the

Effectiveness of the training pattern depends on problem.

Size and nature of training set

Architecture (i.e., number of layers and number of neurons per layer)

where oi is the number of processing elements j that feed – forward to

Concept is based on geometric analysis.

Leads to rapid learning but there is oscillation of weights

Smaller value – Has vice versa effect.

Common value assigned is 0.9

Can be used in pattern by pattern updating or batch – mode updating.

Momentum factor leaves some useful information for weight

Helps in faster convergence

Making small changes in the input space of a pattern without

Smaller networks are preferred since, a network with large number

Let us consider the input space can be linearly separable into L

For a network of a reasonable size, the size of hidden nodes has to be

[v12 v22 v02 ] = [-0.3 0.4 0.5]

[w1 w2 w0] = [ 0.4 0.1 -0.2]

Activation function used:

Calculate the net input to the output layer.

Compute the error using

Change in weight between the hidden and output layer.

You might also like