0% found this document useful (0 votes)

39 views41 pages

Dynamic Programming Methods in Pairwise Alignment

The document discusses various methods for aligning biological sequences including global and local alignment. It covers the Needleman-Wunsch and Smith-Waterman algorithms which use dynamic programming to find optimal sequence alignments.

Uploaded by

Priyanshu Panda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views41 pages

Dynamic Programming Methods in Pairwise Alignment

Uploaded by

Priyanshu Panda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

20 BI 019

I.MSc BIOINFORMATICS
3RD YEAR 5TH SEMESTER
BJB AUTONOMOUS COLLEGE, BBSR
~ OUTLINE ~

Alignment of Pair of Sequences

Global vs Local Alignment Optimal Alignment

Dynamic Programming?

Steps performed by the Needleman-Wunsch and the Smith-

Waterman algorithms to produce a sequence alignment.

Tools based on the algorithm-

EMBOSS Needle and EMBOSS Water
Sequence Alignment
Sequence Alignment
• Comparison of two or more sequences by
searching for a series of character patterns that
are same in the same order in the sequences.

Sequences ALIGNED = an evolutionary relationship

• Sequencealignment also refers to the process of

accessing degree of similarity between the
sequences.
Sequence Alignment
• PAIRWISE SEQUENCE ALIGNMENT: two sequences

• MULTIPLE SEQUENCE ALIGNMENT: three or more

Comparing Two Sequences
•Point
mutations, easy:
ACGTCTGATACGCCGTATAGTCTATCT
ACGTCTGATTCGCCCTATCGTCTATCT
•Indels are difficult, must align sequences:

ACGTCTGATACGCCGTATAGTCTATCT
CTGATTCGCATCGTCTATCT
•
ACGTCTGATACGCCGTATAGTCTATCT
----CTGATTCGC---ATCGTCTATCT
Causes for sequence
(dis)similarity
• mutation: a character at a certain location is
replaced by another character

ATCC
ATTCC
GAP = an insertion or a deletion event
AAAGT
TA_GT
Global vs Local Alignment
• Align the entire sequence • Align stretches of
up to both ends using all sequences with high
sequence characters density of matches
• Sequences – quite similar • Sequences – similar along
and approximately the some lengths but
same length dissimilar in others
Global vs Local Alignment
• Stretched over entire • Favoursfinding
sequence to find as many conserved subsequences
matching characters as
• Subalignments
possible
considered – over
conserved regions
SCORING a sequence alignment
•Scoring Scheme
set of values assigned to different events in an alignment

MATCH = identity MAXIMUM VALUE

MISMATCH
GAP = insertion or deletion MINIMUM VALUE
•GAP PENALTY : negative score assigned to indel
events in Pairwise Sequence Alignment
•Scoring scheme is not universal.
Comparing Two Sequences
•Point
mutations, easy:
ACGTCTGATACGCCGTATAGTCTATCT
ACGTCTGATTCGCCCTATCGTCTATCT
•Indels operations: edit operations

ACGTCTGATACGCCGTATAGTCTATCT
CTGATTCGCATCGTCTATCT
•
ACGTCTGATACGCCGTATAGTCTATCT
----CTGATTCGC---ATCGTCTATCT
SCORING a sequence alignment
•Match score: +1
•Mismatch score: +0
•Gap penalty: –1
ACGTCTGATACGCCGTATAGTCTATCT
||||| ||| || ||||||||
----CTGATTCGC---ATCGTCTATCT

•Matches: 18 × (+1)
•Mismatches: 2×0
TOTAL sim = +11
•Gaps: 7 × (– 1)
Optimal Alignment
• The alignment that gives the highest similarity
score.
• To access the degree of similarity between a pair
of sequences, we need to find the optimal
alignment.

• Number of matches = MAXIMUM

• Number of mismatches and gaps = MINIMUM
Methods of Sequence Alignment
1. DOT MATRIX
 Simple 2-D graphs

2. DYNAMIC PROGRAMMING
 Algorithms for optimization

3. HEURISTICS METHODS
 Fast computational Methods of approximation
Methods of Sequence Alignment
1. DOT MATRIX :
 Graphical similarity comparison

 Both sequences placed along two

axes of a 2-D plot
 A dot is placed at every point of
identity

 Does not show or produce precise

nor optimal alignment
DYNAMIC
PROGRAMMING
Dynamic Programming?
• Richard E. Bellman at RAND Corporation – optimal
decision making processes research in 1950s
• 1953 – Dynamic Programming
• Large scale system analysis and optimization
• Computer-oriented approaches for breaking problems
into sub-problems

PROBLEM
Dynamic Programming?
• solving a complex problem
P
• first breaking into a
collection of simpler
subproblems SP SP
• solving each subproblem just
once
• storing their solutions to avoid s s s s
repetitive computations.
Biological Sequence
Alignment and
Dynamic Programming
Dynamic Programming
• The dynamic programming approach to sequence
alignment always tries to follow the best prior-result so
far.
• Try to align two sequences by inserting some gaps at
different locations, so as to maximize the score of this
alignment.

• Examples:
 Needleman-Wunsch(1970)
 Smith-Waterman(1981)
Dynamic Programming
measurement is determined by "match
• Score
award", "mismatch penalty" and "gap penalty".
• The higher the score, the better the alignment.
• If
both penalties are set to 0, it aims to always find an
alignment with maximum matches so far.
• It
is used to compare the similarity between two
sequences of DNA or Protein, to predict similarity of
their functionalities.
Needleman-Wunsch Method
• The Needleman-Wunsch algorithm (1970)
 performs an optimal global alignment on two sequences
 applied to align protein or nucleotide sequences.
• The Needleman-Wunsch algorithm is guaranteed to
find the alignment with the maximum score.
• Scoresfor aligned characters are specified by the
transition scoring matrix (i,j) :
the similarity of characters i and j.
Needleman-Wunsch Method

3-STEP PROCESS

1. INITIALIZATION
2. MATRIX FILLING
3. TRACEBACK
ALIGNMENT
1. INITIALIZATION
Gap Penalty = -6

Gap Penalty
X Row Number

Gap Penalty
X Column Number
2. MATRIX FILLING
F (i,j) = cell of ‘i’ rows and ‘j’ columns

S(xi,yj) = substitution score Assigned scoring scheme

MATCH = +5
d = gap penalty MISMATCH = -2
GAP = -6
The square matrix is solved by predefined scoring
scheme and matrix is filled. MATCH = +5
MISMATCH = -2
GAP = -6
The partial alignment scores are calculated at
all parts of the alignment matrix.
3. TRACEBACK & ALIGNMENT

Optimal Alignment:

TGCTCGTA
T_ _ TCATA
Dynamic Programming Tools
• GLOBAL ALIGNMENT TOOLS
1. Needle (EMBOSS)
 optimal global alignment – N-W algorithm
2. Stretcher (EMBOSS)
 Modified N-W algorithm to globally align larger
sequences
3. GGSEARCH2SEQ
EMBOSS – Needle (EMBL-EBI)

https://www.ebi.ac.uk/Tools/psa/emboss_needle
Smith-Waterman Method
• The Smith-Waterman algorithm (1981) is for
determining similar regions between two nucleotide
or protein sequences.
• Smith-Waterman is also a dynamic programming
algorithm and improves on Needleman-Wunsch.
• Followsthe same 3-step Process as N-W
algorithm, with just adding the 0 in the 2nd step.
INITIALIZATION
• Thefirst rows and columns are filled as per the gap
penalty of scoring scheme.
• The negative scores are then substituted with “0”.

• UnlikeN-W method, no negative values are

allowed in the alignment scoring matrix.
MATRIX FILLING
F (i,j) = cell of ‘i’ rows and ‘j’ columns

0
Smith – Waterman introduces ‘0’ so as to when the
scoring matrix value becomes negative, the value
is set to ZERO.
3. TRACEBACK
• Thetraceback is started from the highest scoring
position in the scoring matrix.
• Path is traced up to a box that scores Zero.
• Assuch, it has the desirable property that it is
guaranteed to find the optimal local alignment
with respect to the scoring system being used
(which includes the substitution matrix and the
gap-scoring scheme).
Smith-Waterman Method
Optimal Alignment

C D
C D
+5+5

OPTIMAL ALIGNMENT
SCORE: +10
Smith-Waterman Method

• However,
the Smith-Waterman algorithm is
demanding of time and memory resources
• Asa result, it has largely been replaced in
practical use by the BLAST algorithm;
although not guaranteed to find optimal
alignments, BLAST is much more efficient.
Dynamic Programming Tools
• LOCAL ALIGNMENT TOOLS
1. Water (EMBOSS)
 optimal local alignment – enhanced S-W algorithm
2. Matcher (EMBOSS)
 Modified N-W algorithm to globally align larger
sequences based on LAALIGN
3. LAALIGN
4. SSEARCH2SEQ
 Optimal local alignment using S-W algorithm
EMBOSS – Water (EMBL-EBI)

https://www.ebi.ac.uk/Tools/psa/emboss_water
Dynamic Programming
applications
• Sequence comparison
• Gene recognition
• RNA structure prediction and hundreds of other
problems are solved by ever new variants of DP.

• Computationally intensive
• Paved way for Fast computational Heuristics
Methods of approximation e.g – FASTA and
BLAST
CONCLUSION
All of the alignment methods in use
today are related to the original
method of Needleman and Wunsch.

Dynamic Programming methods still

have absolute relevance amongst the
current fast computational approaches
for biological sequence alignment.
• ~ references ~

• Wikipedia
• Wikimedia Commons
• Unsplash and Microsoft Bing Images
• Buffalo University – Tutorial Compatibility PPT
• Bioinformatica – Youtube channel
• Class Lecture Notes –Rakesh Ranjan Ojha (BJB Faculty)
• MIT OCW – 3. NW, SW and PAM, BLOSUM (youtube)
• https://www.youtube.com/watch?v=PdyARRNwi7I
• Bioinformatics : Methods and applications
• by Rastogi and Rastogi
THANKYOU
EVERYONE

Cell Biology Interview Questions and Answers Guide.: Global Guideline
100% (1)
Cell Biology Interview Questions and Answers Guide.: Global Guideline
19 pages
7.3 Cell Transport - Lesson - Review - Workbook
100% (1)
7.3 Cell Transport - Lesson - Review - Workbook
4 pages
Lecture2 Sequence Alignment
No ratings yet
Lecture2 Sequence Alignment
26 pages
Tabby
No ratings yet
Tabby
11 pages
Alignment Methods
No ratings yet
Alignment Methods
33 pages
Module 3 CSE3069 (Bioinformatics)
No ratings yet
Module 3 CSE3069 (Bioinformatics)
57 pages
Sequence Alignment
No ratings yet
Sequence Alignment
36 pages
Unit I Algorithms
No ratings yet
Unit I Algorithms
42 pages
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
No ratings yet
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
57 pages
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
No ratings yet
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
59 pages
Sequence Alignment Methods
No ratings yet
Sequence Alignment Methods
32 pages
Sequence Alignment
No ratings yet
Sequence Alignment
9 pages
Bioinfo Generic Skill
No ratings yet
Bioinfo Generic Skill
10 pages
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
No ratings yet
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
17 pages
Bioinformatics 04
No ratings yet
Bioinformatics 04
28 pages
L3.4 Alignment
No ratings yet
L3.4 Alignment
90 pages
Importance and Significance of Sequence Alignment - pptx12
No ratings yet
Importance and Significance of Sequence Alignment - pptx12
15 pages
Introduction-To-Computational Biology
No ratings yet
Introduction-To-Computational Biology
61 pages
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
No ratings yet
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
47 pages
Sequence Alignment Presentation
No ratings yet
Sequence Alignment Presentation
27 pages
Sequence Analysis - Pairwise Alignment
No ratings yet
Sequence Analysis - Pairwise Alignment
26 pages
Sequence Alignment Methods and Algorithms
75% (4)
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment Methods and Algorithms
No ratings yet
Sequence Alignment Methods and Algorithms
37 pages
Lecture 6 - Sequence Analysis
No ratings yet
Lecture 6 - Sequence Analysis
28 pages
Optimization of A Classical Algorithm For The Alignment of Genomic Sequences With Artificial Bee Colony
No ratings yet
Optimization of A Classical Algorithm For The Alignment of Genomic Sequences With Artificial Bee Colony
7 pages
Sequence Alignment: Lecture - 4
No ratings yet
Sequence Alignment: Lecture - 4
19 pages
5 Sequence Alignment
No ratings yet
5 Sequence Alignment
21 pages
Daa Assignment 9
No ratings yet
Daa Assignment 9
4 pages
Bioinfo Notes 2
No ratings yet
Bioinfo Notes 2
9 pages
Sequence Alignment
No ratings yet
Sequence Alignment
24 pages
Sequence Analysis in Bioinformatics
No ratings yet
Sequence Analysis in Bioinformatics
18 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
Sequence Alignment: "Continuing.." (5th Week)
No ratings yet
Sequence Alignment: "Continuing.." (5th Week)
61 pages
Sequence Comparison
No ratings yet
Sequence Comparison
39 pages
The Needleman Wunsch Algorithm For Sequence Alignment
No ratings yet
The Needleman Wunsch Algorithm For Sequence Alignment
46 pages
Bio Medical Tics - Sequence Analysis - Alignment - 2011
No ratings yet
Bio Medical Tics - Sequence Analysis - Alignment - 2011
96 pages
Needleman-Wunsch and Smith-Waterman Algorithm
67% (9)
Needleman-Wunsch and Smith-Waterman Algorithm
19 pages
W03 Pairwise
No ratings yet
W03 Pairwise
55 pages
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
No ratings yet
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
14 pages
Sequence Alignment
No ratings yet
Sequence Alignment
25 pages
Bioinformatics: Sequence Alignment Methods
No ratings yet
Bioinformatics: Sequence Alignment Methods
32 pages
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
No ratings yet
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
17 pages
Introduction Dynamic Programming
No ratings yet
Introduction Dynamic Programming
52 pages
Unit 2.1
No ratings yet
Unit 2.1
77 pages
Daa Assignment 9 Aryan Project
No ratings yet
Daa Assignment 9 Aryan Project
5 pages
Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
No ratings yet
Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
4 pages
Multiple Sequence Alignment Black and White
No ratings yet
Multiple Sequence Alignment Black and White
2 pages
Unit 3 Sequence Alignment and Phylogenetic Tree
No ratings yet
Unit 3 Sequence Alignment and Phylogenetic Tree
70 pages
Sequence Alignment
No ratings yet
Sequence Alignment
27 pages
Lecture 5 Introduction Dynamic Programming
No ratings yet
Lecture 5 Introduction Dynamic Programming
52 pages
Chap 03 BioInfo
No ratings yet
Chap 03 BioInfo
15 pages
Sequence Allignment
No ratings yet
Sequence Allignment
5 pages
Review Questions 2
No ratings yet
Review Questions 2
15 pages
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
No ratings yet
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
107 pages
Daa Assignment 10 Aryan Project
No ratings yet
Daa Assignment 10 Aryan Project
11 pages
Sequence Alignment
No ratings yet
Sequence Alignment
22 pages
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
No ratings yet
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
51 pages
CE6068 Lecture 5
No ratings yet
CE6068 Lecture 5
83 pages
Lecture 5
No ratings yet
Lecture 5
15 pages
11 Smith-Waterman Algorithm 06-08-2024
No ratings yet
11 Smith-Waterman Algorithm 06-08-2024
9 pages
Genetic Recombination
No ratings yet
Genetic Recombination
13 pages
Protein Changes During Malting and Brewing With Focus On Haze and Foam Formation: A Review
No ratings yet
Protein Changes During Malting and Brewing With Focus On Haze and Foam Formation: A Review
14 pages
7500 Specification Sheet PDF
No ratings yet
7500 Specification Sheet PDF
4 pages
Methods For Food Analysis and Quality Control: January 2019
No ratings yet
Methods For Food Analysis and Quality Control: January 2019
24 pages
Prokaryotic
No ratings yet
Prokaryotic
8 pages
Fisiologi Sel 1
No ratings yet
Fisiologi Sel 1
35 pages
Essential Plant Nutrients Functions and PDF
100% (1)
Essential Plant Nutrients Functions and PDF
40 pages
Namrata SOP 8
No ratings yet
Namrata SOP 8
4 pages
RND Systems cd4 Brasdasd
No ratings yet
RND Systems cd4 Brasdasd
20 pages
Flow of Energy and Matter in Ecosystems: Prepared By: Mr. Joselito Christian Paulus M. Villanueva
No ratings yet
Flow of Energy and Matter in Ecosystems: Prepared By: Mr. Joselito Christian Paulus M. Villanueva
9 pages
Biological Macromolecule: Nucleic Acid: Melissa Caitlin Redcoblado
No ratings yet
Biological Macromolecule: Nucleic Acid: Melissa Caitlin Redcoblado
9 pages
20BSCAGH254 Nisha PLPT
No ratings yet
20BSCAGH254 Nisha PLPT
11 pages
Enzyme Inhibitors
No ratings yet
Enzyme Inhibitors
3 pages
Ihc Soft Tissue Tumors
No ratings yet
Ihc Soft Tissue Tumors
56 pages
MPL203T
No ratings yet
MPL203T
2 pages
Lec3 GeneticVariation
No ratings yet
Lec3 GeneticVariation
41 pages
BAKER Et Al-1998-Molecular Ecology
No ratings yet
BAKER Et Al-1998-Molecular Ecology
13 pages
1st Year MBBS MCQs
100% (1)
1st Year MBBS MCQs
2 pages
Proteins As Drug Targets: Receptors
No ratings yet
Proteins As Drug Targets: Receptors
17 pages
Chapter 3-Part 1 Subjective Practice Biol 209
No ratings yet
Chapter 3-Part 1 Subjective Practice Biol 209
10 pages
Strain Improvement
No ratings yet
Strain Improvement
122 pages
Fcell 09 699597
No ratings yet
Fcell 09 699597
17 pages
Lecture-8-Fasting-Liver and Adipose
No ratings yet
Lecture-8-Fasting-Liver and Adipose
21 pages
Expt-6 (Isolation of Plasmid DNA)
No ratings yet
Expt-6 (Isolation of Plasmid DNA)
4 pages
Snork DNA 1
No ratings yet
Snork DNA 1
3 pages
MutantShop - MX - Lista de Precios Mayoreo
No ratings yet
MutantShop - MX - Lista de Precios Mayoreo
28 pages
The Effect of The Alternative Solutions To Formaldehyde
No ratings yet
The Effect of The Alternative Solutions To Formaldehyde
11 pages
Biotechnology Preeti Paper
No ratings yet
Biotechnology Preeti Paper
1 page

Dynamic Programming Methods in Pairwise Alignment

Uploaded by

Dynamic Programming Methods in Pairwise Alignment

Uploaded by

20 BI 019

Alignment of Pair of Sequences

Global vs Local Alignment Optimal Alignment

Steps performed by the Needleman-Wunsch and the Smith-

Tools based on the algorithm-

Sequences ALIGNED = an evolutionary relationship

• Sequencealignment also refers to the process of

• MULTIPLE SEQUENCE ALIGNMENT: three or more

MATCH = identity MAXIMUM VALUE

• Number of matches = MAXIMUM

 Both sequences placed along two

 Does not show or produce precise

S(xi,yj) = substitution score Assigned scoring scheme

• UnlikeN-W method, no negative values are

Dynamic Programming methods still

You might also like