Bioinformatics I
Bioinformatics I
Marineil C. Gomez
School of Chemical, Biological and Materials Engineering and Sciences
Mapua University
Pairwise Sequence Alignment
Outline
⚫ Definition of Homology, orthologs and
paralogs
⚫ Global and Local Alignment
⚫ Scoring Matrices
⚫ Nucletide Models
⚫ Protein Models
Why do Alignment?
⚫ Is the gene/protein related to any other gene/protein?
⚫ Relatedness:
⚫ Sequence level = homologous
⚫ Common functions
:
For Local Alignments
By Percent Identity:
Cons: where does the threshold lie?
26% vs 30%;
40% vs 60%
Differences in the length of proteins
20 aa vs 150 aa
For Local Alignments
By Relative Entropy:
• The relative entropy (H) of the target and background distributions
measures the information that is available per aligned amino acid
position that, on average, distinguishes a true alignment from a chance
alignment
• For each substitution matrix with its unique target frequencies qij and
background distributions pipj, it is possible to derive the relative entropy
H as follows: