2. Protein Structure Prediction

Uploaded by

indu221007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views34 pages

2. Protein Structure Prediction

Uploaded by

indu221007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

UNIT II – STRUCTURE PREDICTION AND

DRUG DESIGN

Dr. M Indira
Associate Professor
Department of Biotechnology
Vignan University
SYLLABUS
 Protein structure prediction;
 Introduction to comparative modeling;
 Sequence alignment;
 Constructing and evaluating a comparative model;
 Predicting protein structures by 'threading';
Molecular docking - AUTODOCK/EASYMODELLER
and HEX;
 Structure based de novo ligand design;
 Drug discovery;
 Chemoinformatics; QSAR.
PROTEIN STRUCTURE
PREDICTION
INTRODUCTION
• Proteins are an important class of
biological macromolecules
which are the polymers of amino
acids.
• Biochemists have distinguished
several levels of structural
organization of proteins. They
are:
– Primary structure
– Secondary structure
– Tertiary structure
– Quaternary structure
HOMOLOGY MODELLING
INTRODUCTION:
 Homology modeling, also known as comparative modeling of protein
is the technique which allows to construct an unknown atomic-
resolution model of the "target" protein from:
1.Its amino acid sequence and
2.An experimental 3D structure of a related homologous protein (the
"template").

 Prediction of the three dimensional structure of a given protein sequence

i.e. target protein from the amino acid sequence of a homologous
(template) protein for which an X-ray or NMR structure is available based
on an alignment to one or more known protein structures.
 If similarity between the target sequence and the template sequence
is detected, structural similarity can be assumed.
 In general, 30% sequence identity is required to generate an useful
model.
 Homologous proteins
 Portion of 2 proteins with similar amino
acids : Conserved Region
 Highly similar proteins may have same
basic
function
 Homology Modeling
 Comparative modeling of protein
 To predict protein structure based on
known 3D
shape protein as the template
UNKNOWN STRUCTURE
TEMPLA

?
TE

STRUCTURAL
MODEL
SEQUENCE
ALIGNMENT
As long as the length of two sequences and the percentage of identical residues fall
in the region marked as “safe” the two sequences are practically guaranteed to adopt
a similar structure.

SEQUENCE SIMILARITY &

STRUCTURAL
SIMILARITY
HOMOLOGY MODELLING
STRUCTURE PREDICTION BY HOMOLOGY
MODELLING
AN EXAMPLE
 To know the structure of sequence A (150 amino acids long), 1ST of all compare
sequence A to all the sequences of known structures stored in the PDB (using, for
example, BLAST), if a sequence B (300 amino acids long) containing a region of
150 amino acids that match sequence A with 50% identical residues.
 As this match (alignment) clearly falls in the safe zone(50%) , we can simply
take the known structure of sequence B (the template), cut out the fragment
corresponding to the aligned region, mutate those amino acids that differ between
sequences A and B, and finally arrive at our model for structure A. Structure A is
called the target and is of course not known at the time of modeling.

HISTORY
 The first homology modelling studies were done using wire and plastic models of
bonds and atoms as early as the 1960’s. The models were constructed by taking the
coordinates of a known protein structure and modified by hand for those amino
acids that did not match the structure.
 In 1969 David Phillips, Brown and co-workers published the first paper regarding
homology modelling. They modelled -lactalbumin based on the structure of
hen- egg white lysozyme. The sequence identity between these two proteins was
39%.
STEPS OF HOMOLOGY MODELLING
Protein
1. Template recognition and Sequence
initial alignment
2. Alignment correction Database
Sequence
3. Backbone generation alignment Searches
4. Loop modeling
5. Side chain modeling Secondary
Good
6. Model optimization Structur structure
e prediction
7. Model validation homolog
ue?
Improve
alignment
using
secondary
structure
prediction

Homology
modelling Minimisation

Three
Check dimensional
model structure
1.Template recognition and initial alignment
 Template recognition & selection involves searching the PDB for
homologous proteins with determined structures.
 The search can be performed using simple sequence alignment programs
such as BLAST or FASTA as the percentage identity between the Target
sequence and a possible template is high enough in the safe zone, to be
detected with these programs.

 To obtain a list of hits-the modeling templates and corresponding

alignments the program compares the query sequence to all the
sequences of known structures in the PDB using mainly two matrices:

1. A residue exchange matrix

2. An alignment matrix .
2. Alignment correction
 Sometimes it may be difficult to align two sequences in a region where the
percentage sequence identity is very low. One can then use other
sequences from homologous proteins to find a solution.

 For ex: To align the sequence LTLTLTLT with YAYAYAYAY which is nearly
impossible, then only a third sequence, TYTYTYTYT, that aligns easily to
both of them can solve the issue.

 2 is correct, because it leads to a small gap, compared to a huge hole

associated with alignment 1.
3.BACKBONE GENERATION
 When the alignment is correct, the backbone of the target can be created.
 The coordinates of the template-backbone are copied to the target.
 When the residues are identical, the side-chain coordinates are also
copied.

4.LOOP MODELLING
 After the sequence alignment, there are often regions created by
insertions and deletions that lead to gaps in alignment. These gaps are
modeled by loop modeling, which is less accurate. Currently, two main
techniques are used to approach the problem:
 The database searching method - this involves finding loops from
known protein structures and superimposing them onto the two stem
regions (main chains mostly) of the target protein. Some specialized
programs like FREAD and CODA can be used.
 The ab initio method - this generates many random loops and searches
for one that has reasonably low energy and φ and ψ angles in the
allowable regions in the Ramachandran plot.
The red loop is modeled with the green
residues as anchor residues. The insertion
of 2 residues results in a longer loop.
5.Side-Chain Modeling
 This is important in evaluating protein–ligand interactions at active sites
and protein–protein interactions at the contact interface.
 A side chain can be built by searching every possible conformation for
every torsion angle of the side chain to select the one that has the lowest
interaction energy with neighboring atoms.
 A rotamer library can also be used, which has all the favorable side chain
torsion angles extracted from known protein crystal structures.
6: Model Optimization
 energy minimization procedure on the entire model, by adjusting the
relative position of the atoms so that the overall conformation of the
molecule has the lowest possible energy potential. The goal is to
relieve steric collisions without altering the overall structure.
 Optimization can also be done by Molecular Dynamic Simulation which
moves the atoms toward a global minimum by applying various
stimulation conditions (heating, cooling, considering water molecules)
thus having a better chance at finding the true structure.
 Energy = Stretching Energy +Bending Energy +Torsion Energy +Non-
Bonded Interaction Energy
7.Model Validation
 Every homology model contains errors. Two main reasons are:
1. The percentage sequence identity between template and target. If it is
greater than 90%, the accuracy of the model can be compared to
crystallographically determined structures & if less than 30% large
error occurs
2. The number of errors in templates
 The final model has to be evaluated for checking the φ–ψ angles,
chirality,
bond lengths, close contacts and also the stereo chemical
properties. Modeling Programs like Modeller, SWISS MODEL,
Schrodinger, 3D- JIGSAW.
 A successful model depends on template selection, algorithm used and the
validation of the model.
Advantages
 It can find the location of alpha carbons of key residues inside the
folded protein.
 It can help to guide the mutagenesis experiments, or hypothesize
structure-
function relationships.
 The positions of conserved regions of the protein surface can help
identify putative active sites, binding pockets and ligands.

Disadvantages

 Homology models are unable to predict conformations of insertions

or deletions, or side chain positions with a high level of accuracy.
 Homology models are not useful in modeling and ligand docking
studies necessary for the drug designing and development process.
However, it may be helpful for the same, if the sequence identity with
the template is greater than 70%.
http://blast.ncbi.nlm.nih.
gov/
1

3
http://www.ebi.ac.uk/Tools/msa/clusta
lw2/

2
 CPHmodel
 EsyPred3D
 SWISS-
MODEL
http://swissmodel.expasy.
org/
 Swiss-PdbViewer
4.1
 YASARA
 Accelrys DS Viewer
5.0
PDB
YASA
 Verification of
Model
 Verify3D
 ErratPlot
 Ramachandran
Plot
RAMACHANDRAN PLOT
 In a polypeptide the main chain (N-Calpha)
and (Calpha-C bonds) relatively are free to
rotate. These rotations are represented by the
torsion angles phi (φ) and psi(ψ ), respectively.
 A Ramachandran plot (or a [φ,ψ] plot),
originally developed in 1963 by G. N.
Ramachandran, C. Ramakrishnan, and V.
Sasisekharan,is a way to visualize
backbone dihedral angles ψ against φ of
amino acid residues in protein structure.
A Ramachandran plot can be used:
 One is to show in theory which values, or
conformations, of the ψ and φ angles are
possible for an amino-acid residue in a
protein.
 second is to show the empirical distribution
of datapoints observed in a single structure
in usage for structure validation, or else in a
database of many structures.

Homology Modelling
No ratings yet
Homology Modelling
26 pages
3.7 Protein structure prediction and classification.pptx
No ratings yet
3.7 Protein structure prediction and classification.pptx
20 pages
Katalog Maskot 2024
No ratings yet
Katalog Maskot 2024
82 pages
Lecture 5 Molecular modelling
No ratings yet
Lecture 5 Molecular modelling
13 pages
Sulphater Atttack On Concrete by Mehta
No ratings yet
Sulphater Atttack On Concrete by Mehta
5 pages
MCQ on bonding
No ratings yet
MCQ on bonding
47 pages
HOMOLOGY MODELLING
No ratings yet
HOMOLOGY MODELLING
11 pages
Protein Tertiaty Structure Prediction
No ratings yet
Protein Tertiaty Structure Prediction
12 pages
Homology Modeling Tutorial
No ratings yet
Homology Modeling Tutorial
11 pages
Lecture for HND Homology Modelling
No ratings yet
Lecture for HND Homology Modelling
31 pages
3D Structure Prediction
No ratings yet
3D Structure Prediction
18 pages
AMS แคตตาล็อค (สีฟ้า New)
No ratings yet
AMS แคตตาล็อค (สีฟ้า New)
16 pages
modelling.ppt
No ratings yet
modelling.ppt
32 pages
Generation of 3D Structure of Protein (1)
No ratings yet
Generation of 3D Structure of Protein (1)
11 pages
Formulation and Evaluation of ChitosanNaClMaltodex
No ratings yet
Formulation and Evaluation of ChitosanNaClMaltodex
15 pages
Homology Modeling
No ratings yet
Homology Modeling
22 pages
Protein Modelling
No ratings yet
Protein Modelling
20 pages
Protein Modeling
No ratings yet
Protein Modeling
17 pages
MSDS Granular sulphur
No ratings yet
MSDS Granular sulphur
9 pages
Experiment-7(HOMOLOGY MODELING)
No ratings yet
Experiment-7(HOMOLOGY MODELING)
12 pages
Protein Structure Prediction
No ratings yet
Protein Structure Prediction
13 pages
3rdunitii
No ratings yet
3rdunitii
12 pages
Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
No ratings yet
Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
30 pages
Computation prediction protein structure
No ratings yet
Computation prediction protein structure
22 pages
Tools For Analyzing Comparative Protein Structure
No ratings yet
Tools For Analyzing Comparative Protein Structure
7 pages
17.1- The Flow of Energy _ Quizizz
No ratings yet
17.1- The Flow of Energy _ Quizizz
5 pages
Protein Structure Prediction.pptx
No ratings yet
Protein Structure Prediction.pptx
23 pages
Protein structure prediction and modeling
No ratings yet
Protein structure prediction and modeling
20 pages
Lecture 13- Protein 3 D Structure
No ratings yet
Lecture 13- Protein 3 D Structure
20 pages
Protein Side Chain Correction
No ratings yet
Protein Side Chain Correction
28 pages
Lec6-Protein Structure Prediction
No ratings yet
Lec6-Protein Structure Prediction
16 pages
Homology modeling
No ratings yet
Homology modeling
2 pages
TI _ Plastigen G
No ratings yet
TI _ Plastigen G
2 pages
12 Arrhenius Made Easy
No ratings yet
12 Arrhenius Made Easy
12 pages
Protein Structure Prediction Using Homology Modeling
No ratings yet
Protein Structure Prediction Using Homology Modeling
11 pages
Structural Bioinformatics and Protein Structure Prediction (1)
No ratings yet
Structural Bioinformatics and Protein Structure Prediction (1)
14 pages
Structural bioinformatics
No ratings yet
Structural bioinformatics
23 pages
BIF101 - II - Spring 2024
No ratings yet
BIF101 - II - Spring 2024
8 pages
Document (2) (14)
No ratings yet
Document (2) (14)
3 pages
Protein Tertiary Structures: Prediction From Amino Acid Sequences
No ratings yet
Protein Tertiary Structures: Prediction From Amino Acid Sequences
7 pages
Metallurgical Formula Cheat Sheets.Vol 2
No ratings yet
Metallurgical Formula Cheat Sheets.Vol 2
57 pages
s4-chem-1-2025
No ratings yet
s4-chem-1-2025
6 pages
Bioinformatics Notes - 17Bt54: Module - 4
No ratings yet
Bioinformatics Notes - 17Bt54: Module - 4
48 pages
Homology modeling
No ratings yet
Homology modeling
5 pages
Workshop Protein Modeling PDF
No ratings yet
Workshop Protein Modeling PDF
54 pages
Dr. Qudsia Yousafi
No ratings yet
Dr. Qudsia Yousafi
30 pages
western blotting
No ratings yet
western blotting
12 pages
Protein Modeling in Biochemistry
No ratings yet
Protein Modeling in Biochemistry
29 pages
7 HomologyModelling 12oct2020
No ratings yet
7 HomologyModelling 12oct2020
8 pages
Protein Modelling
No ratings yet
Protein Modelling
15 pages
3-D Structure of Proteins: Laws of Physics Theory of Evolution
No ratings yet
3-D Structure of Proteins: Laws of Physics Theory of Evolution
9 pages
Jeffrey 1975
No ratings yet
Jeffrey 1975
4 pages
Tertiary Structure Prediction Methods: Any Given Protein Sequence
No ratings yet
Tertiary Structure Prediction Methods: Any Given Protein Sequence
29 pages
Sanchez CurrOpinStructBiol 1997
No ratings yet
Sanchez CurrOpinStructBiol 1997
9 pages
Mineral (Garam Rich Minerals)
No ratings yet
Mineral (Garam Rich Minerals)
4 pages
Agilent FPD-Low Level Sulfur Detection in CO2
No ratings yet
Agilent FPD-Low Level Sulfur Detection in CO2
4 pages
Homology Modeling, Also Known As Comparative Modeling of
No ratings yet
Homology Modeling, Also Known As Comparative Modeling of
19 pages
Unit 3
No ratings yet
Unit 3
9 pages
Isometric Pid Pertamina
No ratings yet
Isometric Pid Pertamina
3 pages
Homology Model Prediction
No ratings yet
Homology Model Prediction
1 page
Pre-Assessment Questions
No ratings yet
Pre-Assessment Questions
18 pages
Station
No ratings yet
Station
30 pages
Protein Structure Modeling
No ratings yet
Protein Structure Modeling
21 pages
Forensics High School Complete Listdocx
No ratings yet
Forensics High School Complete Listdocx
3 pages
Daftar Pustaka
No ratings yet
Daftar Pustaka
4 pages
Protein Modelling: (Building 3D Models of Proteins)
No ratings yet
Protein Modelling: (Building 3D Models of Proteins)
19 pages
Physical Pharmacy Lab
No ratings yet
Physical Pharmacy Lab
30 pages
Homo Logy
No ratings yet
Homo Logy
8 pages
Homolgy Modeling
No ratings yet
Homolgy Modeling
19 pages
CC - Week 1 (Lec)
No ratings yet
CC - Week 1 (Lec)
7 pages
Protein Structure Determination: Goal
No ratings yet
Protein Structure Determination: Goal
8 pages
Bronsted Lowry Concept
No ratings yet
Bronsted Lowry Concept
4 pages
Genome Sequencing Projects: Increase in The Number of Protein Sequences
No ratings yet
Genome Sequencing Projects: Increase in The Number of Protein Sequences
27 pages
Protective Coating For Mildsteel Pipes&fittings
No ratings yet
Protective Coating For Mildsteel Pipes&fittings
20 pages
MEE Unit 2
No ratings yet
MEE Unit 2
23 pages
Mebeverine Prolonged-Release Capsules - British Pharmacopoeia
100% (1)
Mebeverine Prolonged-Release Capsules - British Pharmacopoeia
3 pages
Logical Modeling of Biological Systems
From Everand
Logical Modeling of Biological Systems
Luis Fariñas del Cerro
No ratings yet
Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
From Everand
Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
Dr. M. Kanagasabapathy
5/5 (1)
Protein Modelling
No ratings yet
Protein Modelling
53 pages
Bif401 Solved Final Papers 2017
No ratings yet
Bif401 Solved Final Papers 2017
8 pages
Homology Modelling
No ratings yet
Homology Modelling
29 pages
MSDS Kortho Leibinger K70031
No ratings yet
MSDS Kortho Leibinger K70031
4 pages
Protein Structure Prediction
No ratings yet
Protein Structure Prediction
17 pages
Homology Modeling: Ref: Structural Bioinformatics, P.E Bourne Molecular Modeling, Folkers
No ratings yet
Homology Modeling: Ref: Structural Bioinformatics, P.E Bourne Molecular Modeling, Folkers
16 pages
Molecular Modelling and Drug Design
From Everand
Molecular Modelling and Drug Design
K Anand Solomon
No ratings yet
SSPC Guide 24-2018 Soluble Salt Testing Frequency and Locations on New Steel Surfaces (1)
No ratings yet
SSPC Guide 24-2018 Soluble Salt Testing Frequency and Locations on New Steel Surfaces (1)
5 pages
Precipitation Titration
No ratings yet
Precipitation Titration
1 page
Fosroc Nitoflor Conductive: Constructive Solutions
No ratings yet
Fosroc Nitoflor Conductive: Constructive Solutions
2 pages
En 42
No ratings yet
En 42
1 page
Methods in Molecular Biology Volume Vol. 857
100% (1)
Methods in Molecular Biology Volume Vol. 857
432 pages

2. Protein Structure Prediction

Uploaded by

2. Protein Structure Prediction

Uploaded by

UNIT II – STRUCTURE PREDICTION AND

 Prediction of the three dimensional structure of a given protein sequence

SEQUENCE SIMILARITY &

 To obtain a list of hits-the modeling templates and corresponding

1. A residue exchange matrix

 2 is correct, because it leads to a small gap, compared to a huge hole

 Homology models are unable to predict conformations of insertions

You might also like