Computat Onal Biology: Saad Mneimneh
Computat Onal Biology: Saad Mneimneh
Lecture 2
Saad Mneimneh
http://engr.smu.edu/~saad/
Genetic mapping
• Single chromosome with n genes
d
• Problems
– Too many chromosomes
– Not all genes have phenotypes that can be observed
– We usually don’t know what we are looking for
Saad Mneimneh
http://engr.smu.edu/~saad/
RFLP: Restriction Fragment Length
Polymorphism
• A restriction enzyme cuts the DNA molecules at every occurrence of
a particular sequence, called restriction site.
Saad Mneimneh
http://engr.smu.edu/~saad/
Gel-Electrophoresis
• DNA is cut into fragments using an enzyme
• The cut DNA is put on a Gel material
• An electric current is applied on the Gel
• DNA is negatively charge
• DNA fragments will start moving towards the
positively charged side
• Smaller fragments move faster
• After some time, we have a separation of the
different fragment lengths
Saad Mneimneh
http://engr.smu.edu/~saad/
DNA Sample
• Some cells are obtained
• Liquidized DNA is
obtained from these cells
Saad Mneimneh
http://engr.smu.edu/~saad/
Restriction Enzyme
• A restriction enzyme
is used to cut the
DNA into fragments
Saad Mneimneh
http://engr.smu.edu/~saad/
Apply Enzyme
Saad Mneimneh
http://engr.smu.edu/~saad/
Water Bath
• The tube is put on a
plate floating on water
at 37oC
Saad Mneimneh
http://engr.smu.edu/~saad/
Preparing the Gel
• In the meantime, we
prepare the Gel
• Agarose powder is
the basic substance
for making the Gel
Saad Mneimneh
http://engr.smu.edu/~saad/
Preparing the Gel
Saad Mneimneh
http://engr.smu.edu/~saad/
Preparing the Gel
• The container is
heated (in a
microwave if you
want) until the powder
completely dissolves
in the water
Saad Mneimneh
http://engr.smu.edu/~saad/
Preparing the Gel
• The liquid Gel is poured into
the inner box comb like piece
Saad Mneimneh
http://engr.smu.edu/~saad/
Gel Ready
• Gel ready
• Fill the H shaped
container with water
• Remove comb
well
Saad Mneimneh
http://engr.smu.edu/~saad/
Putting DNA on the Gel
original DNA cut by
uncut DNA Hind III
• DNA samples mixed with
colored solution and UV
reactive solution
Saad Mneimneh
http://engr.smu.edu/~saad/
Run the Gel
• DNA is negatively
charged
Saad Mneimneh
http://engr.smu.edu/~saad/
DNA Fragments Move
Saad Mneimneh
http://engr.smu.edu/~saad/
Viewing
Saad Mneimneh
http://engr.smu.edu/~saad/
Viewing
• Original uncut DNA sample
makes a sharp band at the
beginning (one big fragment)
Saad Mneimneh
http://engr.smu.edu/~saad/
Hybridization
• In a hybridization experiment, we try to
verify whether a specific sequence known
as probe binds (or hybridizes) with a DNA
fragment.
Saad Mneimneh
http://engr.smu.edu/~saad/
RFLP Markers
• We apply a number of probes in turn on the gel
• Each probe is mixed with a radioactive material
• Each probe hybridizes with a portion of the original DNA
• After cutting, the probe will hybridize with the fragments belonging to that
portion
• These fragment can now be observed due to the radioactive material
• RFLP marker is defined by a probe and the set of lengths (unordered) of
fragments that hybridize with the probe.
• Use analysis of recombination to order RFLP markers on the chromosome
RFLP marker
probe 1 probe 2
smear
fragments contained
in the probe
Gel
Saad Mneimneh
http://engr.smu.edu/~saad/
First RFLP map in 1987
• Donis-Keller et al. constructed the first RFLP map of the
human genome, positioning one RFLP marker per
approximately 10 million nucleotides.
Saad Mneimneh
http://engr.smu.edu/~saad/
RFLP and Gene Finding
• Using the ordering of RFLP markers on a chromosome,
we can approximately determine the location of a gene.
– How?
– Find the difference between the RFLP markers of family
members with the disease and family members not having the
disease.
– It is likely that the RFLP marker that consistently differ is on the
gene responsible for the disease, since family members have
more or less the same genetic characteristics.
– But we still don’t know where and what the exact gene is.
Saad Mneimneh
http://engr.smu.edu/~saad/
Physical Mapping
• Genetic mapping and RFLP
– (1) do not tell the actual distance in base pairs
– (2) if genes (or markers) are very close, one cannot
resolve their order, because the observed
recombination frequencies will be zero.
Saad Mneimneh
http://engr.smu.edu/~saad/
Hybridization Mapping
• Break several copies of DNA into fragments (using different restriction
enzymes).
• Clones may overlap (cutting DNA with distinct enzymes), and we want them
to (we will see why).
• Fingerprinting the clones: Now use DNA probes, and for every clone
determine the list of probes that hybridize with the clone
• When two clones have substantial overlap, their fingerprints will be similar.
• Reconstruct the relative order of the clones using the overlap information
(this order is unknown in RFLP)
Saad Mneimneh
http://engr.smu.edu/~saad/
Hybridization Mapping
• For n clones, and m probes, the hybridization
data consists of an n x m matrix D, such that
dij=1 if clone Ci contains probe pj.
Saad Mneimneh
http://engr.smu.edu/~saad/
Illustration
No overlap
No information
Covering String:
Saad Mneimneh
http://engr.smu.edu/~saad/
Covering string
Saad Mneimneh
http://engr.smu.edu/~saad/
Shortest Covering String
probes
A B C D E F G
1 1 1 CAEBGCFDAGEBAGD
2 1 1 1 1 1
3 1 1 1 1 1 1 2
4 1 1 1 1 1 3
4
clones 5 1 1 1 1 5
6 1 1 1 1 6
7 1 1 1 1 1 7
8
8 1 1 1 1 9
9 1
A covering string: S= AC ABEG BCDEFG BCDFG CDFG ADFG ABDEG ABDG D
Saad Mneimneh
http://engr.smu.edu/~saad/
Restriction Mapping
• Before using the list of probes in a clone as a
fingerprint, biologists used the order of
restriction fragments in a clone.
Saad Mneimneh
http://engr.smu.edu/~saad/
Double Digest
• Cut the DNA fragment with enzyme A, then enzyme B,
then both
• A: {2,2,3} 2 3 2
• B: {3,4} 3 4
• A+B: {1,2,2,2} 2 1 2 2
Saad Mneimneh
http://engr.smu.edu/~saad/
Partial Digestion
• Instead of obtaining lengths of restriction fragments, the DNA is digested in
such a way that fragments are formed by every two cuts and the lengths of
all fragments are obtained.
• Partial Digest Problem. Given ∆X, reconstruct X (E=∆X). Also known as the
turnpike problem in computer science, construct the geography of the
highway from knowing the distance between every two exits.
• No polynomial time algorithm for this problem is yet known, but in practice,
efficient algorithms exist.
Saad Mneimneh
http://engr.smu.edu/~saad/