This document discusses next generation sequencing technologies. It provides details on several massively parallel sequencing platforms and describes their advantages over traditional Sanger sequencing such as higher throughput, lower costs, and ability to process millions of reads in parallel. It then outlines several applications of next generation sequencing like mutation discovery, transcriptome analysis, metagenomics, epigenetics research and discovery of non-coding RNAs.
Explains high throughput sequencing: lower cost, shorter time, parallel processing, and advantages over capillary sequencing. Describes various NGS technologies including Pyrosequencing, Illumina, and others facilitating high throughput and efficiency.
Covers applications such as mutation discovery, RNA-Seq, metagenomics, and importance in diagnostics.
Discusses techniques like ChIP-Seq for studying DNA-protein interactions and chromatin packaging effects on gene expression.
Focuses on regulatory non-coding RNAs and the challenges in their predictive modeling.
Introduction of third-generation sequencing and notable research impacts on genetics from expert publications.
High throughputsequencing
Lower Cost
Less time
Parallel Sequencing process
Sequence thousands of sequences at once
3.
Massively ParallelSignature Sequencing (Lynx
Therapeutics)
Polony Sequencing (Agencourt Biosciences)
454 Pyrosequencing (454 Life Sciences)
Illumina (Solexa) sequencing
SOLiD Sequencing (Applied Bio-systems)
Ion Semiconductor sequencing (Ion Torrent Systems Inc.)
DNA Nanoball (Complete Genomics)
Heli-oscope Single Molecule Sequencing
Single Molecule SMRT Sequencing (Pacific Biosciences)
4.
The abilityto process millions of sequence reads in
parallel rather than 96 at a time.
NGS fragment libraries do not need vector based
cloning and E. coli based amplification stages used in
capillary sequencing.
Shorter Read Lengths.
Capillary sequencing – 96 wells, NGS – 10 million wells
High throughput :
Sanger: 96 reads < 800-1000b/run
Solexa: 1.2X106 reads < 75b/run
5.
High Throughput
Adapter ligation
Requirement of relatively little input DNA
Production of shorter read lengths(more
convenient in downstream processing).
Discovery ofmutations that determine
phenotypes.
Conventional Approach – PCR amplified –
Capillary sequencing – alignment/detection.
Whole genome resequencing is faster and less
expensive using NGS.
E.g. Discovery of SNP in C. elegans required only a
single run of Illumina Sequencer. (Hiller et.al.)
16.
Massively ParallelSequencing method for
Transcriptome analysis.
mRNA (transcript) – cDNA – sequencing using Next
Generation Short Read Sequencing technology.
Reads are aligned to a reference genome and a
Transcriptome map is constructed.
Advantages :
Does not require existing genomic sequence unlike
hybridization.
Low background noise
High resolution – up to 1 bp (identification of SNP)
High throughput, low cost
17.
Even thoughcomplete genome sequence are available
for disease causing microbes, continuous evolution by
mutation and sequence exchange.
The depth of sampling of NGS helps greatly in
identification of rare VARIANTS in the clinical strain
isolates.
This is not possible in sequencing PCR products which is
commonly done in a clinical diagnostic setting, because
the low signal strength from variant nucleotides would
not be detectable on a capillary sequencer.
The cloning bias is eliminated.
Improve diagnostics, monitoring and treatments.
18.
Metagenomics –sequencing of DNA of
uncultured/unpurified microbial population followed
by bioinformatics based analysis by comparison.
Associated cost of capillary sequencing remains very
high.
Elimination of Metagenomic signatures from certain
microbial sequences that are not carried stably by
E.coli. during cloning.
Characterizations of the microbial census of the human
and mouse intestinal flora and the oral cavity
Microbiome.
19.
DNA-Protein interactions– DNA packaging into
histones
Regulatory protein Binding
Exploring Chromatin Packaging
20.
ChIP requiresan antibody specific for the DNA
binding protein.
Protein DNA cross linker is added.
Cell lysis --- DNA fragmentation – Antibody
Immunoprecipitation.
Crosslinking reversal or southern blotting or qPCR
ChIP-Seq --- simply make an adaptor ligated library of
the released immunoprecipitated fragments and
sequence them en masse.
High coverage and higher resolution.
NRSF and STAT1 transcription factors.
21.
Genomic DNApackaging into histones – availability of
genes for transcription.
ChIP-Seq to compare histone methylations at promoter
regions to check gene expression levels.
In a study, 20 histones, one histone variant (H2A.Z), RNA
Polymerase II and insulator binding protein.
Result: Changes in Chromatin state at specific promoters
reflect changes in gene expression they control.
22.
ncRNAs– regulatoryRNA molecules.
Prediction of precursor and sequences of ncRNA by
in silico methods is of limited use.
Examines the potential for secondary structure
formation, putative genomic identification and
regulatory molecules.
Identification of 21-U -RNAs in C.elegans.
23.
Third generation(Next-Next Generation) Sequencing.
Variations in sequences of human genome (about 5%
considering the allele variation) is found using NGS.
A pilot project for determination of additional Human
Genome sequences.
24.
Elaine R.Mardis (2008) the impact of next-generation
sequencing technology on genetics. Cell vol.24
No.3,133-14
Jorge S Reis-Filho (2010): Next-Generation Sequencing,
Breast Cancer Research 2010, 11(Suppl 3)
Elaine R. Mardis (2009): Next-Generation Sequencing
Methods. Annu. Rev. Genomics hum genet. 9:387-402
Some websites