Bio informatics
Bioinformatics is the
design and
development of
computer-based
technology that
supports life science.
Therefore my role as a
bioinformatician or
computational
biologist is to provide
biological researchers
a systematic
assistance using
bioinformatics tools
and systems in solving
What I want to do in
real world problems
and its complexity.
the future
Specifically, I’m
interested in
• Studying data management
and data integration in
Bioinformatics
– Closely focus on the large and widely
used set of data sources;
– Study integration approaches and
analytical methods needed to carry out
bioinformatics investigations;
– Plan out a data warehousing tool using
the studied approaches and methods
• New techniques in
programming
– Study complex and new programming
techniques
– Enhance known programming skills
• Further research and
development
– Continuous development of the
research study in real world
application;
– Streamlining these tools to adopt the
increasing data management and
integration problems
hings I need to prepare
Master’s degree in Bioinformatic
Trainings and Research project
Doctorate degree opportunitie
Post Doctoral opportunities
Career opportunities in
Bioinformatics
Tracking the route to achieve my
goal
• Knowledge establishment
and mastery of subject
• Application of tools and
approaches into formal
research
• Development of new tools
and methods for arising
complexities of biological
problems
• Actively participate in
bioinformatics seminars
and consortium,
discussion groups and
organizations
Summary information
Bioinformatics
The term was first coined in 1988 by Dr. Hwa Lim
“The study of the information content
and information flow in biological
systems and processes”.
“… addresses problems related to the
Pioneer of
Bioinformatics
Atlas of Proteins
Sequences and Structures
DATABASE
A collection of information,
usually stored in an electronic
format that can be searched by a
computer.
A large, organized body of
persistent data, usually
associated with computerized
software designed to update,
query, and retrieve components
of the data stored within the
Need of Biological
Databases
Explosive growth in
biological data(sequences,
3D structures, 2D gel
analysis, MS analysis….)
Easy access to the
information; and
A method for extracting
only that information
Building of Sequence
Databases Curators
C TC
A A
ATCATCT
GAG
GAG
Labs
TA
TA
G
CC RefSeq
C G
G TG
AC TATAGCCG
C
TA
G AGCTCCGATA
T
C
GA
T
TGA
CCGATGACAA
G
T
AT
AC
CA
TGC
CG
A
G A
TT TTGACA
ACG A
CG
C Genome
CT
CGTGA
AG A
A
TA T
CG C
C
GC
A
TA TTG C Assembly
G
CTGA
CGGA
A
CA Algorithms
TAT
GC TAA
TG
CT
T
TG
TA
C C C C
AT G
A G T A
A
G G TTATAGCCG
ATT TATAGCCGA TA AT TG
TGTATAGCCG
TATAGCCG
TA
A
A T
T
T
A
T
AT
C
GA GenBank
AT
CT
GAGTC T
A C TC
A A
ATTATC
GAGA
ATCATT
GAG
GAG
A
DDBJ/EMBL/
GenBank
The International Nucleotide Sequence
Collaboration
DDBJ: DNA Data Bank of Japan
CIB: Center for Information Biology a
of Japan
NIG: National Institute of Genetics
EMBL: European Molecular Biology L
EBI: European Bioinformatics Institut
NCBI: National Center for Biotechno
NLM: National Library of Medicine
NIH : National Institute of Health
nternational Advisory Meeting
nternational Collaborative Meeting
DRUG DESIGNING
Genomic Large Molecule Small Combinatorial
Biology Targets Assays Molecules Chemistry
High
Throughput
Bioinformatics Screening Cheminformatics
Clinical
Technology is impacting this process
GENOMICS, PROTEOMICS & BIOPHARM.
Potentially producing many more targets
and “personalized” targets
HIGH THROUGHPUT SCREENING
Identify disease Screening up to 100,000 compounds a
day for activity against a target protein
VIRTUAL SCREENING
Using a computer to
Isolate protein predict activity
COMBINATORIAL CHEMISTRY
Rapidly producing vast numbers Find drug
of compounds
MOLECULAR MODELING
Computer graphics & models help improve activity
Preclinical testing
IN VITRO & IN SILICO ADME MODELS
Tissue and computer models begin to replace animal testing
Genome-enabled science: what
IMMEDIATE
can we do?
DNA POST-GENOMIC STUDIES
• IDENTIFY GENES, ORFS, ETC.
• TRANSCRIPTION/TRANSLATION LAB EXPERIMENTS
START/STOPS • GLOBAL GENE EXPRESSION
• NUCLEOTIDE COMPOSITION • GLOBAL PROTEIN ANALYSIS
• “WORDS”, CONSENSUS MOTIFS • GLOBAL MUTAGENESIS
• CODON USAGE (knock-outs, reporter genes)
• GENOME STRUCTURE • PROTEIN OVEREXPRESSION
• SCREENS/ASSAYS
GENES (gene function, drug design,
• ORTHO-/PARA-/HOMOLOGS virulence)
• GENE FAMILIES
• PREDICTED CELL LOCALIZATION “IN SILICO STUDIES”
• MOTIFS • COMPARATIVE GENOMICS
• 3-D STRUCTURAL PREDICTIONS
• BIOCHEMICAL PATHWAYS
FUTURE • GENE EXPRESSION HIERARCHY
Comprehensive understanding of:
• PHYSIOLOGY
• GENETICS
• EVOLUTION
• BIOCHEMISTRY
olving roles of computational analysis in biolog
sequencing era (before 1978-80)
Study biological function
e-genomic era (1980-1996)
Study biological function Clone/sequence gene
Analyze/interpret sequence
ost-genomic era (1996-
Analyze/interpret sequences
Sequence genome of all genes
Study biological function Prioritize targets
FORMATICS AND BIOLOGICAL COMPLEXIT
FUTURE CHALLENGES
Protein structure and function
Genomic Biology Structure- 2°- Tm, signal peptide,domains
Expression 3° - Folds, Interaction surface
(RNA, protein, Functional predictions (Gene Ontology)
metabolite) Molecular function
Biochemistry Cellular process
Genetics Location
Networks and Pathways
Quantitative differences: RNA, Protein
INFORMATICS Qualitative differences: proteolysis, PTM
Correlations with SNP’s
Analysis of the “biological system”
Temporal changes in the “molecular portrait”
In multiple tissues/cells and physiological conditions-
Predictors of: Disease status
Response to therapy
Toxicity
s Revolution: What do we need to fully e
ew technologies for follow-up studies on a large sca
new
gene function, structural biology, protein-protein in
, etc.)
more sophisticated tools for analysis of large data se
w generation of biologists with a solid foundation in
(mathematics, statistics, computer science) and an
truly interdisciplinary research
What type of people did bioinformatics
require?
Companies need cross-functional
manpower at all levels Biologists with IT
skills,
IT professionals
What
with
type of work a serious
available in interest in
biology (just one of the skills is not
Bioinformatics?
enough)
Bioinformaticians need to perform two
critical roles:
1.Develop IT tools embodying novel
algorithms and analytical techniques, and
What skills should a Bioinformatician
have?
The following are the "core requirements" of
bioinformaticians:
1. Fairly deep background in some aspect of molecular
biology. (It can be biochemistry, molecular biology and
molecular biophysics)
2. Understanding the central dogma of molecualr biology,
(how and why DNA sequence is transcribed into RNA and
translated into protein is vital.)
3. Should have substantial experience with atleast one or
two major molecular biology software packages for
sequence anlaysis (EMBOSS, BioSuite and GCG The
experience of learning one of these packages makes it
much easier to learn to use other software quickly.)
equirement of students
Bioinformatics is a multi-disciplinary, highly
collaborative field of study which relies on
knowledge of biology, statistics, applied
mathematics, and computer science.
There are many degree programs but, as
students with a background in physics, we
should expect degree completion to take a
bit longer than usual.
Job opportunities are more than plentiful for
Contd…
Preliminary courses: genetics, biological
database, introduction to micro array,
statistics, …
Weekly meeting to present bio-related
papers.
To become familiar with Bioinformatics
software packages.
Wish to do Bioinformatics research in
Experts says ..
Used for the discovery of a new drug or a
new herbicide/herbicide-resistant crop
combination. Drug toxicology,
pharmacogenetics and clinical trial studies
can also benefit from this technology which
can even be used to genetically engineer
crops and livestock that have enhanced
nutritional qualities and the ability to
produce pharmaceuticals,"
Bioinformatics applications in drug discovery
and development is expected to reduce the
annual cost of developing a new drug by 33%,
and the time for drug discovery by 30%
Continue
s..
• Many Indian entrepreneurs and intellectuals
have set up companies to take advantage of
the emerging opportunities in bioinformatics.
Even major IT companies like Wipro, Tata
Consultancy Services, Kshema
Technologies,Mascon, Satyam ,Reliance (RelBio)
and Infosys have diversified their activities and
went ahead to set up aseparate section/division
within their organizations.
• HCL's focus is on target validation by providing
in-silico methods for Gene to Drug by using
algorithms & software tools.
Continu
es..
• Tata Consultancy Services (TCS) and
Council of Scientific and Industrial
Research (CSIR) have launched ‘Bio Suite’,
the country’s first comprehensive
software for bioinformatics which caters to
the needs of fields such as biology, post-
genomic drug discovery and other related
areas.
• Accelrys and IBM help customers
accelerate drug discovery and
development
Thank you