1
1
Lecture 1
By Safynaz AbdEl-Fattah Sayed
Computer Science Department
Resources
• Course Work: 20
• Midterm: 20
• Final: 60
Course Outline
- Basic Biological Concepts
- Algorithms:
- Sequence Alignment
- DNA restriction mapping
- Motif Finding
• Define the challenges that are facing the large scale biological data and
think how to solve them through developing algorithms and
bioinformatics tools.
AGENDA
• What is Computational Biology.
• Central Dogma of Life.
• DNA, Genomes.
• RNA, Transcriptomes.
• Proteins, Proteomics .
• Transcriptions and Translation.
• Omics
Computational Biology
• Bioinformatics is computer aided biology! Also called computational biology.
+ =
Computational Biology
Biological Data Computer Algorithms
Computational Biology
• Bioinformatics is an interdisciplinary field that develops methods and
software tools for understanding biological data (Wikipedia).
anti-sense/reverse/-/crick/template strand
Chromosome
• The nucleus of human cell contains 23 chromosomes.
• A chromosome is a long, continuous piece of DNA, which contains many
genes, regulatory elements and other intervening nucleotide sequences.
RNA Polymerase
RNA Processing according to Gene Cell Type
https://www.cs.cmu.edu/~ckingsf/bioinfo-lectures/hmm.pdf
RNA Processing
Other RNA products
• ribosomal RNA (rRNA), which includes major constituents of
ribosomes.
• mRNA
Translation
• The nucleotide sequence of the mRNA is decoded to specify the amino acid sequence of a
polypeptide.
• This process occurs inside a ribosome and requires adapter molecules called tRNAs.
• Ribosome is the machine that synthesizes proteins from mRNA.
• Proteins are molecules composed of one or more polypeptides.
• a polypeptide is a polymer composed of amino acids.
• Cells build their proteins from 20 different amino acids.
• A polypeptide can be thought of as a string composed from a 20-character alphabet.
codons
https://www.peptidesciences.com/information/peptides-vs-proteins/
Amino acid to complex
protein
RNA vs DNA
• RNA like DNA is a string of four alphabets (U, C, A, G):
DNA RNA
• Double-stranded • Single-stranded
• Stable • Unstable
• Hard to mutate • Easy to mutate
• Storage • Storage and Function
OMICS
• Genomics: DNA.
• Transcriptomics: RNA.
• Proteomics: Proteins.
• Metagenomics: is a new research area focused on the analysis of mixture of
DNA sequences extracted from different organisms i.e. viral, bacterial, or
eukaryotic that are living together in a symbiotic community.
• Epigenomics is the study of the complete set of epigenetic modifications on
the genetic material of a cell, known as the epigenome. The epigenome is
made up of chemical compounds and proteins that can attach to DNA and
direct such actions as turning genes on or off, controlling the production of
proteins in particular cells.
Sequencing
Prerequisite
Bioinformatics
Sample Preparation
• Due to DNA is three billions bases long &
• The sequencing machines can only sequence short (200-300 bp)
fragments.
• So, It’s important to break DNA into small fragments.
Sample Preparation