M 1, V1

WS 1415, Modul 1, Vorlesung 1

WS 1415, Modul 1, Vorlesung 1


Fichier Détails

Cartes-fiches 34
Langue Deutsch
Catégorie Alimentation
Niveau Université
Crée / Actualisé 12.10.2014 / 15.10.2014
Lien de web
https://card2brain.ch/box/m_1_v1
Intégrer
<iframe src="https://card2brain.ch/box/m_1_v1/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

-genomics’
 

using genome sequencing + analysis tools, describes genetic makeup

-omics’
 

large-scale system-wide analysis tools

genome sequence

 defines all there is to know about an entire organism


 

Genetic predisposition

- example
 

 -folate metabolism

→ a defect in MTHFR , polymorphism

Personal genomics
eg.

23andMe

23andMe

direct-to-consumer (DTC) personal genome test

• targeted sequencing of specific genomic locations

(based on published research)
 

read

sequenziertes Fragment

Principal component analysis (PCA)
 

statistical procedure ,convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components.


 

A genomic library is

a collection of the total genomic DNA from a single organism.

(SNPs)
 

single nucleotide polymorphisms
 

Genome sequencing : Project design (1): available reference sequence?
 


 

1. De novo genome project:
 

2. Re-sequencing project
 

 Re-sequencing project
 

• map reads onto reference sequence

• identify minor changes compared to reference,

e.g. single nucleotide polymorphisms (SNPs)

• simple, fast, cheap, but selective analysis

--

-find minor changes, cheaper (selective analysis), use reference sequence

-assumption: almost identical to data base reference


 

1. De novo genome project:
 

assembly of sequence reads without reference

• identify major changes compared to known sequences

a)genomic rearrangements

(e.g. cancer genomics), not new, just changed expression

• new sequences not present in reference

(e.g. ‘genomic islands’ encoding antibiotic resistance

in bacteria), not present in reference because integrated in genome, can be identifierd by de -novo project

--


 

-find major changes to known sequences but no reference does exist

-complicated, expensive
 

Genome sequencing Project design (2): sequencing target?


 

1. Whole-genome sequencing:
 

2. Targeted genome sequencing:
 

 Whole-genome sequencing:
 

• huge technical, bioinformatic + financial effort

• especially for human genome sequencing
 

Targeted genome sequencing:
 

• regions with known phenotypic relevance e.g.

BRCA1/2 (breast cancer susceptibility)

• exome sequencing

• e.g.: 23andMe
 

Genome-wide association study (GWAS):

 globally, compare cancer mutation frequency(mind: cancer vs. background influence)

-->MOST STUDIES ON GENOMICS ARE GWAS

16S rRNA amplicon seq
 

-(microbiota analysis)

• >90% of microbiome projects

• “Who’s there?” but not: “What are they doing?”
 

Exome seq.
 

Targeted genome sequencing
(Human exome = gesamtheit der exons), coding for genes/protein sequences
(Intron/ noncoding DNA is important for regulation but not so much in sequencing, so its cheaper not to focus on everuthing
 

ITS amplicon sequencing


 

•fungal equivalent of bacterial 16S rRNA analysis

internal transcribed spacer’ (ITS)
 

Multi-locus sequence typing (MLST)

•housekeeping genes (= turned on under all conditions, used by all cells)

•higher resolution than 16S rRNA sequencing"

(e.g. to differentiate + epidemiologically characterize multiple

E. coli isolates)

---

-eg. salmonella, 7patients, same bacteria? → 7 regions, compare, are they the same?
 

Targeted genome sequencing (eg. 23andME)


 

Exome seq.(coding for genes/protein sequences)

16S rRNA amplicon seq. (microbiota analysis “Who’s there?”)

ITS amplicon sequencing (fungal equivalent of bacterial 16S rRNA analysis)
 

Multi-locus sequence typing (MLST) (housekeeping genes
, same salmonella in patients?)
 


 



 

Illumina sequencing


 

-high-throughput short read sequencing

• DNA synthesis progression by position,


•Problem: speed
 

Pacific Biosciences sequencing
 

Single-Molecule Real Time (SMRT) sequencing

 

• no need for template amplification

• fast progression, long read lengths

• detection of methylated bases possible
 

Oxford Nanopore sequencing
 

Promised results:

• fast, cheap, long reads
 

Sequencing technologies:
 

Illumina sequencing (-high-throughput short read sequencing)

Pacific Biosciences sequencing (Single-Molecule Real Time (SMRT) sequencing)

Oxford Nanopore sequencing (Promised results fast, cheap, long reads)




 

transcriptome sequencing


 

= RNA-Seq

-if genome sequence is available (reference-guided analysis)

-sequencing depth is more critical than read length
 

cDNA synthesis in Eukaryotes
 


 

-need to be sure to only deal with mRNA (small persantage), always has poly(A)tail

-take reverse transcriptase to build DNAstrang based on RNA strang
 

cDNA synthesis in Procaryotes


 

-random oligomers

-cDNA synthesis not as easy, as it does not have an poly(A)tail, random oligomers bind randomly, cannot differentiate between r,t,m,RNA
 

Epigenetics:
 

“Heritable changes in gene expression not attributable to

nucleotide sequence variation.”

(Murrell et al. [2005] Hum Mol Genet)

Epigenetics:
 

“Heritable changes in gene expression not attributable to

nucleotide sequence variation.”

(Murrell et al. [2005] Hum Mol Genet)

Transcriptome

 

• coding sequence (+ rRNA + tRNA)

actually expressed genes

• suitable for

comparative analysis of different

physiological conditions (cells or microbes)

Exome

• coding sequence

 

potentially available gene functions

(

 

 

no information about actual gene expression!)

• 0.01 x

 

reduced sequencing effort

Genome

 

• everything: coding + non-coding sequence

• including regulatory units

• difficult interpretation:

 

what is relevant?