Bio 123, UZH

Bio 123, FS 2016

Bio 123, FS 2016


Kartei Details

Karten 82
Lernende 23
Sprache Deutsch
Kategorie Biologie
Stufe Universität
Erstellt / Aktualisiert 21.03.2016 / 30.06.2021
Weblink
https://card2brain.ch/box/bio_123_uzh
Einbinden
<iframe src="https://card2brain.ch/box/bio_123_uzh/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

What is DNA sequencing?

Any method that is used to determine the specific order of the 4 nucleotides in a strand of DNA

Explain the technologie of Next Generation Sequencing (NGS)

• Highthroughput-sequencing technologies parallelize the sequencing process producing  thousands or millions of sequences at once

• Low cost production of large volumes of sequence data

Define library

A collection of DNA fragments with adapters ligated to each end

Define reads

The stretches of DNA read by the sequencer (the library consists in the end of reads)

Define Barcodes

Identifier sequence that allows pooling several samples without physical separation

(Barcoding: DNA-Barcoding (englisch DNA barcoding) ist eine taxonomische Methode zur Artenbestimmung anhand der DNA-Sequenz eines Markergens.)

Define Alignment

Deutsch: Angleichung/Anpassung

The aligning and merging of reads in order to restruct the original sequence

You can distinguish two types:

  1. Mapping= assembling reads against an existing backbone sequence
  2. De novo=  assembling short reads to full length sequences

Define Contig

E: Sequenced reads that overlap are reassembled back into longer sequences.

D: Ein Contig ist ein Satz überlappenderDNA- oder Protein-Stücke (reads), die von derselben genetischen Quelle stammen. Ein solches Contig kann dazu genutzt werden, die Original-DNA-Sequenz dieser genetischen Quelle abzuleiten.

Define scaffold

A series of linked contig which are in the right order but not necessary connected in one continuous stretch of sequence

Define Throughput

Either how many or how long reads we create

Define Coverage

The average number of reads representing a given nucleotide in the reconstructed sequence is called depth of coverage

What’s the difference between 2nd and 3rd generation sequencers?

See picture

Which sequencer is from the 2and generation? 

Features of Illumina sequencing

- High number of reads and low input

- Short read but very high throughput

- clonal amplification

- State-of-the-arts optics used for detection

- At the end of the bridge amplification, you get a cluster. This is what you actually going to sequence!

- SBS (sequencing by synthesis)

Features of Ion Torrent sequencing 

- Mid-to-High number of reads and low input

- Short reads and low throughput

- but very quick (usefull during a surgery when quick results are needed)

- beads (Kugel) are used for monoclonal amplifications

- ph-meter is used

- no modified nucleotides

Features of PacBio sequencing

- Low number of reads and high input

- SMRT (single molecule real time sequencing)

- Eavesdropping (lauschen) on the DNA polymerase as it replicates DNA 

- Only to the last phosphore ist fluorescent dye attache, the strand is natural

-> https://www.youtube.com/watch?v=v8p4ph2MAvI

Features of Oxford Nanopore sequencing

- Low numper of reads and high input

- sequencing of long fragments

- flow of ions that are transported through perforated membranes

- https://www.youtube.com/watch?v=3UHw22hBpAk

What's the difference between forward Genetics and reverse Genetics?

1.       Forward Genetics

This is the more classical approach. Here you do random mutagenesis and then screen for cells with a change in the phenotype. So you select for a phenotype, and then identify the mutated gene.

2.       Reverse Genetics

This is really the opposite, I already know which gene I want to test, and what mutant, but in the past we couldn’t really do that because we didn’t know all genes.

So here you start with a gene of unknown function (but you know exactly which gene it is), then you do (only with this gene!) an overexpression or another sort of mutation. Then you examine the resulting change in the phenotype to infer the function.

-> "targeted" mutagenesis

 

How can we do forward Genetics at a large scale?

- Grow a very large number of cells in a dish,

- Then you throw in an agent who randomly mutate some cells, but limited so you get one or even less mutation per cell! (=limited random mutagenesis)

- Next step is to select the cells with the desired phenotyp. There are two ways you can do that:

Either by testing the resistance (throw something on the cell that kill all the cells BUT the one with mutations, for example toxic compound or a virus.) or second possibility by a fluorescent reportert (you can mark either type, the normal ones or the ones with a mutation)

The last step is to identify the mutation by sequencing.

How can we do reverse Genetics at a large scale?

-First generate a large number of reagents each perturbing a specific gene. (3 important methods: cDNA, RNAi, CRISPR-CAS9)

- Then make an arrayed (geordnet) library where each reagent has a pre-defined position (which is called a "well"). Third step is to add cells to each well and systematically score for phenotypes. Last, you assign the function to each gene.

Explain the cDNA methode.

cDNA = copy DNA from mRNA

You use reverse transcriptase to get a DNA from a mRNA. Because the mRNA is already processed (e.g. Spliced) there are also no introns etc.

What's the function of microRNA and RNA interference pathways?

With miRNA and RNA interference you can find other RNA or even degrade the whole RNA in a cell. -> Knock down

Explain the CRISPR-CAS9 methode

An adaptive immune systeme of bacteria adapted to work in any cell

It allows them to make transcripts with a little seqeuence of this foreign DNA (of a virus), so when this virus come again at a second infection, the bacteria is now able to recognize and cleave it -> knock out

A single guide RNA you can design some that can targeting any gene you want

You introduce a few mutations, cell try to repair that, but then often you get a frame shift, so you’re gene is not functional (functional knock out)

Allg. Erklärung auf Deutsch: http://www.transgen.de/lexikon/1845.crispr-cas.html ->

Finden: Der CRISPR-Abschnitt erkennt mit Hilfe der darin integrierten RNA (Guide RNA) das jeweilige Ziel, eine bestimmte Sequenz in dem umzuschreibenden Gen.

Schneiden: Das an den CRISPR-Abschnitt gekoppelte Cas9-Protein schneidet den DNA-Doppelstrang genau an der gewünschten Zielsequenz. Beide Elememte - CRISPR und Cas9 - werden synthetisch hergestellt und anschließend in eine Zelle eingeführt.

Reparieren: Die zelleigenen Reparatursysteme fügen nun den durchtrennten DNA-Strang wieder zusammen. Dabei können einzelne DNA-Bausteine abgeschaltet oder wieder aktiviert werden. Möglich ist auch, kurze DNA-Sequenzen neu einzubauen

What can you do with the Zernike function?

Zernike function can be used to describe the place of the proteins

Principle of generating a network

1. Segment Cell and nucleus -> with segmentation you can quantify all kinds of properties from each cell 

2. Quantitative read-outs of phenotypes ->How does the phenotyp change after a pertubation?

3. Genes with similar loss-of-function can be connected in networks -> Functional modulses

 

Proteins have many different functions. Which?

- Catalytic (enzymes, e.g. kinases)

- Transport (hemoglobin)

- Structural (keratins)

- Hormonal (insulin)

- Immunity (antibodies)

Define proteome

The total protein complement of a genome or all proteins of the cells present at any given time.

Define proteomics

Proteomics is the study of protein properties on a large scale to obtain global, integrated view of disease processes , cellular processes and networks at the protein level. They are dynamic!

Some examples for properties of proteins are expression-level, modification, interaction, structure, location, etc.

Measuring proteins is extremely important in life sciences. Why?

Proteins are organized in networks. It’s a set of nodes and edges. Edges describe a relationship between the nodes. In any modell, you have always to define what these edges stand for (does it regulate? Or binds?).

What properties does a network have?

- Path length (Thes shortest path between two nodes)

- Network Diameter (the longest shortest path)

- Degree ki (The number of edges involving node i)

- Degree distribution P(k) (The probability (frequency) of nodes of degree k)

- Clustering Coefficient

Degree in a undirected network and directed network 

see picture

Degree distribution in different networks

see picture

What's a scale free network

A scale free network is when you have  some nodes that are very highly connected, and many which are only connected to some//the majority often have one edge, only a small fraction have a hundred edges -> power law degree distribution

(the highly connected ones are called hubs) like airports

The big advantage of them is that they are resistant to component failure. And the topology is important for robustness.

And the big disadvantage is of course its attack vulnerability. If the hubs has a problem, the whole network could collapse.  

The hubs doesn't tend to interact directly with each other, and they also tend to be "older" proteins. Hubs also seem to be more conserved than average between species. (because there is a lot of selective pressure!)

What indicates the cluster coefficient? 

The density of the network

Why is measuring a proteome quite challanging?

This has to do with the complex structure and the big variety of proteins.  (humans: 20`000) and also the huge abundance range. For example the ones that are present in one cell in 10^6copies, we find them easily. But what if there are only  <50 copies present?

How can we do proteomics? (different methodes)

  • Colorimetric
  • UV spectroscope 
  • Fluorescent-based
  • Antibodies
  • Mass spectometry

What does the Mass Spectrometer do?

Mass Spectrometer measures mass over charge ratio (m/z) of ions. 

m=Masse

z= Ladungsverhältnis

We know two different mass spectrometers. Which?

The MS and the MS/MS mode.

The MS/MS works with isolation, where you only work with one peptid at the end.

The MS first ionize the sample, then analyze and then detect it. In the end you get a data analysis.

 

What do we need to make a analysis of protein networks?

1.       Identification of system components

A.      Gel-based approaches (but very obsolete)

B.      Shotgun proteomics

You shoot but you don’t know what you shoot exactly.

C.      Targeted proteomics

You use when you’re interested in a specific protein, which you already know before the experiment. (Quantification of a specific list of proteins across many perturbations)

The advantages are its multicomplexed, it distinguishes isoforms and it’s precise and reproducible. But there is tedious assay development and difficult implemenation for a non-specialized lab

 

2.       Component abundance changes

If you want to know what is changing in a network during time, you have to follow a network over a time.

Here you use quantitative proteomics. It is done by comparing the signal intensity of peptides with the same sequence, but different isotop labels.

 

3.       Absolute quantification of a network

F.e reaction rates, stochiometry, stochastity, ...

Here you choose very informative peptides

(from which you know very accurately the concentrations!!!

So you can compare it with others)

 

4.       Modifications

 

5.       Prot-Prot interactions

 

6.       Structural networks

What's the Challenge in OMICS?

A) Data Quality

  • Completeness
  • Reproducibility
  • Measurement Accuracy
  • Biological Noise

B) Data Representation

  • Visualization
  • Patterns/Trends

C) Data Interpretation

  • False Discovery RAte
  • Signal vs. Noise
  • Multiple Testing
  • Modelling

5 Schritte für die Netzwerk-Analyse

  • Network Topology
  • Network Motifs
  • Biological Insights from Topology
  • Network Partitioning/Clustering
  • Networks as Tools for Data Representation