Proteomics

Altklausuren

Altklausuren


Fichier Détails

Cartes-fiches 45
Langue English
Catégorie Biologie
Niveau Université
Crée / Actualisé 03.04.2023 / 03.04.2023
Lien de web
https://card2brain.ch/box/20230403_proteomics
Intégrer
<iframe src="https://card2brain.ch/box/20230403_proteomics/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

What is meant by “proteomics”? Why is it more complex compared to genomics? Why is it important to study proteins in context (Give examples)? 

  • Proteomics is the large-scale study of proteins: expression, structure, functions.

  • There is one genome but different proteomes from the same genome. The proteome is the entire complement of proteins, including the modifications made to a particular set of proteins, produced by an organism or system. This will vary with time and distinct requirements, or stresses that a cell or organism undergoes

  • Contex: in order to understand biological systems at the molecular level, we must analyze proteomes quantitatively, in time and space and under different (patho-) physiological conditions; molecular constituents of biological systems do not operate in isolation, thousands of interactions e.g. mitosis 

  • Bottom-up proteomics is the digestion of proteins into peptides.

    • Explain the “in-gel digestion” workflow

    • Two advantages and two disadvantaged of this procedure

  • in gel digesion: typically performed with samples containing detergents, high amounts of salts

    1. Sample reduction and alkylation (DTT) 2. SDS PAGE (desalting) 3. Cut out protein bands 4. Wash Gel and let it dehydrate 5. Reswell in protease solution and digest 6. Extract peptides 

    + Simple to perform (scalabe to high troughput)

    + tolerant to samples with "weird" buffer ingredients (e.g. detergents, lipids, DNA)

    + Isolation of proteins of interest possible (single bands)

    - sample amount limited to gel capacity

    - limited visual sensitivity

    - lower peptide recovery

What is “ion pair reverse phase chromatography”? Which types of interactions are established between the analytes and the stationary phase? 

  • Ion pair reverse chromatography is a form of liquid chromatography: has the highest resolution

  • hydrophobic and electrostatic interactions 

  • The stationary phase is highly hydrophobic, the mobile phase starts with being hydrophilic and increases in hydrophobicity with the time. The elutes are hydrophilic in the beginning and hydrophobic in the end 

  • Interactions: direct hydrophobic interactions between nonpolar peptide side chains and the nonpolar stationary phase & electrostatic interactions between polar side chains of amino acids in a peptide and an amphiphilic ion pairing agent (e.g. triufluoro-acetic acid) that mediates interactions with the nonpolar stationary phase 

Ionization of the peptides is necessary for measurements with the mass spectrometer.

Why is MALDI a soft ionization technique? How does the technique prevent analytes from thermal degradation?

 

  • MALDI = Matrix Assisted Laser Desorption Ionization

  • The matrix traps laser energy for desorption (absorption maximum ideally at laser wavelength), protects the analytes from thermal decomposition (hot and cold matrices) and ionises analyte molecules 

  • Soft ionization technique because matrix absorbs the energy of the laser and hleps to transfer it to the analyte causing it to vaporize and ionize gently without too much fragmentation of the peptides.

  • Soft ionization from the condensed phase (crystals)

 

MALDI and ESI are both soft ionization methods because they generate gas-phase ions without causing significant fragmentation or degradation of the analyte molecules.

In MALDI, the analyte is mixed with a matrix material that absorbs laser energy and facilitates desorption and ionization of the analyte: Most of the deposited energy by the laser is taken up by the matrix, matrix gets heated, desorption of matrix and material into the vaccum, rapid cooling of evaporized material prevents thermal decomposition. The resulting ions are mainly intact and carry a single charge.

  • Measuring ionized peptides with mass analyzers.

    • Name two mass analyzers, explain how they work and how they determine the m/z ratio

    • For each: state one characteristic in what they are really good at in comparison to others (e.g. mass accuracy, resolution, m/z range, dynamic range, measurement time, quantification)

    • Which one would you choose for analysis of intact antibody. Explain in one sentence.

  • TOF: Analytes (ions) are accelerated with the same voltage (all start at the same kinetic energy) and then travel through a field-free tube (no force applied). The time it takes for an ion to travel from the ion source to the detector (their velocity vary depending on the m/z) is used to determine its m/z value. t= C * sqrt(m/z)
    • Resolution increased by reflection mode TOF and by fast detector digitizer
    • TOFs can separate ions at a rate of >10.000 spectra/second

characteristic: scan speed 100µsec, m/z range > 500000, resolution >20.000, high sensitivity

Quadrupole: Consists of two positive and two negative poles arranged in corners of a square. RF (radiofrequency) amplitude and DC (direct current) voltages are applied to create an oscillating electric field that allows only ions with a specific m/z ratio to pass through the analyzer (Lower amplitude, ions with lower m/z pass). The ions that pass through the quadrupole analyzer are detected by a detector, which records the intensity of the ion signal. The ion intensity data is then used to generate a mass spectrum.

  • Resolution depends on the number of oscillations along the path -> resolution increases with longer rods or decreasing scan speed 
  • m/z range limitations (up to 4-8000 m/z)

characteristic: dynamic range, also high accuracy, quantification

For the analysis of an intact antibody, I would choose the TOF mass analyzer because it has high resolution, which is important for distinguishing between different antibody isoforms with slight mass differences.

Quadrupole, TOF, Ion trap, Oritrap

  • Mass accuracy: 100-1000, 1-10, 100-1000, 1-2 ppm
  • resolution: 200-2.000, >20.000, 200-20.000, >100.000
  • m/z range: 4-8000, >500000, 4000, 4-8000
  • scan speed: 1-10msec, 100µsec, 10-100msec, 20-200 msec
  • dynamic range: 1:10000,1:5000, 1:1000, 1:5000
  • senstitivity: ++, +++, +++, ++
  • quantification: +++, ++, +, ++

  • Tandem Mass Spectrometry

    • Explain the workflow. How can we determine the peptide sequence

    • Spectrum matching in Proteomics. How can we determine the false discovery rate (FDR)?

  • 2 consecutive MS: MS1 one shows m/z of tryptic peptide. The mass spectrometer selects the (20) most abundant ions from MS1 for MS2 -> peptide isolation and fragmentation -> measurement of fragments m/z

  • Sequence determination through specific amino acid mass, step by step. Or database searching, whole peptide or sequence tag

  • dynamic exclusion prevents the same peptides to be analysed again and again

 

  • Through target-decoy approach: you have 2 databases, one composed of targets and other of decoys. Then search the tandem mass spectra against both databases. The PSMs to decoy sequnces must be wrong matches (false positives). The FDR is determined by FDR = FP / FP + TP = decoys / targets. Then, define a cut-off score (=how many FP you will accept): FDR(x) = decoys with score > x / targets with score > x

  • How does label free quantification (LFQ) works? Two (relevant) advantages and two disadvantages of LFQ in comparison to label-based methods.

  • No isotopic label required, no artifitial chemical modification of peptides or proteins: LFQ intensity describes the integrated MS signal (area under the peak) across the chromatographic peak. Therefore, the same peptide is detected in consecutive MS1 spectra. The chromatographic peak is reconstructed based on MS1 signal -> extracted ion chromatogram (XIC). The area under the XIC is a measure for peptide quantity. Relative quantification of the same peptide vs a reference sample or between x conditions. 

  • Pros:
    • Cost and time effective with respect to sample preparation (no need to label/ pre-process the samples)
    • Applicable to all identified/ quantified proteins – proteome-wide
    • Applicable to all organisms/ tissues/ cells – independent of biology
    • Applicable to very large numbers of sample – large scale
    • No increase in sample complexity – high sensitivity
  • Cons:
    • Const and time ineffective in terms of MS measurement time (individual sample prep and measurement, no multiplexing)
    • Error- prone: Reproducible sample preparation, chromatography and mass spectrometric performance is critical for this approach! Accumulation of experimental variation along the workflow
    • Quantitative precision and accuracy of label free analysis are often lower than those based on stable isotope labeling, particularly when workflow comprises many steps
    • Missing values across different sample increases with increasing number of samples
    • (Large-scale label-free projects are computationally intensive and large bioinformatics resources may be needed

  • DDA, Targeted proteomics, DIA

    • Three differences between DDA and Targeted proteomic

    • Can we use TMT labeling when we do DIA? Your opinion with reasons

 

  • DDA (shotgun) data is analysed without prior knowledge of which proteins are present. 

    Targeted quantification: analysis of a preselected group of proteins (hypothesis driven, need prior knowledge) 

  • DDA: Quantification of many proteins (1000) in relaively few samples. Targeted: only 10-100 proteins can be identified.

  • Targeted: More precise, accurate and reproducible quantification (higher sensitivity ad dynamic range) 

 

  • DIA = Data independent aquisition: attempt to take the best of bot shotgun and targeted: measure verything (withput prior knowledge), the analyse selected anlytes (with prior knowledge) 

    • to quantify large numbers of proteins and peptides over many samples

    • when no need of maximum selectivity and sensitivity

    • when POI change in the future

TMT: Tandem mass tags. TMT labeling is a widely used method for quantitative proteomics that allows for the simultaneous measurement of multiple samples in a single mass spectrometry experiment. It involves the chemical labeling of peptide samples with isobaric tags, each of which contains a unique mass reporter ion that can be used to quantify the relative abundance of the labeled peptides across multiple samples.

Combining TMT labeling with DIA provides a powerful approach for high-throughput quantitative proteomics. TMT labeling allows for the multiplexed analysis of multiple samples in a single DIA experiment, enabling the quantification of thousands of proteins across multiple conditions or timepoints.

Furthermore, TMT labeling can reduce the complexity of the DIA data by pre-fractionating the samples prior to analysis, which reduces the number of peptides in each DIA window and simplifies the identification and quantification of the peptides.

  • Post-translational modifications (PTMs)

    • Name 5 PTM

    • why could it be difficult to investigate ubiquitination with TMT labeling

    • Warum muss man PTMs anreichern

 

  • Phosphorylation

  • acetylation

  • hydroxylation

  • methylation

  • Ubiquitylation

  • Deamidation

  • Proteolytic cleavage

  • glycosilation

 

  • TMT reagents contain Lysin residues. Ubiquitination is a post-translational modification that involves the attachment of ubiquitin molecules to specific lysine residues on target proteins. if a lysine residue in the TMT tag is located in the same position as a ubiquitination site on the protein, it can lead to inaccurate quantification or even loss of identification of the ubiquitinated peptide

 

  • anreicherung kinasenaffinity matrices (kinobeads): unselective kinase inhibitors immobilized to sepharose beads. They bind kinases and pull them down from sample

 

  • Since PTMs usually occur at low levels, it is often necessary to enrich proteins prior to analysis to ensure adequate detection. PTM enrichment allows specific PTMs to be isolated in a sample

  • Investigation of interaction partners of proteins by using affinity purification. You are interested in a protein and want to find its interaction partners. Explain how you would set up such an experiment, name the steps. Critical steps in your experimental workflow/setup?

  • Choose appropiate affinity tag -> Incubation of affinity tag with proteins -> express and purify tagged protein ->POI added to biological sample -> bind POI -> wash to elute undesired backgroud proteins -> elute POI -> analyse in MS

  • Co-IP: (antibody - targeted)

    • Antibody against POI (bait)

    • Pulldown of AB-POI complex by protein immobilized to beads

    • Enrichment of bait protein and co-enrichment of interacting proteins

  • TAP

    • furnish gene of interest with a dual affinity tag at C or N terminus.

    • Steps: ProteinA-IgG interaction (1st affinity purification), wash, TEV cleavage, CBP-calmodulin interaction (2nd affinity purification), wash, elution by EGTA treatment

    • Critical: washing steps: wash away bait 

    • Critical: TEV-cleave: should not cleave POI 

Definition proteoform, how does complexity arise in the proteome compared to the genome? ( circa 6 min)

  • Proteoform = all different molecular forms in which the protein product of a single gene can be found

  • Complexity due to genetic variations, alternatively spliced RNA transcripts and PTMs

  • Reasons for why peak broadening occurs? how does it arise and how can it be circumvented? ( circa 6 bis 10 min) 

  • Eddy dispersion, longitudinal diffusion, restricted mass transfer 

    • Eddy: analytes can take different paths through the stationary phase, different paths have different distances

      • solution: smaller particle size, better packing

    • Longitudinal: Brownian motion causes analytes to travel in all dimensions. Analyte concentration is lower at the edges of the peak than at the center.

      • solution: higher flow rate 

    • Mass transfer: Mass transfer between stationary and mobile phase. High flow rates restricts this mass transfer.

      • solution: lower flow rate

  • Bottom up approach: steps in 1-2 sentences with used reagents if possible 
  • Name one additional step for separation on protein level

  • 1.protein extraction (lysis Buffer(HEPES/Tris; ßME/DTT for disulfide bridges; Detergents e.g. Urea) => disruption: mechanical (detergents eg SDS; chaotropic reagents eg urea) or biophysical (sonification) => Removing contaminants: disruption, digestion, precipitation, extraction

  • 2.Digestion in solution/ in gel (with proteases: Trypsin: cleaves c terminally of lysines and arginines) => reduction & alkylation (DTT/TCEP/ßME)  -> peptide separation (reverse phase chr., C18) -> MS analysis -> Database search

  • Optional:

    • separation on cell/ organelle level

    • separation on protein level

      • protein fractionation: measure different fractions of samples separately, eg chromatography SEC; PAGE

      • protein enrichment: affinity enrichment eg cofactors/ Kinobeads; Co-immunoprecipitation

    • Separation on peptide level

      • peptide fractionation: chromatography

      • peptide enrichment: antibody based

in solution digestion

typically performed with samples containing chaotrophes (e.g. urea buffers)

1. Reduction & alkylation

2. Proteolytic digestion (must denature sample but keep protease active)

3. Desalting (by solid phase extraction C18 to remove reagents)

+ simple to perform

+ relatively quick

+ independent of sample quantity

- not compatible with weird buffer ingredients

  • Name the method used for sequencing Peptide with MS/ in Proteomics ( 1min)

  • FDR  needed to be explained  (6 – 8 min)

  • You made a Search of human microbiome against human database and a second one with a larger database containing also bacterial Proteins (Ecoli). Why do you get less hits with the larger database? (circa 4 min)

  • tandem mass spectrometry

  • FDR = false discovery rate tells you how many false positives you have by target-decoy (FDR = decoy / target = FP / FP + TP

    • target database: sequence database containing all AA sequences known gor eg humans
    • decoy dataase: sequence database containing reversed or scrambled AA sequences for the same proteins
    • search tandem mass spectra against both databases -> hits to decoys must be wrong matches -> expect to get the same number of random matches in the target database = decoys/targets. 
    • find score cutoff which results in acceptable FDR typically 1%
  • Less hits because decoy database is also bigger -> more decoy hits -> higher FDR -> cutoff higher -> less proteins 

  • Difference between online and offline Chromatography (circa 6 min)

  • On-line is coupled to a MS, Detector is MS and the separation principle is based on hydrophobicity, commonly used for separation is (ion pair) reverse phase chromatography

  • Off-line is a stand alone system, detector is UV, fluorescence etc., separation principle is based on hydrophobicity, charge, size, etc., commonly used for basic reverse phase, strong anion/cation exchange etc.

  • Explain DDA, targeted and DIA in one sentence (6 min circa)

  • Which one of theese methods would you choose for biomarker / (one biomarker protein) quantification on a regular basis for patients in clinics and why? (2-4 min circa)

  • Vergleiche 3 Eigenschaften von DIA und DDA 

  • DDA/ shotgun proteomics (not the best sensitivity)
    • Discovery technique for mapping out the proteome cotent of a sample, i.e. identify as many proteins/peptides/PTMs as possible
    • Typically, shotgun data is analyzed without prior knowledge of which proteins are present 
  • Targeted proteomics (IPAD)
    • Targeted quantification: analysis of a preselected group of proteins (or peptides thereof) -> purely hypothesis-driven
    • If done right, it can deliver more sensitive and more accurate quantification data
  • Data independent acquisition (DIA)
    • Measure everything in bins (whithout prior knowledge), but analyse selected analytes (with prior knowledge)
    • No selection of individual peptides, but all peptides that are within an isolation window (e.g. 400-450 m/z)
  • For biomarker: targeted proteomics because you know what you want to quantify

Ease of data acquisition; ease of data analysis; breadth of protein and peptide detection

  • DDA: easiest; easiest; 10.000
  • DIA: easy; hardest; 10.000
  • DIA: unlimited number of targetable peptides/proteins, re-analysis at later time-point if other proteins become interesting, no missing values: con: need DDA data to build libraries for data analysis 

  • DDA: ideal technique for mapping out the proteome content of a sample, typically shotgun data gets analyzed without pior knowledge, identification of 1000s proteins and PRM

factors that determine the quality of the separation

  • Efficiency (Plate height) refers to the ability of the column to separate closely related compounds. The higher the efficiency, the better the resolution and the sharper the peaks. Efficiency is affected by the particle size and distribution of the stationary phase, column length, and flow rate of the mobile phase. Height equivalent to a theoretical plate HELP = L/N

  • Selectivity is the ability of the stationary phase to differentiate between components in the sample based on their chemical or physical properties. Describes the analyte retention by the column. A more selective stationary phase will result in better separation of closely related compounds.

  • Resolution is the degree of separation between two adjacent peaks in the chromatogram. Describes the overall quality of a separation (efficiency + selectivity). The narrower ther peaks and the larger the difference in retention times, the better the separation.

  • Peak capacity is the maximum number of analytes that can be separated in a single chromatographic run. Depends on chromatographic resolution and gradient time. Highest peak capacities are obtained with long, efficient columns and long gradient times. Higher peak capacity allows for better separation of complex mixtures. 

The ability to meassure mass accurately and precisely typically increases with increasing resolution of the mass measurement

How does chromatography work?

The general principle of chromatography involves the separation of components in a mixture by their differential partitioning between two phases: a mobile phase and a stationary phase.
 

The mobile phase is the liquid or gas that flows through the chromatographic column, carrying the sample to be separated. The stationary phase is a hydrophobic/ non polar solid or liquid material that is packed into the column, which interacts with the components in the sample to varying degrees depending on their properties. 

When the sample is introduced into the column, it is carried by the mobile phase and interacts with the stationary phase. Depending on the properties of the components, they will have different affinities for the stationary phase, causing them to move at different rates through the column. Components that have a higher affinity for the stationary phase will move more slowly and will be retained in the column longer, while components with a lower affinity for the stationary phase will move more quickly through the column and will be separated earlier.


Gradient elution is a technique used to increase the separation efficiency by changing the composition of the mobile phase during the chromatographic run. This technique involves increasing the concentration of a more polar or non-polar solvent over time, which can improve the resolution and peak capacity of the separation => adjustment of retention time

Describe the steps leading to peptide ionization using the MALDI technique. Explain how a time-of-flight mass spectrometer works (with the relevant equations) and explain why MALDI-TOF mass spectrometry is a good combination of ionization and mass analysis

  • MALDI: matrix assisted laser desorption ionisation

  • Firstly, the ionization component of MALDI-TOF MS involves the use of a laser to irradiate a sample that has been mixed with a matrix. The matrix absorbs the energy from the laser and then transfers it to the sample, causing the molecules to become ionized. The ionization process is gentle and does not cause significant fragmentation of the molecules, making it ideal for the analysis of fragile biomolecules.

    Secondly, the mass analysis component of MALDI-TOF MS involves the use of a time-of-flight mass analyzer, which separates ions based on their mass-to-charge ratio (m/z). The ions are accelerated through a flight tube and the time it takes for them to reach the detector is directly proportional to their m/z value. This results in a mass spectrum that shows the distribution of ions based on their m/z values.

    Overall, MALDI-TOF MS is a good combination of ionization and mass analysis because it is a sensitive and fast technique that allows for the analysis of a wide range of biomolecules. The gentle ionization process ensures that fragile biomolecules are not significantly fragmented, while the mass analysis component provides accurate mass measurements.

Why do analytes have to be ionised?

So that their motions in electromagnetic fields can be used to determine mass and charge

ESI

ESI: uses a high voltage to disperse liquid into highly charged droplets. Uneven fission of the droplets at the rayleigh limit creates about 20 smaller drops carrying off 20% of the mass and 15% of all charges. The solvent evaporates from the droplets, at some point there is no more droplet and no more solvent so the charge ends up on the molecule -> ions are formed and separated based on their mass-to-charge ratio (in MS). ESI is particularly useful for analyzing complex mixtures of proteins or peptides because it generates a wide range of multiply charged ions.

  • Can couple chromatography and MS
  • High efficiency
  • Can transfer very large molecules from lliquid into gas phase
  • The slower the flow rate, the better the response

Ion trap & orbitrap

  • Ion trap: use electromagnetic fields to trap and manipulate ions in a confined space. RF (AC) potential applied to a ring electrode creates a 3D quadrupolar field which traps ions in a stable oscillating trajectory. The trapped ions can be excited by applying RF and DC voltages to the electrodes, this destabilises the ion motions of ions with particular m/z values and they are ejected through the exit endcap, where they can be detected by a detector.
    • m/z range limitations apply
    • traps can store ions of interest
    • advantage of linear trap over 3D trap: ability to store more ions

characteristic: high resolution and sensitivity 

  • Orbitrap: uses an electrostatic field to trap and detect ions. Cosists of outer electrode and central spindle. The ions are injected and oscillate stably around the spindle. As the ions move, they induce a current. m/z dependent detection of ions is achieved by recording the frequency qith which ions are oscillating along the z axis (small ions are quicker)
    • Resolution depends on the number of oscillations -> very high resolution possible

characteristic: highest resolution <100000

What is the difference between accuracy and precision?

Precision: How reproducibly a measurement is obtained
Accuracy: How close a measurement is to the true value

What are hybrid mass spectrometers and why are they needed in protomics?

Hybrid mass spectrometers, aka tandem MS or MS/MS instruments, are instruments that combine two or more types of mass analyzers in a single system. Hybrid mass spectrometers are used in proteomics to improve the accuracy and sensitivity of MS analyses and to obtain more comprehensive structural information about complex molecules.

Measure complex mixtures of ions, select single (e.g. peptide) analyte for fragmentation.
e.g.: Q-TOF, triple quadrupole, ion trap-orbitrap, TOF-TOF

Q-TOF:  quadrupole for mass filter, TOF mass analyser

 

Steps tandem MS

tandem MS means consecutive stages of MS with a chemical reaction in the gas phase in between two stages

Tryptic peptides -> Ionisation -> Measurement of peptide m/z -> peptide isolation -> fragmentation -> measurement of fragment m/z

MS1: composition information 

MS2: sequence information

 

Peptide fragmentation

CID/HCDC: Collision induced dissociation and high energy collision induced dissociation

  • collide ions wirh neutral gas (He)
  • Kinetic energy is converted into internal energy which results in bond breakage
  • generates b and y ions

How does one analyse and evaluate large scale MS/MS data?

spectrum matching: high throughput peptide 'sequencing'

  • Peptide identification without manual interpretation
  • fragment ions signals represent peptide sequence <-> match measured fragment ion spectra to computed spectra of all peptides in a sequence database
  • uninterpreted tandem MS can be searched against a protein sequence database to identify a peptide (protein)
  • matchig of experimental and theoretical fragment ion lists (peptide spectrum match PSM)
  • requires a database of known protein sequences

How can we quantify proteins?

y axis: measured intensity (area)
x axis: amount (moles)
Between LLOQ (Lower Limit Of Quantification) and ULOQ signal intensity scales linear with peptide abundance (linear dynamic range)
Accurate quantitative results can only be achieved when working within the linear dynamic range of every given peptide, respectively
The linear dynamic range and LLOF and ULOQ are peptide- and mass spectrometer-dependent 

Metabolic labeling

Stable isotope labeling of peptides: A stable isotope labeled peptide is chemically identical to its native counterpart, the two peptides exhibit identical mass spectrometric responses. The MS can recognize the mass difference btw the labeled and unlabeled forms of a peptide. Isotopes do not change physicochemical properties -> relative quantification of same peptide in same MS measurement

  • Metabolic labeling: Stable isotope labeling with amino acids in cell culture SILAC:  growing cells in a culture medium containing a labeled AA with a stable isotope, (13C/15N). The labeled AA (Lys & Arg isotipically labeled) is incorporated into the newly synthesized proteins, resulting in a population of proteins that have a different mass than the unlabeled proteins -> trypsin cleavage (every peptides contains Lys/Arg) -> relative quantification, chromatography based, MS1 level
  • Pros:

    • Comparison of up to three conditions - no missing values
    • most accurate
    • Least error-prone: samples are combined at earliest point possible
    • Applicable to measuring relatively small biological changes
    • Medium dynamic range of quantification 
  • Cons:
    • Largely confined to cell culture systems and lower organisms
    • Expensive reagents
    • Time consuming: Conversion to heavy label takes time and can be difficult/impossible for some systems
    • Confined to the comparison of 2-3 conditions
    • Doubling/Tripling number of peptides in samples increases MS1 complexity, decreases protein IDs

peptide MS intensity dependent on

  • concentration in the sample
  • digestion efficiency
  • recovery during sample preparation
  • ionization of the peptide
  • matrix effect
  • specific MS detection properties

 

chemical labeling

  • Chemical labelig: addition of isotope tag to peptides. After protein extraction and digestion. Relative quantification using MS1 or MS2 spectra. 
    • Tandem Mass Tag TMT: isobaric labeling: all reagents add the same mass -> labeled peptides are isobaric. TMT reporter ions form upon fragmentation and have different masses. Quantification is done using the intensities of the TMT reporter ions recorded in the MS2 spectrum
    • Pros:
      • Multiplexing: Allows simultaneous measurement of up to 18 samples – no missing values; high throughput
      • No increase in complexity of MS1 spectra 
      • Tag improves peptide detection
      • Applicable to any biological sample
      • Robust: Quantification not so dependent on reproducible LC conditions, highly quantitative precision because all conditions are measured simultaneously
    • Cons
      • Limited signal in MS/MS can compromise quantification sensitivity –
      • Co-fragmenting peptides can compress ratios leading to underestimation of true ratios -> reduced accuracy due to ratio compression
      • Expensive reagents
      • Lower identification rate (dominance of abundant peptides limits dynamic range)
      • Accumulation of technical variance up to additional peptide labeling step 

spike in standard

  • Spike-in standarsSuper-SILAC: SILAC labeled cells are digested and spiked into tissue samples -> relative quantification of tissue peptides/proteins via heavy spike-in peptides for entire proteome
  • Pros:
    • Unlimited number of samples
    • More accurate than label-free
    • Applicable to all biological samples for which Super-SILAC standard can be generated
  • Cons:
    • Expensive reagents
    • Presence of labeled species increases MS1 complexity and decreases dynamic range
    • Missing values
    • Only ratios of ratios -> decreased quantiative accuracy/precision
    • standard needs to reflect sample composition, only proteins present in SILAC reference sample can be quantified

Which strategy is best for which type of experiment? (acquisition)

  • DDA/ shoutgun
    • Identification of 1000s of proteins and PRMs
    • Global proteome mapping
    • Quantification of many proteins in relatively few samples
  • IPAD
    • Precise, accurate and reproducible quantification over many samples
    • With SRM and PRM only 10s to 100s of proteins can be quantified
  • DIA
    • When you want to quantify a large number of proteins and peptides over many samples
    • When you do NOT need maximum selectivity and sensitivity
    • When your proteins/peptides of interest change in the future

clinical proteomics - promises and hurdles

Promises

  • To study disease biology (identify disease drivers, patient stratification, drug target discovery)
  • To identify and validate biomarkers
  • To treat patients according to individual needs (“personalized” precision medicine)

Hurdles

  • Heterogeneity within the sample (different cell types in tissue)
  • Heterogeneity between samples of same tissue (different composition at different sites)
  • Reference proteome?
  • Inter-patient variability (large cohorts)
  • High dynamic range (high sensitivity required)
  • Contaminations (e.g. blood in tissue samples, paraffin in FFPE)
  • Limited sample quality (ischemia times)
  • Limited amount of sample
  • Mass spectrometry requires expert knowledge
  • Expensive machinery and reagents
  • Often not very robust (maintenance, down-times of mass specs)

biomarker development pipeline

 

  • Discovery
  • Which biomedical problem?
  • What kind of specimen?
  • Small cohort (statisitical power?)
  • Shotgun proteomics
    • identification of significant proteins
    • biomarker candidates
  • Verification
  • What is appropriate patient group? (availability, regional bias, etc)
  • What are appropriate control groups? (healthy?, gender, ager etc)
  • Medium sized cohort (statistical power)
  • Targeted proteomics
    • quantification of previously identified biomarker candidates
  • Validation
  • Collaboration with clinical partners (preclinical trial) and industry (assay development)
  • Large cohort size (statistical power)
  • Often orthogonal assays, e.g. ELISA
    • Confirmation of clinical relevance in clinically relevant tissue and or body fluid in different cohorts in different geographical regions, etc

What to do when analysing data

1. Quality control: 

  • QC raw data: avoid high backgroud/ low signal/noise ratio, deterctor saturation
  • Data distribution: statistical tests require normally distributed data
  • PCA analysis: groups samples according to similarity of composition. Identify sample processing errors and batch effects
  • systematic errors due to technical variations can be solved by normalisation

2. normalisation

  • column (=experiment) wise: normalisation of unequal sample amounts 
  • row wise (=proteins): in replicate experiments: normalisation of batch effects 

3. statistical analysis

  • fold change: ratio between 2 conditions
  • statistical tests: Student T-test or ANOVA
  • volcano plot: p value vs fold change

data visualization and interpretation

  • hierarchical clustering
  • gene ontology: annotation according to molecular function, biological process and cellular compartment
  • StringDB: functional protein association networks

functions of PTMs

  • cellular localization
  • (de)activation
  • signal transmission
  • cell interaction and comunication
  • degradation
  • solubility
  • stability

Methods to study PTMs

  • targeted; antibody based methods: e.g. western blot, reversed phase protein arrays
  • explorative: mass spectrometry -> requires enrichment