Oral Presentation 23rd Annual Lorne Proteomics Symposium 2018

Bioinformatics aspects of DIA/SWATH with large extended libraries (#14)

Dana Pascovici 1 , Jemma Wu 1 , Xiaomin Song 1 , Thiri Zaw 1 , Vera Ignjatovic 2 , Mark Molloy 1
  1. Australian Proteome Analysis Facility, Macquarie University, NSW, Austria
  2. Hematology Research Laboratory, Murdoch Children's Research Institute, Melbourne, Australia

Protein quantitation using DIA/SWATH mass spectrometry relies on using high quality peptide MS/MS spectral libraries, however building such libraries to ensure deep proteome coverage can be time consuming and expensive.  In order to address this issue various computational approaches for merging archived or external libraries were created and evaluated, including efforts from our group [1].  Such approaches are appealing, since they promise to expand the set of proteins that can be quantitated via DIA/SWATH, and potentially at low costs, considering the in-silico nature of the process. However, when using larger publicly available reference libraries for extension, the risk of introducing computational artefacts by these approaches can increase as well, and particularly so if the datasets themselves are large.

Here we describe the ways in which SWATH quantitative datasets obtained using local libraries and larger extended libraries can differ, in the context of several proteomics datasets including a recently published large plasma proteomics experiment containing samples from neonates, young children and adults [2].  We also describe a few simple principles that can be used to evaluate the process of library extension itself, in order to ensure that the proteins are reliably detected and their quantitation is consistent and reproducible.  These steps are summarised in a recently described workflow [3].  Implicit in it is a filtering of the set of proteins quantitated via this project of extension, which can be used as needed depending on individual project goals.

  1. [1] Wu JX, Song X, Pascovici D, Zaw T, Care N, Krisp C, Molloy MP. SWATH Mass Spectrometry Performance Using Extended Peptide MS/MS Assay Libraries. Mol Cell Proteomics. 2016 Jul;
  2. [2] Bjelosevic S, Pascovici D, Ping H, Karlaftis V, Zaw T, Song X, Molloy MP, Monagle P, Ignjatovic V. Quantitative Age-specific Variability of Plasma Proteins in Healthy Neonates, Children and Adults. Mol Cell Proteomics. 2017 May;16
  3. [3] Wu JX, Pascovici D, Ignjatovic V, Song X, Krisp C, Molloy MP. Improving Protein Detection Confidence Using SWATH-Mass Spectrometry with Large Peptide Reference Libraries. Proteomics. 2017 Oct;17