{"id":621338,"date":"2023-03-24T05:49:25","date_gmt":"2023-03-24T10:49:25","guid":{"rendered":"https:\/\/news.sellorbuyhomefast.com\/index.php\/2023\/03\/24\/global-detection-of-human-variants-and-isoforms-by-deep-proteome-sequencing\/"},"modified":"2023-03-24T05:49:25","modified_gmt":"2023-03-24T10:49:25","slug":"global-detection-of-human-variants-and-isoforms-by-deep-proteome-sequencing","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2023\/03\/24\/global-detection-of-human-variants-and-isoforms-by-deep-proteome-sequencing\/","title":{"rendered":"Global detection of human variants and isoforms by deep proteome sequencing"},"content":{"rendered":"<p>Science &#038; Nature <\/p>\n<div>\n<div id=\"Sec1-section\" data-title=\"Main\">\n<h2 id=\"Sec1\">Main<\/h2>\n<div id=\"Sec1-content\">\n<p>Near-complete proteomes of simple organisms can be detected by mass spectrometry (MS) following only 1\u2009h of analysis<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\" title=\"Richards, A. L. et al. One-hour proteome analysis in yeast. Nat. Protoc. 10, 701\u2013714 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR1\" id=\"ref-link-section-d3926824e634\">1<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\" title=\"Hebert, A. S. et al. The one hour yeast proteome. Mol. Cell. Proteomics 13, 339\u2013347 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR2\" id=\"ref-link-section-d3926824e637\">2<\/a><\/sup>. For more complex organisms, it is possible to monitor over 10,000 proteins within a day (refs. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Gholami, A. M. et al. Global proteome analysis of the NCI-60 cell line panel. Cell Rep. 4, 609\u2013620 (2013).\" href=\"http:\/\/www.nature.com\/#ref-CR3\" id=\"ref-link-section-d3926824e641\">3<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kelstrup, C. D. et al. Performance evaluation of the Q Exactive HF-X for shotgun proteomics. J. Proteome Res. 17, 727\u2013738 (2018).\" href=\"http:\/\/www.nature.com\/#ref-CR4\" id=\"ref-link-section-d3926824e641_1\">4<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kim, M. S. et al. A draft map of the human proteome. Nature 509, 575\u2013581 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR5\" id=\"ref-link-section-d3926824e641_2\">5<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wilhelm, M. et al. Mass-spectrometry-based draft of the human proteome. Nature 509, 582\u2013587 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR6\" id=\"ref-link-section-d3926824e641_3\">6<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Adhikari, S. et al. A high-stringency blueprint of the human proteome. Nat. Commun. 11, 5301 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR7\" id=\"ref-link-section-d3926824e644\">7<\/a><\/sup>). Community-based maps of the human proteome, assembled using extensive data from various tissues and cell types from laboratories across the world, have provided evidence for the translation of >90% of annotated protein-coding genes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Adhikari, S. et al. A high-stringency blueprint of the human proteome. Nat. Commun. 11, 5301 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR7\" id=\"ref-link-section-d3926824e648\">7<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Wang, M. et al. Assembling the community-scale discoverable human proteome. Cell Syst. 7, 412\u2013421.e5 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR8\" id=\"ref-link-section-d3926824e651\">8<\/a><\/sup>. However, although the human genome contains approximately 20,000 protein-coding genes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\" title=\"Frankish, A. et al. GENCODE 2021. Nucleic Acids Res. 49, D916\u2013D923 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR9\" id=\"ref-link-section-d3926824e655\">9<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Harrow, J. et al. GENCODE: the reference human genome annotation for the ENCODE project. Genome Res. 22, 1760\u20131774 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR10\" id=\"ref-link-section-d3926824e658\">10<\/a><\/sup>, it is estimated that alternative splicing events, whereby precursor messenger RNA sequences are combined in different arrangements, have the potential to notably increase proteome diversity. Specifically, from RNA sequencing (RNA-seq) analysis of human organs, reports have estimated that transcripts from more than 95% of multi-exon genes undergo alternative splicing<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\" title=\"Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470\u2013476 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR11\" id=\"ref-link-section-d3926824e662\">11<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\" title=\"Pan, Q., Shai, O., Lee, L. J., Frey, B. J. &#038; Blencowe, B. J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 40, 1413\u20131415 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR12\" id=\"ref-link-section-d3926824e665\">12<\/a><\/sup>. Furthermore, recent single-cell transcriptome sequencing has revealed that true splice isoform complexity is likely greater than previously appreciated<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 13\" title=\"Joglekar, A. et al. A spatially resolved brain region- and cell type-specific isoform atlas of the postnatal mouse brain. Nat. Commun. 12, 463 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR13\" id=\"ref-link-section-d3926824e670\">13<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"00 title=\"Hardwick, S. A. et al. Single-nuclei isoform RNA sequencing unlocks barcoded exon connectivity in frozen brain tissue. Nat. Biotechnol. 40, 1082\u20131092 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR14\" id=\"ref-link-section-d3926824e673\">14<\/a><\/sup>. Other sources of proteome variation, such as single-amino acid polymorphisms (SAPs), alternative splicing and posttranslational modifications, further increase proteomic complexity<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Myers, R. M. et al. A user\u2019s guide to the encyclopedia of DNA elements (ENCODE). The ENCODE Project Consortium. PLoS Biol. 9, e1001046 (2011).\" href=\"http:\/\/www.nature.com\/#ref-CR15\" id=\"ref-link-section-d3926824e677\">15<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Altshuler, D. L. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061\u20131073 (2010).\" href=\"http:\/\/www.nature.com\/#ref-CR16\" id=\"ref-link-section-d3926824e677_1\">16<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Zubarev, R. A. The challenge of the proteome dynamic range and its implications for in-depth proteomics. Proteomics 13, 723\u2013726 (2013).\" href=\"http:\/\/www.nature.com\/#ref-CR17\" id=\"ref-link-section-d3926824e677_2\">17<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Sheynkman, G. M., Shortreed, M. R., Frey, B. L., Scalf, M. &#038; Smith, L. M. Large-scale mass spectrometric detection of variant peptides resulting from nonsynonymous nucleotide differences. J. Proteome Res. 13, 228\u2013240 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR18\" id=\"ref-link-section-d3926824e677_3\">18<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Sheynkman, G. M., Shortreed, M. R., Frey, B. L. &#038; Smith, L. M. Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-seq. Mol. Cell. Proteomics 12, 2341\u20132353 (2013).\" href=\"http:\/\/www.nature.com\/#ref-CR19\" id=\"ref-link-section-d3926824e677_4\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"11 title=\"Menon, R. et al. Distinct splice variants and pathway enrichment in the cell-line models of aggressive human breast cancer subtypes. J. Proteome Res. 13, 212\u2013227 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR20\" id=\"ref-link-section-d3926824e680\">20<\/a><\/sup>.<\/p>\n<p>Limitations in proteomic technology have not permitted the global-scale detection of protein diversity. Typically, for shotgun proteomic methods, the presence of an entire protein is determined using a small number of peptide proxies\u2014as few as two or three. Thus, sequence coverage in a proteomics experiment is generally insufficient to fully characterize all protein states present within a sample<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"22 title=\"Smith, L. M. &#038; Kelleher, N. L. Proteoform: a single term describing protein complexity. Nat. Methods 10, 186\u2013187 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR21\" id=\"ref-link-section-d3926824e687\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"33 title=\"Smith, L. M. et al. The human proteoform project: defining the human proteome. Sci. Adv. 7, eabk0734 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR22\" id=\"ref-link-section-d3926824e690\">22<\/a><\/sup>. Yet the ability to precisely monitor protein isoforms is essential to understanding biological systems. Even the current deepest proteomic datasets<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"44 title=\"Samaras, P. et al. ProteomicsDB: a multi-omics and multi-organism resource for life science research. Nucleic Acids Res. 48, D1153\u2013D1163 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR23\" id=\"ref-link-section-d3926824e694\">23<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"55 title=\"Omenn, G. S. et al. Research on the human proteome reaches a major milestone: >90% of predicted human proteins now credibly detected, according to the HUPO human proteome project. J. Proteome Res. 19, 4735\u20134746 (2020).&#8221; href=&#8221;http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR24&#8243; id=&#8221;ref-link-section-d3926824e697&#8243;>24<\/a><\/sup> do not contain enough sequence data to globally identify proteoforms. One approach to achieving proteoform-level detection is top-down MS, a strategy that measures intact protein mass before dissociation for sequence determination using tandem mass spectrometry (MS\/MS). Ensuring no loss in resolution, the top-down strategy is appealing. Practical issues with high-mass proteins, sequence coverage and detection of low-abundance species, however, limit its impact<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"66 title=\"Toby, T. K., Fornelli, L. &#038; Kelleher, N. L. Progress in top-down proteomics and the analysis of proteoforms. Annu. Rev. Anal. Chem. (Palo Alto Calif.) 9, 499\u2013519 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR25\" id=\"ref-link-section-d3926824e701\">25<\/a><\/sup>.<\/p>\n<p>Given the technical hurdles with top-down proteomics, we revisited the shotgun strategy. Shotgun proteomics preferentially relies on trypsin to catalyze hydrolysis of proteins. Trypsin cleaves C-terminal to lysine and arginine residues and produces peptides of length and charge distributions most amenable to MS\/MS. However, even with the assistance of extensive chromatographic separation, not all portions of the proteome are accessible from tryptic peptides<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"77 title=\"Meyer, J. G. et al. Expanding proteome coverage with orthogonal-specificity \u03b1-lytic proteases. Mol. Cell. Proteomics 13, 823\u2013835 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR26\" id=\"ref-link-section-d3926824e708\">26<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"88 title=\"Giansanti, P., Tsiatsiani, L., Low, T. Y. &#038; Heck, A. J. R. Six alternative proteases for mass spectrometry-based proteomics beyond trypsin. Nat. Protoc. 11, 993\u20131006 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR27\" id=\"ref-link-section-d3926824e711\">27<\/a><\/sup>; many of the peptides produced are either too short or too long to be detected using current liquid chromatography\u2013mass spectrometry (LC\u2013MS) technology. As proteoforms can differ by a small number of amino acids, extensive sequence coverage is crucial for distinguishing near-identical variants. The use of alternative enzymes in addition to trypsin during digestion can increase the amino acid coverage of individual proteins, phosphorylation sites and whole proteomes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Aebersold, R. H., Leavitt, J., Saavedra, R. A., Hood, L. E. &#038; Kent, S. B. Internal amino acid sequence analysis of proteins separated by one- or two-dimensional gel electrophoresis after in situ protease digestion on nitrocellulose. Proc. Natl Acad. Sci. USA 84, 6970\u20136974 (1987).\" href=\"http:\/\/www.nature.com\/#ref-CR28\" id=\"ref-link-section-d3926824e715\">28<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"MacCoss, M. J. et al. Shotgun identification of protein modifications from protein complexes and lens tissue. Proc. Natl Acad. Sci. USA 99, 7900\u20137905 (2002).\" href=\"http:\/\/www.nature.com\/#ref-CR29\" id=\"ref-link-section-d3926824e715_1\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Choudhary, G., Wu, S. L., Shieh, P. &#038; Hancock, W. S. Multiple enzymatic digestion for enhanced sequence coverage of proteins in complex proteomic mixtures using capillary LC with ion trap MS\/MS. J. Proteome Res. 2, 59\u201367 (2003).\" href=\"http:\/\/www.nature.com\/#ref-CR30\" id=\"ref-link-section-d3926824e715_2\">30<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Harper, R. G., Workman, S. R., Schuetzner, S., Timperman, A. T. &#038; Sutton, J. N. Low-molecular-weight human serum proteome using ultrafiltration, isoelectric focusing, and mass spectrometry. Electrophoresis 25, 1299\u20131306 (2004).\" href=\"http:\/\/www.nature.com\/#ref-CR31\" id=\"ref-link-section-d3926824e715_3\">31<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Schlosser, A., Vanselow, J. T. &#038; Kramer, A. Mapping of phosphorylation sites by a multi-protease approach with specific phosphopeptide enrichment and NanoLC-MS\/MS analysis. Anal. Chem. 77, 5243\u20135250 (2005).\" href=\"http:\/\/www.nature.com\/#ref-CR32\" id=\"ref-link-section-d3926824e715_4\">32<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Biringer, R. G. et al. Enhanced sequence coverage of proteins in human cerebrospinal fluid using multiple enzymatic digestion and linear ion trap LC-MS\/MS. Brief. Funct. Genomic. Proteomic. 5, 144\u2013153 (2006).\" href=\"http:\/\/www.nature.com\/#ref-CR33\" id=\"ref-link-section-d3926824e715_5\">33<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Elenitoba-Johnson, K. S. J. et al. Proteomic identification of oncogenic chromosomal translocation partners encoding chimeric anaplastic lymphoma kinase fusion proteins. Proc. Natl Acad. Sci. USA 103, 7402\u20137407 (2006).\" href=\"http:\/\/www.nature.com\/#ref-CR34\" id=\"ref-link-section-d3926824e715_6\">34<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wang, B., Malik, R., Nigg, E. A. &#038; K\u00f6rner, R. Evaluation of the low-specificity protease elastase for large-scale phosphoproteome analysis. Anal. Chem. 80, 9526\u20139533 (2008).\" href=\"http:\/\/www.nature.com\/#ref-CR35\" id=\"ref-link-section-d3926824e715_7\">35<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Gauci, S. et al. Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach. Anal. Chem. 81, 4493\u20134501 (2009).\" href=\"http:\/\/www.nature.com\/#ref-CR36\" id=\"ref-link-section-d3926824e715_8\">36<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Swaney, D. L., Wenger, C. D. &#038; Coon, J. J. Value of using multiple proteases for large-scale mass spectrometry-based proteomics. J. Proteome Res. 9, 1323\u20131329 (2010).\" href=\"http:\/\/www.nature.com\/#ref-CR37\" id=\"ref-link-section-d3926824e715_9\">37<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Guo, X., Trudgian, D. C., Lemoff, A., Yadavalli, S. &#038; Mirzaei, H. Confetti: a multiprotease map of the HeLa proteome for comprehensive proteomics. Mol. Cell. Proteomics 13, 1573\u20131584 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR38\" id=\"ref-link-section-d3926824e715_10\">38<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Giansanti, P. et al. An augmented multiple-protease-based human phosphopeptide atlas. Cell Rep. 11, 1834\u20131843 (2015).\" href=\"http:\/\/www.nature.com\/#ref-CR39\" id=\"ref-link-section-d3926824e715_11\">39<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Bekker-Jensen, D. B. et al. An optimized shotgun strategy for the rapid generation of comprehensive human proteomes. Cell Syst. 4, 587\u2013599 (2017).\" href=\"http:\/\/www.nature.com\/#ref-CR40\" id=\"ref-link-section-d3926824e715_12\">40<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Miller, R. M. et al. Improved protein inference from multiple protease bottom-up mass spectrometry data. J. Proteome Res. 18, 3429\u20133438 (2019).\" href=\"http:\/\/www.nature.com\/#ref-CR41\" id=\"ref-link-section-d3926824e715_13\">41<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wang, D. et al. A deep proteome and transcriptome abundance atlas of 29 healthy human tissues. Mol. Syst. Biol. 15, e8503 (2019).\" href=\"http:\/\/www.nature.com\/#ref-CR42\" id=\"ref-link-section-d3926824e715_14\">42<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Dau, T., Bartolomucci, G. &#038; Rappsilber, J. Proteomics using protease alternatives to trypsin benefits from sequential digestion with trypsin. Anal. Chem. 92, 9523\u20139527 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR43\" id=\"ref-link-section-d3926824e715_15\">43<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"99 title=\"Richards, A. L. et al. Data-independent acquisition protease-multiplexing enables increased proteome sequence coverage across multiple fragmentation modes. J. Proteome Res. 21, 1124\u20131136 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR44\" id=\"ref-link-section-d3926824e718\">44<\/a><\/sup>. However, given the considerably increased effort involved, this strategy is not amenable to routine use and to our knowledge has not been previously employed for the global-scale detection of proteoforms.<\/p>\n<p>In this study, we investigate whether the separate digestion of human proteomes expressed in six different cell lines with six different proteases, coupled with extensive liquid chromatography (LC) fractionation and state-of-the-art MS, produces sufficient sequence depth to afford a global assessment of how genomic variants and alternative splicing are incorporated into the proteome. Generated peptides were extensively fractionated before analysis on an Orbitrap Tribrid mass spectrometer, where they were dissociated using various fragmentation methods, including higher-energy collisional dissociation (HCD)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"00 title=\"Olsen, J. V. et al. Higher-energy C-trap dissociation for peptide modification analysis. Nat. Methods 4, 709\u2013712 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR45\" id=\"ref-link-section-d3926824e725\">45<\/a><\/sup>, collisionally activated dissociation (CAD)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"11 title=\"Mitchell Wells, J. &#038; McLuckey, S. A. Collision-induced dissociation (CID) of peptides and proteins. Methods Enzymol. 402, 148\u2013185 (2005).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR46\" id=\"ref-link-section-d3926824e729\">46<\/a><\/sup> and electron transfer dissociation (ETD)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"22 title=\"Coon, J. J., Shabanowitz, J., Hunt, D. F. &#038; Syka, J. E. P. Electron transfer dissociation of peptide anions. J. Am. Soc. Mass. Spectrom. 16, 880\u2013882 (2005).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR47\" id=\"ref-link-section-d3926824e733\">47<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"33 title=\"Syka, J. E., Coon, J. J., Schroeder, M. J., Shabanowitz, J. &#038; Hunt, D. F. Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. Proc. Natl. Acad. Sci. USA 101, 9528\u20139533 (2004).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR48\" id=\"ref-link-section-d3926824e736\">48<\/a><\/sup>. We collected ~20\u2009million high-resolution mass spectra and ~164\u2009million MS\/MS spectra from ~2,500 nano-scale liquid chromatography-tandem mass spectrometry (nLC\u2013MS\/MS) experiments. The combined data enabled identification of 17,717 unique proteins with an overall median sequence coverage of 79.2%. Using these data, we provide a global view of genomic and transcriptomic sequence variant expression at the protein level. From a direct comparison with quantitative RNA-seq data, we detect ~80% of SAPs and ~20% of exon\u2013exon junctions, representing both inclusion and skipping of frame-preserving alternative splicing events. However, for proteins with the highest proteomics sequence coverage, represented by genes with relatively high expression (that is, log<sub>2<\/sub> of reads per kilobase per million (RPKM) of \u22657) at the transcript level, ~64% of frame-preserving alternatively splicing events are detected and the rates of detection of constitutively spliced and alternatively spliced junctions are similar. And finally, using the extensive, overlapping peptide sequence information provided by this resource, we demonstrate the feasibility of de novo protein assembly. Data generated from the present study represent the deepest proteomics map collected to date and have been compiled into an online resource at deep-sequencing.app. These methods and resources lay the foundation for comprehensive mapping of protein diversity and are expected to catalyze future research efforts.<\/p>\n<\/div>\n<\/div>\n<div id=\"Sec2-section\" data-title=\"Results\">\n<h2 id=\"Sec2\">Results<\/h2>\n<div id=\"Sec2-content\">\n<h3 id=\"Sec3\">Deep human proteome sequencing<\/h3>\n<p>In silico tryptic digestion of the ~21,030 reviewed canonical protein sequences of the human proteome (UniProtKB\/Swiss-Prot) predicts 2.3\u2009million tryptic peptides of suitable size for MS detection (7\u201335 amino acids, up to two missed cleavages). These peptides comprise 9.9\u2009million amino acid residues of the 11.5\u2009million total\u2014that is, only 86% of the proteome. If we consider digestion of the same proteins using the six enzymes in our study (LysC, LysN, AspN, chymotrypsin, GluC and trypsin), 7.4\u2009million peptides suitable for shotgun proteomics are generated. These peptides cover 99% of the amino acids contained in the human proteome.<\/p>\n<p>To test the hypothesis that we can in such manner increase coverage of the human proteome, we selected six diverse human cell lines: hES1, an embryonic stem cell line; HeLa S3, from cervical carcinoma; HepG2, from liver carcinoma; GM12878, a blood lymphoblastoid line; K562, from chronic myeloid leukemia; and HUVEC, from umbilical vein epithelial cells (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig1\">1<\/a>). Having been included in the Encyclopedia of DNA Elements (ENCODE) project, these cell lines have a large amount of publicly available genomic and transcriptomic data<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"44 title=\"Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101\u2013108 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR49\" id=\"ref-link-section-d3926824e760\">49<\/a><\/sup>. Proteins from each cell line were separately digested with the six proteases listed above. To maximize depth, the resultant peptides were heavily fractionated (24\u201380 fractions) and analyzed using nano flow LC coupled with quadrupole-Orbitrap\u2013linear ion trap hybrid MS systems. Dissociation for MS\/MS was achieved using HCD, CAD and ETD. The resulting 2,491 raw files were simultaneously analyzed by database search to identify proteins and peptides using the Andromeda search engine<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"55 title=\"Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 10, 1794\u20131805 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR50\" id=\"ref-link-section-d3926824e764\">50<\/a><\/sup> inside MaxQuant<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"66 title=\"Tyanova, S., Temu, T. &#038; Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301\u20132319 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR51\" id=\"ref-link-section-d3926824e768\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"77 title=\"Cox, J. &#038; Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367\u20131372 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR52\" id=\"ref-link-section-d3926824e771\">52<\/a><\/sup>, and results were sequentially filtered to 1% peptide spectrum matches (PSMs) and protein-level false discovery rate (FDR) over the whole dataset.<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-1\" data-title=\"Deep proteome sequencing workflow.\">\n<figure><figcaption><b id=\"Fig1\" data-test=\"figure-caption-text\">Fig. 1: Deep proteome sequencing workflow.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/1\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig1\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_Fig1_HTML.png\" alt=\"Science &amp; Nature figure 1\" loading=\"lazy\" width=\"685\" height=\"195\"><\/picture><\/a><\/div>\n<p>Six human cell lines were grown in parallel, their proteomes were isolated and then one of the six proteases was used to digest separate aliquots of each proteome in parallel. Peptides resulting from each digestion were fractionated by high-pH RP chromatography and then analyzed separately with nLC\u2013MS\/MS using HCD, ETD and CAD. The resulting data were searched with MaxQuant<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"88 title=\"Tyanova, S., Temu, T. &#038; Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301\u20132319 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR51\" id=\"ref-link-section-d3926824e787\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"99 title=\"Cox, J. &#038; Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367\u20131372 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR52\" id=\"ref-link-section-d3926824e790\">52<\/a><\/sup> against the human proteome database, and over 17,000 proteins were identified by peptides that produce a median coverage of over 80%. The high coverage achieved is illustrated on the sequence of hemoglobin subunit gamma-1, with color coding to illustrate the number of unique peptides that cover each amino acid position.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/1\" data-track-dest=\"link:Figure1 Full size image\" aria-label=\"Reference 7\"00 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2<\/a> summarizes these data, showcasing the depth of coverage and gains achieved by the multi-enzyme approach. For each cell line, an average of 539,325 unique peptides, corresponding to ~16,000 proteins, were identified (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2a<\/a>). The highest number of identified proteins was from the hES1 cell line (17,121), followed by HeLa S3 (16,399), GM12878 (16,344), HepG2 (16,328), HUVEC (16,158) and K562 (16,054). The trypsin dataset contributed the largest number of unique peptides (396,782), followed by LysN (194,506), LysC (193,956), GluC (162,784), AspN (152,259) and chymotrypsin (114,152). Properties of detected peptides, such as a number of missed cleavages, length distribution and cleavage motif, are in high agreement with previous proteomics multi-enzyme studies (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">1<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"11 title=\"Meyer, J. G. et al. Expanding proteome coverage with orthogonal-specificity \u03b1-lytic proteases. Mol. Cell. Proteomics 13, 823\u2013835 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR26\" id=\"ref-link-section-d3926824e814\">26<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"22 title=\"Giansanti, P., Tsiatsiani, L., Low, T. Y. &#038; Heck, A. J. R. Six alternative proteases for mass spectrometry-based proteomics beyond trypsin. Nat. Protoc. 11, 993\u20131006 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR27\" id=\"ref-link-section-d3926824e817\">27<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"33 title=\"Swaney, D. L., Wenger, C. D. &#038; Coon, J. J. Value of using multiple proteases for large-scale mass spectrometry-based proteomics. J. Proteome Res. 9, 1323\u20131329 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR37\" id=\"ref-link-section-d3926824e820\">37<\/a><\/sup>. Notably, within each cell line, data from each enzyme digestion alone identified over 10,000 protein groups. Data from tryptic peptides contributed the largest number of identifications and unique sequences, totaling 17,631 proteins with 56.5% median sequence coverage. However, using all data comprising all proteases afforded a modest increase in the number of identified proteins (17,717) but considerably boosted the median sequence coverage to 79.2%. In total, we identified 12,151,708 PSMs and 1,119,510 unique peptides at FDR of 1%. Of those, 790 proteins were identified with complete sequence coverage. The average number of unique peptides per protein was 97 (median 65). However, 54 proteins were identified by only one unique peptide; only 1,122 proteins, or 6.3% of the total proteins, were identified by ten or fewer unique peptides. Median sequence coverage for the combined dataset and the contribution from subsets is shown in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2b<\/a>, and ranges from 49.7% (HUVEC; 16,158 proteins) to 63.9% (HeLa S3; 16,399 proteins). Remarkably, nearly half of all identified proteins were observed with 80\u2013100% sequence coverage (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">2a,b<\/a>). Only 936 proteins, or 5.3% of the total data, have sequence coverage below 25%.<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-2\" data-title=\"Overview of results from deep proteomics analysis.\">\n<figure><figcaption><b id=\"Fig2\" data-test=\"figure-caption-text\">Fig. 2: Overview of results from deep proteomics analysis.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/2\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig2\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_Fig2_HTML.png\" alt=\"Science &amp; Nature figure 2\" loading=\"lazy\" width=\"685\" height=\"742\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, Number of proteins detected for each of the six cell lines and cumulative as a function of peptides from the various protease digests. <b>b<\/b>, Median sequence coverage of various cell line proteomes achieved by digests with individual proteases and by combining all protease results. Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">2c<\/a> shows sequence coverage distributions separately for all combinations of cell lines, proteases and fragmentation methods. <b>c<\/b>, Venn diagram of all observed amino acids digested by trypsin versus all proteases combined excluding trypsin. <b>d<\/b>, Sequence coverage for each of the detected proteins for the tryptic peptide data (red) and combined protease digests, including trypsin (gray). <b>e<\/b>, Observed (dark gray) and theoretical (light gray) distributions of sequence coverage achieved for various combinations of proteases. The top three combinations of 2, 3, 4 or 5 proteases are displayed. <b>f<\/b>, Protein coverage comparison of transmembrane and nonmembrane proteins. For <b>e<\/b> and <b>f<\/b>, the lower whisker\/quartile and upper quartile\/whisker show the 5th, 25th, 75th and 95th percentiles, accordingly. <b>g<\/b>, Relative protein coverage of N terminus (left) and C terminus (right) transmembrane segments. Chymo., chymotrypsin.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/2\" data-track-dest=\"link:Figure2 Full size image\" aria-label=\"Reference 7\"44 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>The addition of enzymes other than trypsin provided a slight increase in the total number of proteins identified but induced a large increase in the nonredundant amino acids detected. The 17,717 detected human proteins comprise 12,006,700 amino acid residues, including those that arise from noncanonical proteins, that is, isoforms. In total, the unique peptides identified in the combined tryptic datasets from all cell lines detected approximately half of these amino acids (6,113,639). The number of covered amino acids rises to 8,291,681 when all protease data are used (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2c<\/a>). Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2d<\/a> illustrates the impact of these additional amino acids on protein sequence coverage. Next, we determined the most optimal multi-protease combinations (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2e<\/a>), noting that all top combinations included trypsin. Our total human proteome coverage is, to our knowledge, the largest to date, with 2.12\u2009million more residues (a 34.4% increase) over the 6.17\u2009million identified using exclusively tryptic peptides from the entire MassIVE data repository (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">2d<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"55 title=\"Wang, M. et al. Assembling the community-scale discoverable human proteome. Cell Syst. 7, 412\u2013421.e5 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR8\" id=\"ref-link-section-d3926824e896\">8<\/a><\/sup>. Finally, we compared the proteins identified in this study with the curated neXtProt database<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"66 title=\"Adhikari, S. et al. A high-stringency blueprint of the human proteome. Nat. Commun. 11, 5301 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR7\" id=\"ref-link-section-d3926824e901\">7<\/a><\/sup>, which categorizes proteins across five groups based on the strength of the evidence for their existence. As shown in Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">3<\/a>, most of our protein identifications (13,603 proteins) fall into the highest-confidence category (PE1), and 79 proteins now can be promoted to PE1 status from lower categories (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM3\">1<\/a>).<\/p>\n<p>Alternative proteases have previously been utilized to uncover novel portions of the proteome, including membrane proteins<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"77 title=\"Gilmore, J. M. &#038; Washburn, M. P. Advances in shotgun proteomics and the analysis of membrane proteomes. J. Proteomics 73, 2078\u20132091 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR53\" id=\"ref-link-section-d3926824e915\">53<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"88 title=\"Washburn, M. P., Wolters, D. &#038; Yates, J. R. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. Biotechnol. 19, 242\u2013247 (2001).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR54\" id=\"ref-link-section-d3926824e918\">54<\/a><\/sup>. These proteins\u2014essential to many biological processes and representing important drug discovery targets<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"99 title=\"Wu, C. C. &#038; Yates, J. R. The application of mass spectrometry to membrane proteomics. Nat. Biotechnol. 21, 262\u2013267 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR55\" id=\"ref-link-section-d3926824e922\">55<\/a><\/sup>\u2014remain under-represented in proteomics datasets due to their hydrophobic nature. This is also true of our dataset. Gene ontology cellular component pathway enrichment analysis of the proteins with sequence coverage below 25% revealed that these low-coverage proteins were primarily membrane proteins (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">2e<\/a>). Indeed, we also observe a coverage reduction for transmembrane proteins across all studied proteases (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2f<\/a>). To further explore the behavior of peptides generated from transmembrane-spanning sequences, we calculated the enzyme-specific coverage of aligned membrane-spanning regions to either the N or C terminus (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2g<\/a>). These data demonstrate that because transmembrane regions are depleted for typical protease cleavage sites, peptides suitable for detection by shotgun proteomics are less likely to be observed. This conclusion is further supported by the strong relative performance of chymotrypsin, which is atypical in cleaving at hydrophobic residues, as compared with the other proteases.<\/p>\n<h3 id=\"Sec4\">De novo protein assembly<\/h3>\n<p>Protein inference is conceptually akin to reference transcriptome assembly in short-read sequencing, where a previously assembled proteome or genome database is required to map peptide sequences or nucleic acid reads, respectively. In proteomics, however, genome assemblies for proteome database generation are either unavailable or low-quality for many organisms. Several tools are available to assemble short sequencing reads without a reference genome, such as SOAPdenovo-Trans<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"00 title=\"Xie, Y. et al. SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-seq reads. Bioinformatics 30, 1660\u20131666 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR56\" id=\"ref-link-section-d3926824e943\">56<\/a><\/sup>. However, de novo assembly of nucleic acid sequences relies on the presence of randomly overlapping sequences, which is not a common property of proteomic datasets, which typically use only a single enzyme (for example, trypsin).<\/p>\n<p>With the data from six different proteases and deep coverage presented above, we produce many peptides with partial overlap, which we hypothesized may enable de novo protein assembly. An excellent example for the de novo assembly is the proteasome subunit alpha type-6, which is represented by full sequence coverage (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">4a<\/a>). Overall, the de novo assembly produced 35,480 scaffolds, of which 16,496 (~47%) correctly match to 9,695 protein groups. Median sequence coverage from the de novo assembly was 18% compared with 79.2% for the reference assembly (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">4b,c<\/a>). Assembled scaffolds have a range of 33\u2013358 amino acids with a median length of 45 (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">4d<\/a>), and an average of two scaffolds were mapped to each protein (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">4e<\/a>). These results demonstrate the feasibility of de novo proteome assembly using overlapping peptides from multiple protease digestions of the proteome; application of proteomics-specific assembly methods may improve this result in the future<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"11 title=\"Guthals, A., Clauser, K. R. &#038; Bandeira, N. Shotgun protein sequencing with meta-contig assembly. Mol. Cell. Proteomics 11, 1084\u20131096 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR57\" id=\"ref-link-section-d3926824e962\">57<\/a><\/sup>.<\/p>\n<h3 id=\"Sec5\">Majority of hypothetical SAPs are confirmed in the proteome<\/h3>\n<p>SAPs are variations in the protein sequence which often arise from single nucleotide polymorphisms (SNPs) that result in nonsynonymous codon changes in genomic sequence. The HeLa S3 cell line used in this study contains ~4.5\u2009million SNPs when compared with the hg38 reference human genome. Of these, ~30,000 occur in coding regions, and 4,740 result in nonsynonymous codon changes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"22 title=\"Landry, J. J. M. et al. The genomic and transcriptomic landscape of a HeLa cell line. G3 (Bethesda) 3, 1213\u20131224 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR58\" id=\"ref-link-section-d3926824e974\">58<\/a><\/sup>. We assessed whether our deep proteomics data would afford the ability to determine whether these SNPs are translated into SAPs. To this end, we searched for SAPs with a MaxQuant module which is tailored for the identification of peptide evidence for the translation of genomic variations (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">5<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"33 title=\"Sinitcyn, P., Gerwien, M. &#038; Cox, J. MaxQuant module for the identification of genomic variants propagated into peptides. Methods Mol. Biol. 2456, 339\u2013347 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR59\" id=\"ref-link-section-d3926824e981\">59<\/a><\/sup>. From this analysis, we observe protein-level evidence for up to 2,179 SAPs in individual cell lines, or a total of 5,060 SAPs (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3a<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM3\">2<\/a>). To assess the quality of these SAP-containing peptide identifications, we performed a correlation analysis of all peptide spectral matches both with and without SAPs (mutated and reference peptides, respectively). Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3b<\/a> demonstrates the distribution of correlation coefficients between observed and predicted MS\/MS spectra using the machine learning-based tool DeepMass<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"44 title=\"Tiwary, S. et al. High-quality MS\/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. Nat. Methods 16, 519\u2013525 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR60\" id=\"ref-link-section-d3926824e995\">60<\/a><\/sup> for mutated and reference peptides. The baseline is drawn for peptides with multiple fragmentation spectra, which are compared with each other. The distributions for reference and mutated peptides are similar, providing increased confidence that these peptide spectral matches are legitimate.<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-3\" data-title=\"Discovery of proteins with SAPs.\">\n<figure><figcaption><b id=\"Fig3\" data-test=\"figure-caption-text\">Fig. 3: Discovery of proteins with SAPs.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/3\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig3\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_Fig3_HTML.png\" alt=\"Science &amp; Nature figure 3\" loading=\"lazy\" width=\"685\" height=\"754\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, Comparison of SAPs discovered in the ENCODE transcriptomic data (Trans) and presented proteomics data (Prot) for each of the cell lines. <b>b<\/b>, Distribution of correlation coefficients between observed and predicted by DeepMass<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"55 title=\"Tiwary, S. et al. High-quality MS\/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. Nat. Methods 16, 519\u2013525 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR60\" id=\"ref-link-section-d3926824e1016\">60<\/a><\/sup> spectra. The baseline distribution shows acquisition-to-acquisition variation by comparing observed spectra for peptides. The white circle shows the median value. The lower and upper quartiles of the box demonstrate the 25th and 75th percentiles, accordingly. The lower and upper whiskers show the 5th and 95th percentiles, accordingly. The distributions are based on 5,128,969, 442,476, 16,516 and 4,969 comparisons (from left to right). <b>c<\/b>, Clustered binary heatmap of the detected SAPs row-grouped by cell line and omics platform (transcriptomics or proteomics). Blue rectangles highlight clusters specific to each cell line, and the green rectangle SAPs that are conserved across all cell lines. <b>d<\/b>, Gene ontology (GO) enrichment of genes with SAPs detected or undetected by MS. Genes with a mixed population of SAPs were removed, and repeats collapsed. Blue dots highlight GO terms with the word \u2018membrane\u2019 mentioned in the name. <b>e<\/b>. SIFT-generated<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"66 title=\"Kumar, P., Henikoff, S. &#038; Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073\u20131081 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR61\" id=\"ref-link-section-d3926824e1030\">61<\/a><\/sup> score distribution over four categories for detected and undetected SAPs. Applying the two-sided Wilcoxon rank sum test on the raw scores results in <i>P<\/i> value of 2\u2009\u00d7\u200910<sup>\u22128<\/sup>. <b>f<\/b>, The same as <b>e<\/b>, but for the PolyPhen-2 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"77 title=\"Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248\u2013249 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR62\" id=\"ref-link-section-d3926824e1045\">62<\/a><\/sup>) tool. Applying the two-sided Wilcoxon rank sum test on the raw scores results in <i>P<\/i> value of 1.1\u2009\u00d7\u200910<sup>\u221212<\/sup>.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/3\" data-track-dest=\"link:Figure3 Full size image\" aria-label=\"Reference 8\"88 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>For all cell lines except HUVEC, we observed high overlap between the mutations detected by transcriptomics and by proteomics (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">6a<\/a>). Given HUVEC is the only primary cell line (that is, obtained directly from host tissue) in the study, this low overlap is expected as the transcriptomic and proteomic data were collected from cells originating from different donors. Therefore, we omitted HUVEC from further analysis. Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3a<\/a> shows that most nonsynonymous SNPs that appear in the transcript also appear at the protein level (median 73% over all studied cell lines). Further, the multi-enzyme data led on average to a doubling of identified SAPs compared with when only trypsin was used (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">6a<\/a>).<\/p>\n<p>Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3c<\/a> shows the presence of variants as a function of cell line and whether they are detected at the protein level. We note that there are primarily two types of SAP\u2014those that are cell line specific (highlighted within a blue rectangle) and those that are conserved across the cell lines (highlighted within a green rectangle). Enrichment analysis of the SAPs found only at the transcriptomic level (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3d<\/a>) revealed several gene ontology terms associated with membrane protein families\u2014supporting our earlier conclusions that peptides for such proteins are less amenable to MS analysis.<\/p>\n<p>To test whether some of the mutations that were undetected at the protein level, even though transcripts evidence was present, caused protein instability, we leveraged the SIFT<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"99 title=\"Kumar, P., Henikoff, S. &#038; Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073\u20131081 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR61\" id=\"ref-link-section-d3926824e1087\">61<\/a><\/sup> and PolyPhen-2 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"00 title=\"Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248\u2013249 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR62\" id=\"ref-link-section-d3926824e1091\">62<\/a><\/sup>) tools. These software tools predict how an amino acid mutation can alter protein structure and function by classifying mutations as either benign or deleterious. As depicted in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3e,f<\/a>, both algorithms predict a significant shift (<i>P<\/i> values of 2\u2009\u00d7\u200910<sup>\u22128<\/sup> and 1.1\u2009\u00d7\u200910<sup>\u221212<\/sup>, respectively from two-sided Wilcoxon rank sum test) in the fraction of deleterious mutations for the undetected SAP group. These data confirm that at least a subset of undetected SAPs likely arise from cases where the mutation induces protein instability.<\/p>\n<h3 id=\"Sec6\">Protein-level evidence for alternative splicing<\/h3>\n<p>The high proteome sequence coverage of our dataset provides an opportunity to globally detect protein isoforms arising from alternative splicing and affords a direct assessment of the degree to which this process contributes to proteomic complexity. As mentioned above, RNA-seq analyses of diverse human organs and cell lines have provided evidence that more than 95% of multi-exon genes produce alternatively spliced transcripts<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"11 title=\"Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470\u2013476 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR11\" id=\"ref-link-section-d3926824e1114\">11<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"22 title=\"Pan, Q., Shai, O., Lee, L. J., Frey, B. J. &#038; Blencowe, B. J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 40, 1413\u20131415 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR12\" id=\"ref-link-section-d3926824e1117\">12<\/a><\/sup>. However, the extent to which alternative transcripts with the potential to encode different proteins are translated has been the subject of considerable debate<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"33 title=\"Tress, M. L., Abascal, F. &#038; Valencia, A. Alternative splicing may not be the key to proteome complexity. Trends Biochem. Sci. 42, 98\u2013110 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR63\" id=\"ref-link-section-d3926824e1121\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"44 title=\"Blencowe, B. J. The relationship between alternative splicing and proteomic complexity. Trends Biochem. Sci. 42, 407\u2013408 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR64\" id=\"ref-link-section-d3926824e1124\">64<\/a><\/sup>, in large part due to the lack of MS datasets with sufficiently deep coverage. Accordingly, using the high-coverage data generated here, we assessed the proportion of alternatively spliced transcript variants that are detected in the proteome.<\/p>\n<p>To assess the extent to which it is possible to detect splicing within our dataset, we first determined the relative proportions of peptides that fall entirely within exons versus those that span exon\u2013exon junctions. Approximately 30% of identified peptide sequences span junction sequences formed by splicing of protein-coding exons (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">7a<\/a>). Notably, trypsin generates the lowest ratio of junction-spanning versus exon body peptides of all proteases used in this study (~25% versus 28\u201332%) (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">7a<\/a>). This observation confirms in silico predictions of the limited utility of trypsin alone for detection of spliced junction sequences in shotgun proteomics data<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"55 title=\"Wang, X. et al. Detection of proteome diversity resulted from alternative splicing is limited by Trypsin cleavage specificity. Mol. Cell. Proteomics 17, 422\u2013430 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR65\" id=\"ref-link-section-d3926824e1137\">65<\/a><\/sup>. In particular, peptides from trypsin and LysC digestion that fully map within exons have a clear bias which coincides with the first or last amino acids encoded by exons (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">8b<\/a>). Additionally, exon-spanning LysN peptides tend to overlap by a single amino acid at their C termini (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">8c<\/a>). These data are also consistent with a high frequency of lysine residues overlapping splice sites<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"66 title=\"Wang, X. et al. Detection of proteome diversity resulted from alternative splicing is limited by Trypsin cleavage specificity. Mol. Cell. Proteomics 17, 422\u2013430 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR65\" id=\"ref-link-section-d3926824e1148\">65<\/a><\/sup> and illustrate the importance of utilizing additional proteases (chymotrypsin, AspN, GluC and so on) when attempting to detect splice isoforms.<\/p>\n<p>Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig4\">4<\/a> illustrates our strategy for detection of translated alternative splicing events. In the example provided, alternative splicing of a cassette exon (exon 8) of the Amyloid precursor protein (<i>APP<\/i>) gene is detected by a combination of peptides spanning exons 7 and 9, the junction formed by skipping of the exon, and by peptides spanning exons 7 and 8 or exons 8 and 9, which are formed by inclusion of the exon. In total, we detect 11 unique peptides spanning these three junctions, thus confirming translation of isoforms resulting from inclusion and skipping of the exon. Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5a<\/a> depicts the major classes of alternative splicing events and the detection frequencies of these as they appear in RNA-seq data<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"77 title=\"Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101\u2013108 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR49\" id=\"ref-link-section-d3926824e1164\">49<\/a><\/sup> generated from all six cell lines analyzed in this study, and the numbers of these events detected at the proteomics level, when considering peptides mapping to one of both possible resulting isoforms (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM3\">3<\/a>). With a requirement for expression of at least one of two isoforms, we detect 4,608 of 13,450 (34.3%) alternative splicing events (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5a<\/a>). Notably, of 6,145 alternative splicing events with RNA-seq expression evidence for both alternatives, we detect 1,141 (18.6%) at the protein level, where junction-spanning peptides representing both alternative isoforms are identified.<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-4\" data-title=\"Example of proteomics data corroborating occurrence of an alternative splicing (AS) event in APP.\">\n<figure><figcaption><b id=\"Fig4\" data-test=\"figure-caption-text\">Fig. 4: Example of proteomics data corroborating occurrence of an alternative splicing (AS) event in APP.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/4\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig4\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_Fig4_HTML.png\" alt=\"Science &amp; Nature figure 4\" loading=\"lazy\" width=\"685\" height=\"602\"><\/picture><\/a><\/div>\n<p>The initial sequential order of exons undergoes transcription. Splicing processing follows, resulting in either 7\u20139 or 7\u20138\u20139 exon combinations. Since all mentioned exons are part of APP\u2019s open reading frame, they have a theoretical possibility to be present and translated into a protein sequence. The multi-enzyme shotgun MS approach described here allows detection of peptides specific to each isoform. Two of 42 total spectra, corroborating these splicing events, are shown.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/4\" data-track-dest=\"link:Figure4 Full size image\" aria-label=\"Reference 9\"88 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-5\" data-title=\"Properties of detected exon skipping AS events.\">\n<figure><figcaption><b id=\"Fig5\" data-test=\"figure-caption-text\">Fig. 5: Properties of detected exon skipping AS events.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/5\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig5\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_Fig5_HTML.png\" alt=\"Science &amp; Nature figure 5\" loading=\"lazy\" width=\"685\" height=\"780\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, Summary table of annotated, detected by transcriptomics and proteomics splicing events. AS events are further subdivided into groups with expression evidence for at least one or both alternatives. <b>b<\/b>, Proteomics detection rate of exon skipping AS events as a function of expression. Each gene is grouped by expression level as obtained from RNA-seq data. <b>c<\/b>, Proportions of detected AS events with in-frame or out-of-frame properties. For in-frame AS events, the length of included exon is divisible by 3. It is not the case for out-of-frame AS events which hence result in a frameshift. <b>d<\/b>, The same analysis as in <b>b<\/b> but performed based on frame-preserving isoform events only. <b>e<\/b>, Percentage of MS-identified splicing sites as a function of transcriptional coverage (reads per million, RPM). Three groups of splicing sites are displayed\u2014constitutive (present in all isoforms of a specific gene), exclusion and inclusion splice sites. For more information, see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">8<\/a>. <b>f<\/b>, The same as <b>e<\/b>, but by individual proteases used in this study or all combined (Total). <b>g<\/b>, Splice junction proteomic coverage achieved over all protease combinations. The top two combinations are displayed for 2\u20135 proteases. Only splice junctions with transcriptomics coverage of more than 1 RPM are included in this analysis. <b>h<\/b>, ROC curve of a binary XGBoost<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\"99 title=\"Chen, T. &#038; Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785\u2013794 (ACM, 2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR68\" id=\"ref-link-section-d3926824e1239\">68<\/a><\/sup> classifier trained to predict whether AS events are detected or not detected on the proteomics level. <b>i<\/b>, Features ranked by their importance for the XGBoost classifier. The bars and whiskers demonstrate mean and 1\u2009s.d. accordingly. The visualized values were calculated over 100 random shuffles for each parameter. <b>j<\/b>, Proteomics detection rate as a function of percent spliced-in (PSI) value defined by RNA-seq data. AUC, area under the curve.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x\/figures\/5\" data-track-dest=\"link:Figure5 Full size image\" aria-label=\"Reference 10\"00 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>Several factors inherently limit the detection of transcript isoforms at the protein level. These include (1) relatively low transcript abundance arising from reduced levels of gene expression; (2) transcript turnover due to nonsense-mediated mRNA decay (NMD), triggered by premature termination codons introduced by frame-shifting alternative splicing events<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"11 title=\"Lewis, B. P., Green, R. E. &#038; Brenner, S. E. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl Acad. Sci. USA 100, 189\u2013192 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR66\" id=\"ref-link-section-d3926824e1261\">66<\/a><\/sup> and other turnover processes; and (3) reduced levels of splicing, as measured using the metric PSI. Exemplifying these limitations, intron retention events, which often result in nuclear retention of transcripts or trigger NMD if the retained intron does not prevent transcript export<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"22 title=\"Braunschweig, U. et al. Widespread intron retention in mammals functionally tunes transcriptomes. Genome Res. 24, 1774\u20131786 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR67\" id=\"ref-link-section-d3926824e1265\">67<\/a><\/sup>, are the most rarely detected at the protein level (that is, only 9 of 105). Furthermore, the rate of detection at the proteomics level gradually increases as the corresponding transcript levels for cassette alternative exons increase (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5b<\/a>). Moreover, most of the events detected at the proteomics level derive from frame-preserving (that is, in-frame) alternative isoforms (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5c<\/a>). Considering only frame-preserving alternative splicing events in relatively abundant transcripts (that is, \u22657 log<sub>2<\/sub> RPKM), we observe 64% of alternative spliced events at the protein level (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5d<\/a>).<\/p>\n<p>To estimate the possible upper bound detection rates for alternative splicing events at the proteomics level, we compared relative detection rates for alternatively spliced and constitutively spliced junctions in the same RNA transcripts, where constitutively spliced exon\u2013exon junctions are defined as those present in all isoforms of a gene. Importantly, detection rates for constitutive and alternative exon\u2013exon junctions were comparable over a range of transcript levels, in both cases plateauing at approximately 40% of total junctions detected at the highest levels of transcript abundance (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5e<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">9a\u2013f<\/a>). Consistent with these results, the maximum detection levels require combined data from all six proteases, since each enzyme alone resulted in substantially lower detection levels (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5f<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">9g\u2013i<\/a>). Additionally, the analysis of all protease combinations shows that nonarginine and nonlysine directed proteases (GluC, AspN and Chymotrypsin) are highly complementary to trypsin in terms of splice site coverage (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5g<\/a>).<\/p>\n<p>Finally, to further evaluate factors contributing to the detection of spliced isoforms at the proteomics level, we trained a machine learning binary classifier<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"33 title=\"Chen, T. &#038; Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785\u2013794 (ACM, 2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR68\" id=\"ref-link-section-d3926824e1303\">68<\/a><\/sup>. Specifically, we classified cassette exon skipping events detected in both proteomics and transcriptomics data versus those events detected solely in the transcriptome. After training on the following properties\u2014transcript abundance, PSI value, exon length, protein coding sequence length, frame-preserving status and a minimum theoretical peptide coverage between isoforms for each studied protease\u2014we evaluated performance using sevenfold cross-validation. This classifier results in 0.83 area under the receiver operating characteristic (ROC) curve (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5h<\/a>), which is better than random performance. We next used the permutation importance<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"44 title=\"Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825\u20132830 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR69\" id=\"ref-link-section-d3926824e1310\">69<\/a><\/sup> to evaluate the importance of each property and to establish the most important ones for influencing proteomic detection of alternative splicing events. The top three most important parameters are transcript abundance, PSI and frame status (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5i<\/a>), consistent with the results in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5b\u2013d<\/a>.<\/p>\n<p>The PSI ratio reflects the percentage of the total transcript abundance that results in exon inclusion. Since the exon-included isoform contains two junctions for proteomic detection, while the excluded-exon form only contains one, in the case of equally abundant isoforms, exon-inclusion events have double the probability of detection. This situation would result in an optimal PSI for proteomic detection of 33%. This is confirmed in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5j<\/a>, where the highest proteomics detection rate for exon exclusion is close to 30%. Note that for extreme PSI values, for example, >0.9, the abundance of the spliced-in isoform is tenfold higher than the splice-out version. This phenomenon likely reduces the overall protein abundance of one isoform, adding to the challenge of its detection.<\/p>\n<\/div>\n<\/div>\n<div id=\"Sec7-section\" data-title=\"Discussion\">\n<h2 id=\"Sec7\">Discussion<\/h2>\n<div id=\"Sec7-content\">\n<p>Here we used six human cell lines, six parallel protease digestions and three MS\/MS fragmentation methods to generate over 164\u2009million tandem mass spectra from nearly 2,500 nLC\u2013MS\/MS analyses (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig1\">1<\/a>). Our analysis of the combined data identified over 1\u2009million unique peptides from 17,717 genes encoding protein sequences (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig2\">2<\/a>). The median protein sequence coverage was 79.2%, representing 8.29\u2009million unique amino acids. The use of proteases that produce sequences complementary to trypsin was particularly important for detecting 2.18\u2009million unique amino acids, increasing the average protein\u2019s sequence coverage by 19%. We conclude that while the use of multiple enzymes only modestly increases protein identification rates, this strategy substantially increases proteomic coverage. A key result from this work is that proteomic coverage gains often come from protein regions with suboptimal trypsin cleavage sites, for example, membrane-spanning domains and splice junctions. Additionally, with the coverage achieved here, we provide evidence that de novo assembly can be accomplished directly from proteomic data, although currently for a limited subset of highly expressed proteins.<\/p>\n<p>We developed informatics tools to allow global detection of nonsynonymous mutations and alternative splicing. Our analysis provides evidence that approximately 73% of nonsynonymous SNPs (that is, SAPs) are translated and present in the proteome. To our knowledge, this is the first proteogenomic study of SAP variants with such depth. This resource now provides a framework to directly study allele-specific expression and address fundamental questions of how mutations impact protein expression and stability<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"55 title=\"Cleary, S. &#038; Seoighe, C. Perspectives on allele-specific expression. Annu. Rev. Biomed. Data Sci. 4, 101\u2013122 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR70\" id=\"ref-link-section-d3926824e1344\">70<\/a><\/sup>. Furthermore, our catalog of expressed SAPs, and the appropriate enzymes and dissociation methods needed to detect them, offers the ability to globally monitor SAPs in both basic and clinical contexts. We note that the ability to detect SAPs in clinical samples could raise privacy concerns<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"66 title=\"Mann, S. P., Treit, P. V., Geyer, P. E., Omenn, G. S. &#038; Mann, M. Ethical principles, constraints, and opportunities in clinical proteomics. Mol. Cell. Proteomics 20, 100046 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR71\" id=\"ref-link-section-d3926824e1348\">71<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"77 title=\"Fierro-Monti, I., Vizcaino, J. A., Choudhary, J. S. &#038; Wright, J. C. Identifying individuals using proteomics: are we there yet? Front. Mol. Biosci. 9, 1062031 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR72\" id=\"ref-link-section-d3926824e1351\">72<\/a><\/sup>.<\/p>\n<p>Alternative splicing, which is pervasive at the transcript level, was previously largely undetected at the proteomics level due to the low degree of peptide coverage in most shotgun MS experiments<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"88 title=\"Reixachs-Sol\u00e9, M. &#038; Eyras, E. Uncovering the impacts of alternative splicing on the proteome with current omics techniques. Wiley Interdiscip. Rev. RNA 13, e1707 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR73\" id=\"ref-link-section-d3926824e1358\">73<\/a><\/sup>. The failure of proteomics to detect and monitor these events is generally accepted; indeed, it is common practice to report protein isoforms as groups<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\"99 title=\"Nesvizhskii, A. I., Keller, A., Kolker, E. &#038; Aebersold, R. A statistical model for identifying proteins by tandem mass spectrometry. Anal. Chem. 75, 4646\u20134658 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR74\" id=\"ref-link-section-d3926824e1362\">74<\/a><\/sup>. This shortcoming has limited not only our ability to differentiate protein isoforms but also our knowledge of how splicing impacts the proteome. Here we provide evidence that over half (about 64%) of the frame-preserving splicing events of relatively highly expressed genes detected by transcriptomics are indeed translated and present at the protein level (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5d<\/a>), and 22% are detected across the entire expression range (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5c<\/a>). Given the highly dynamic nature of protein expression and the challenges of detecting differentially expressed splice variants, we expect these numbers to be underestimates, as evidenced by the lack of full detection of constitutively spliced exon\u2013exon junctions even at the highest levels of peptide coverage (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5e<\/a>).<\/p>\n<p>Our study established a compendium of ~25,000 peptides which provide proteomics evidence for ~5,000 splice events. This detection was enabled using multiple proteases (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5f,g<\/a>). While trypsin digestion generates peptides of a preferred length and behavior for mass spectrometric detection, it also limits the ability to detect splice junctions. Splice site sequences are inherently biased for lysine codons such that trypsin digestion results in an under-representation of junction-spanning peptides (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">7b<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"00 title=\"Wang, X. et al. Detection of proteome diversity resulted from alternative splicing is limited by Trypsin cleavage specificity. Mol. Cell. Proteomics 17, 422\u2013430 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR65\" id=\"ref-link-section-d3926824e1384\">65<\/a><\/sup>. The use of alternative proteases, however, generates a substantial increase in peptides spanning these junctions, approximately doubling their number for MS detection (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5f<\/a>). We further confirm additional features of alternative splicing events that limit their detection at the proteomics level. These include intron retention, NMD (which may be triggered by intron retention or other frame-shifting events), and low or high PSI range alternative splicing events (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig5\">5j<\/a>), which may arise because of splicing regulation or transcript turnover. Our results are largely consistent with the findings of previous ribosome profiling studies, providing evidence that the majority of alternatively spliced junctions overlapping coding sequence in stably expressed transcripts are translated<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"11 title=\"Weatheritt, R. J., Sterne-Weiler, T. &#038; Blencowe, B. J. The ribosome-engaged landscape of alternative splicing. Nat. Struct. Mol. Biol. 23, 1117\u20131123 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR75\" id=\"ref-link-section-d3926824e1395\">75<\/a><\/sup>. The results of the present study provide direct evidence that alternative splicing is widespread at the protein level, refuting conclusions of previous studies on MS data with limited coverage generated using trypsin alone<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"22 title=\"Tress, M. L., Abascal, F. &#038; Valencia, A. Alternative splicing may not be the key to proteome complexity. Trends Biochem. Sci. 42, 98\u2013110 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR63\" id=\"ref-link-section-d3926824e1399\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"33 title=\"Blencowe, B. J. The relationship between alternative splicing and proteomic complexity. Trends Biochem. Sci. 42, 407\u2013408 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR64\" id=\"ref-link-section-d3926824e1402\">64<\/a><\/sup>.<\/p>\n<p>Owing to its scope, depth and coverage, the dataset reported in this study represents a resource to drive future work on the human proteome. To make this peptide catalog accessible, we have created an online resource\u2014deep-sequencing.app. This resource has a gene-centric design, such that one can query any gene and examine the corresponding peptides, SAPs and splicing junctions detected. Beyond providing detailed knowledge of selected genes and their proteoforms, these data could be similarly useful for MS analyses by targeted proteomics and for large-scale machine learning endeavors<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"44 title=\"Cox, J. Prediction of peptide mass spectral libraries with machine learning. Nat. Biotechnol. 41, 33\u201343 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR76\" id=\"ref-link-section-d3926824e1410\">76<\/a><\/sup>. For targeted work, our resource provides a global-scale proteomics database of mutations and splice junctions, and the specific peptides that enable their monitoring. For machine learning, it offers over 12\u2009million PSMs from the use of multiple proteases and MS\/MS dissociation methods. These data resources will thus enable new insights into unstudied portions of the proteome, potentially offering improved prediction of parameters including peptide detectability, dissociation behavior and chromatographic retention. Finally, these data are expected to facilitate prioritization of SAPs and protein isoforms for future functional studies.<\/p>\n<\/div>\n<\/div>\n<div id=\"Sec8-section\" data-title=\"Methods\">\n<h2 id=\"Sec8\">Methods<\/h2>\n<div id=\"Sec8-content\">\n<h3 id=\"Sec9\">Cell culture and lysis<\/h3>\n<p>HeLa S3 cells (CCL-22; ATCC) were grown at 37\u2009\u00b0C with 5% CO<sub>2<\/sub> in F-12K medium (ATCC) supplemented with 10% FBS and antibiotics. HUVEC cells (CC-2517; Lonza) were grown at 37\u2009\u00b0C with 5% CO<sub>2<\/sub> in Endothelial Growth Media (EGM) supplemented with EGM Complete Media (Lonza) and antibiotics. HepG2 cells (HB-8065; ATCC) were grown at 37\u2009\u00b0C with 5% CO<sub>2<\/sub> in EMEM (ATCC) supplemented with 10% FBS and antibiotics. K562 cells (CCL-243; ATCC) were grown at 37\u2009\u00b0C with 5% CO<sub>2<\/sub> in IMDM (ATCC) supplemented with 10% FBS and antibiotics. GM12878 cells (GM12878 K Order 104598; Coriell Institute for Medical Research) were supplemented with 15% FBS and RPMI-1640 medium (Sigma Aldrich). hESC-1 cells were prepared according to previously published protocols<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"55 title=\"Phanstiel, D. H. et al. Proteomic and phosphoproteomic comparison of human ES and iPS cells. Nat. Methods 8, 821\u2013827 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR77\" id=\"ref-link-section-d3926824e1434\">77<\/a><\/sup>. Cells were collected at >70% confluency through centrifugation at 300<i>g<\/i> for 5\u2009min at 4\u2009\u00b0C. The supernatant was removed, and cells were washed with PBS and centrifuged at 300<i>g<\/i> for 5\u2009min at 4\u2009\u00b0C. The resulting pellet was stored at \u221280\u2009\u00b0C. Cell pellets were resuspended in lysis buffer containing 8\u2009M urea, 50\u2009mM Tris (pH 8), 5\u2009mM CaCl<sub>2<\/sub>, 30\u2009mM NaCl, and protease (Roche) and phosphatase (Roche) inhibitor tablets. The pellet was lysed by four rounds of sonication at 4\u2009\u00b0C, alternating between 20\u2009s on and 20\u2009s off. Lysate protein concentration was measured by Bicinchoninic acid Protein Assay Kit (Thermo Pierce).<\/p>\n<h3 id=\"Sec10\">Digestion<\/h3>\n<p>Protein was reduced by addition of 5\u2009mM dithiothreitol and incubation for 45\u2009min at 55\u2009\u00b0C. The mixture was cooled to room temperature, followed by alkylation of free thiols by addition of 15\u2009mM iodoacetamide in the dark for 30\u2009min. The alkylation reaction was quenched with 5\u2009mM dithiothreitol. For tryptic digestion, a 1-mg protein aliquot was digested overnight with 20\u2009\u00b5g of trypsin (Promega) at room temperature in 1\u2009M urea. For LysC digestion, a 1-mg protein aliquot was digested overnight with 20\u2009\u00b5g of LysC (Wako) at room temperature in 4\u2009M urea. For LysN digestion, a 1-mg protein aliquot was digested for 4\u2009h with 20\u2009\u00b5g of LysN (Thermo Pierce) at 37\u2009\u00b0C in 4\u2009M urea. For GluC digestion, a 1-mg protein aliquot was digested overnight with 25\u2009\u00b5g of GluC (Roche) at room temperature in 0.5\u2009M urea. For chymotrypsin digestion, a 1-mg protein aliquot was digested overnight with 12.5\u2009\u00b5g of chymotrypsin resuspended in 0.2% formic acid (Promega) in 1\u2009M urea. For digestion with AspN, a 1-mg protein aliquot was incubated with 6\u2009\u00b5g of AspN (Roche) at room temperature overnight. Each digest was quenched by the addition of Trifluoroacetic acid and desalted on a 100-mg C<sub>18<\/sub> Sep-Pak cartridge (Waters).<\/p>\n<h3 id=\"Sec11\">Fractionation<\/h3>\n<p>High-pH reversed-phase (RP) fractionation was performed using either a Surveyor LC quaternary pump or a Dionex UltiMate 3000. Fractionation was performed at a flow rate of 1.0\u2009ml\u2009min<sup>\u22121<\/sup> using a 5-\u00b5m column packed with C18 particles (250\u2009\u00d7\u20094.6\u2009mm<sup>2<\/sup>, Phenomenex) on a Surveyor LC quaternary pump. Samples were resuspended in buffer A and separated using the following gradient: 0\u20132\u2009min, 100% buffer A, and separated by increasing buffer B over a 60-min gradient at a flow rate of 0.8\u2009ml\u2009min<sup>\u22121<\/sup> (buffer A: 20\u2009mM ammonium formate, pH 10; buffer B: 20\u2009mM ammonium formate, pH 10, in 80% ACN). Flow rate was increased to 1.5\u2009ml\u2009min<sup>\u22121<\/sup> during equilibration. Fractionation was performed at a flow rate of 0.45\u2009ml\u2009min<sup>\u22121<\/sup> using a 1.7-\u00b5m column packed with BEH particles (50\u2009\u00d7\u20091\u2009mm<sup>2<\/sup>, Waters) on a Dionex Ultimate 3000 pump (Thermo). Samples were resuspended in buffer A and separated by increasing buffer B over a 45-min gradient at a flow rate of 0.45\u2009ml\u2009min<sup>\u22121<\/sup> (buffer A: 20\u2009mM ammonium bicarbonate; buffer B: 20\u2009mM ammonium bicarbonate in 80% ACN). Trypsin-digested H1-hESC cells were first fractionated via strong cation exchange fractionation. Peptides were dissolved in 400\u2009\u03bcl of strong cation exchange buffer A (5\u2009mM KH<sub>2<\/sub>PO<sub>4<\/sub> and 30% acetonitrile (ACN); pH 2.65) and injected onto a polysulfoethylaspartamide column (9.4\u2009\u00d7\u2009200\u2009mm<sup>2<\/sup>; PolyLC) attached to a Surveyor LC quaternary pump (Thermo Electron) operating at 3\u2009ml\u2009min<sup>\u22121<\/sup>. Fractions were collected every 2\u2009min starting at 10\u2009min into the following gradient: 0\u20132\u2009min at 100% buffer A, 2\u20135\u2009min at 0\u201315% buffer B (5\u2009mM KH<sub>2<\/sub>PO<sub>4<\/sub>, 30% ACN and 350\u2009mM KCl (pH 2.65)) and 5\u201335\u2009min at 15\u2013100% buffer B. Buffer B was held at 100% for 10\u2009min. Fractions were collected from 8\u201312\u2009min, 12\u201314\u2009min, 14\u201316\u2009min and 16\u201325\u2009min. Each of these four strong cation-exchange fractions was further fractionated by high-pH RP fractionation on a Surveyor LC quaternary pump, as described above.<\/p>\n<h3 id=\"Sec12\">LC\u2013MS\/MS<\/h3>\n<p>Samples were resuspended in 0.2% formic acid and separated via RP chromatography. Peptides were injected onto an RP column prepared in-house. Approximately 35-cm lengths of 75-\u03bcm to 360-\u03bcm inner\/outer diameter bare-fused silica capillaries, each with a laser pulled electrospray tip, were packed with 1.7-\u03bcm diameter, 130-\u00c5 pore size, Bridged Ethylene Hybrid C18 particles (Waters). Columns were fitted onto either a nanoAcquity (Waters) or a Dionex (Thermo) and heated to 60\u2009\u00b0C using a home-built column heater. Mobile phase buffer A was composed of water and 0.2% formic acid. Mobile phase B was composed of 70% ACN, 0.2% formic acid and 5% dimethylsulfoxide. Each sample was separated over a 100-min gradient, including time for column re-equilibration. Flow rates were set at 300\u2013350\u2009\u00b5l\u2009min<sup>\u22121<\/sup>.<\/p>\n<p>Peptide cations were converted to gas-phase ions by electrospray ionization and analyzed on a Thermo Orbitrap Fusion or a Thermo Orbitrap Lumos (Thermo Fisher Scientific). All fractions were analyzed using HCD. Precursor scans were performed from 300 to 1,500\u2009<i>m<\/i>\/<i>z<\/i> at either 60,000 or 120,000 resolution (at 400\u2009<i>m<\/i>\/<i>z<\/i>). A 5\u2009\u00d7\u200910<sup>5<\/sup> ion count target was used on the Orbitrap Fusion; a 1\u2009\u00d7\u200910<sup>6<\/sup> ion count target was used on the Orbitrap Lumos. Precursors selected for MS\/MS were isolated at 0.7\u2009Thomson (Th) with the quadrupole, fragmented by HCD with a normalized collision energy of 30 and analyzed using turbo scan in the ion trap. For some analyses, precursors above 500\u2009<i>m<\/i>\/<i>z<\/i> were fragmented by HCD using the described conditions, while precursors below 500\u2009<i>m<\/i>\/<i>z<\/i> were fragmented by CAD with a normalized collision energy of 30. The maximum injection time for MS\/MS analysis was normally set at either 25 or 35\u2009ms, but was set higher for some analyses, with an ion count target of 10<sup>4<\/sup>. Precursors with a charge state of 2\u20138 were sampled for MS\/MS. Dynamic exclusion time was set at 15\u2009s, with a 10-p.p.m. tolerance around the selected precursor and its isotopes. Monoisotopic precursor selection was turned on. Analyses were performed in top speed mode with either 3- or 5-s cycles.<\/p>\n<p>LysC, LysN, AspN, GluC and chymotrypsin fractions were analyzed using ETD. To maximize identifications, precursor scans were performed from 200 to 800\u2009<i>m<\/i>\/<i>z<\/i> at either 60,000 or 120,000 resolution (at 400\u2009<i>m<\/i>\/<i>z<\/i>). A 5\u2009\u00d7\u200910<sup>5<\/sup> ion count target was used on the Orbitrap Fusion; a 1\u2009\u00d7\u200910<sup>6<\/sup> ion count target was used on the Orbitrap Lumos. Precursors selected for MS\/MS were isolated at 0.7\u2009Th with the quadrupole. Precursors were fragmented by ETD using custom reaction times; +3: 40\u2009ms, +4: 22\u2009ms, +5: 14\u2009ms, +6: 10\u2009ms, +2: 70\u2009ms. Electron-transfer\/higher-energy collision dissociation (EThcD) was performed on +2 precursors, at 25% supplemental activation collision energy. Precursor ions were selected for fragmentation based on charge state in the following order: +3, +4, +5, +6, +2. Fragment ions were analyzed in the ion trap. Dynamic exclusion time was set at 15\u2009s, with a 10-p.p.m. tolerance around the selected precursor and its isotopes. Monoisotopic precursor selection was turned on. Analyses were performed in top speed mode with either 3- or 5-s cycles.<\/p>\n<p>Fractionated peptides from chymotrypsin-catalyzed proteolysis were analyzed using CAD. Precursor scans were performed from 300 to 1,500\u2009<i>m<\/i>\/<i>z<\/i> at either 60,000 or 120,000 resolution (at 400\u2009<i>m<\/i>\/<i>z<\/i>). A 5\u2009\u00d7\u200910<sup>5<\/sup> ion count target was used on the Orbitrap Fusion; a 1\u2009\u00d7\u200910<sup>6<\/sup> ion count target was used on the Orbitrap Lumos. Precursors selected for MS\/MS were isolated at 0.7\u2009Th with the quadrupole, fragmented by CAD with a normalized collision energy of 30 and analyzed using turbo scan in the ion trap. The maximum injection time for MS\/MS analysis was normally set at either 25 or 35\u2009ms, but was set higher for some analyses, with an ion count target of 10<sup>4<\/sup>. Precursors with a charge state of 2\u20138 were sampled for MS\/MS. Dynamic exclusion time was set at 15\u2009s, with a 10-p.p.m. tolerance around the selected precursor and its isotopes. Monoisotopic precursor selection was turned on. Analyses were performed in top speed mode with either 3- or 5-s cycles.<\/p>\n<h3 id=\"Sec13\">Protein identification<\/h3>\n<p>The 2,491 raw files were simultaneously analyzed by database search to identify proteins and peptides using the Andromeda search engine<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"66 title=\"Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 10, 1794\u20131805 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR50\" id=\"ref-link-section-d3926824e1588\">50<\/a><\/sup> inside MaxQuant (v.1.5.7.5)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"77 title=\"Tyanova, S., Temu, T. &#038; Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301\u20132319 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR51\" id=\"ref-link-section-d3926824e1592\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"88 title=\"Cox, J. &#038; Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367\u20131372 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR52\" id=\"ref-link-section-d3926824e1595\">52<\/a><\/sup>. Searches were performed against the following protein sequence databases: UniProt canonical (release 2017_02; UP000005640_9606), UniProt isoform (UP000005640_9606_additional), Ensembl canonical (release 86; GRCh38.pep.all), Ensembl isoform (GRCh38.pep.abinitio). Searches used the default precursor mass tolerances (20\u2009p.p.m. first search and 4.5\u2009p.p.m. main search) and a product mass tolerance of 0.35\u2009Da. The in silico digest was set to specific cleavage and a maximum of two missed cleavages for all proteases, except chymotrypsin, where up to four missed cleavages were allowed. Parameters for each protease (LysC, LysN, chymotrypsin, AspN, GluC and trypsin) were set in groups. The fixed modifications specified were carbamidomethylation of cysteine residues and variable modifications were oxidation of methionine and acetylation of protein N terminus. PSMs and protein groups were both sequentially filtered to a 1% FDR over the whole dataset, resulting in detection of 12,151,708 forward PSMs (7,469 reverse PSMs; 0.06% FDR), 1,119,510 forward peptides (4,486 reverse peptides; 0.4% FDR) and 17,717 proteins (176 reverse proteins; 0.99% FDR). Note, PSMs that match only to protein groups that do not pass the protein-level FDR filtering are not present in the output tables, resulting in lower than the initially specified 1% PSM FDR (0.06%). Protein groups were filtered for \u2018Only identified by site\u2019, \u2018Reverse\u2019 and \u2018Contaminant\u2019. Gene locus information was mapped to majority protein identifications with Human Gene Nomenclature Database identifications from UniProt and Ensembl BioMart.<\/p>\n<h3 id=\"Sec14\">Protein coverage calculation<\/h3>\n<p>Sequence coverage for various subsets of runs was calculated with a custom C# application. For each row in the MaxQuant proteinGroups.txt output, all associated peptides were retrieved from peptides.txt. For each peptide, it was first determined whether it was found in this subset of runs, using the experiment-based PSM count columns in peptides.txt. If so, the sequence was searched for all occurrences in the sequence of the first major protein of the protein group, ignoring enzyme specificity. A list of unique amino acid residues observed was maintained across all peptides, and at the end the number of residues in the list was divided by the total number of residues in the major protein sequence. Whenever possible, sequence coverages obtained in this manner were compared with those computed by MaxQuant and included in proteinGroups.txt, and the agreement was excellent. The console C# code is located at <a href=\"https:\/\/github.com\/cwenger\/cwenger.github.io\/tree\/master\/MaxQuantAnalyzer\">https:\/\/github.com\/cwenger\/cwenger.github.io\/tree\/master\/MaxQuantAnalyzer.<\/a><\/p>\n<h3 id=\"Sec15\">Spectra visualization and annotation<\/h3>\n<p>All presented spectra were annotated and visualized with a web-based Interactive Peptide Spectra Annotator<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\"99 title=\"Brademan, D. R., Riley, N. M., Kwiecien, N. W. &#038; Coon, J. J. Interactive peptide spectral annotator: a versatile web-based tool for proteomic applications. Mol. Cell. Proteomics 18, S193\u2013S201 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR78\" id=\"ref-link-section-d3926824e1621\">78<\/a><\/sup>. Two spectra shown in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig4\">4<\/a> have the following Universal Spectrum Identifiers\u2014mzspec:<a href=\"http:\/\/proteomecentral.proteomexchange.org\/cgi\/GetDataset?ID=PXD024364\">PXD024364<\/a>:20160115_alr_CompleteHumanProteome_HUVEC_chymo_CAD_fr14:scan:50088:CMAVCGSAIPTTAASTPDAVDKY\/2 (left side) and mzspec:<a href=\"http:\/\/proteomecentral.proteomexchange.org\/cgi\/GetDataset?ID=PXD024364\">PXD024364<\/a>:HeLaS3_trypsin_19_140824180249:scan:34854:DPVKLPTTAASTPDAVDK\/2 (right side).<\/p>\n<h3 id=\"Sec16\">De novo proteome assembly<\/h3>\n<p>The PSMs were extracted from the evidence.txt file and filtered by \u2018Potential contaminant\u2019 and \u2018Reverse\u2019. Each PSM was reverse translated into nucleotide sequence with a nondegenerate codon table and written into a FASTA file as input to SOAPdenovo. The SOAPdenovo config file parameters were set to default except for maximal read length to 150. SOAPdenovo-Trans-31mer was run with <i>k<\/i>-mer length 23 (at least 8 amino acids) and minimum contig length 100 (at least 34 amino acids). Scaffolds from the assembly were matched back to the proteome sequences using brute force string matching.<\/p>\n<h3 id=\"Sec17\">RNA-seq data and analysis<\/h3>\n<p>The paired RNA-seq data for HeLa S3\/HUVEC\/HepG2\/K562\/GM12878\/hESC are a part of the ENCODE dataset<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"00 title=\"Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101\u2013108 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR49\" id=\"ref-link-section-d3926824e1661\">49<\/a><\/sup> and were downloaded from SRA (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/sra\/?term=SRP014320\">SRP014320<\/a>). Raw reads were filtered using trimmomatic (v.0.36) using default parameters for paired-end data. Filtered reads were mapped to the human reference genome GRCh38 (Ensemble release 91) using STAR aligner (v.2.5.3a). Further processing\u2014sorting, converting from SAM to BAM format and indexing\u2014was done using SAMtools (v.1.6).<\/p>\n<p>To compare proteomics and transcriptomics data (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#Fig3\">3b<\/a>), raw reads per gene were counted in Perseus (v.1.6.14.0)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"11 title=\"Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731\u2013740 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR79\" id=\"ref-link-section-d3926824e1678\">79<\/a><\/sup>, and rows were logarithmized with pseudocount 1 and normalized by <i>z<\/i>-scoring for each experiment independently. Intensity-based absolute quantification values from the standard proteomics search were summed for each cell line (through fractions, fragmentation methods and proteases), logarithmized, <i>z<\/i>-scored for each cell line independently and imputed by replacing missing values from the normal distribution (width\u2009=\u20090.3, down shift\u2009=\u20091.8), separated for each cell line. After joining the two tables, genes with both proteomics and transcriptomics data were used for the principal component analysis plot. Component 1 (accounting for 27.8% of the variance) was not used because it explains the difference between proteomics and transcriptomics data.<\/p>\n<h3 id=\"Sec18\">Mutation analysis\u2014transcriptomics<\/h3>\n<p>Nonsynonymous mutations were extracted from RNA-seq data of all studied cell lines using the \u2018Variation extraction\u2019 tool in MaxQuant (Tools\/Variation extraction; Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">4<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"22 title=\"Sinitcyn, P., Gerwien, M. &#038; Cox, J. MaxQuant module for the identification of genomic variants propagated into peptides. Methods Mol. Biol. 2456, 339\u2013347 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR59\" id=\"ref-link-section-d3926824e1699\">59<\/a><\/sup>. This tool reports in a fasta file all nonsynonymous mutations that pass a list of filters: total reads depth should more than or equal to 10; number of reads with mutations should be more than or equal to 5; the frequency of reads with mutations to overall depth should be more than or equal to 15%; the base quality, as well as the mapping quality, should be more than or equal to 13, which automatically filters out multi-mapped reads. The \u2018Variation extraction\u2019 tool generates, amongst many output files, a <i>protein.fa<\/i> file with all annotated \u2018protein_coding\u2019 sequences as well as information about nonsynonymous mutations in a header for each sequence.<\/p>\n<h3 id=\"Sec19\">Mutation analysis\u2014proteomics<\/h3>\n<p>To enable MaxQuant to use the specified mutations, one has to add the fasta file into the \u2018Fasta files\u2019 tab (Global Parameters\/Sequences\/Fasta files) and change the \u2018Variation mode\u2019 parameter to \u2018Read from fasta file\u2019<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"33 title=\"Sinitcyn, P., Gerwien, M. &#038; Cox, J. MaxQuant module for the identification of genomic variants propagated into peptides. Methods Mol. Biol. 2456, 339\u2013347 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR59\" id=\"ref-link-section-d3926824e1715\">59<\/a><\/sup>. In the MaxQuant output \u2018peptides.txt\u2019 file an additional column such as \u2018Mutated\u2019 and \u2018Mutation names\u2019 columns will be created. The \u2018Mutated\u2019 column reports \u2018No\u2019 if one peptide comes from the reference proteome (without mutations), \u2018Yes\u2019 if a peptide results from mutation inclusion and \u2018Mixed\u2019 if one can find peptides in the reference as well as mutated proteomes. The \u2018Mutation names\u2019 stands for a list of involved mutations.<\/p>\n<h3 id=\"Sec20\">Splicing analysis\u2014transcriptomics and proteomics<\/h3>\n<p>The analysis of alternative splicing is based on the gene graph structure, where nodes represent the beginnings and the ends of exons, and edges correspond to exon\u2013exon junctions as well as connections within an exon. Each splicing event in this graph is a local subgraph with multiple paths; however, all paths start from the same node and finish on the same downstream node. It is important to point out that one path can consist of several isoforms. The algorithm is adapted from ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"44 title=\"Sammeth, M. Complete alternative splicing events are bubbles in splicing graphs. J. Comput. Biol. 16, 1117\u20131140 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR80\" id=\"ref-link-section-d3926824e1727\">80<\/a><\/sup>. To use the same approach for proteomics, protein coordinates of peptides were converted to genome locations, taking into account the intron\u2013exon structure of genes. The modified version of the algorithm is available as a plugin for Perseus software (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM1\">6<\/a>).<\/p>\n<h3 id=\"Sec21\">Binary classification of alternative splicing events<\/h3>\n<p>The binary classification was conducted with XGBoost python package (v.1.5.0)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"55 title=\"Chen, T. &#038; Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785\u2013794 (ACM, 2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR68\" id=\"ref-link-section-d3926824e1742\">68<\/a><\/sup>. The optimum set of learning parameters has been estimated using a grid search (RandomizedSearchCV function from sklearn package) with the sevenfold cross-validation technique and area under the ROC curve as a performance metric. The selected parameters are listed as follow\u2014learning rate: 0.05; L1 regularization weight: 1.15; L2 regularization weight: 4.0; minimum child weight: 2.0; maximum depth: 3; minimum loss reduction (gamma): 2.0; subsample ratio of columns: 0.3; subsample ratio of the training instances: 0.65; scale positive weight: 4.44.<\/p>\n<h3 id=\"Sec22\">Reporting summary<\/h3>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#MOESM2\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n<\/div>\n<\/div><\/div>\n<div data-enable-entitlement-checks>\n<div id=\"data-availability-section\" data-title=\"Data availability\">\n<h2 id=\"data-availability\">Data availability<\/h2>\n<p>All raw mass spectrometry data files and MaxQuant output from the standard search have been deposited to the ProteomeXchange Consortium (<a href=\"http:\/\/proteomecentral.proteomexchange.org\">http:\/\/proteomecentral.proteomexchange.org<\/a>) via the MassIVE<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\"66 title=\"Wang, M. et al. Assembling the community-scale discoverable human proteome. Cell Syst. 7, 412\u2013421.e5 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01714-x#ref-CR8\" id=\"ref-link-section-d3926824e1853\">8<\/a><\/sup> partner repository with the dataset identifier <a href=\"http:\/\/proteomecentral.proteomexchange.org\/cgi\/GetDataset?ID=PXD024364\">PXD024364<\/a>. Profiled protein and transcript variants are compiled in the following location: <a href=\"https:\/\/deep-sequencing.app\">https:\/\/deep-sequencing.app.<\/a><\/p>\n<\/div>\n<div id=\"code-availability-section\" data-title=\"Code availability\">\n<h2 id=\"code-availability\">Code availability<\/h2>\n<\/div>\n<div id=\"MagazineFulltextArticleBodySuffix\" aria-labelledby=\"Bib1\" data-title=\"References\">\n<h2 id=\"Bib1\">References<\/h2>\n<div data-container-section=\"references\" id=\"Bib1-content\">\n<ol data-track-component=\"outbound reference\">\n<li data-counter=\"1.\">\n<p id=\"ref-CR1\">Richards, A. L. et al. One-hour proteome analysis in yeast. <i>Nat. Protoc.<\/i> <b>10<\/b>, 701\u2013714 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nprot.2015.040\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnprot.2015.040\" aria-label=\"Reference 12\"77 data-doi=\"10.1038\/nprot.2015.040\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2MXmt1Smtrc%3D\" aria-label=\"Reference 12\"88>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=25855955\" aria-label=\"Reference 12\"99>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6434932\" aria-label=\"Reference 13\"00>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 13\"11 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=One-hour%20proteome%20analysis%20in%20yeast&#038;journal=Nat.%20Protoc.&#038;doi=10.1038%2Fnprot.2015.040&#038;volume=10&#038;pages=701-714&#038;publication_year=2015&#038;author=Richards%2CAL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"2.\">\n<p id=\"ref-CR2\">Hebert, A. S. et al. The one hour yeast proteome. <i>Mol. Cell. Proteomics<\/i> <b>13<\/b>, 339\u2013347 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.M113.034769\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.M113.034769\" aria-label=\"Reference 13\"22 data-doi=\"10.1074\/mcp.M113.034769\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXitlSksg%3D%3D\" aria-label=\"Reference 13\"33>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24143002\" aria-label=\"Reference 13\"44>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 13\"55 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20one%20hour%20yeast%20proteome&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.M113.034769&#038;volume=13&#038;pages=339-347&#038;publication_year=2014&#038;author=Hebert%2CAS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"3.\">\n<p id=\"ref-CR3\">Gholami, A. M. et al. Global proteome analysis of the NCI-60 cell line panel. <i>Cell Rep.<\/i> <b>4<\/b>, 609\u2013620 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.celrep.2013.07.018\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.celrep.2013.07.018\" aria-label=\"Reference 13\"66 data-doi=\"10.1016\/j.celrep.2013.07.018\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXht1Gms7fN\" aria-label=\"Reference 13\"77>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23933261\" aria-label=\"Reference 13\"88>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 13\"99 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Global%20proteome%20analysis%20of%20the%20NCI-60%20cell%20line%20panel&#038;journal=Cell%20Rep.&#038;doi=10.1016%2Fj.celrep.2013.07.018&#038;volume=4&#038;pages=609-620&#038;publication_year=2013&#038;author=Gholami%2CAM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"4.\">\n<p id=\"ref-CR4\">Kelstrup, C. D. et al. Performance evaluation of the Q Exactive HF-X for shotgun proteomics. <i>J. Proteome Res.<\/i> <b>17<\/b>, 727\u2013738 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/acs.jproteome.7b00602\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Facs.jproteome.7b00602\" aria-label=\"Reference 2\"0000 data-doi=\"10.1021\/acs.jproteome.7b00602\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXhvVOit7vL\" aria-label=\"Reference 2\"0101>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=29183128\" aria-label=\"Reference 2\"0202>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"0303 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Performance%20evaluation%20of%20the%20Q%20Exactive%20HF-X%20for%20shotgun%20proteomics&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Facs.jproteome.7b00602&#038;volume=17&#038;pages=727-738&#038;publication_year=2018&#038;author=Kelstrup%2CCD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"5.\">\n<p id=\"ref-CR5\">Kim, M. S. et al. A draft map of the human proteome. <i>Nature<\/i> <b>509<\/b>, 575\u2013581 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature13302\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature13302\" aria-label=\"Reference 2\"0404 data-doi=\"10.1038\/nature13302\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXoslCrtrc%3D\" aria-label=\"Reference 2\"0505>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24870542\" aria-label=\"Reference 2\"0606>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4403737\" aria-label=\"Reference 2\"0707>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"0808 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20draft%20map%20of%20the%20human%20proteome&#038;journal=Nature&#038;doi=10.1038%2Fnature13302&#038;volume=509&#038;pages=575-581&#038;publication_year=2014&#038;author=Kim%2CMS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"6.\">\n<p id=\"ref-CR6\">Wilhelm, M. et al. Mass-spectrometry-based draft of the human proteome. <i>Nature<\/i> <b>509<\/b>, 582\u2013587 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature13319\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature13319\" aria-label=\"Reference 2\"0909 data-doi=\"10.1038\/nature13319\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXoslCrt7k%3D\" aria-label=\"Reference 2\"1010>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24870543\" aria-label=\"Reference 2\"1111>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"1212 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Mass-spectrometry-based%20draft%20of%20the%20human%20proteome&#038;journal=Nature&#038;doi=10.1038%2Fnature13319&#038;volume=509&#038;pages=582-587&#038;publication_year=2014&#038;author=Wilhelm%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"7.\">\n<p id=\"ref-CR7\">Adhikari, S. et al. A high-stringency blueprint of the human proteome. <i>Nat. Commun.<\/i> <b>11<\/b>, 5301 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41467-020-19045-9\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41467-020-19045-9\" aria-label=\"Reference 2\"1313 data-doi=\"10.1038\/s41467-020-19045-9\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXitFCks7bO\" aria-label=\"Reference 2\"1414>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33067450\" aria-label=\"Reference 2\"1515>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7568584\" aria-label=\"Reference 2\"1616>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"1717 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20high-stringency%20blueprint%20of%20the%20human%20proteome&#038;journal=Nat.%20Commun.&#038;doi=10.1038%2Fs41467-020-19045-9&#038;volume=11&#038;publication_year=2020&#038;author=Adhikari%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"8.\">\n<p id=\"ref-CR8\">Wang, M. et al. Assembling the community-scale discoverable human proteome. <i>Cell Syst.<\/i> <b>7<\/b>, 412\u2013421.e5 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2018.08.004\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2018.08.004\" aria-label=\"Reference 2\"1818 data-doi=\"10.1016\/j.cels.2018.08.004\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1cXitVSisrfP\" aria-label=\"Reference 2\"1919>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=30172843\" aria-label=\"Reference 2\"2020>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6279426\" aria-label=\"Reference 2\"2121>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"2222 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Assembling%20the%20community-scale%20discoverable%20human%20proteome&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2018.08.004&#038;volume=7&#038;pages=412-421.e5&#038;publication_year=2018&#038;author=Wang%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"9.\">\n<p id=\"ref-CR9\">Frankish, A. et al. GENCODE 2021. <i>Nucleic Acids Res.<\/i> <b>49<\/b>, D916\u2013D923 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/nar\/gkaa1087\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fnar%2Fgkaa1087\" aria-label=\"Reference 2\"2323 data-doi=\"10.1093\/nar\/gkaa1087\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXntlejtbY%3D\" aria-label=\"Reference 2\"2424>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33270111\" aria-label=\"Reference 2\"2525>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"2626 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=GENCODE%202021&#038;journal=Nucleic%20Acids%20Res.&#038;doi=10.1093%2Fnar%2Fgkaa1087&#038;volume=49&#038;pages=D916-D923&#038;publication_year=2021&#038;author=Frankish%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"10.\">\n<p id=\"ref-CR10\">Harrow, J. et al. GENCODE: the reference human genome annotation for the ENCODE project. <i>Genome Res.<\/i> <b>22<\/b>, 1760\u20131774 (2012).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1101\/gr.135350.111\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1101%2Fgr.135350.111\" aria-label=\"Reference 2\"2727 data-doi=\"10.1101\/gr.135350.111\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC38XhtlentLvN\" aria-label=\"Reference 2\"2828>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=22955987\" aria-label=\"Reference 2\"2929>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3431492\" aria-label=\"Reference 2\"3030>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"3131 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=GENCODE%3A%20the%20reference%20human%20genome%20annotation%20for%20the%20ENCODE%20project&#038;journal=Genome%20Res.&#038;doi=10.1101%2Fgr.135350.111&#038;volume=22&#038;pages=1760-1774&#038;publication_year=2012&#038;author=Harrow%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"11.\">\n<p id=\"ref-CR11\">Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. <i>Nature<\/i> <b>456<\/b>, 470\u2013476 (2008).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature07509\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature07509\" aria-label=\"Reference 2\"3232 data-doi=\"10.1038\/nature07509\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1cXhsVegtbfL\" aria-label=\"Reference 2\"3333>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=18978772\" aria-label=\"Reference 2\"3434>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC2593745\" aria-label=\"Reference 2\"3535>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"3636 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Alternative%20isoform%20regulation%20in%20human%20tissue%20transcriptomes&#038;journal=Nature&#038;doi=10.1038%2Fnature07509&#038;volume=456&#038;pages=470-476&#038;publication_year=2008&#038;author=Wang%2CET\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"12.\">\n<p id=\"ref-CR12\">Pan, Q., Shai, O., Lee, L. J., Frey, B. J. &#038; Blencowe, B. J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. <i>Nat. Genet.<\/i> <b>40<\/b>, 1413\u20131415 (2008).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/ng.259\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fng.259\" aria-label=\"Reference 2\"3737 data-doi=\"10.1038\/ng.259\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1cXhsVWhu7vP\" aria-label=\"Reference 2\"3838>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=18978789\" aria-label=\"Reference 2\"3939>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"4040 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Deep%20surveying%20of%20alternative%20splicing%20complexity%20in%20the%20human%20transcriptome%20by%20high-throughput%20sequencing&#038;journal=Nat.%20Genet.&#038;doi=10.1038%2Fng.259&#038;volume=40&#038;pages=1413-1415&#038;publication_year=2008&#038;author=Pan%2CQ&#038;author=Shai%2CO&#038;author=Lee%2CLJ&#038;author=Frey%2CBJ&#038;author=Blencowe%2CBJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"13.\">\n<p id=\"ref-CR13\">Joglekar, A. et al. A spatially resolved brain region- and cell type-specific isoform atlas of the postnatal mouse brain. <i>Nat. Commun.<\/i> <b>12<\/b>, 463 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41467-020-20343-5\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41467-020-20343-5\" aria-label=\"Reference 2\"4141 data-doi=\"10.1038\/s41467-020-20343-5\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhvFaqsro%3D\" aria-label=\"Reference 2\"4242>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33469025\" aria-label=\"Reference 2\"4343>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7815907\" aria-label=\"Reference 2\"4444>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"4545 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20spatially%20resolved%20brain%20region-%20and%20cell%20type-specific%20isoform%20atlas%20of%20the%20postnatal%20mouse%20brain&#038;journal=Nat.%20Commun.&#038;doi=10.1038%2Fs41467-020-20343-5&#038;volume=12&#038;publication_year=2021&#038;author=Joglekar%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"14.\">\n<p id=\"ref-CR14\">Hardwick, S. A. et al. Single-nuclei isoform RNA sequencing unlocks barcoded exon connectivity in frozen brain tissue. <i>Nat. Biotechnol.<\/i> <b>40<\/b>, 1082\u20131092 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41587-022-01231-3\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41587-022-01231-3\" aria-label=\"Reference 2\"4646 data-doi=\"10.1038\/s41587-022-01231-3\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XmtlCitLg%3D\" aria-label=\"Reference 2\"4747>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35256815\" aria-label=\"Reference 2\"4848>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC9287170\" aria-label=\"Reference 2\"4949>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5050 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Single-nuclei%20isoform%20RNA%20sequencing%20unlocks%20barcoded%20exon%20connectivity%20in%20frozen%20brain%20tissue&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fs41587-022-01231-3&#038;volume=40&#038;pages=1082-1092&#038;publication_year=2022&#038;author=Hardwick%2CSA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"15.\">\n<p id=\"ref-CR15\">Myers, R. M. et al. A user\u2019s guide to the encyclopedia of DNA elements (ENCODE). The ENCODE Project Consortium. <i>PLoS Biol.<\/i> <b>9<\/b>, e1001046 (2011).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1371\/journal.pbio.1001046\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1371%2Fjournal.pbio.1001046\" aria-label=\"Reference 2\"5151 data-doi=\"10.1371\/journal.pbio.1001046\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3MXlt12hsLw%3D\" aria-label=\"Reference 2\"5252>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5353 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20user%E2%80%99s%20guide%20to%20the%20encyclopedia%20of%20DNA%20elements%20%28ENCODE%29.%20The%20ENCODE%20Project%20Consortium&#038;journal=PLoS%20Biol.&#038;doi=10.1371%2Fjournal.pbio.1001046&#038;volume=9&#038;publication_year=2011&#038;author=Myers%2CRM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"16.\">\n<p id=\"ref-CR16\">Altshuler, D. L. et al. A map of human genome variation from population-scale sequencing. <i>Nature<\/i> <b>467<\/b>, 1061\u20131073 (2010).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature09534\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature09534\" aria-label=\"Reference 2\"5454 data-doi=\"10.1038\/nature09534\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=20981092\" aria-label=\"Reference 2\"5555>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5656 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20map%20of%20human%20genome%20variation%20from%20population-scale%20sequencing&#038;journal=Nature&#038;doi=10.1038%2Fnature09534&#038;volume=467&#038;pages=1061-1073&#038;publication_year=2010&#038;author=Altshuler%2CDL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"17.\">\n<p id=\"ref-CR17\">Zubarev, R. A. The challenge of the proteome dynamic range and its implications for in-depth proteomics. <i>Proteomics<\/i> <b>13<\/b>, 723\u2013726 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1002\/pmic.201200451\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1002%2Fpmic.201200451\" aria-label=\"Reference 2\"5757 data-doi=\"10.1002\/pmic.201200451\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXjs1Cjsr4%3D\" aria-label=\"Reference 2\"5858>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23307342\" aria-label=\"Reference 2\"5959>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"6060 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20challenge%20of%20the%20proteome%20dynamic%20range%20and%20its%20implications%20for%20in-depth%20proteomics&#038;journal=Proteomics&#038;doi=10.1002%2Fpmic.201200451&#038;volume=13&#038;pages=723-726&#038;publication_year=2013&#038;author=Zubarev%2CRA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"18.\">\n<p id=\"ref-CR18\">Sheynkman, G. M., Shortreed, M. R., Frey, B. L., Scalf, M. &#038; Smith, L. M. Large-scale mass spectrometric detection of variant peptides resulting from nonsynonymous nucleotide differences. <i>J. Proteome Res.<\/i> <b>13<\/b>, 228\u2013240 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/pr4009207\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fpr4009207\" aria-label=\"Reference 2\"6161 data-doi=\"10.1021\/pr4009207\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXhslSmtrnF\" aria-label=\"Reference 2\"6262>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24175627\" aria-label=\"Reference 2\"6363>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"6464 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Large-scale%20mass%20spectrometric%20detection%20of%20variant%20peptides%20resulting%20from%20nonsynonymous%20nucleotide%20differences&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Fpr4009207&#038;volume=13&#038;pages=228-240&#038;publication_year=2014&#038;author=Sheynkman%2CGM&#038;author=Shortreed%2CMR&#038;author=Frey%2CBL&#038;author=Scalf%2CM&#038;author=Smith%2CLM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"19.\">\n<p id=\"ref-CR19\">Sheynkman, G. M., Shortreed, M. R., Frey, B. L. &#038; Smith, L. M. Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-seq. <i>Mol. Cell. Proteomics<\/i> <b>12<\/b>, 2341\u20132353 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.O113.028142\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.O113.028142\" aria-label=\"Reference 2\"6565 data-doi=\"10.1074\/mcp.O113.028142\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXht1Shsr7K\" aria-label=\"Reference 2\"6666>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23629695\" aria-label=\"Reference 2\"6767>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3734590\" aria-label=\"Reference 2\"6868>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"6969 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Discovery%20and%20mass%20spectrometric%20analysis%20of%20novel%20splice-junction%20peptides%20using%20RNA-seq&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.O113.028142&#038;volume=12&#038;pages=2341-2353&#038;publication_year=2013&#038;author=Sheynkman%2CGM&#038;author=Shortreed%2CMR&#038;author=Frey%2CBL&#038;author=Smith%2CLM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"20.\">\n<p id=\"ref-CR20\">Menon, R. et al. Distinct splice variants and pathway enrichment in the cell-line models of aggressive human breast cancer subtypes. <i>J. Proteome Res.<\/i> <b>13<\/b>, 212\u2013227 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/pr400773v\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fpr400773v\" aria-label=\"Reference 2\"7070 data-doi=\"10.1021\/pr400773v\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXhs1Sis7bN\" aria-label=\"Reference 2\"7171>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24111759\" aria-label=\"Reference 2\"7272>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"7373 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Distinct%20splice%20variants%20and%20pathway%20enrichment%20in%20the%20cell-line%20models%20of%20aggressive%20human%20breast%20cancer%20subtypes&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Fpr400773v&#038;volume=13&#038;pages=212-227&#038;publication_year=2014&#038;author=Menon%2CR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"21.\">\n<p id=\"ref-CR21\">Smith, L. M. &#038; Kelleher, N. L. Proteoform: a single term describing protein complexity. <i>Nat. Methods<\/i> <b>10<\/b>, 186\u2013187 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth.2369\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth.2369\" aria-label=\"Reference 2\"7474 data-doi=\"10.1038\/nmeth.2369\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXjtFWitb4%3D\" aria-label=\"Reference 2\"7575>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23443629\" aria-label=\"Reference 2\"7676>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4114032\" aria-label=\"Reference 2\"7777>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"7878 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Proteoform%3A%20a%20single%20term%20describing%20protein%20complexity&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth.2369&#038;volume=10&#038;pages=186-187&#038;publication_year=2013&#038;author=Smith%2CLM&#038;author=Kelleher%2CNL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"22.\">\n<p id=\"ref-CR22\">Smith, L. M. et al. The human proteoform project: defining the human proteome. <i>Sci. Adv.<\/i> <b>7<\/b>, eabk0734 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/sciadv.abk0734\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fsciadv.abk0734\" aria-label=\"Reference 2\"7979 data-doi=\"10.1126\/sciadv.abk0734\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXis1GhsbrM\" aria-label=\"Reference 2\"8080>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34767442\" aria-label=\"Reference 2\"8181>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8589312\" aria-label=\"Reference 2\"8282>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"8383 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20human%20proteoform%20project%3A%20defining%20the%20human%20proteome&#038;journal=Sci.%20Adv.&#038;doi=10.1126%2Fsciadv.abk0734&#038;volume=7&#038;publication_year=2021&#038;author=Smith%2CLM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"23.\">\n<p id=\"ref-CR23\">Samaras, P. et al. ProteomicsDB: a multi-omics and multi-organism resource for life science research. <i>Nucleic Acids Res.<\/i> <b>48<\/b>, D1153\u2013D1163 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhs1GltrvP\" aria-label=\"Reference 2\"8484>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31665479\" aria-label=\"Reference 2\"8585>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"8686 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=ProteomicsDB%3A%20a%20multi-omics%20and%20multi-organism%20resource%20for%20life%20science%20research&#038;journal=Nucleic%20Acids%20Res.&#038;volume=48&#038;pages=D1153-D1163&#038;publication_year=2020&#038;author=Samaras%2CP\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"24.\">\n<p id=\"ref-CR24\">Omenn, G. S. et al. Research on the human proteome reaches a major milestone: >90% of predicted human proteins now credibly detected, according to the HUPO human proteome project. <i>J. Proteome Res.<\/i> <b>19<\/b>, 4735\u20134746 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/acs.jproteome.0c00485\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Facs.jproteome.0c00485\" aria-label=\"Reference 2\"8787 data-doi=\"10.1021\/acs.jproteome.0c00485\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhvVeqsr%2FP\" aria-label=\"Reference 2\"8888>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32931287\" aria-label=\"Reference 2\"8989>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7718309\" aria-label=\"Reference 2\"9090>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"9191 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Research%20on%20the%20human%20proteome%20reaches%20a%20major%20milestone%3A%20%3E90%25%20of%20predicted%20human%20proteins%20now%20credibly%20detected%2C%20according%20to%20the%20HUPO%20human%20proteome%20project&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Facs.jproteome.0c00485&#038;volume=19&#038;pages=4735-4746&#038;publication_year=2020&#038;author=Omenn%2CGS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"25.\">\n<p id=\"ref-CR25\">Toby, T. K., Fornelli, L. &#038; Kelleher, N. L. Progress in top-down proteomics and the analysis of proteoforms. <i>Annu. Rev. Anal. Chem. (Palo Alto Calif.)<\/i> <b>9<\/b>, 499\u2013519 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1146\/annurev-anchem-071015-041550\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1146%2Fannurev-anchem-071015-041550\" aria-label=\"Reference 2\"9292 data-doi=\"10.1146\/annurev-anchem-071015-041550\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XhtVGgu7%2FO\" aria-label=\"Reference 2\"9393>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27306313\" aria-label=\"Reference 2\"9494>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"9595 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Progress%20in%20top-down%20proteomics%20and%20the%20analysis%20of%20proteoforms&#038;journal=Annu.%20Rev.%20Anal.%20Chem.%20%28Palo%20Alto%20Calif.%29&#038;doi=10.1146%2Fannurev-anchem-071015-041550&#038;volume=9&#038;pages=499-519&#038;publication_year=2016&#038;author=Toby%2CTK&#038;author=Fornelli%2CL&#038;author=Kelleher%2CNL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"26.\">\n<p id=\"ref-CR26\">Meyer, J. G. et al. Expanding proteome coverage with orthogonal-specificity \u03b1-lytic proteases. <i>Mol. Cell. Proteomics<\/i> <b>13<\/b>, 823\u2013835 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.M113.034710\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.M113.034710\" aria-label=\"Reference 2\"9696 data-doi=\"10.1074\/mcp.M113.034710\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXktlWitbw%3D\" aria-label=\"Reference 2\"9797>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24425750\" aria-label=\"Reference 2\"9898>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3945911\" aria-label=\"Reference 2\"9999>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0000 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Expanding%20proteome%20coverage%20with%20orthogonal-specificity%20%CE%B1-lytic%20proteases&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.M113.034710&#038;volume=13&#038;pages=823-835&#038;publication_year=2014&#038;author=Meyer%2CJG\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"27.\">\n<p id=\"ref-CR27\">Giansanti, P., Tsiatsiani, L., Low, T. Y. &#038; Heck, A. J. R. Six alternative proteases for mass spectrometry-based proteomics beyond trypsin. <i>Nat. Protoc.<\/i> <b>11<\/b>, 993\u20131006 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nprot.2016.057\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnprot.2016.057\" aria-label=\"Reference 7\"0101 data-doi=\"10.1038\/nprot.2016.057\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XmsFaqsLs%3D\" aria-label=\"Reference 7\"0202>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27123950\" aria-label=\"Reference 7\"0303>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0404 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Six%20alternative%20proteases%20for%20mass%20spectrometry-based%20proteomics%20beyond%20trypsin&#038;journal=Nat.%20Protoc.&#038;doi=10.1038%2Fnprot.2016.057&#038;volume=11&#038;pages=993-1006&#038;publication_year=2016&#038;author=Giansanti%2CP&#038;author=Tsiatsiani%2CL&#038;author=Low%2CTY&#038;author=Heck%2CAJR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"28.\">\n<p id=\"ref-CR28\">Aebersold, R. H., Leavitt, J., Saavedra, R. A., Hood, L. E. &#038; Kent, S. B. Internal amino acid sequence analysis of proteins separated by one- or two-dimensional gel electrophoresis after in situ protease digestion on nitrocellulose. <i>Proc. Natl Acad. Sci. USA<\/i> <b>84<\/b>, 6970\u20136974 (1987).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.84.20.6970\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.84.20.6970\" aria-label=\"Reference 7\"0505 data-doi=\"10.1073\/pnas.84.20.6970\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DyaL1cXjtVWl\" aria-label=\"Reference 7\"0606>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=3313383\" aria-label=\"Reference 7\"0707>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC299210\" aria-label=\"Reference 7\"0808>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0909 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Internal%20amino%20acid%20sequence%20analysis%20of%20proteins%20separated%20by%20one-%20or%20two-dimensional%20gel%20electrophoresis%20after%20in%20situ%20protease%20digestion%20on%20nitrocellulose&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.84.20.6970&#038;volume=84&#038;pages=6970-6974&#038;publication_year=1987&#038;author=Aebersold%2CRH&#038;author=Leavitt%2CJ&#038;author=Saavedra%2CRA&#038;author=Hood%2CLE&#038;author=Kent%2CSB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"29.\">\n<p id=\"ref-CR29\">MacCoss, M. J. et al. Shotgun identification of protein modifications from protein complexes and lens tissue. <i>Proc. Natl Acad. Sci. USA<\/i> <b>99<\/b>, 7900\u20137905 (2002).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.122231399\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.122231399\" aria-label=\"Reference 7\"1010 data-doi=\"10.1073\/pnas.122231399\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD38XkvVGjsbg%3D\" aria-label=\"Reference 7\"1111>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=12060738\" aria-label=\"Reference 7\"1212>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC122992\" aria-label=\"Reference 7\"1313>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"1414 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Shotgun%20identification%20of%20protein%20modifications%20from%20protein%20complexes%20and%20lens%20tissue&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.122231399&#038;volume=99&#038;pages=7900-7905&#038;publication_year=2002&#038;author=MacCoss%2CMJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"30.\">\n<p id=\"ref-CR30\">Choudhary, G., Wu, S. L., Shieh, P. &#038; Hancock, W. S. Multiple enzymatic digestion for enhanced sequence coverage of proteins in complex proteomic mixtures using capillary LC with ion trap MS\/MS. <i>J. Proteome Res.<\/i> <b>2<\/b>, 59\u201367 (2003).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/pr025557n\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fpr025557n\" aria-label=\"Reference 7\"1515 data-doi=\"10.1021\/pr025557n\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD38XovV2ltL8%3D\" aria-label=\"Reference 7\"1616>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=12643544\" aria-label=\"Reference 7\"1717>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"1818 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Multiple%20enzymatic%20digestion%20for%20enhanced%20sequence%20coverage%20of%20proteins%20in%20complex%20proteomic%20mixtures%20using%20capillary%20LC%20with%20ion%20trap%20MS%2FMS&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Fpr025557n&#038;volume=2&#038;pages=59-67&#038;publication_year=2003&#038;author=Choudhary%2CG&#038;author=Wu%2CSL&#038;author=Shieh%2CP&#038;author=Hancock%2CWS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"31.\">\n<p id=\"ref-CR31\">Harper, R. G., Workman, S. R., Schuetzner, S., Timperman, A. T. &#038; Sutton, J. N. Low-molecular-weight human serum proteome using ultrafiltration, isoelectric focusing, and mass spectrometry. <i>Electrophoresis<\/i> <b>25<\/b>, 1299\u20131306 (2004).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1002\/elps.200405864\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1002%2Felps.200405864\" aria-label=\"Reference 7\"1919 data-doi=\"10.1002\/elps.200405864\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2cXks12ht7g%3D\" aria-label=\"Reference 7\"2020>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=15174052\" aria-label=\"Reference 7\"2121>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"2222 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Low-molecular-weight%20human%20serum%20proteome%20using%20ultrafiltration%2C%20isoelectric%20focusing%2C%20and%20mass%20spectrometry&#038;journal=Electrophoresis&#038;doi=10.1002%2Felps.200405864&#038;volume=25&#038;pages=1299-1306&#038;publication_year=2004&#038;author=Harper%2CRG&#038;author=Workman%2CSR&#038;author=Schuetzner%2CS&#038;author=Timperman%2CAT&#038;author=Sutton%2CJN\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"32.\">\n<p id=\"ref-CR32\">Schlosser, A., Vanselow, J. T. &#038; Kramer, A. Mapping of phosphorylation sites by a multi-protease approach with specific phosphopeptide enrichment and NanoLC-MS\/MS analysis. <i>Anal. Chem.<\/i> <b>77<\/b>, 5243\u20135250 (2005).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/ac050232m\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fac050232m\" aria-label=\"Reference 7\"2323 data-doi=\"10.1021\/ac050232m\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2MXlvF2isrg%3D\" aria-label=\"Reference 7\"2424>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16097765\" aria-label=\"Reference 7\"2525>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"2626 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Mapping%20of%20phosphorylation%20sites%20by%20a%20multi-protease%20approach%20with%20specific%20phosphopeptide%20enrichment%20and%20NanoLC-MS%2FMS%20analysis&#038;journal=Anal.%20Chem.&#038;doi=10.1021%2Fac050232m&#038;volume=77&#038;pages=5243-5250&#038;publication_year=2005&#038;author=Schlosser%2CA&#038;author=Vanselow%2CJT&#038;author=Kramer%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"33.\">\n<p id=\"ref-CR33\">Biringer, R. G. et al. Enhanced sequence coverage of proteins in human cerebrospinal fluid using multiple enzymatic digestion and linear ion trap LC-MS\/MS. <i>Brief. Funct. Genomic. Proteomic.<\/i> <b>5<\/b>, 144\u2013153 (2006).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/bfgp\/ell026\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fbfgp%2Fell026\" aria-label=\"Reference 7\"2727 data-doi=\"10.1093\/bfgp\/ell026\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD28XhtVSrsb7N\" aria-label=\"Reference 7\"2828>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16772279\" aria-label=\"Reference 7\"2929>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"3030 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Enhanced%20sequence%20coverage%20of%20proteins%20in%20human%20cerebrospinal%20fluid%20using%20multiple%20enzymatic%20digestion%20and%20linear%20ion%20trap%20LC-MS%2FMS&#038;journal=Brief.%20Funct.%20Genomic.%20Proteomic.&#038;doi=10.1093%2Fbfgp%2Fell026&#038;volume=5&#038;pages=144-153&#038;publication_year=2006&#038;author=Biringer%2CRG\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"34.\">\n<p id=\"ref-CR34\">Elenitoba-Johnson, K. S. J. et al. Proteomic identification of oncogenic chromosomal translocation partners encoding chimeric anaplastic lymphoma kinase fusion proteins. <i>Proc. Natl Acad. Sci. USA<\/i> <b>103<\/b>, 7402\u20137407 (2006).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.0506514103\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.0506514103\" aria-label=\"Reference 7\"3131 data-doi=\"10.1073\/pnas.0506514103\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD28XkslOrsbg%3D\" aria-label=\"Reference 7\"3232>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16651537\" aria-label=\"Reference 7\"3333>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC1464352\" aria-label=\"Reference 7\"3434>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"3535 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Proteomic%20identification%20of%20oncogenic%20chromosomal%20translocation%20partners%20encoding%20chimeric%20anaplastic%20lymphoma%20kinase%20fusion%20proteins&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.0506514103&#038;volume=103&#038;pages=7402-7407&#038;publication_year=2006&#038;author=Elenitoba-Johnson%2CKSJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"35.\">\n<p id=\"ref-CR35\">Wang, B., Malik, R., Nigg, E. A. &#038; K\u00f6rner, R. Evaluation of the low-specificity protease elastase for large-scale phosphoproteome analysis. <i>Anal. Chem.<\/i> <b>80<\/b>, 9526\u20139533 (2008).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/ac801708p\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fac801708p\" aria-label=\"Reference 7\"3636 data-doi=\"10.1021\/ac801708p\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1cXhtlOitbfN\" aria-label=\"Reference 7\"3737>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=19007248\" aria-label=\"Reference 7\"3838>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"3939 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Evaluation%20of%20the%20low-specificity%20protease%20elastase%20for%20large-scale%20phosphoproteome%20analysis&#038;journal=Anal.%20Chem.&#038;doi=10.1021%2Fac801708p&#038;volume=80&#038;pages=9526-9533&#038;publication_year=2008&#038;author=Wang%2CB&#038;author=Malik%2CR&#038;author=Nigg%2CEA&#038;author=K%C3%B6rner%2CR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"36.\">\n<p id=\"ref-CR36\">Gauci, S. et al. Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach. <i>Anal. Chem.<\/i> <b>81<\/b>, 4493\u20134501 (2009).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/ac9004309\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fac9004309\" aria-label=\"Reference 7\"4040 data-doi=\"10.1021\/ac9004309\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1MXltl2iu7c%3D\" aria-label=\"Reference 7\"4141>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=19413330\" aria-label=\"Reference 7\"4242>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"4343 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Lys-N%20and%20trypsin%20cover%20complementary%20parts%20of%20the%20phosphoproteome%20in%20a%20refined%20SCX-based%20approach&#038;journal=Anal.%20Chem.&#038;doi=10.1021%2Fac9004309&#038;volume=81&#038;pages=4493-4501&#038;publication_year=2009&#038;author=Gauci%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"37.\">\n<p id=\"ref-CR37\">Swaney, D. L., Wenger, C. D. &#038; Coon, J. J. Value of using multiple proteases for large-scale mass spectrometry-based proteomics. <i>J. Proteome Res.<\/i> <b>9<\/b>, 1323\u20131329 (2010).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/pr900863u\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fpr900863u\" aria-label=\"Reference 7\"4444 data-doi=\"10.1021\/pr900863u\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3cXht12hs7o%3D\" aria-label=\"Reference 7\"4545>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=20113005\" aria-label=\"Reference 7\"4646>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC2833215\" aria-label=\"Reference 7\"4747>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"4848 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Value%20of%20using%20multiple%20proteases%20for%20large-scale%20mass%20spectrometry-based%20proteomics&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Fpr900863u&#038;volume=9&#038;pages=1323-1329&#038;publication_year=2010&#038;author=Swaney%2CDL&#038;author=Wenger%2CCD&#038;author=Coon%2CJJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"38.\">\n<p id=\"ref-CR38\">Guo, X., Trudgian, D. C., Lemoff, A., Yadavalli, S. &#038; Mirzaei, H. Confetti: a multiprotease map of the HeLa proteome for comprehensive proteomics. <i>Mol. Cell. Proteomics<\/i> <b>13<\/b>, 1573\u20131584 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.M113.035170\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.M113.035170\" aria-label=\"Reference 7\"4949 data-doi=\"10.1074\/mcp.M113.035170\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXpsFCjt7s%3D\" aria-label=\"Reference 7\"5050>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24696503\" aria-label=\"Reference 7\"5151>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4047476\" aria-label=\"Reference 7\"5252>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"5353 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Confetti%3A%20a%20multiprotease%20map%20of%20the%20HeLa%20proteome%20for%20comprehensive%20proteomics&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.M113.035170&#038;volume=13&#038;pages=1573-1584&#038;publication_year=2014&#038;author=Guo%2CX&#038;author=Trudgian%2CDC&#038;author=Lemoff%2CA&#038;author=Yadavalli%2CS&#038;author=Mirzaei%2CH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"39.\">\n<p id=\"ref-CR39\">Giansanti, P. et al. An augmented multiple-protease-based human phosphopeptide atlas. <i>Cell Rep.<\/i> <b>11<\/b>, 1834\u20131843 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.celrep.2015.05.029\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.celrep.2015.05.029\" aria-label=\"Reference 7\"5454 data-doi=\"10.1016\/j.celrep.2015.05.029\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2MXhtVeiu7rN\" aria-label=\"Reference 7\"5555>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26074081\" aria-label=\"Reference 7\"5656>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"5757 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=An%20augmented%20multiple-protease-based%20human%20phosphopeptide%20atlas&#038;journal=Cell%20Rep.&#038;doi=10.1016%2Fj.celrep.2015.05.029&#038;volume=11&#038;pages=1834-1843&#038;publication_year=2015&#038;author=Giansanti%2CP\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"40.\">\n<p id=\"ref-CR40\">Bekker-Jensen, D. B. et al. An optimized shotgun strategy for the rapid generation of comprehensive human proteomes. <i>Cell Syst<\/i>. <b>4<\/b>, 587\u2013599 (2017).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2017.05.009\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2017.05.009\" aria-label=\"Reference 7\"5858 data-doi=\"10.1016\/j.cels.2017.05.009\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXhtFSmtr3J\" aria-label=\"Reference 7\"5959>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=28601559\" aria-label=\"Reference 7\"6060>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5493283\" aria-label=\"Reference 7\"6161>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"6262 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=An%20optimized%20shotgun%20strategy%20for%20the%20rapid%20generation%20of%20comprehensive%20human%20proteomes&#038;journal=Cell%20Syst&#038;doi=10.1016%2Fj.cels.2017.05.009&#038;volume=4&#038;pages=587-599&#038;publication_year=2017&#038;author=Bekker-Jensen%2CDB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"41.\">\n<p id=\"ref-CR41\">Miller, R. M. et al. Improved protein inference from multiple protease bottom-up mass spectrometry data. <i>J. Proteome Res.<\/i> <b>18<\/b>, 3429\u20133438 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/acs.jproteome.9b00330\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Facs.jproteome.9b00330\" aria-label=\"Reference 7\"6363 data-doi=\"10.1021\/acs.jproteome.9b00330\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXhsFWms7rP\" aria-label=\"Reference 7\"6464>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31378069\" aria-label=\"Reference 7\"6565>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6733628\" aria-label=\"Reference 7\"6666>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"6767 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Improved%20protein%20inference%20from%20multiple%20protease%20bottom-up%20mass%20spectrometry%20data&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Facs.jproteome.9b00330&#038;volume=18&#038;pages=3429-3438&#038;publication_year=2019&#038;author=Miller%2CRM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"42.\">\n<p id=\"ref-CR42\">Wang, D. et al. A deep proteome and transcriptome abundance atlas of 29 healthy human tissues. <i>Mol. Syst. Biol.<\/i> <b>15<\/b>, e8503 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.15252\/msb.20188503\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.15252%2Fmsb.20188503\" aria-label=\"Reference 7\"6868 data-doi=\"10.15252\/msb.20188503\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=30777892\" aria-label=\"Reference 7\"6969>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6379049\" aria-label=\"Reference 7\"7070>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"7171 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20deep%20proteome%20and%20transcriptome%20abundance%20atlas%20of%2029%20healthy%20human%20tissues&#038;journal=Mol.%20Syst.%20Biol.&#038;doi=10.15252%2Fmsb.20188503&#038;volume=15&#038;publication_year=2019&#038;author=Wang%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"43.\">\n<p id=\"ref-CR43\">Dau, T., Bartolomucci, G. &#038; Rappsilber, J. Proteomics using protease alternatives to trypsin benefits from sequential digestion with trypsin. <i>Anal. Chem.<\/i> <b>92<\/b>, 9523\u20139527 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/acs.analchem.0c00478\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Facs.analchem.0c00478\" aria-label=\"Reference 7\"7272 data-doi=\"10.1021\/acs.analchem.0c00478\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhtlWlsrfN\" aria-label=\"Reference 7\"7373>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32628831\" aria-label=\"Reference 7\"7474>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7377536\" aria-label=\"Reference 7\"7575>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"7676 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Proteomics%20using%20protease%20alternatives%20to%20trypsin%20benefits%20from%20sequential%20digestion%20with%20trypsin&#038;journal=Anal.%20Chem.&#038;doi=10.1021%2Facs.analchem.0c00478&#038;volume=92&#038;pages=9523-9527&#038;publication_year=2020&#038;author=Dau%2CT&#038;author=Bartolomucci%2CG&#038;author=Rappsilber%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"44.\">\n<p id=\"ref-CR44\">Richards, A. L. et al. Data-independent acquisition protease-multiplexing enables increased proteome sequence coverage across multiple fragmentation modes. <i>J. Proteome Res.<\/i> <b>21<\/b>, 1124\u20131136 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/acs.jproteome.1c00960\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Facs.jproteome.1c00960\" aria-label=\"Reference 7\"7777 data-doi=\"10.1021\/acs.jproteome.1c00960\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XlsVKrsLs%3D\" aria-label=\"Reference 7\"7878>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35234472\" aria-label=\"Reference 7\"7979>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"8080 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Data-independent%20acquisition%20protease-multiplexing%20enables%20increased%20proteome%20sequence%20coverage%20across%20multiple%20fragmentation%20modes&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Facs.jproteome.1c00960&#038;volume=21&#038;pages=1124-1136&#038;publication_year=2022&#038;author=Richards%2CAL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"45.\">\n<p id=\"ref-CR45\">Olsen, J. V. et al. Higher-energy C-trap dissociation for peptide modification analysis. <i>Nat. Methods<\/i> <b>4<\/b>, 709\u2013712 (2007).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth1060\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth1060\" aria-label=\"Reference 7\"8181 data-doi=\"10.1038\/nmeth1060\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2sXps12gsLY%3D\" aria-label=\"Reference 7\"8282>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=17721543\" aria-label=\"Reference 7\"8383>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"8484 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Higher-energy%20C-trap%20dissociation%20for%20peptide%20modification%20analysis&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth1060&#038;volume=4&#038;pages=709-712&#038;publication_year=2007&#038;author=Olsen%2CJV\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"46.\">\n<p id=\"ref-CR46\">Mitchell Wells, J. &#038; McLuckey, S. A. Collision-induced dissociation (CID) of peptides and proteins. <i>Methods Enzymol.<\/i> <b>402<\/b>, 148\u2013185 (2005).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/S0076-6879(05)02005-7\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2FS0076-6879%2805%2902005-7\" aria-label=\"Reference 7\"8585 data-doi=\"10.1016\/S0076-6879(05)02005-7\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16401509\" aria-label=\"Reference 7\"8686>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"8787 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Collision-induced%20dissociation%20%28CID%29%20of%20peptides%20and%20proteins&#038;journal=Methods%20Enzymol.&#038;doi=10.1016%2FS0076-6879%2805%2902005-7&#038;volume=402&#038;pages=148-185&#038;publication_year=2005&#038;author=Mitchell%20Wells%2CJ&#038;author=McLuckey%2CSA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"47.\">\n<p id=\"ref-CR47\">Coon, J. J., Shabanowitz, J., Hunt, D. F. &#038; Syka, J. E. P. Electron transfer dissociation of peptide anions. <i>J. Am. Soc. Mass. Spectrom.<\/i> <b>16<\/b>, 880\u2013882 (2005).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.jasms.2005.01.015\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.jasms.2005.01.015\" aria-label=\"Reference 7\"8888 data-doi=\"10.1016\/j.jasms.2005.01.015\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2MXktlChtbc%3D\" aria-label=\"Reference 7\"8989>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=15907703\" aria-label=\"Reference 7\"9090>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"9191 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Electron%20transfer%20dissociation%20of%20peptide%20anions&#038;journal=J.%20Am.%20Soc.%20Mass.%20Spectrom.&#038;doi=10.1016%2Fj.jasms.2005.01.015&#038;volume=16&#038;pages=880-882&#038;publication_year=2005&#038;author=Coon%2CJJ&#038;author=Shabanowitz%2CJ&#038;author=Hunt%2CDF&#038;author=Syka%2CJEP\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"48.\">\n<p id=\"ref-CR48\">Syka, J. E., Coon, J. J., Schroeder, M. J., Shabanowitz, J. &#038; Hunt, D. F. Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. <i>Proc. Natl. Acad. Sci. USA<\/i> <b>101<\/b>, 9528\u20139533 (2004).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.0402700101\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.0402700101\" aria-label=\"Reference 7\"9292 data-doi=\"10.1073\/pnas.0402700101\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2cXlvVahs70%3D\" aria-label=\"Reference 7\"9393>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=15210983\" aria-label=\"Reference 7\"9494>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC470779\" aria-label=\"Reference 7\"9595>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"9696 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Peptide%20and%20protein%20sequence%20analysis%20by%20electron%20transfer%20dissociation%20mass%20spectrometry&#038;journal=Proc.%20Natl.%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.0402700101&#038;volume=101&#038;pages=9528-9533&#038;publication_year=2004&#038;author=Syka%2CJE&#038;author=Coon%2CJJ&#038;author=Schroeder%2CMJ&#038;author=Shabanowitz%2CJ&#038;author=Hunt%2CDF\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"49.\">\n<p id=\"ref-CR49\">Djebali, S. et al. Landscape of transcription in human cells. <i>Nature<\/i> <b>489<\/b>, 101\u2013108 (2012).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature11233\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature11233\" aria-label=\"Reference 7\"9797 data-doi=\"10.1038\/nature11233\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC38XhtlGnt73M\" aria-label=\"Reference 7\"9898>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=22955620\" aria-label=\"Reference 7\"9999>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3684276\" aria-label=\"Reference 7\"0000>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0101 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Landscape%20of%20transcription%20in%20human%20cells&#038;journal=Nature&#038;doi=10.1038%2Fnature11233&#038;volume=489&#038;pages=101-108&#038;publication_year=2012&#038;author=Djebali%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"50.\">\n<p id=\"ref-CR50\">Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant environment. <i>J. Proteome Res.<\/i> <b>10<\/b>, 1794\u20131805 (2011).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/pr101065j\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fpr101065j\" aria-label=\"Reference 7\"0202 data-doi=\"10.1021\/pr101065j\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3MXit1Gis74%3D\" aria-label=\"Reference 7\"0303>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=21254760\" aria-label=\"Reference 7\"0404>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0505 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Andromeda%3A%20a%20peptide%20search%20engine%20integrated%20into%20the%20MaxQuant%20environment&#038;journal=J.%20Proteome%20Res.&#038;doi=10.1021%2Fpr101065j&#038;volume=10&#038;pages=1794-1805&#038;publication_year=2011&#038;author=Cox%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"51.\">\n<p id=\"ref-CR51\">Tyanova, S., Temu, T. &#038; Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. <i>Nat. Protoc.<\/i> <b>11<\/b>, 2301\u20132319 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nprot.2016.136\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnprot.2016.136\" aria-label=\"Reference 7\"0606 data-doi=\"10.1038\/nprot.2016.136\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XhslynsL7O\" aria-label=\"Reference 7\"0707>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27809316\" aria-label=\"Reference 7\"0808>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"0909 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20MaxQuant%20computational%20platform%20for%20mass%20spectrometry-based%20shotgun%20proteomics&#038;journal=Nat.%20Protoc.&#038;doi=10.1038%2Fnprot.2016.136&#038;volume=11&#038;pages=2301-2319&#038;publication_year=2016&#038;author=Tyanova%2CS&#038;author=Temu%2CT&#038;author=Cox%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"52.\">\n<p id=\"ref-CR52\">Cox, J. &#038; Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. <i>Nat. Biotechnol.<\/i> <b>26<\/b>, 1367\u20131372 (2008).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nbt.1511\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnbt.1511\" aria-label=\"Reference 7\"1010 data-doi=\"10.1038\/nbt.1511\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1cXhsVWjtLzJ\" aria-label=\"Reference 7\"1111>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=19029910\" aria-label=\"Reference 7\"1212>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"1313 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=MaxQuant%20enables%20high%20peptide%20identification%20rates%2C%20individualized%20p.p.b.-range%20mass%20accuracies%20and%20proteome-wide%20protein%20quantification&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fnbt.1511&#038;volume=26&#038;pages=1367-1372&#038;publication_year=2008&#038;author=Cox%2CJ&#038;author=Mann%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"53.\">\n<p id=\"ref-CR53\">Gilmore, J. M. &#038; Washburn, M. P. Advances in shotgun proteomics and the analysis of membrane proteomes. <i>J. Proteomics<\/i> <b>73<\/b>, 2078\u20132091 (2010).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.jprot.2010.08.005\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.jprot.2010.08.005\" aria-label=\"Reference 7\"1414 data-doi=\"10.1016\/j.jprot.2010.08.005\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3cXht1CmtbnP\" aria-label=\"Reference 7\"1515>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=20797458\" aria-label=\"Reference 7\"1616>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"1717 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Advances%20in%20shotgun%20proteomics%20and%20the%20analysis%20of%20membrane%20proteomes&#038;journal=J.%20Proteomics&#038;doi=10.1016%2Fj.jprot.2010.08.005&#038;volume=73&#038;pages=2078-2091&#038;publication_year=2010&#038;author=Gilmore%2CJM&#038;author=Washburn%2CMP\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"54.\">\n<p id=\"ref-CR54\">Washburn, M. P., Wolters, D. &#038; Yates, J. R. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. <i>Nat. Biotechnol.<\/i> <b>19<\/b>, 242\u2013247 (2001).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/85686\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2F85686\" aria-label=\"Reference 7\"1818 data-doi=\"10.1038\/85686\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD3MXhslaqtbw%3D\" aria-label=\"Reference 7\"1919>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=11231557\" aria-label=\"Reference 7\"2020>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"2121 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Large-scale%20analysis%20of%20the%20yeast%20proteome%20by%20multidimensional%20protein%20identification%20technology&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2F85686&#038;volume=19&#038;pages=242-247&#038;publication_year=2001&#038;author=Washburn%2CMP&#038;author=Wolters%2CD&#038;author=Yates%2CJR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"55.\">\n<p id=\"ref-CR55\">Wu, C. C. &#038; Yates, J. R. The application of mass spectrometry to membrane proteomics. <i>Nat. Biotechnol.<\/i> <b>21<\/b>, 262\u2013267 (2003).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nbt0303-262\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnbt0303-262\" aria-label=\"Reference 7\"2222 data-doi=\"10.1038\/nbt0303-262\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD3sXhsFajsrs%3D\" aria-label=\"Reference 7\"2323>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=12610573\" aria-label=\"Reference 7\"2424>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"2525 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20application%20of%20mass%20spectrometry%20to%20membrane%20proteomics&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fnbt0303-262&#038;volume=21&#038;pages=262-267&#038;publication_year=2003&#038;author=Wu%2CCC&#038;author=Yates%2CJR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"56.\">\n<p id=\"ref-CR56\">Xie, Y. et al. SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-seq reads. <i>Bioinformatics<\/i> <b>30<\/b>, 1660\u20131666 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/bioinformatics\/btu077\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fbioinformatics%2Fbtu077\" aria-label=\"Reference 7\"2626 data-doi=\"10.1093\/bioinformatics\/btu077\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXpvFCqsbg%3D\" aria-label=\"Reference 7\"2727>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24532719\" aria-label=\"Reference 7\"2828>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"2929 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=SOAPdenovo-Trans%3A%20de%20novo%20transcriptome%20assembly%20with%20short%20RNA-seq%20reads&#038;journal=Bioinformatics&#038;doi=10.1093%2Fbioinformatics%2Fbtu077&#038;volume=30&#038;pages=1660-1666&#038;publication_year=2014&#038;author=Xie%2CY\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"57.\">\n<p id=\"ref-CR57\">Guthals, A., Clauser, K. R. &#038; Bandeira, N. Shotgun protein sequencing with meta-contig assembly. <i>Mol. Cell. Proteomics<\/i> <b>11<\/b>, 1084\u20131096 (2012).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.M111.015768\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.M111.015768\" aria-label=\"Reference 7\"3030 data-doi=\"10.1074\/mcp.M111.015768\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC38XhsFSmtrzI\" aria-label=\"Reference 7\"3131>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=22798278\" aria-label=\"Reference 7\"3232>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3494147\" aria-label=\"Reference 7\"3333>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"3434 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Shotgun%20protein%20sequencing%20with%20meta-contig%20assembly&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.M111.015768&#038;volume=11&#038;pages=1084-1096&#038;publication_year=2012&#038;author=Guthals%2CA&#038;author=Clauser%2CKR&#038;author=Bandeira%2CN\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"58.\">\n<p id=\"ref-CR58\">Landry, J. J. M. et al. The genomic and transcriptomic landscape of a HeLa cell line. <i>G3 (Bethesda)<\/i> <b>3<\/b>, 1213\u20131224 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1534\/g3.113.005777\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1534%2Fg3.113.005777\" aria-label=\"Reference 7\"3535 data-doi=\"10.1534\/g3.113.005777\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23550136\" aria-label=\"Reference 7\"3636>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"3737 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20genomic%20and%20transcriptomic%20landscape%20of%20a%20HeLa%20cell%20line&#038;journal=G3%20%28Bethesda%29&#038;doi=10.1534%2Fg3.113.005777&#038;volume=3&#038;pages=1213-1224&#038;publication_year=2013&#038;author=Landry%2CJJM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"59.\">\n<p id=\"ref-CR59\">Sinitcyn, P., Gerwien, M. &#038; Cox, J. MaxQuant module for the identification of genomic variants propagated into peptides. <i>Methods Mol. Biol.<\/i> <b>2456<\/b>, 339\u2013347 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1007\/978-1-0716-2124-0_23\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1007%2F978-1-0716-2124-0_23\" aria-label=\"Reference 7\"3838 data-doi=\"10.1007\/978-1-0716-2124-0_23\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35612753\" aria-label=\"Reference 7\"3939>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"4040 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=MaxQuant%20module%20for%20the%20identification%20of%20genomic%20variants%20propagated%20into%20peptides&#038;journal=Methods%20Mol.%20Biol.&#038;doi=10.1007%2F978-1-0716-2124-0_23&#038;volume=2456&#038;pages=339-347&#038;publication_year=2022&#038;author=Sinitcyn%2CP&#038;author=Gerwien%2CM&#038;author=Cox%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"60.\">\n<p id=\"ref-CR60\">Tiwary, S. et al. High-quality MS\/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. <i>Nat. Methods<\/i> <b>16<\/b>, 519\u2013525 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41592-019-0427-6\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41592-019-0427-6\" aria-label=\"Reference 7\"4141 data-doi=\"10.1038\/s41592-019-0427-6\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXhtVCitLzE\" aria-label=\"Reference 7\"4242>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31133761\" aria-label=\"Reference 7\"4343>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"4444 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=High-quality%20MS%2FMS%20spectrum%20prediction%20for%20data-dependent%20and%20data-independent%20acquisition%20data%20analysis&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fs41592-019-0427-6&#038;volume=16&#038;pages=519-525&#038;publication_year=2019&#038;author=Tiwary%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"61.\">\n<p id=\"ref-CR61\">Kumar, P., Henikoff, S. &#038; Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. <i>Nat. Protoc.<\/i> <b>4<\/b>, 1073\u20131081 (2009).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nprot.2009.86\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnprot.2009.86\" aria-label=\"Reference 7\"4545 data-doi=\"10.1038\/nprot.2009.86\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1MXovVyns78%3D\" aria-label=\"Reference 7\"4646>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=19561590\" aria-label=\"Reference 7\"4747>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"4848 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Predicting%20the%20effects%20of%20coding%20non-synonymous%20variants%20on%20protein%20function%20using%20the%20SIFT%20algorithm&#038;journal=Nat.%20Protoc.&#038;doi=10.1038%2Fnprot.2009.86&#038;volume=4&#038;pages=1073-1081&#038;publication_year=2009&#038;author=Kumar%2CP&#038;author=Henikoff%2CS&#038;author=Ng%2CPC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"62.\">\n<p id=\"ref-CR62\">Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. <i>Nat. Methods<\/i> <b>7<\/b>, 248\u2013249 (2010).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth0410-248\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth0410-248\" aria-label=\"Reference 7\"4949 data-doi=\"10.1038\/nmeth0410-248\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3cXjvFKqu78%3D\" aria-label=\"Reference 7\"5050>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=20354512\" aria-label=\"Reference 7\"5151>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC2855889\" aria-label=\"Reference 7\"5252>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"5353 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20method%20and%20server%20for%20predicting%20damaging%20missense%20mutations&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth0410-248&#038;volume=7&#038;pages=248-249&#038;publication_year=2010&#038;author=Adzhubei%2CIA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"63.\">\n<p id=\"ref-CR63\">Tress, M. L., Abascal, F. &#038; Valencia, A. Alternative splicing may not be the key to proteome complexity. <i>Trends Biochem. Sci.<\/i> <b>42<\/b>, 98\u2013110 (2017).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.tibs.2016.08.008\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.tibs.2016.08.008\" aria-label=\"Reference 7\"5454 data-doi=\"10.1016\/j.tibs.2016.08.008\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28Xhs1WntL%2FJ\" aria-label=\"Reference 7\"5555>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27712956\" aria-label=\"Reference 7\"5656>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"5757 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Alternative%20splicing%20may%20not%20be%20the%20key%20to%20proteome%20complexity&#038;journal=Trends%20Biochem.%20Sci.&#038;doi=10.1016%2Fj.tibs.2016.08.008&#038;volume=42&#038;pages=98-110&#038;publication_year=2017&#038;author=Tress%2CML&#038;author=Abascal%2CF&#038;author=Valencia%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"64.\">\n<p id=\"ref-CR64\">Blencowe, B. J. The relationship between alternative splicing and proteomic complexity. <i>Trends Biochem. Sci.<\/i> <b>42<\/b>, 407\u2013408 (2017).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.tibs.2017.04.001\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.tibs.2017.04.001\" aria-label=\"Reference 7\"5858 data-doi=\"10.1016\/j.tibs.2017.04.001\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXmtFOiu7Y%3D\" aria-label=\"Reference 7\"5959>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=28483376\" aria-label=\"Reference 7\"6060>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"6161 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20relationship%20between%20alternative%20splicing%20and%20proteomic%20complexity&#038;journal=Trends%20Biochem.%20Sci.&#038;doi=10.1016%2Fj.tibs.2017.04.001&#038;volume=42&#038;pages=407-408&#038;publication_year=2017&#038;author=Blencowe%2CBJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"65.\">\n<p id=\"ref-CR65\">Wang, X. et al. Detection of proteome diversity resulted from alternative splicing is limited by Trypsin cleavage specificity. <i>Mol. Cell. Proteomics<\/i> <b>17<\/b>, 422\u2013430 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.RA117.000155\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.RA117.000155\" aria-label=\"Reference 7\"6262 data-doi=\"10.1074\/mcp.RA117.000155\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1cXjs1agsbo%3D\" aria-label=\"Reference 7\"6363>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=29222161\" aria-label=\"Reference 7\"6464>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"6565 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Detection%20of%20proteome%20diversity%20resulted%20from%20alternative%20splicing%20is%20limited%20by%20Trypsin%20cleavage%20specificity&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.RA117.000155&#038;volume=17&#038;pages=422-430&#038;publication_year=2018&#038;author=Wang%2CX\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"66.\">\n<p id=\"ref-CR66\">Lewis, B. P., Green, R. E. &#038; Brenner, S. E. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. <i>Proc. Natl Acad. Sci. USA<\/i> <b>100<\/b>, 189\u2013192 (2003).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.0136770100\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.0136770100\" aria-label=\"Reference 7\"6666 data-doi=\"10.1073\/pnas.0136770100\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD3sXktlOgtA%3D%3D\" aria-label=\"Reference 7\"6767>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=12502788\" aria-label=\"Reference 7\"6868>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"6969 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Evidence%20for%20the%20widespread%20coupling%20of%20alternative%20splicing%20and%20nonsense-mediated%20mRNA%20decay%20in%20humans&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.0136770100&#038;volume=100&#038;pages=189-192&#038;publication_year=2003&#038;author=Lewis%2CBP&#038;author=Green%2CRE&#038;author=Brenner%2CSE\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"67.\">\n<p id=\"ref-CR67\">Braunschweig, U. et al. Widespread intron retention in mammals functionally tunes transcriptomes. <i>Genome Res.<\/i> <b>24<\/b>, 1774\u20131786 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1101\/gr.177790.114\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1101%2Fgr.177790.114\" aria-label=\"Reference 7\"7070 data-doi=\"10.1101\/gr.177790.114\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXhvFWqs7bF\" aria-label=\"Reference 7\"7171>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=25258385\" aria-label=\"Reference 7\"7272>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4216919\" aria-label=\"Reference 7\"7373>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"7474 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Widespread%20intron%20retention%20in%20mammals%20functionally%20tunes%20transcriptomes&#038;journal=Genome%20Res.&#038;doi=10.1101%2Fgr.177790.114&#038;volume=24&#038;pages=1774-1786&#038;publication_year=2014&#038;author=Braunschweig%2CU\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"68.\">\n<p id=\"ref-CR68\">Chen, T. &#038; Guestrin, C. XGBoost: a scalable tree boosting system. In <i>Proc. of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining<\/i> 785\u2013794 (ACM, 2016).<\/p>\n<\/li>\n<li data-counter=\"69.\">\n<p id=\"ref-CR69\">Pedregosa, F. et al. Scikit-learn: machine learning in Python. <i>J. Mach. Learn. Res.<\/i> <b>12<\/b>, 2825\u20132830 (2011).<\/p>\n<p><a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"7575 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Scikit-learn%3A%20machine%20learning%20in%20Python&#038;journal=J.%20Mach.%20Learn.%20Res.&#038;volume=12&#038;pages=2825-2830&#038;publication_year=2011&#038;author=Pedregosa%2CF\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"70.\">\n<p id=\"ref-CR70\">Cleary, S. &#038; Seoighe, C. Perspectives on allele-specific expression. <i>Annu. Rev. Biomed. Data Sci.<\/i> <b>4<\/b>, 101\u2013122 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1146\/annurev-biodatasci-021621-122219\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1146%2Fannurev-biodatasci-021621-122219\" aria-label=\"Reference 7\"7676 data-doi=\"10.1146\/annurev-biodatasci-021621-122219\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34465174\" aria-label=\"Reference 7\"7777>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"7878 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Perspectives%20on%20allele-specific%20expression&#038;journal=Annu.%20Rev.%20Biomed.%20Data%20Sci.&#038;doi=10.1146%2Fannurev-biodatasci-021621-122219&#038;volume=4&#038;pages=101-122&#038;publication_year=2021&#038;author=Cleary%2CS&#038;author=Seoighe%2CC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"71.\">\n<p id=\"ref-CR71\">Mann, S. P., Treit, P. V., Geyer, P. E., Omenn, G. S. &#038; Mann, M. Ethical principles, constraints, and opportunities in clinical proteomics. <i>Mol. Cell. Proteomics<\/i> <b>20<\/b>, 100046 (2021).<\/p>\n<\/li>\n<li data-counter=\"72.\">\n<p id=\"ref-CR72\">Fierro-Monti, I., Vizcaino, J. A., Choudhary, J. S. &#038; Wright, J. C. Identifying individuals using proteomics: are we there yet? <i>Front. Mol. Biosci.<\/i> <b>9<\/b>, 1062031 (2022).<\/p>\n<\/li>\n<li data-counter=\"73.\">\n<p id=\"ref-CR73\">Reixachs-Sol\u00e9, M. &#038; Eyras, E. Uncovering the impacts of alternative splicing on the proteome with current omics techniques. <i>Wiley Interdiscip. Rev. RNA<\/i> <b>13<\/b>, e1707 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1002\/wrna.1707\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1002%2Fwrna.1707\" aria-label=\"Reference 7\"7979 data-doi=\"10.1002\/wrna.1707\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34979593\" aria-label=\"Reference 7\"8080>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC9542554\" aria-label=\"Reference 7\"8181>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"8282 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Uncovering%20the%20impacts%20of%20alternative%20splicing%20on%20the%20proteome%20with%20current%20omics%20techniques&#038;journal=Wiley%20Interdiscip.%20Rev.%20RNA&#038;doi=10.1002%2Fwrna.1707&#038;volume=13&#038;publication_year=2022&#038;author=Reixachs-Sol%C3%A9%2CM&#038;author=Eyras%2CE\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"74.\">\n<p id=\"ref-CR74\">Nesvizhskii, A. I., Keller, A., Kolker, E. &#038; Aebersold, R. A statistical model for identifying proteins by tandem mass spectrometry. <i>Anal. Chem.<\/i> <b>75<\/b>, 4646\u20134658 (2003).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/ac0341261\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fac0341261\" aria-label=\"Reference 7\"8383 data-doi=\"10.1021\/ac0341261\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD3sXltlynt70%3D\" aria-label=\"Reference 7\"8484>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=14632076\" aria-label=\"Reference 7\"8585>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"8686 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20statistical%20model%20for%20identifying%20proteins%20by%20tandem%20mass%20spectrometry&#038;journal=Anal.%20Chem.&#038;doi=10.1021%2Fac0341261&#038;volume=75&#038;pages=4646-4658&#038;publication_year=2003&#038;author=Nesvizhskii%2CAI&#038;author=Keller%2CA&#038;author=Kolker%2CE&#038;author=Aebersold%2CR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"75.\">\n<p id=\"ref-CR75\">Weatheritt, R. J., Sterne-Weiler, T. &#038; Blencowe, B. J. The ribosome-engaged landscape of alternative splicing. <i>Nat. Struct. Mol. Biol.<\/i> <b>23<\/b>, 1117\u20131123 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nsmb.3317\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnsmb.3317\" aria-label=\"Reference 7\"8787 data-doi=\"10.1038\/nsmb.3317\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28Xhsl2hsb%2FJ\" aria-label=\"Reference 7\"8888>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27820807\" aria-label=\"Reference 7\"8989>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5295628\" aria-label=\"Reference 7\"9090>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"9191 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20ribosome-engaged%20landscape%20of%20alternative%20splicing&#038;journal=Nat.%20Struct.%20Mol.%20Biol.&#038;doi=10.1038%2Fnsmb.3317&#038;volume=23&#038;pages=1117-1123&#038;publication_year=2016&#038;author=Weatheritt%2CRJ&#038;author=Sterne-Weiler%2CT&#038;author=Blencowe%2CBJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"76.\">\n<p id=\"ref-CR76\">Cox, J. Prediction of peptide mass spectral libraries with machine learning. <i>Nat. Biotechnol.<\/i> <b>41<\/b>, 33\u201343 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41587-022-01424-w\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41587-022-01424-w\" aria-label=\"Reference 7\"9292 data-doi=\"10.1038\/s41587-022-01424-w\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=36008611\" aria-label=\"Reference 7\"9393>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"9494 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Prediction%20of%20peptide%20mass%20spectral%20libraries%20with%20machine%20learning&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fs41587-022-01424-w&#038;volume=41&#038;pages=33-43&#038;publication_year=2022&#038;author=Cox%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"77.\">\n<p id=\"ref-CR77\">Phanstiel, D. H. et al. Proteomic and phosphoproteomic comparison of human ES and iPS cells. <i>Nat. Methods<\/i> <b>8<\/b>, 821\u2013827 (2011).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth.1699\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth.1699\" aria-label=\"Reference 7\"9595 data-doi=\"10.1038\/nmeth.1699\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3MXhtFGnur3P\" aria-label=\"Reference 7\"9696>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=21983960\" aria-label=\"Reference 7\"9797>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3432645\" aria-label=\"Reference 7\"9898>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 7\"9999 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Proteomic%20and%20phosphoproteomic%20comparison%20of%20human%20ES%20and%20iPS%20cells&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth.1699&#038;volume=8&#038;pages=821-827&#038;publication_year=2011&#038;author=Phanstiel%2CDH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"78.\">\n<p id=\"ref-CR78\">Brademan, D. R., Riley, N. M., Kwiecien, N. W. &#038; Coon, J. J. Interactive peptide spectral annotator: a versatile web-based tool for proteomic applications. <i>Mol. Cell. Proteomics<\/i> <b>18<\/b>, S193\u2013S201 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1074\/mcp.TIR118.001209\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1074%2Fmcp.TIR118.001209\" aria-label=\"Reference 8\"0000 data-doi=\"10.1074\/mcp.TIR118.001209\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXitVeiu7rO\" aria-label=\"Reference 8\"0101>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31088857\" aria-label=\"Reference 8\"0202>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6692776\" aria-label=\"Reference 8\"0303>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 8\"0404 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Interactive%20peptide%20spectral%20annotator%3A%20a%20versatile%20web-based%20tool%20for%20proteomic%20applications&#038;journal=Mol.%20Cell.%20Proteomics&#038;doi=10.1074%2Fmcp.TIR118.001209&#038;volume=18&#038;pages=S193-S201&#038;publication_year=2019&#038;author=Brademan%2CDR&#038;author=Riley%2CNM&#038;author=Kwiecien%2CNW&#038;author=Coon%2CJJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"79.\">\n<p id=\"ref-CR79\">Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. <i>Nat. Methods<\/i> <b>13<\/b>, 731\u2013740 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth.3901\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth.3901\" aria-label=\"Reference 8\"0505 data-doi=\"10.1038\/nmeth.3901\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XhtVKntbnN\" aria-label=\"Reference 8\"0606>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27348712\" aria-label=\"Reference 8\"0707>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 8\"0808 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20Perseus%20computational%20platform%20for%20comprehensive%20analysis%20of%20%28prote%29omics%20data&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth.3901&#038;volume=13&#038;pages=731-740&#038;publication_year=2016&#038;author=Tyanova%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"80.\">\n<p id=\"ref-CR80\">Sammeth, M. Complete alternative splicing events are bubbles in splicing graphs. <i>J. Comput. Biol.<\/i> <b>16<\/b>, 1117\u20131140 (2009).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1089\/cmb.2009.0108\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1089%2Fcmb.2009.0108\" aria-label=\"Reference 8\"0909 data-doi=\"10.1089\/cmb.2009.0108\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD1MXhtVWnurbK\" aria-label=\"Reference 8\"1010>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=19689216\" aria-label=\"Reference 8\"1111>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 8\"1212 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Complete%20alternative%20splicing%20events%20are%20bubbles%20in%20splicing%20graphs&#038;journal=J.%20Comput.%20Biol.&#038;doi=10.1089%2Fcmb.2009.0108&#038;volume=16&#038;pages=1117-1140&#038;publication_year=2009&#038;author=Sammeth%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<\/ol>\n<p><a data-track=\"click\" data-track-action=\"download citation references\" data-track-label=\"link\" rel=\"nofollow\" href=\"https:\/\/citation-needed.springer.com\/v2\/references\/10.1038\/s41587-023-01714-x?format=refman&#038;flavour=references\">Download references<\/a><\/p>\n<\/div>\n<\/div>\n<div id=\"Ack1-section\" data-title=\"Acknowledgements\">\n<h2 id=\"Ack1\">Acknowledgements<\/h2>\n<p>We thank the National Institutes of Health (NIH) for support of this research &#8211; National Center for Quantitative Biology of Complex Systems grant P41108538 (J.J.C.), National Institute of General Medical Sciences grant R35GM118110 (J.J.C.), and Genomic Sciences Training Program grant T32HG002760 (A.L.R.). P.S. and D.R.B. are supported by Morgridge Interdisciplinary Postdoctoral Fellowships. B.J.B. acknowledges support from the Canadian Institutes for Health Research. We also thank C. Wenger for technical support of MaxQuantAnalyzer usage. We thank the laboratory of J. Thomson for the gift of embryonic stem cells. B.J.B. holds the University of Toronto Banbury Chair in Medical Research and the Canada Research Chair in RNA Biology and Genomics. Finally, we thank A. Williams for editing the paper and S. Hwang for editing the figures.<\/p>\n<\/div>\n<div id=\"Fun-section\" data-title=\"Funding\">\n<h2 id=\"Fun\">Funding<\/h2>\n<p>Open access funding provided by Max Planck Society.<\/p>\n<\/div>\n<div id=\"author-information-section\" aria-labelledby=\"author-information\" data-title=\"Author information\">\n<h2 id=\"author-information\">Author information<\/h2>\n<div id=\"author-information-content\">\n<p><span id=\"author-notes\">Author notes<\/span><\/p>\n<ol>\n<li id=\"na1\">\n<p>These authors contributed equally: Pavel Sinitcyn, Alicia L. Richards.<\/p>\n<\/li>\n<\/ol>\n<h3 id=\"affiliations\">Authors and Affiliations<\/h3>\n<ol>\n<li id=\"Aff1\">\n<p>Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany<\/p>\n<p>Pavel Sinitcyn\u00a0&#038;\u00a0J\u00fcrgen Cox<\/p>\n<\/li>\n<li id=\"Aff2\">\n<p>Morgridge Institute for Research, Madison, WI, USA<\/p>\n<p>Pavel Sinitcyn,\u00a0Dain R. Brademan\u00a0&#038;\u00a0Joshua J. Coon<\/p>\n<\/li>\n<li id=\"Aff3\">\n<p>National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA<\/p>\n<p>Alicia L. Richards,\u00a0Harald Marx,\u00a0Evgenia Shishkova,\u00a0Jesse G. Meyer,\u00a0Alexander S. Hebert,\u00a0Michael S. Westphall\u00a0&#038;\u00a0Joshua J. Coon<\/p>\n<\/li>\n<li id=\"Aff4\">\n<p>Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA<\/p>\n<p>Alicia L. Richards\u00a0&#038;\u00a0Joshua J. Coon<\/p>\n<\/li>\n<li id=\"Aff5\">\n<p>EMBL Australia and Garvan Institute of Medical Research, Sydney, New South Wales, Australia<\/p>\n<p>Robert J. Weatheritt<\/p>\n<\/li>\n<li id=\"Aff6\">\n<p>School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia<\/p>\n<p>Robert J. Weatheritt<\/p>\n<\/li>\n<li id=\"Aff7\">\n<p>Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA<\/p>\n<p>Dain R. Brademan,\u00a0Harald Marx,\u00a0Evgenia Shishkova,\u00a0Jesse G. Meyer,\u00a0Michael S. Westphall\u00a0&#038;\u00a0Joshua J. Coon<\/p>\n<\/li>\n<li id=\"Aff8\">\n<p>Department of Microbiology and Ecosystem Science, University of Vienna, Vienna, Austria<\/p>\n<p>Harald Marx<\/p>\n<\/li>\n<li id=\"Aff9\">\n<p>The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada<\/p>\n<p>Benjamin J. Blencowe<\/p>\n<\/li>\n<li id=\"Aff10\">\n<p>Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada<\/p>\n<p>Benjamin J. Blencowe<\/p>\n<\/li>\n<\/ol>\n<h3 id=\"contributions\">Contributions<\/h3>\n<p>A.L.R., E.S., A.S.H., M.S.W. and J.J.C. conceptualized the wet laboratory experiments and mass spectrometric measurements. A.L.R. and E.S. carried out the wet laboratory experiments and mass spectrometric measurements. P.S., R.J.W., D.R.B., H.M., J.G.M. and J.C. analyzed the data. D.R.B. developed the website resource. P.S., A.L.R., E.S., B.J.B., J.C. and J.J.C. wrote the paper. B.J.B., J.C. and J.J.C. directed the project. B.J.B. and R.J.W. directed the splicing aspect of the project. All authors interpreted data and revised the paper.<\/p>\n<h3 id=\"corresponding-author\">Corresponding authors<\/h3>\n<p id=\"corresponding-author-list\">Correspondence to<br \/>\n                <a id=\"corresp-c1\" href=\"http:\/\/www.nature.com\/mailto:co*@*********pg.de\" data-original-string=\"VAeFsF585N3uPrRj+b\/H6w==7f4jRHX5mbZWWcCyWbjklxT91k4oPwGknvw2yG8heSTO8w=\" title=\"This contact has been encoded by Anti-Spam by CleanTalk. Click to decode. To finish the decoding make sure that JavaScript is enabled in your browser.\">J\u00fcrgen Cox<\/a> or <a id=\"corresp-c2\" href=\"http:\/\/www.nature.com\/mailto:co**@**sc.edu\" data-original-string=\"HnuCQ7CFdQZ9QT1diOOKrw==7f4PwwOPhacSPNnz6r6VK\/7BQ==\" title=\"This contact has been encoded by Anti-Spam by CleanTalk. Click to decode. To finish the decoding make sure that JavaScript is enabled in your browser.\">Joshua J. Coon<\/a>.<\/p>\n<\/div>\n<\/div>\n<div id=\"ethics-section\" data-title=\"Ethics declarations\">\n<h2 id=\"ethics\">Ethics declarations<\/h2>\n<div id=\"ethics-content\">\n<h3 id=\"FPar2\">Competing interests<\/h3>\n<p>J.J.C. is a consultant for Thermo Fisher Scientific, 908 Devices, and Seer. The other authors declare no competing interests.<\/p>\n<\/p><\/div>\n<\/div>\n<div id=\"peer-review-section\" data-title=\"Peer review\">\n<h2 id=\"peer-review\">Peer review<\/h2>\n<div id=\"peer-review-content\">\n<h3 id=\"FPar1\">Peer review information<\/h3>\n<p><i>Nature Biotechnology<\/i> thanks Gilbert Omenn, Eric Deutsch and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.<\/p>\n<\/p><\/div>\n<\/div>\n<div id=\"additional-information-section\" data-title=\"Additional information\">\n<h2 id=\"additional-information\">Additional information<\/h2>\n<p><b>Publisher\u2019s note<\/b> Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.<\/p>\n<\/div>\n<div id=\"Sec24-section\" data-title=\"Supplementary information\">\n<h2 id=\"Sec24\">Supplementary information<\/h2>\n<div data-test=\"supplementary-info\" id=\"Sec24-content\">\n<p data-test=\"supp-item\" id=\"MOESM2\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"https:\/\/static-content.springer.com\/esm\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_MOESM2_ESM.pdf\" data-supp-info-image>Reporting Summary<\/a><\/h3>\n<\/p>\n<div data-test=\"supp-item\" id=\"MOESM3\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"https:\/\/static-content.springer.com\/esm\/art%3A10.1038%2Fs41587-023-01714-x\/MediaObjects\/41587_2023_1714_MOESM3_ESM.xlsx\" data-supp-info-image>Supplementary Table<\/a><\/h3>\n<p>Supplementary Table 1. Summary of cross-mapping of neXtProt and detected protein accessions. Supplementary Table 2. Summary of all mutations detected in the proteomics and transcriptomics data. Supplementary Table 3. Summary of all splicing events detected in the proteomics and transcriptomics data.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"rightslink-section\" data-title=\"Rights and permissions\">\n<h2 id=\"rightslink\">Rights and permissions<\/h2>\n<div id=\"rightslink-content\">\n<p><b>Open Access<\/b>  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article\u2019s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article\u2019s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"license\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p><a data-track=\"click\" data-track-action=\"view rights and permissions\" data-track-label=\"link\" href=\"https:\/\/s100.copyright.com\/AppDispatchServlet?title=Global%20detection%20of%20human%20variants%20and%20isoforms%20by%20deep%20proteome%20sequencing&#038;author=Pavel%20Sinitcyn%20et%20al&#038;contentID=10.1038%2Fs41587-023-01714-x&#038;copyright=The%20Author%28s%29&#038;publication=1087-0156&#038;publicationDate=2023-03-23&#038;publisherName=SpringerNature&#038;orderBeanReset=true&#038;oa=CC%20BY\">Reprints and Permissions<\/a><\/p>\n<\/div>\n<\/div>\n<div id=\"article-info-section\" aria-labelledby=\"article-info\" data-title=\"About this article\">\n<h2 id=\"article-info\">About this article<\/h2>\n<div id=\"article-info-content\">\n<p><a data-crossmark=\"10.1038\/s41587-023-01714-x\" target=\"_blank\" rel=\"noopener\" href=\"https:\/\/crossmark.crossref.org\/dialog\/?doi=10.1038\/s41587-023-01714-x\" data-track=\"click\" data-track-action=\"Click Crossmark\" data-track-label=\"link\" data-test=\"crossmark\"><img loading=\"lazy\" decoding=\"async\" width=\"57\" height=\"81\" alt=\"Science &amp; Nature Verify currency and authenticity via CrossMark\" src=\"data:image\/svg+xml;base64,PHN2ZyBoZWlnaHQ9IjgxIiB3aWR0aD0iNTciIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyI+PGcgZmlsbD0ibm9uZSIgZmlsbC1ydWxlPSJldmVub2RkIj48cGF0aCBkPSJtMTcuMzUgMzUuNDUgMjEuMy0xNC4ydi0xNy4wM2gtMjEuMyIgZmlsbD0iIzk4OTg5OCIvPjxwYXRoIGQ9Im0zOC42NSAzNS40NS0yMS4zLTE0LjJ2LTE3LjAzaDIxLjMiIGZpbGw9IiM3NDc0NzQiLz48cGF0aCBkPSJtMjggLjVjLTEyLjk4IDAtMjMuNSAxMC41Mi0yMy41IDIzLjVzMTAuNTIgMjMuNSAyMy41IDIzLjUgMjMuNS0xMC41MiAyMy41LTIzLjVjMC02LjIzLTIuNDgtMTIuMjEtNi44OC0xNi42Mi00LjQxLTQuNC0xMC4zOS02Ljg4LTE2LjYyLTYuODh6bTAgNDEuMjVjLTkuOCAwLTE3Ljc1LTcuOTUtMTcuNzUtMTcuNzVzNy45NS0xNy43NSAxNy43NS0xNy43NSAxNy43NSA3Ljk1IDE3Ljc1IDE3Ljc1YzAgNC43MS0xLjg3IDkuMjItNS4yIDEyLjU1cy03Ljg0IDUuMi0xMi41NSA1LjJ6IiBmaWxsPSIjNTM1MzUzIi8+PHBhdGggZD0ibTQxIDM2Yy01LjgxIDYuMjMtMTUuMjMgNy40NS0yMi40MyAyLjktNy4yMS00LjU1LTEwLjE2LTEzLjU3LTcuMDMtMjEuNWwtNC45Mi0zLjExYy00Ljk1IDEwLjctMS4xOSAyMy40MiA4Ljc4IDI5LjcxIDkuOTcgNi4zIDIzLjA3IDQuMjIgMzAuNi00Ljg2eiIgZmlsbD0iIzljOWM5YyIvPjxwYXRoIGQ9Im0uMiA1OC40NWMwLS43NS4xMS0xLjQyLjMzLTIuMDFzLjUyLTEuMDkuOTEtMS41Yy4zOC0uNDEuODMtLjczIDEuMzQtLjk0LjUxLS4yMiAxLjA2LS4zMiAxLjY1LS4zMi41NiAwIDEuMDYuMTEgMS41MS4zNS40NC4yMy44MS41IDEuMS44MWwtLjkxIDEuMDFjLS4yNC0uMjQtLjQ5LS40Mi0uNzUtLjU2LS4yNy0uMTMtLjU4LS4yLS45My0uMi0uMzkgMC0uNzMuMDgtMS4wNS4yMy0uMzEuMTYtLjU4LjM3LS44MS42Ni0uMjMuMjgtLjQxLjYzLS41MyAxLjA0LS4xMy40MS0uMTkuODgtLjE5IDEuMzkgMCAxLjA0LjIzIDEuODYuNjggMi40Ni40NS41OSAxLjA2Ljg4IDEuODQuODguNDEgMCAuNzctLjA3IDEuMDctLjIzcy41OS0uMzkuODUtLjY4bC45MSAxYy0uMzguNDMtLjguNzYtMS4yOC45OS0uNDcuMjItMSAuMzQtMS41OC4zNC0uNTkgMC0xLjEzLS4xLTEuNjQtLjMxLS41LS4yLS45NC0uNTEtMS4zMS0uOTEtLjM4LS40LS42Ny0uOS0uODgtMS40OC0uMjItLjU5LS4zMy0xLjI2LS4zMy0yLjAyem04LjQtNS4zM2gxLjYxdjIuNTRsLS4wNSAxLjMzYy4yOS0uMjcuNjEtLjUxLjk2LS43MnMuNzYtLjMxIDEuMjQtLjMxYy43MyAwIDEuMjcuMjMgMS42MS43MS4zMy40Ny41IDEuMTQuNSAyLjAydjQuMzFoLTEuNjF2LTQuMWMwLS41Ny0uMDgtLjk3LS4yNS0xLjIxLS4xNy0uMjMtLjQ1LS4zNS0uODMtLjM1LS4zIDAtLjU2LjA4LS43OS4yMi0uMjMuMTUtLjQ5LjM2LS43OC42NHY0LjhoLTEuNjF6bTcuMzcgNi40NWMwLS41Ni4wOS0xLjA2LjI2LTEuNTEuMTgtLjQ1LjQyLS44My43MS0xLjE0LjI5LS4zLjYzLS41NCAxLjAxLS43MS4zOS0uMTcuNzgtLjI1IDEuMTgtLjI1LjQ3IDAgLjg4LjA4IDEuMjMuMjQuMzYuMTYuNjUuMzguODkuNjdzLjQyLjYzLjU0IDEuMDNjLjEyLjQxLjE4Ljg0LjE4IDEuMzIgMCAuMzItLjAyLjU3LS4wNy43NmgtNC4zNmMuMDcuNjIuMjkgMS4xLjY1IDEuNDQuMzYuMzMuODIuNSAxLjM4LjUuMjkgMCAuNTctLjA0LjgzLS4xM3MuNTEtLjIxLjc2LS4zN2wuNTUgMS4wMWMtLjMzLjIxLS42OS4zOS0xLjA5LjUzLS40MS4xNC0uODMuMjEtMS4yNi4yMS0uNDggMC0uOTItLjA4LTEuMzQtLjI1LS40MS0uMTYtLjc2LS40LTEuMDctLjctLjMxLS4zMS0uNTUtLjY5LS43Mi0xLjEzLS4xOC0uNDQtLjI2LS45NS0uMjYtMS41MnptNC42LS42MmMwLS41NS0uMTEtLjk4LS4zNC0xLjI4LS4yMy0uMzEtLjU4LS40Ny0xLjA2LS40Ny0uNDEgMC0uNzcuMTUtMS4wNy40NS0uMzEuMjktLjUuNzMtLjU4IDEuM3ptMi41LjYyYzAtLjU3LjA5LTEuMDguMjgtMS41My4xOC0uNDQuNDMtLjgyLjc1LTEuMTNzLjY5LS41NCAxLjEtLjcxYy40Mi0uMTYuODUtLjI0IDEuMzEtLjI0LjQ1IDAgLjg0LjA4IDEuMTcuMjNzLjYxLjM0Ljg1LjU3bC0uNzcgMS4wMmMtLjE5LS4xNi0uMzgtLjI4LS41Ni0uMzctLjE5LS4wOS0uMzktLjE0LS42MS0uMTQtLjU2IDAtMS4wMS4yMS0xLjM1LjYzLS4zNS40MS0uNTIuOTctLjUyIDEuNjcgMCAuNjkuMTcgMS4yNC41MSAxLjY2LjM0LjQxLjc4LjYyIDEuMzIuNjIuMjggMCAuNTQtLjA2Ljc4LS4xNy4yNC0uMTIuNDUtLjI2LjY0LS40MmwuNjcgMS4wM2MtLjMzLjI5LS42OS41MS0xLjA4LjY1LS4zOS4xNS0uNzguMjMtMS4xOC4yMy0uNDYgMC0uOS0uMDgtMS4zMS0uMjQtLjQtLjE2LS43NS0uMzktMS4wNS0uN3MtLjUzLS42OS0uNy0xLjEzYy0uMTctLjQ1LS4yNS0uOTYtLjI1LTEuNTN6bTYuOTEtNi40NWgxLjU4djYuMTdoLjA1bDIuNTQtMy4xNmgxLjc3bC0yLjM1IDIuOCAyLjU5IDQuMDdoLTEuNzVsLTEuNzctMi45OC0xLjA4IDEuMjN2MS43NWgtMS41OHptMTMuNjkgMS4yN2MtLjI1LS4xMS0uNS0uMTctLjc1LS4xNy0uNTggMC0uODcuMzktLjg3IDEuMTZ2Ljc1aDEuMzR2MS4yN2gtMS4zNHY1LjZoLTEuNjF2LTUuNmgtLjkydi0xLjJsLjkyLS4wN3YtLjcyYzAtLjM1LjA0LS42OC4xMy0uOTguMDgtLjMxLjIxLS41Ny40LS43OXMuNDItLjM5LjcxLS41MWMuMjgtLjEyLjYzLS4xOCAxLjA0LS4xOC4yNCAwIC40OC4wMi42OS4wNy4yMi4wNS40MS4xLjU3LjE3em0uNDggNS4xOGMwLS41Ny4wOS0xLjA4LjI3LTEuNTMuMTctLjQ0LjQxLS44Mi43Mi0xLjEzLjMtLjMxLjY1LS41NCAxLjA0LS43MS4zOS0uMTYuOC0uMjQgMS4yMy0uMjRzLjg0LjA4IDEuMjQuMjRjLjQuMTcuNzQuNCAxLjA0Ljcxcy41NC42OS43MiAxLjEzYy4xOS40NS4yOC45Ni4yOCAxLjUzcy0uMDkgMS4wOC0uMjggMS41M2MtLjE4LjQ0LS40Mi44Mi0uNzIgMS4xM3MtLjY0LjU0LTEuMDQuNy0uODEuMjQtMS4yNC4yNC0uODQtLjA4LTEuMjMtLjI0LS43NC0uMzktMS4wNC0uN2MtLjMxLS4zMS0uNTUtLjY5LS43Mi0xLjEzLS4xOC0uNDUtLjI3LS45Ni0uMjctMS41M3ptMS42NSAwYzAgLjY5LjE0IDEuMjQuNDMgMS42Ni4yOC40MS42OC42MiAxLjE4LjYyLjUxIDAgLjktLjIxIDEuMTktLjYyLjI5LS40Mi40NC0uOTcuNDQtMS42NiAwLS43LS4xNS0xLjI2LS40NC0xLjY3LS4yOS0uNDItLjY4LS42My0xLjE5LS42My0uNSAwLS45LjIxLTEuMTguNjMtLjI5LjQxLS40My45Ny0uNDMgMS42N3ptNi40OC0zLjQ0aDEuMzNsLjEyIDEuMjFoLjA1Yy4yNC0uNDQuNTQtLjc5Ljg4LTEuMDIuMzUtLjI0LjctLjM2IDEuMDctLjM2LjMyIDAgLjU5LjA1Ljc4LjE0bC0uMjggMS40LS4zMy0uMDljLS4xMS0uMDEtLjIzLS4wMi0uMzgtLjAyLS4yNyAwLS41Ni4xLS44Ni4zMXMtLjU1LjU4LS43NyAxLjF2NC4yaC0xLjYxem0tNDcuODcgMTVoMS42MXY0LjFjMCAuNTcuMDguOTcuMjUgMS4yLjE3LjI0LjQ0LjM1LjgxLjM1LjMgMCAuNTctLjA3LjgtLjIyLjIyLS4xNS40Ny0uMzkuNzMtLjczdi00LjdoMS42MXY2Ljg3aC0xLjMybC0uMTItMS4wMWgtLjA0Yy0uMy4zNi0uNjMuNjQtLjk4Ljg2LS4zNS4yMS0uNzYuMzItMS4yNC4zMi0uNzMgMC0xLjI3LS4yNC0xLjYxLS43MS0uMzMtLjQ3LS41LTEuMTQtLjUtMi4wMnptOS40NiA3LjQzdjIuMTZoLTEuNjF2LTkuNTloMS4zM2wuMTIuNzJoLjA1Yy4yOS0uMjQuNjEtLjQ1Ljk3LS42My4zNS0uMTcuNzItLjI2IDEuMS0uMjYuNDMgMCAuODEuMDggMS4xNS4yNC4zMy4xNy42MS40Ljg0LjcxLjI0LjMxLjQxLjY4LjUzIDEuMTEuMTMuNDIuMTkuOTEuMTkgMS40NCAwIC41OS0uMDkgMS4xMS0uMjUgMS41Ny0uMTYuNDctLjM4Ljg1LS42NSAxLjE2LS4yNy4zMi0uNTguNTYtLjk0LjczLS4zNS4xNi0uNzIuMjUtMS4xLjI1LS4zIDAtLjYtLjA3LS45LS4ycy0uNTktLjMxLS44Ny0uNTZ6bTAtMi4zYy4yNi4yMi41LjM3LjczLjQ1LjI0LjA5LjQ2LjEzLjY2LjEzLjQ2IDAgLjg0LS4yIDEuMTUtLjYuMzEtLjM5LjQ2LS45OC40Ni0xLjc3IDAtLjY5LS4xMi0xLjIyLS4zNS0xLjYxLS4yMy0uMzgtLjYxLS41Ny0xLjEzLS41Ny0uNDkgMC0uOTkuMjYtMS41Mi43N3ptNS44Ny0xLjY5YzAtLjU2LjA4LTEuMDYuMjUtMS41MS4xNi0uNDUuMzctLjgzLjY1LTEuMTQuMjctLjMuNTgtLjU0LjkzLS43MXMuNzEtLjI1IDEuMDgtLjI1Yy4zOSAwIC43My4wNyAxIC4yLjI3LjE0LjU0LjMyLjgxLjU1bC0uMDYtMS4xdi0yLjQ5aDEuNjF2OS44OGgtMS4zM2wtLjExLS43NGgtLjA2Yy0uMjUuMjUtLjU0LjQ2LS44OC42NC0uMzMuMTgtLjY5LjI3LTEuMDYuMjctLjg3IDAtMS41Ni0uMzItMi4wNy0uOTVzLS43Ni0xLjUxLS43Ni0yLjY1em0xLjY3LS4wMWMwIC43NC4xMyAxLjMxLjQgMS43LjI2LjM4LjY1LjU4IDEuMTUuNTguNTEgMCAuOTktLjI2IDEuNDQtLjc3di0zLjIxYy0uMjQtLjIxLS40OC0uMzYtLjctLjQ1LS4yMy0uMDgtLjQ2LS4xMi0uNy0uMTItLjQ1IDAtLjgyLjE5LTEuMTMuNTktLjMxLjM5LS40Ni45NS0uNDYgMS42OHptNi4zNSAxLjU5YzAtLjczLjMyLTEuMy45Ny0xLjcxLjY0LS40IDEuNjctLjY4IDMuMDgtLjg0IDAtLjE3LS4wMi0uMzQtLjA3LS41MS0uMDUtLjE2LS4xMi0uMy0uMjItLjQzcy0uMjItLjIyLS4zOC0uM2MtLjE1LS4wNi0uMzQtLjEtLjU4LS4xLS4zNCAwLS42OC4wNy0xIC4ycy0uNjMuMjktLjkzLjQ3bC0uNTktMS4wOGMuMzktLjI0LjgxLS40NSAxLjI4LS42My40Ny0uMTcuOTktLjI2IDEuNTQtLjI2Ljg2IDAgMS41MS4yNSAxLjkzLjc2cy42MyAxLjI1LjYzIDIuMjF2NC4wN2gtMS4zMmwtLjEyLS43NmgtLjA1Yy0uMy4yNy0uNjMuNDgtLjk4LjY2cy0uNzMuMjctMS4xNC4yN2MtLjYxIDAtMS4xLS4xOS0xLjQ4LS41Ni0uMzgtLjM2LS41Ny0uODUtLjU3LTEuNDZ6bTEuNTctLjEyYzAgLjMuMDkuNTMuMjcuNjcuMTkuMTQuNDIuMjEuNzEuMjEuMjggMCAuNTQtLjA3Ljc3LS4ycy40OC0uMzEuNzMtLjU2di0xLjU0Yy0uNDcuMDYtLjg2LjEzLTEuMTguMjMtLjMxLjA5LS41Ny4xOS0uNzYuMzFzLS4zMy4yNS0uNDEuNGMtLjA5LjE1LS4xMy4zMS0uMTMuNDh6bTYuMjktMy42M2gtLjk4di0xLjJsMS4wNi0uMDcuMi0xLjg4aDEuMzR2MS44OGgxLjc1djEuMjdoLTEuNzV2My4yOGMwIC44LjMyIDEuMi45NyAxLjIuMTIgMCAuMjQtLjAxLjM3LS4wNC4xMi0uMDMuMjQtLjA3LjM0LS4xMWwuMjggMS4xOWMtLjE5LjA2LS40LjEyLS42NC4xNy0uMjMuMDUtLjQ5LjA4LS43Ni4wOC0uNCAwLS43NC0uMDYtMS4wMi0uMTgtLjI3LS4xMy0uNDktLjMtLjY3LS41Mi0uMTctLjIxLS4zLS40OC0uMzctLjc4LS4wOC0uMy0uMTItLjY0LS4xMi0xLjAxem00LjM2IDIuMTdjMC0uNTYuMDktMS4wNi4yNy0xLjUxcy40MS0uODMuNzEtMS4xNGMuMjktLjMuNjMtLjU0IDEuMDEtLjcxLjM5LS4xNy43OC0uMjUgMS4xOC0uMjUuNDcgMCAuODguMDggMS4yMy4yNC4zNi4xNi42NS4zOC44OS42N3MuNDIuNjMuNTQgMS4wM2MuMTIuNDEuMTguODQuMTggMS4zMiAwIC4zMi0uMDIuNTctLjA3Ljc2aC00LjM3Yy4wOC42Mi4yOSAxLjEuNjUgMS40NC4zNi4zMy44Mi41IDEuMzguNS4zIDAgLjU4LS4wNC44NC0uMTMuMjUtLjA5LjUxLS4yMS43Ni0uMzdsLjU0IDEuMDFjLS4zMi4yMS0uNjkuMzktMS4wOS41M3MtLjgyLjIxLTEuMjYuMjFjLS40NyAwLS45Mi0uMDgtMS4zMy0uMjUtLjQxLS4xNi0uNzctLjQtMS4wOC0uNy0uMy0uMzEtLjU0LS42OS0uNzItMS4xMy0uMTctLjQ0LS4yNi0uOTUtLjI2LTEuNTJ6bTQuNjEtLjYyYzAtLjU1LS4xMS0uOTgtLjM0LTEuMjgtLjIzLS4zMS0uNTgtLjQ3LTEuMDYtLjQ3LS40MSAwLS43Ny4xNS0xLjA4LjQ1LS4zMS4yOS0uNS43My0uNTcgMS4zem0zLjAxIDIuMjNjLjMxLjI0LjYxLjQzLjkyLjU3LjMuMTMuNjMuMi45OC4yLjM4IDAgLjY1LS4wOC44My0uMjNzLjI3LS4zNS4yNy0uNmMwLS4xNC0uMDUtLjI2LS4xMy0uMzctLjA4LS4xLS4yLS4yLS4zNC0uMjgtLjE0LS4wOS0uMjktLjE2LS40Ny0uMjNsLS41My0uMjJjLS4yMy0uMDktLjQ2LS4xOC0uNjktLjMtLjIzLS4xMS0uNDQtLjI0LS42Mi0uNHMtLjMzLS4zNS0uNDUtLjU1Yy0uMTItLjIxLS4xOC0uNDYtLjE4LS43NSAwLS42MS4yMy0xLjEuNjgtMS40OS40NC0uMzggMS4wNi0uNTcgMS44My0uNTcuNDggMCAuOTEuMDggMS4yOS4yNXMuNzEuMzYuOTkuNTdsLS43NC45OGMtLjI0LS4xNy0uNDktLjMyLS43My0uNDItLjI1LS4xMS0uNTEtLjE2LS43OC0uMTYtLjM1IDAtLjYuMDctLjc2LjIxLS4xNy4xNS0uMjUuMzMtLjI1LjU0IDAgLjE0LjA0LjI2LjEyLjM2cy4xOC4xOC4zMS4yNmMuMTQuMDcuMjkuMTQuNDYuMjFsLjU0LjE5Yy4yMy4wOS40Ny4xOC43LjI5cy40NC4yNC42NC40Yy4xOS4xNi4zNC4zNS40Ni41OC4xMS4yMy4xNy41LjE3LjgyIDAgLjMtLjA2LjU4LS4xNy44My0uMTIuMjYtLjI5LjQ4LS41MS42OC0uMjMuMTktLjUxLjM0LS44NC40NS0uMzQuMTEtLjcyLjE3LTEuMTUuMTctLjQ4IDAtLjk1LS4wOS0xLjQxLS4yNy0uNDYtLjE5LS44Ni0uNDEtMS4yLS42OHoiIGZpbGw9IiM1MzUzNTMiLz48L2c+PC9zdmc+\"><\/a><\/p>\n<div>\n<h3 id=\"citeas\">Cite this article<\/h3>\n<p>Sinitcyn, P., Richards, A.L., Weatheritt, R.J. <i>et al.<\/i> Global detection of human variants and isoforms by deep proteome sequencing.<br \/>\n                    <i>Nat Biotechnol<\/i>  (2023). https:\/\/doi.org\/10.1038\/s41587-023-01714-x<\/p>\n<p><a data-test=\"citation-link\" data-track=\"click\" data-track-action=\"download article citation\" data-track-label=\"link\" data-track-external rel=\"nofollow\" href=\"https:\/\/citation-needed.springer.com\/v2\/references\/10.1038\/s41587-023-01714-x?format=refman&#038;flavour=citation\">Download citation<\/a><\/p>\n<ul data-test=\"publication-history\">\n<li>\n<p>Received<span>: <\/span><span><time datetime=\"2022-08-26\">26 August 2022<\/time><\/span><\/p>\n<\/li>\n<li>\n<p>Accepted<span>: <\/span><span><time datetime=\"2023-02-15\">15 February 2023<\/time><\/span><\/p>\n<\/li>\n<li>\n<p>Published<span>: <\/span><span><time datetime=\"2023-03-23\">23 March 2023<\/time><\/span><\/p>\n<\/li>\n<li>\n<p><abbr title=\"Digital Object Identifier\">DOI<\/abbr><span>: <\/span><span>https:\/\/doi.org\/10.1038\/s41587-023-01714-x<\/span><\/p>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div><\/div>\n<p><a href=\"https:\/\/www.nature.com\/articles\/s41587-023-01714-x\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><br \/>\n Pavel Sinitcyn<\/p>\n","protected":false},"excerpt":{"rendered":"<p>MainNear-complete proteomes of simple organisms can be detected by mass spectrometry (MS) following only 1\u2009h of analysis1,2. For more complex organisms, it is possible to monitor over 10,000 proteins within a day (refs. 3,4,5,6,7). Community-based maps of the human proteome, assembled using extensive data from various tissues and cell types from laboratories across the world<\/p>\n","protected":false},"author":1,"featured_media":621339,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[30811,1424,536],"tags":[],"class_list":{"0":"post-621338","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-detection","8":"category-global","9":"category-science-nature"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/621338","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=621338"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/621338\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/621339"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=621338"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=621338"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=621338"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}