{"id":641428,"date":"2023-04-25T15:05:38","date_gmt":"2023-04-25T20:05:38","guid":{"rendered":"https:\/\/news.sellorbuyhomefast.com\/index.php\/2023\/04\/25\/efficient-evolution-of-human-antibodies-from-general-protein-language-models\/"},"modified":"2023-04-25T15:05:38","modified_gmt":"2023-04-25T20:05:38","slug":"efficient-evolution-of-human-antibodies-from-general-protein-language-models","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2023\/04\/25\/efficient-evolution-of-human-antibodies-from-general-protein-language-models\/","title":{"rendered":"Efficient evolution of human antibodies from general protein language models"},"content":{"rendered":"<p>Science &#038; Nature <\/p>\n<div>\n<div id=\"Sec1-section\" data-title=\"Main\">\n<h2 id=\"Sec1\">Main<\/h2>\n<div id=\"Sec1-content\">\n<p>Evolution searches across an immense space of possible sequences for rare mutations that improve fitness<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\" title=\"Futuyma, D. J. Evolutionary Biology 3rd ed (Sinauer Associates, 1997).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR1\" id=\"ref-link-section-d117614227e475\">1<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\" title=\"Wright, S. The roles of mutation, inbreeding, crossbreeding and selection in evolution. Proc. of the VI International Congress of Genetics 355\u2013366 (Blackwell, 1932).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR2\" id=\"ref-link-section-d117614227e478\">2<\/a><\/sup>. In nature, this search is based on simple processes of random mutation and recombination<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\" title=\"Futuyma, D. J. Evolutionary Biology 3rd ed (Sinauer Associates, 1997).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR1\" id=\"ref-link-section-d117614227e482\">1<\/a><\/sup>, but using the same approach for directed evolution of proteins in the laboratory<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Arnold, F. H. Directed evolution: bringing new chemistry to life. Angew. Chem. Int. Ed. Engl. 57, 4143\u20134148 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR3\" id=\"ref-link-section-d117614227e486\">3<\/a><\/sup> imposes a considerable experimental burden. Artificial evolution based on random guessing or brute force search typically devotes substantial effort to interrogate weakly active or non-functional proteins, requiring high experimental throughput to identify variants with improved fitness<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\" title=\"Fowler, D. M. &#038; Fields, S. Deep mutational scanning: a new style of protein science. Nat. Methods 11, 801\u2013807 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR4\" id=\"ref-link-section-d117614227e490\">4<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\" title=\"Hunter, S. A. &#038; Cochran, J. R. Cell-binding assays for determining the affinity of protein\u2013protein interactions. Methods Enzymol. 580, 21\u201344 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR5\" id=\"ref-link-section-d117614227e493\">5<\/a><\/sup>.<\/p>\n<p>Although evolutionary fitness is determined, in part, by specific selection pressures, there are also properties that apply more generally across a protein family or are prerequisites for fitness and function across most proteins; for example, some mutations maintain or improve stability or evolvability<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\" title=\"Khersonsky, O. &#038; Tawfik, D. S. Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu. Rev. Biochem. 79, 471\u2013505 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR6\" id=\"ref-link-section-d117614227e500\">6<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Bloom, J. D., Labthavikul, S. T., Otey, C. R. &#038; Arnold, F. H. Protein stability promotes evolvability. Proc. Natl Acad. Sci. USA 103, 5869\u20135874 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR7\" id=\"ref-link-section-d117614227e503\">7<\/a><\/sup>, whereas others are structurally destabilizing<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Bloom, J. D., Labthavikul, S. T., Otey, C. R. &#038; Arnold, F. H. Protein stability promotes evolvability. Proc. Natl Acad. Sci. USA 103, 5869\u20135874 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR7\" id=\"ref-link-section-d117614227e507\">7<\/a><\/sup> or induce incompetent, misfolded states<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Markin, C. J. et al. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. Science 373, eabf8761 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR8\" id=\"ref-link-section-d117614227e511\">8<\/a><\/sup>. One approach to improving the efficiency of evolution is to ensure that mutations adhere to these general properties, which we refer to as evolutionary plausibility. Identifying plausible mutations could help guide evolution away from invalid regimes<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"00 title=\"Wittmann, B. J., Yue, Y. &#038; Arnold, F. H. Informed training set design enables efficient machine learning-assisted directed protein evolution. Cell Syst. 12, 1026\u20131045 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR9\" id=\"ref-link-section-d117614227e515\">9<\/a><\/sup>, thereby indirectly improving evolutionary efficiency without requiring any explicit knowledge of the function of interest. However, this strategy is also challenging because, first, protein sequences are governed by complex rules, and, second, even if we restrict search to evolutionarily plausible mutations, those that also improve a specific definition of fitness might still be rare beyond practical utility (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig1\">1a<\/a>). More broadly, a major open question<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"11 title=\"Hie, B. L., Yang, K. K. &#038; Kim, P. S. Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins. Cell Syst. 13, 274\u2013285 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR10\" id=\"ref-link-section-d117614227e523\">10<\/a><\/sup> is whether general evolutionary information (for example, learning patterns from sequence variation across past evolution) is sufficient to enable efficient evolution under specific selection pressures (for example, higher binding affinity to a specific antigen).<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-1\" data-title=\"Guiding evolution with protein language models.\">\n<figure><figcaption><b id=\"Fig1\" data-test=\"figure-caption-text\">Fig. 1: Guiding evolution with protein language models.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/1\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig1\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig1_HTML.png\" alt=\"Science &amp; Nature figure 1\" loading=\"lazy\" width=\"685\" height=\"206\"><\/picture><\/a><\/div>\n<p><b>a<\/b>,<b>b<\/b>, Two possible models for relating the space of mutations with high evolutionary plausibility (for example, mutations seen in antibodies) to the space with high fitness under specific selection pressures (for example, mutations that result in high binding affinity to a specific antigen). Both models assume that mutations with high fitness make up a rare subset of the full mutational space and that, in general, high-fitness mutations are also evolutionarily plausible. Under the first model (<b>a<\/b>), mutations with high fitness are rare within the subset of mutations that are evolutionarily plausible. Under the second model (<b>b<\/b>), when restricted to the regime of plausible mutations, improvements to fitness become much more common. <b>c<\/b>, Protein language models, trained on millions of natural protein sequences learn amino acid patterns that are likely to be seen in nature. We hypothesized that most mutations with high language model likelihood would also be evolutionarily plausible. Assuming that this is true, and if the second model (<b>b<\/b>) better describes nature, then a language model with no information about specific selection pressures can still efficiently guide evolution.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/1\" data-track-dest=\"link:Figure1 Full size image\" aria-label=\"Reference 2\"22 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>Here we show that evolutionary information alone can lead to improved fitness under specific selection pressures with high efficiency (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig1\">1b<\/a>). For our main experimental test case, we focused on affinity maturation of human antibodies in which our specific selection pressure is defined as stronger binding affinity to a particular antigen. In nature, a process known as somatic hypermutation evolves or \u2018matures\u2019 an antibody lineage to have higher affinity for an antigen via repeated mutagenesis<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Eisen, H. N. &#038; Siskind, G. W. Variations in affinities of antibodies during the immune response. Biochemistry 3, 996\u2013100 (1964).\" href=\"http:\/\/www.nature.com\/#ref-CR11\" id=\"ref-link-section-d117614227e570\">11<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Eisen, H. N. Affinity enhancement of antibodies: how low-affinity antibodies produced early in immune responses are followed by high-affinity antibodies later and in memory B-cell responses. Cancer Immunol. Res. 2, 381\u2013392 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR12\" id=\"ref-link-section-d117614227e570_1\">12<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"33 title=\"Victora, G. D. &#038; Nussenzweig, M. C. Germinal centers. Annu. Rev. Immunol. 40, 413\u2013442 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR13\" id=\"ref-link-section-d117614227e573\">13<\/a><\/sup>. In the laboratory, affinity maturation is a major application of directed evolution due to the therapeutic potential of antibodies with high affinity for disease targets<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"44 title=\"Wellner, A. et al. Rapid generation of potent antibodies by autonomous hypermutation in yeast. Nat. Chem. Biol. 17, 1057\u20131064 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR14\" id=\"ref-link-section-d117614227e577\">14<\/a><\/sup>.<\/p>\n<p>To select evolutionarily plausible mutations, we used algorithms known as language models (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig1\">1c<\/a>) to learn patterns that are likely to occur in natural proteins<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Bepler, T. &#038; Berger, B. Learning the protein language: evolution, structure and function. Cell Syst. 12, 654\u2013669 (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR15\" id=\"ref-link-section-d117614227e587\">15<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Bepler, T. &#038; Berger, B. Learning protein sequence embeddings using information from structure. International Conference on Learning Representations. Preprint at arXiv \n                https:\/\/doi.org\/10.48550\/arXiv.1902.08661\n                \n               (2019).\" href=\"http:\/\/www.nature.com\/#ref-CR16\" id=\"ref-link-section-d117614227e587_1\">16<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Hie, B., Zhong, E., Berger, B. &#038; Bryson, B. Learning the language of viral evolution and escape. Science 371, 284\u2013288 (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR17\" id=\"ref-link-section-d117614227e587_2\">17<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Alley, E. C., Khimulya, G., Biswas, S., AlQuraishi, M. &#038; Church, G. M. Unified rational protein engineering with sequence-based deep representation learning. Nat. Methods 16, 1315\u20131322 (2019).\" href=\"http:\/\/www.nature.com\/#ref-CR18\" id=\"ref-link-section-d117614227e587_3\">18<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl Acad. Sci. USA 118, e2016239118 (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR19\" id=\"ref-link-section-d117614227e587_4\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/#ref-CR20\" id=\"ref-link-section-d117614227e587_5\">20<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Elnaggar, A. et al. ProtTrans: towards cracking the language of life\u2019s code through self-supervised deep learning and high performance computing. IEEE Trans. Pattern Anal. Mach. Intell. 44, 7112\u20137127 (2022).\" href=\"http:\/\/www.nature.com\/#ref-CR21\" id=\"ref-link-section-d117614227e587_6\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"55 title=\"Nijkamp, E., Ruffolo, J., Weinstein, E. N., Naik, N. &#038; Madani, A. ProGen2: exploring the boundaries of protein language models. Preprint at arXiv \n                https:\/\/doi.org\/10.48550\/arXiv.2206.13517\n                \n               (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR22\" id=\"ref-link-section-d117614227e590\">22<\/a><\/sup>. Because we used general language models<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"66 title=\"Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl Acad. Sci. USA 118, e2016239118 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR19\" id=\"ref-link-section-d117614227e594\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"77 title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR20\" id=\"ref-link-section-d117614227e597\">20<\/a><\/sup>, trained on non-redundant sequence datasets that are meant to represent variation across all natural proteins<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"88 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e601\">23<\/a><\/sup>, these models can only learn more general evolutionary rules than could a model trained specifically on antibody sequences<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Olsen, T. H., Moal, I. H. &#038; Deane, C. M. AbLang: an antibody language model for completing antibody sequences. Bioinform. Adv. 2, vbac046 (2022).\" href=\"http:\/\/www.nature.com\/#ref-CR24\" id=\"ref-link-section-d117614227e605\">24<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Prihoda, D. et al. BioPhi: a platform for antibody design, humanization, and humanness evaluation based on natural antibody repertoires and deep learning. mAbs 14, 2020203 (2022).\" href=\"http:\/\/www.nature.com\/#ref-CR25\" id=\"ref-link-section-d117614227e605_1\">25<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Ruffolo, J. A., Gray, J. J. &#038; Sulam J. Deciphering antibody affinity maturation with language models and weakly supervised learning. NeurIPS Workshop on Machine Learning in Structural Biology. Preprint at arXiv \n                https:\/\/doi.org\/10.48550\/arXiv.2112.07782\n                \n               (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR26\" id=\"ref-link-section-d117614227e605_2\">26<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"99 title=\"Shuai, R. W., Ruffolo, J. A. &#038; Gray, J. J. Generative language modeling for antibody design. Preprint at bioRxiv \n                https:\/\/doi.org\/10.1101\/2021.12.13.472419\n                \n               (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR27\" id=\"ref-link-section-d117614227e608\">27<\/a><\/sup> or a model directly supervised with binding affinity<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"00 title=\"Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600\u2013612 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR28\" id=\"ref-link-section-d117614227e613\">28<\/a><\/sup>. Given a single starting sequence, we used these language models to recommend plausible amino acid substitutions that we then experimentally screened for improved fitness. To the end user, the algorithm requires only a single wild-type sequence, without any initial binding affinity data, knowledge of the antigen, task-specific supervision, evolutionary homologs or protein structure information.<\/p>\n<p>We evolved seven human immunoglobulin G (IgG) antibodies that bind to antigens from coronavirus, ebolavirus and influenza A virus. We focused on viral antigens given the importance of antibody therapeutics for viral diseases<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR29\" id=\"ref-link-section-d117614227e621\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR30\" id=\"ref-link-section-d117614227e621_1\">30<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Pinto, D. et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature 583, 290\u2013295 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR31\" id=\"ref-link-section-d117614227e621_2\">31<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"11 title=\"Hansen, J. et al. Studies in humanized mice and convalescent humans yield a SARS-CoV-2 antibody cocktail. Science 369, 1010\u20131014 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR32\" id=\"ref-link-section-d117614227e624\">32<\/a><\/sup>. We improved the affinity of all antibodies after measuring only 20 or fewer new variants of each antibody across just two rounds of evolution, which, to our knowledge, represents unprecedented efficiency for machine-learning-guided evolution<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"22 title=\"Yang, K. K., Wu, Z. &#038; Arnold, F. H. Machine-learning-guided directed evolution for protein engineering. Nat. Methods 16, 687\u2013694 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR33\" id=\"ref-link-section-d117614227e628\">33<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"33 title=\"Hie, B. L. &#038; Yang, K. K. Adaptive machine learning for protein engineering. Curr. Opin. Struct .Biol. 72, 145\u2013152 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR34\" id=\"ref-link-section-d117614227e631\">34<\/a><\/sup>. We also demonstrate that the <i>same<\/i> general protein language models that we used to affinity mature antibodies can also enrich for high-fitness substitutions to diverse proteins beyond antibodies.<\/p>\n<\/div>\n<\/div>\n<div id=\"Sec2-section\" data-title=\"Results\">\n<h2 id=\"Sec2\">Results<\/h2>\n<div id=\"Sec2-content\">\n<h3 id=\"Sec3\">Efficient affinity maturation with protein language models<\/h3>\n<p>Recent work has demonstrated that language models can predict natural evolution despite having no knowledge of specific selection pressures<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"44 title=\"Hie, B. L., Yang, K. K. &#038; Kim, P. S. Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins. Cell Syst. 13, 274\u2013285 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR10\" id=\"ref-link-section-d117614227e650\">10<\/a><\/sup>. However, this prior work only predicted the direction of evolution retrospectively when given full knowledge of the evolutionary trajectory. We hypothesized that the predictive capabilities of protein language models might enable a researcher to provide only a single, wild-type antibody sequence to the algorithm and receive a small, manageable set (~10<sup>1<\/sup>) of high-likelihood variants to experimentally measure for desirable properties. This is a very general setting that does not assume knowledge of protein structure or task-specific training data. A major question, however, is if higher evolutionary likelihood would efficiently translate to higher fitness.<\/p>\n<p>We tested our hypothesis by conducting evolutionary campaigns, guided by language model likelihood, to affinity mature seven antibodies representing diverse antigens and degrees of maturity (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">1<\/a>):<\/p>\n<ul>\n<li>\n<p>MEDI8852: a broadly neutralizing antibody (bnAb) that binds influenza A hemagglutinin (HA) across variants of both major phylogenetic groups (group 1 and group 2) and that reached phase 2 clinical trials; this antibody is highly matured, with its parent being isolated from a human, followed by substantial artificial evolution<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"55 title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR29\" id=\"ref-link-section-d117614227e668\">29<\/a><\/sup><\/p>\n<\/li>\n<li>\n<p>MEDI8852 unmutated common ancestor (UCA): the unmatured, inferred germline sequence of MEDI8852, which only neutralizes viruses with group 1 HAs<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"66 title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR29\" id=\"ref-link-section-d117614227e677\">29<\/a><\/sup><\/p>\n<\/li>\n<li>\n<p>mAb114: a patient-derived antibody that neutralizes ebolavirus by binding to its glycoprotein (GP)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"77 title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR30\" id=\"ref-link-section-d117614227e686\">30<\/a><\/sup> and has been approved for clinical use by the US Food and Drug Administration (FDA)<\/p>\n<\/li>\n<li>\n<p>mAb114 UCA: the unmatured, inferred germline sequence of mAb114 with weak binding to ebolavirus GP<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"88 title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR30\" id=\"ref-link-section-d117614227e696\">30<\/a><\/sup><\/p>\n<\/li>\n<li>\n<p>S309: a patient-derived antibody that cross-neutralizes the sarbecoviruses severe acute respiratory syndrome coronavirus 1 (SARS-CoV-1) and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by binding to the spike glycoprotein (Spike)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\"99 title=\"Pinto, D. et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature 583, 290\u2013295 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR31\" id=\"ref-link-section-d117614227e705\">31<\/a><\/sup> and is the parent antibody of sotrovimab<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"00 title=\"Alexander, E. et al. Antibody therapies for SARS-CoV-2 infection. WO2021252878A1 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR35\" id=\"ref-link-section-d117614227e709\">35<\/a><\/sup>, which had an FDA emergency use authorization (EUA) for treatment of Coronavirus Disease 2019 (COVID-19) caused by earlier variants of SARS-CoV-2 (refs. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"11 title=\"Telenti, A., Hodcroft, E. B. &#038; Robertson, D. L. The evolution and biology of SARS-CoV-2 variants. Cold Spring Harb. Perspect. Med. 12, a041390 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR36\" id=\"ref-link-section-d117614227e713\">36<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"22 title=\"Maher, M. C. et al. Predicting the mutational drivers of future SARS-CoV-2 variants of concern. Sci. Transl. Med. 14, eabk3445 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR37\" id=\"ref-link-section-d117614227e716\">37<\/a><\/sup>)<\/p>\n<\/li>\n<li>\n<p>REGN10987: a patient-derived antibody that binds early variants of SARS-CoV-2 Spike<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"33 title=\"Hansen, J. et al. Studies in humanized mice and convalescent humans yield a SARS-CoV-2 antibody cocktail. Science 369, 1010\u20131014 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR32\" id=\"ref-link-section-d117614227e727\">32<\/a><\/sup> and that had an FDA EUA for use against these variants<\/p>\n<\/li>\n<li>\n<p>C143: an unmatured, patient-derived antibody that binds the SARS-CoV-2 Wuhan-Hu-1 Spike but was isolated before extensive in vivo somatic hypermutation<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"44 title=\"Gaebler, C. et al. Evolution of antibody immunity to SARS-CoV-2. Nature 591, 639\u2013644 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR38\" id=\"ref-link-section-d117614227e737\">38<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"55 title=\"Muecksch, F. et al. Affinity maturation of SARS-CoV-2 neutralizing antibodies confers potency, breadth, and resilience to viral escape mutations. Immunity 54, 1853\u20131868 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR39\" id=\"ref-link-section-d117614227e740\">39<\/a><\/sup><\/p>\n<\/li>\n<\/ul>\n<p>We performed evolution with the ESM-1b language model and the ESM-1v ensemble of five language models (six language models in total)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"66 title=\"Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl Acad. Sci. USA 118, e2016239118 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR19\" id=\"ref-link-section-d117614227e748\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"77 title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR20\" id=\"ref-link-section-d117614227e751\">20<\/a><\/sup>. ESM-1b and ESM-1v were trained on UniRef50 and UniRef90, respectively, which are protein sequence datasets that represent variation across millions of observed natural proteins (UniRef90 contains ~98 million total sequences) and that include only a few thousand antibody-related sequences<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\"88 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e755\">23<\/a><\/sup>. These datasets are also constructed such that no two sequences have more than 50% (UniRef50) or 90% (UniRef90) sequence similarity with each other to avoid biological redundancy. Additionally, both datasets precede the discovery of the SARS-CoV-2 antibodies considered in the study as well as the evolution of all SARS-CoV-2 variants of concern. Therefore, to evolve these antibodies, the language models cannot use disease-specific biases in the training data and must, instead, learn more general evolutionary patterns.<\/p>\n<p>We used these language models to compute likelihoods of all single-residue substitutions to the antibody variable regions of either the heavy chain (VH) or the light chain (VL). We selected substitutions with higher evolutionary likelihood than wild-type across a consensus of six language models (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig5\">1<\/a>). In the first round of evolution, we measured the antigen interaction strength by biolayer interferometry (BLI) of variants that contain only a single-residue substitution from wild-type. In the second round, we measured variants containing combinations of substitutions, where we selected substitutions that corresponded to preserved or improved binding based on the results of the first round. We performed these two rounds for all seven antibodies, measuring 8\u201314 variants per antibody in round one and 1\u201311 variants per antibody in round two (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig2\">2<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">1<\/a>). Variants of the clinically relevant antibodies, which have very low or undetectable dissociation as IgGs, were screened by measuring the dissociation constant (<i>K<\/i><sub>d<\/sub>) of the monovalent fragment antigen-binding (Fab) region; variants of the unmatured antibodies were screened by measuring the apparent <i>K<\/i><sub>d<\/sub> of the bivalent IgG followed by also measuring the <i>K<\/i><sub>d<\/sub> values of the Fab fragments of the highest-avidity variants (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>).<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-2\" data-title=\"Language-model-guided affinity maturation of seven human antibodies.\">\n<figure><figcaption><b id=\"Fig2\" data-test=\"figure-caption-text\">Fig. 2: Language-model-guided affinity maturation of seven human antibodies.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/2\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig2\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig2_HTML.png\" alt=\"Science &amp; Nature figure 2\" loading=\"lazy\" width=\"685\" height=\"874\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, Strip plots visualizing the two rounds of directed evolution conducted for each antibody. Each point represents an IgG or Fab variant plotted according to the fold change in <i>K<\/i><sub>d<\/sub> from wild-type on the <i>y<\/i> axis and jitter on the <i>x<\/i> axis; a gray, dashed line is drawn at a fold change of 1, and the wild-type point is colored gray. MEDI8852 variants were screened against HA H4 Hubei; MEDI8852 UCA variants against HA H1 Solomon; mAb114 and mAb114 UCA variants against ebolavirus GP; S309 variants against Wuhan-Hu-1 S-6P; and REGN10987 and C143 variants against Beta S-6P. <b>b<\/b>, Phylogenetic trees illustrating the evolutionary trajectories from wild-type to the highest-affinity variant(s) of each antibody. Nodes are annotated with the <i>K<\/i><sub>d<\/sub> values for different antigens and the <i>T<\/i><sub>m<\/sub> of the Fab; all <i>K<\/i><sub>d<\/sub> values are for the monovalent Fab versions except those of C143, which are apparent <i>K<\/i><sub>d<\/sub> values for the bivalent IgGs. B, Beta; H1 Solo., H1 Solomon; ML variant, machine-learning-guided variant; O, Omicron; W1, Wuhan-Hu-1. <b>c<\/b>, We obtained avidity and affinity measurements via BLI of IgGs and Fabs at the indicated concentrations binding to the indicated antigen. Selected BLI traces of the highest-affinity variants for the respective antigens are plotted alongside those of the wild-type variants.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/2\" data-track-dest=\"link:Figure2 Full size image\" aria-label=\"Reference 3\"99 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>We could successfully express all but one of 122 variants across our seven evolutionary trajectories. Across all seven antibodies, we found that 71\u2013100% of the first-round Fab variants (containing a single-residue substitution) retained sub-micromolar binding to the antigen, and 14\u201371% percent of first-round variants led to improved binding affinity (defined as a 1.1-fold or higher improvement in <i>K<\/i><sub>d<\/sub> compared to wild-type) (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">1<\/a>). Most of the second-round variants (containing a combination of substitutions) also have improved binding (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">1<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>). For all antibodies except for REGN10987, we also obtained variants with at least a two-fold improvement in <i>K<\/i><sub>d<\/sub>. Thirty-six out of all 76 language-model-recommended, single-residue substitutions (and 18 out of 32 substitutions that lead to improved affinity) occur in framework regions (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">2<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>), which are generally less mutated during conventional affinity maturation compared to the complementarity-determining regions (CDRs)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"00 title=\"Eisen, H. N. Affinity enhancement of antibodies: how low-affinity antibodies produced early in immune responses are followed by high-affinity antibodies later and in memory B-cell responses. Cancer Immunol. Res. 2, 381\u2013392 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR12\" id=\"ref-link-section-d117614227e874\">12<\/a><\/sup>.<\/p>\n<p>We were able to improve the binding affinities for all clinically relevant antibodies tested, despite these antibodies being already highly evolved (starting at low nanomolar or picomolar affinity). MEDI8852 is a potent binder with a sub-picomolar Fab <i>K<\/i><sub>d<\/sub> across many HAs and picomolar or nanomolar binding to HAs from subtypes H4 and H7. Although we explicitly screened variants using an HA H4 antigen, the best design also improves binding across a broad set of HAs (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">3<\/a>), including a sevenfold improvement (from 0.21\u2009nM to 0.03\u2009nM) for HA H7 HK17 (A\/Hong Kong\/125\/2017(H7N9)). The best variant of mAb114, a clinically approved drug, achieves a 3.4-fold improvement in Fab <i>K<\/i><sub>d<\/sub> for ebolavirus GP (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">5<\/a>). For REGN10987, the highest-affinity variant has a 1.3-fold improvement against Beta-variant Spike with six stabilizing proline substitutions (S-6P)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"11 title=\"Hsieh, C.-L. et al. Structure-based design of prefusion-stabilized SARS-CoV-2 spikes. Science 369, 1501\u20131505 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR40\" id=\"ref-link-section-d117614227e899\">40<\/a><\/sup> (the antigen used in screening), and another of our designs has a 5.1-fold improvement for the Omicron BA.1 receptor-binding domain (RBD) (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">8<\/a>). For S309, we compared our designs to wild-type and to a variant with the N55Q substitution in the VH introduced after a small-scale, rational evolutionary screen<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"22 title=\"Alexander, E. et al. Antibody therapies for SARS-CoV-2 infection. WO2021252878A1 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR35\" id=\"ref-link-section-d117614227e906\">35<\/a><\/sup>; the S309 Fab with the VH N55Q substitution forms the Fab of the therapeutic antibody sotrovimab. Our best variant of S309 has higher affinity than sotrovimab, including a 1.3-fold improvement in Fab <i>K<\/i><sub>d<\/sub> compared to wild-type S309 (versus 1.1-fold for sotrovimab) for SARS-CoV-2 Wuhan-Hu-1 S-6P (the antigen used in screening); a 1.7-fold improvement (versus 1.3-fold for sotrovimab) for Beta S-6P; and a 0.93-fold change (versus 0.82-fold for sotrovimab) for Omicron RBD (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">7<\/a>).<\/p>\n<p>We were also able to improve affinities for all three unmatured antibodies, often involving much higher fold changes than when evolving the matured antibodies, indicating easier evolvability with respect to affinity. For MEDI8852 UCA, the best Fab design achieves a 2.6-fold improvement in <i>K<\/i><sub>d<\/sub> against HA H1 Solomon (A\/Solomon Islands\/3\/2006(H1N1)), the antigen used in screening. Our best designs also acquire breadth of binding to some group 2 HAs, including a 23-fold improvement for HA H4 Hubei (A\/swine\/Hubei\/06\/2009(H4N1)) and a 5.4-fold improvement for HA H7 HK17 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">4<\/a>). For mAb114 UCA, our best Fab design achieves a 160-fold improvement in <i>K<\/i><sub>d<\/sub> for ebolavirus GP (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">6<\/a>). Although the algorithm recommends amino acid substitutions to both of these UCA antibodies that are also observed in the matured antibody, other affinity-enhancing substitutions to the UCA antibodies are not found in the matured versions: excluding any substitutions or modified sites found in the matured antibody, our UCA variants achieve up to a sevenfold improvement for HA H4 Hubei (variant VH P75R\/VL G95P; Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">4<\/a>) and a 33-fold improvement for ebolavirus GP (variant VH G88E\/VL V43A; Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">6<\/a>), demonstrating that our algorithm successfully explores alternative evolutionary routes. For C143, a patient-derived antibody isolated before extensive affinity maturation<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"33 title=\"Gaebler, C. et al. Evolution of antibody immunity to SARS-CoV-2. Nature 591, 639\u2013644 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR38\" id=\"ref-link-section-d117614227e942\">38<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"44 title=\"Muecksch, F. et al. Affinity maturation of SARS-CoV-2 neutralizing antibodies confers potency, breadth, and resilience to viral escape mutations. Immunity 54, 1853\u20131868 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR39\" id=\"ref-link-section-d117614227e945\">39<\/a><\/sup>, our best design achieves a 13-fold improvement for Beta S-6P and a 3.8-fold improvement for Omicron RBD (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>). Results from our directed evolution campaigns are further summarized in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig2\">2<\/a>, Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">2<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM3\">1<\/a>.<\/p>\n<h3 id=\"Sec4\">Additional characterization of evolved antibodies<\/h3>\n<p>Although we explicitly selected for improved binders, we also tested these variants for improved stability (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). We found that Fabs for 21 out of the 31 language-model-recommended, affinity-enhancing variants that we tested had a higher melting temperature (<i>T<\/i><sub>m<\/sub>) than wild-type, and all variants maintained thermostability (<i>T<\/i><sub>m<\/sub>\u2009>\u200970\u2009\u00b0C). When evolving S309 to have higher affinity, our best design has a <i>T<\/i><sub>m<\/sub> of 72.8\u2009\u00b0C compared to 72.5\u2009\u00b0C for wild-type, whereas the VH N55Q substitution introduced in sotrovimab decreases the <i>T<\/i><sub>m<\/sub> to 69.6\u2009\u00b0C (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig2\">2<\/a>). Our evolved variants for mAb114, mAb114 UCA, REGN10987 and C143 also preserve or improve <i>T<\/i><sub>m<\/sub>; the highest change that we observed was an increase from 74.5\u2009\u00b0C to 82.5\u2009\u00b0C when evolving mAb114 UCA. Improved thermostability does not completely explain our affinity maturation results, however, as we observed somewhat decreased <i>T<\/i><sub>m<\/sub> for our affinity-matured variants of MEDI8852 and its UCA, although these Fabs are still thermostable (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig2\">2<\/a>).<\/p>\n<p>Additionally, we tested our affinity-matured designs for polyspecific binding, because binding unintended targets could lead to undesirable side effects in therapeutic settings. For each of the seven antibodies, we tested the wild-type alongside three affinity-matured variants using a polyspecificity assay that assesses non-specific binding to soluble membrane proteins (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"55 title=\"Xu, Y. et al. Addressing polyspecificity of antibodies selected from an in vitro yeast presentation system: a FACS-based, high-throughput selection and analytical tool. Protein Eng. Des. Sel. 26, 663\u2013670 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR41\" id=\"ref-link-section-d117614227e1014\">41<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"66 title=\"Makowski, E. K., Wu, L., Desai, A. A. &#038; Tessier, P. M. Highly sensitive detection of antibody nonspecific interactions using flow cytometry. mAbs 13, 1951426 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR42\" id=\"ref-link-section-d117614227e1017\">42<\/a><\/sup>. We observed no substantial changes in polyspecificity for any variants of all seven antibodies, and all tested antibodies have polyspecificity values within a therapeutically viable range (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig3\">3a<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM4\">2<\/a>).<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-3\" data-title=\"Specificity and improved neutralization potency of affinity-matured variants.\">\n<figure><figcaption><b id=\"Fig3\" data-test=\"figure-caption-text\">Fig. 3: Specificity and improved neutralization potency of affinity-matured variants.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/3\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig3\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig3_HTML.png\" alt=\"Science &amp; Nature figure 3\" loading=\"lazy\" width=\"685\" height=\"459\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, Polyspecificity of antibody wild-types and variants was quantified using an assay<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"77 title=\"Makowski, E. K., Wu, L., Desai, A. A. &#038; Tessier, P. M. Highly sensitive detection of antibody nonspecific interactions using flow cytometry. mAbs 13, 1951426 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR42\" id=\"ref-link-section-d117614227e1041\">42<\/a><\/sup> that measures non-specific binding to soluble membrane proteins via flow cytometry, where higher MFI values correspond to more non-specific binding (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). Control antibodies<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"88 title=\"Makowski, E. K., Wu, L., Desai, A. A. &#038; Tessier, P. M. Highly sensitive detection of antibody nonspecific interactions using flow cytometry. mAbs 13, 1951426 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR42\" id=\"ref-link-section-d117614227e1048\">42<\/a><\/sup> are elotuzumab (a clinical antibody with low polyspecificity), ixekizumab (a clinical antibody with high polyspecificity) and 4E10 (a research antibody with high polyspecificity beyond a therapeutically viable level)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"99 title=\"Rujas, E. et al. Structural and thermodynamic basis of epitope binding by neutralizing and nonneutralizing forms of the anti-HIV-1 antibody 4E10. J. Virol. 89, 11975\u201311989 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR62\" id=\"ref-link-section-d117614227e1052\">62<\/a><\/sup>. Bar height indicates the mean across <i>n<\/i>\u2009=\u20093 replicate wells; black dots indicate independent measurements. <b>b<\/b>, Variants of the antibody C143, obtained from our language-model-guided affinity maturation campaign, demonstrate improved neutralization activity in a pseudovirus assay. For Beta pseudovirus, out of the three higher-affinity variants that we also screened for neutralization activity, the best improvement is the 32-fold improvement of VL G53V; for D614G pseudovirus, the best improvement is the 19-fold improvement of VL T33N-G53V (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>). Also see Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig6\">2<\/a>. Points indicate the mean; error bars indicate the s.d.; <i>n<\/i>\u2009=\u20094 independent experiments. <b>c<\/b>, Fold change in <i>K<\/i><sub>d<\/sub> correlates well with fold change in IC<sub>50<\/sub> (Spearman <i>r<\/i>\u2009=\u20090.82, <i>n<\/i>\u2009=\u200915 antibody variants) across all designs tested, consistent with higher binding affinity contributing to improved viral neutralization activity. WT, wild-type.<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/3\" data-track-dest=\"link:Figure3 Full size image\" aria-label=\"Reference 5\"00 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<p>Another therapeutic consideration is immunogenicity. Although computational prediction of immunogenicity remains a challenge, especially involving recognition of discontinuous epitopes, the immunogenicity of linear peptides is better understood<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"11 title=\"Reynisson, B., Alvarez, B., Paul, S., Peters, B. &#038; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 48, W449\u2013W454 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR43\" id=\"ref-link-section-d117614227e1099\">43<\/a><\/sup>. We observed that our affinity-matured variants have no significant increase (one-sided binomial <i>P<\/i>\u2009>\u20090.05) in the number of computationally predicted peptide binders to both human leukocyte antigen (HLA) class I and class II (exact <i>P<\/i> values and sample sizes for these experiments are provided in Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM4\">2<\/a>), which underlies T-cell-mediated immunogenicity.<\/p>\n<p>We also wanted to determine if our affinity-matured variants have better viral neutralization activity. We tested affinity-enhancing variants of four antibodies using pseudovirus neutralization assays (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>) and, in all cases, observed variants with half-maximal inhibitory concentration (IC<sub>50<\/sub>) values that are significantly improved (Bonferroni-corrected, one-sided <i>t<\/i>-test <i>P<\/i>\u2009<\u20090.05, <i>n<\/i>\u2009=\u20094 independent experiments), including a 1.5-fold improvement for the best mAb114 variant against Ebola pseudovirus; a twofold improvement for the best REGN10987 variant against SARS-CoV-2 Beta pseudovirus; and a 32-fold improvement for the best C143 variant against Beta pseudovirus (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig3\">3b<\/a>, Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig6\">2<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">5<\/a>, <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">8<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>). Additionally, the affinity-matured variants of mAb114 UCA demonstrate detectable neutralization at a >100-fold lower concentration compared to wild-type (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig6\">2a<\/a>). In general, change in binding affinity corelates well with change in neutralization (Spearman <i>r<\/i>\u2009=\u20090.82, two-sided <i>t<\/i>-distribution <i>P<\/i>\u2009=\u20091.9\u2009\u00d7\u200910<sup>\u22124<\/sup>, <i>n<\/i>\u2009=\u200915 antibody variants) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig3\">3c<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig6\">2b<\/a>).<\/p>\n<h3 id=\"Sec5\">Originality of affinity-enhancing substitutions<\/h3>\n<p>Although the ability to find any improvement in affinity is itself useful for engineering applications, we were also interested in whether some of the changes recommended by our algorithm demonstrate \u2018originality\u2019. We quantified originality by computing the frequency that a given residue is observed in nature (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). Although many affinity-enhancing substitutions are indeed observed at high frequency both in the model\u2019s training data<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"22 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e1181\">23<\/a><\/sup> and in a database of antibody sequences<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"33 title=\"Swindells, M. B. et al. abYsis: integrated antibody sequence and structure\u2014management, analysis, and prediction. J. Mol. Biol. 429, 356\u2013364 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR44\" id=\"ref-link-section-d117614227e1185\">44<\/a><\/sup>, other substitutions demonstrate greater originality. For example, in the MEDI8852 UCA trajectory, the VL G95P framework substitution (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig2\">2<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">4<\/a>) involves changing a glycine observed in 99% of natural antibody sequences to a proline observed in less than 1% of natural sequences. Overall, five out of 32 affinity-enhancing substitutions (~16%) involve changing the wild-type residue to a rare or uncommon residue (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">10<\/a>) and that are also rare when considering only natural variation of antibodies derived from the same germline genes (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">11<\/a>). These results indicate that the language models learn both the \u2018easy\u2019 evolutionary rules involving high-frequency residues and more complex rules that are not captured by a multiple sequence alignment or conventional antibody evolution. Conceptually, these low-frequency, affinity-enhancing substitutions are analogous to examples in other disciplines where an artificial intelligence program occasionally makes unusual but advantageous choices (for example, unintuitive game-playing decisions<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"44 title=\"Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484\u2013489 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR45\" id=\"ref-link-section-d117614227e1202\">45<\/a><\/sup>) and likewise may be worth further study.<\/p>\n<h3 id=\"Sec6\">Comparison to other sequence-based methods<\/h3>\n<p>We also sought to compare general language models to other methods for selecting plausible mutations based on sequence information alone. To assess the contribution of epistatic information learned by the language model, we considered two site-independent models of mutational frequencies: (1) abYsis sequence annotation, which uses extensively curated antibody sequence alignments, and (2) frequencies based on sequence alignments to the UniRef90 dataset, which was used to train ESM-1v (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). To assess the impact of using language models not trained on antibody-specific sequence variation, we also compared to two antibody language models: (1) AbLang<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"55 title=\"Olsen, T. H., Moal, I. H. &#038; Deane, C. M. AbLang: an antibody language model for completing antibody sequences. Bioinform. Adv. 2, vbac046 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR24\" id=\"ref-link-section-d117614227e1217\">24<\/a><\/sup>, trained on ~10<sup>7<\/sup> sampled sequences from immune repertoire sequencing data in the Observed Antibody Space (OAS) database<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"66 title=\"Olsen, T. H., Boyles, F. &#038; Deane, C. M. Observed antibody space: a diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences. Protein Sci. 31, 141\u2013146 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR46\" id=\"ref-link-section-d117614227e1223\">46<\/a><\/sup>, and (2) Sapiens<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"77 title=\"Prihoda, D. et al. BioPhi: a platform for antibody design, humanization, and humanness evaluation based on natural antibody repertoires and deep learning. mAbs 14, 2020203 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR25\" id=\"ref-link-section-d117614227e1227\">25<\/a><\/sup>, trained on ~10<sup>8<\/sup> human antibody sequences from the OAS datasbase.<\/p>\n<p>We benchmarked these models based on their ability to suggest single-residue substitutions that improve the avidity of the three unmatured IgG antibodies for their respective antigens (MEDI8852 UCA and HA H1 Solomon, mAb114 UCA and GP and C143 and Beta S-6P). For each of the four benchmarked models, we ranked substitutions by their mutant-to-wild-type likelihood ratios and experimentally tested the same number of substitutions considered in the first round of our evolutionary campaigns (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>).<\/p>\n<p>Notably, our approach based on general protein language models consistently outperformed all baseline methods (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a>). In particular, the abYsis and UniRef90 comparisons indicate that epistatic information was critical for consistent performance across antibodies. For example, the site-independent models did not recommend high-fitness substitutions such as VL G95P in MEDI8852 UCA or VL T33N\/G53V in C143, resulting in no avidity-enhancing substitutions to C143 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a>). We also observed that language models recommend a significantly higher number of avidity-enhancing substitutions (simulation-based <i>P<\/i>\u2009=\u20090.0085; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig7\">3a<\/a>) compared to the next-best baseline, UniRef90, and that is robust to differences in sequence alignment depth (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig7\">3b<\/a>, Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). Despite having access to antibody-specific sequence variation, both the AbLang and Sapiens models also consistently underperformed the general protein language models and even underperformed the site-independent models when recommending substitutions to mAb114 UCA (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a>). Our results indicate that general protein language models go beyond site-independent reasoning to make beneficial predictions while also learning sufficient information even from unspecialized protein sequence corpuses.<\/p>\n<h3 id=\"Sec7\">Computational efficiency of our approach<\/h3>\n<p>Our computational pipeline is highly efficient at making predictions, taking less than 1\u2009s per antibody (including both VH and VL sequences) on widely available, GPU-accelerated hardware (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). To demonstrate efficiency, we made predictions over 742 therapeutically relevant antibodies from the Thera-SAbDab database<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"88 title=\"Raybould, M. I. J. et al. Thera-SAbDab: the therapeutic structural antibody database. Nucleic Acids Res. 48, D383\u2013D388 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR47\" id=\"ref-link-section-d117614227e1286\">47<\/a><\/sup> (Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM6\">4<\/a>) in ~3\u2009min, and our approach scales linearly with the number of antibodies.<\/p>\n<h3 id=\"Sec8\">Generality across diverse protein families<\/h3>\n<p>Given the success of general protein language models at guiding antibody evolution, we also tested how well the same models could acquire high-fitness variants across diverse protein families. Previous work has demonstrated that the likelihoods from general protein language models have good correlation with experimental phenotypes from high-throughput assays over ~10<sup>3<\/sup> to 10<sup>4<\/sup> variants<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\"99 title=\"Hie, B. L., Yang, K. K. &#038; Kim, P. S. Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins. Cell Syst. 13, 274\u2013285 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR10\" id=\"ref-link-section-d117614227e1305\">10<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"00 title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR20\" id=\"ref-link-section-d117614227e1308\">20<\/a><\/sup>. Previous computational simulations have also indicated that these models can help bias multi-round evolution away from large regions of a sequence landscape with zero or very low fitness<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"11 title=\"Wittmann, B. J., Yue, Y. &#038; Arnold, F. H. Informed training set design enables efficient machine learning-assisted directed protein evolution. Cell Syst. 12, 1026\u20131045 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR9\" id=\"ref-link-section-d117614227e1312\">9<\/a><\/sup>.<\/p>\n<p>Here, we observed that the same models can also guide efficient evolution when measuring only a small number (~10<sup>1<\/sup>) of variants according to diverse definitions of fitness, including antibiotic resistance, cancer drug resistance, enzyme activity or viral replication fitness<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"22 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e1321\">48<\/a><\/sup>. We used the same algorithm and language models in our affinity maturation experiments to suggest a small number (~10<sup>1<\/sup>) of changes to wild-type sequences from human, bacterial or viral organisms representing eight diverse protein families. We then used experimental measurements from high-throughput scanning mutagenesis experiments<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"33 title=\"Markin, C. J. et al. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. Science 373, eabf8761 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR8\" id=\"ref-link-section-d117614227e1327\">8<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"44 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e1330\">48<\/a><\/sup> to validate the language-model-recommended predictions (notably, these measurements were not provided to the model). As in the antibody evolution campaigns, we are interested in enriching for as many high-fitness variants as possible among the small number of language model recommendations (rather than predicting fitness across the entire mutational space, as previously done<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"55 title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR20\" id=\"ref-link-section-d117614227e1334\">20<\/a><\/sup>).<\/p>\n<p>Language-model-recommended variants were nominally enriched (one-sided hypergeometric <i>P<\/i>\u2009<\u20090.05; exact <i>P<\/i> values and sample sizes are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">13<\/a>) for high-fitness values in six out of nine of the measured datasets, and high-fitness variants made up a much larger portion of language-model-recommended variants compared to random guessing in all but one case (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4a<\/a>, Extended Data Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig8\">4<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig10\">6<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">13<\/a>). For example, whereas high ampicillin resistance is observed for just 7% of all single-residue substitutions to \u03b2-lactamase, it is observed for 40% of language-model-recommended substitutions, and the same set of language models can also help prioritize single-residue substitutions to HA that result in high viral infectivity (from 7% to 31%) and substitutions to PafA that improve enzyme kinetics (from 3% to 20%). Additionally, across all proteins, even the first round of a small-scale evolutionary campaign guided by language models would yield variants that are above or near the 99th percentile of fitness values (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig8\">4<\/a>). Compared to 47 alternative variant effect predictors, including supervised and structure-based models, our strategy ranks higher, on average, than all other methods based on the ability to recommend high-fitness variants (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig8\">4<\/a>, Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM7\">5<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>).<\/p>\n<div data-test=\"figure\" data-container-section=\"figure\" id=\"figure-4\" data-title=\"Guiding evolution without explicitly modeling fitness.\">\n<figure><figcaption><b id=\"Fig4\" data-test=\"figure-caption-text\">Fig. 4: Guiding evolution without explicitly modeling fitness.<\/b><\/figcaption><div>\n<div><a data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/4\" rel=\"nofollow\"><picture><source type=\"image\/webp\" ><img decoding=\"async\" aria-describedby=\"Fig4\" src=\"http:\/\/media.springernature.com\/lw685\/springer-static\/image\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig4_HTML.png\" alt=\"Science &amp; Nature figure 4\" loading=\"lazy\" width=\"685\" height=\"253\"><\/picture><\/a><\/div>\n<p><b>a<\/b>, The same strategy and language models that we use to affinity mature antibodies can also recommend high-fitness changes across a diversity of selection pressures and protein families, as identified experimentally using high-throughput scanning mutagenesis assays<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"66 title=\"Markin, C. J. et al. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. Science 373, eabf8761 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR8\" id=\"ref-link-section-d117614227e1390\">8<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"77 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e1393\">48<\/a><\/sup> (described in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">13<\/a>). \u2018Fraction positive\u2019 indicates the percentage of high-fitness amino acid substitutions within either the set of substitutions recommended by the language model (LM guided) or the set of all single-residue substitutions (Background). A large portion of language-model-guided substitutions have high fitness, which, in many cases, is significantly enriched compared to the background percentage; also see Extended Data Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig8\">4<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig9\">6<\/a>, and see Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">13<\/a> for the exact one-sided hypergeometric <i>P<\/i> values and sample sizes. ADRB2, adrenoreceptor beta 2; \u03b2-la., \u03b2-lactamase; Env, envelope glycoprotein; infA, translation initiation factor 1; MAPK1, mitogen-activated protein kinase 1; PafA, phosphate-irrepressible alkaline phosphatase. <b>b<\/b>, Conceptually, the prior information encoded by evolutionary plausibility is represented in this cartoon by the rainbow road, where ascending corresponds to improving fitness and descending corresponds to lowering fitness. Moving in any direction (for example, via random or brute force mutagenesis) would most likely decrease fitness or have a high chance of being a detrimental change (represented by the green ball). However, if evolutionary plausibility is an efficient prior (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig1\">1b<\/a>), then movement that is constrained to the plausible regime (for example, when guided by a language model) substantially increases the chance of improving fitness (represented by the red ball).<\/p>\n<\/div>\n<p xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\"><a data-test=\"article-link\" data-track=\"click\" data-track-label=\"button\" data-track-action=\"view figure\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/4\" data-track-dest=\"link:Figure4 Full size image\" aria-label=\"Reference 6\"88 rel=\"nofollow\"><span>Full size image<\/span><\/a><\/p>\n<\/figure>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"Sec9-section\" data-title=\"Discussion\">\n<h2 id=\"Sec9\">Discussion<\/h2>\n<div id=\"Sec9-content\">\n<p>We show that general protein language models can guide highly efficient affinity maturation based on the wild-type antibody sequence alone. Although our affinity improvements are lower than those typically observed in successful in vivo evolutionary trajectories, somatic hypermutation explores a mutational space that is larger by multiple orders of magnitude (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig11\">7<\/a>). Moreover, our affinity improvements on unmatured antibodies are within the 2.3-fold to 580-fold range previously achieved by a state-of-the-art, in vitro evolutionary system applied to unmatured, anti-RBD nanobodies (in which the computational portion of our approach, which takes seconds, is replaced with rounds of cell culture and sorting, which take weeks)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\"99 title=\"Wellner, A. et al. Rapid generation of potent antibodies by autonomous hypermutation in yeast. Nat. Chem. Biol. 17, 1057\u20131064 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR14\" id=\"ref-link-section-d117614227e1439\">14<\/a><\/sup> (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig11\">7<\/a>). In vitro, cell surface display methods also encounter physical limits that make it challenging to distinguish better binders when the wildtype binder already has high affinity (<1\u2009nM)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"00 title=\"Hunter, S. A. &#038; Cochran, J. R. Cell-binding assays for determining the affinity of protein\u2013protein interactions. Methods Enzymol. 580, 21\u201344 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR5\" id=\"ref-link-section-d117614227e1446\">5<\/a><\/sup>, which is not a limitation of our approach.<\/p>\n<p>More broadly, a critical finding of our study is that evolutionary information alone provides sufficient prior information when selecting small numbers of substitutions to test for improved fitness (Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig1\">1b<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4b<\/a>). This leads to the result that a model without any task-specific training data or knowledge of the antigen can guide antibody evolution toward higher binding affinity, with competitive performance compared to protein-specific or task-specific methods (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig9\">5<\/a>). We hypothesize that, in many settings, when mutations are constrained to follow a set of general evolutionary rules, a substantial portion (greater than 10%) is bound to improve fitness (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4b<\/a>), which has immediate and broader implications for evolution in the laboratory and in nature.<\/p>\n<h3 id=\"Sec10\">Practical implications and extensions<\/h3>\n<p>We anticipate that language models will become a key part of the antibody engineer\u2019s toolkit, particularly within preclinical development as a rapid way to identify improved variants. In addition to speed, by focusing on ~10 single-site substitutions, a higher-throughput experimental budget that would have been allocated to brute force search could, instead, be allocated to exploring combinations of mutations<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"11 title=\"Zhao, H., Giver, L., Shao, Z., Affholter, J. A. &#038; Arnold, F. H. Molecular evolution by staggered extension process (StEP) in vitro recombination. Nat. Biotechnol. 16, 258\u2013261 (1998).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR49\" id=\"ref-link-section-d117614227e1475\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"22 title=\"Yu, Y. W., Daniels, N. M., Danko, D. C. &#038; Berger, B. Entropy-scaling search of massive biological data. Cell Syst. 1, 130\u2013140 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR50\" id=\"ref-link-section-d117614227e1478\">50<\/a><\/sup> or to exploring variants of more wild-type antibodies. Language-model-guided evolution could also complement or replace random mutagenesis strategies based on, for example, an error-prone polymerase.<\/p>\n<p>To the end user, guiding evolution via pre-trained, unsupervised models is less resource intensive than collecting enough task-specific data to train a supervised model<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"33 title=\"Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600\u2013612 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR28\" id=\"ref-link-section-d117614227e1485\">28<\/a><\/sup>. Language models should also serve as a baseline for future machine learning methods using supervision or other task-specific training data. Our techniques can also be used in conjunction with supervised approaches<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"44 title=\"Wittmann, B. J., Yue, Y. &#038; Arnold, F. H. Informed training set design enables efficient machine learning-assisted directed protein evolution. Cell Syst. 12, 1026\u20131045 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR9\" id=\"ref-link-section-d117614227e1489\">9<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"55 title=\"Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600\u2013612 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR28\" id=\"ref-link-section-d117614227e1492\">28<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"66 title=\"Yang, K. K., Wu, Z. &#038; Arnold, F. H. Machine-learning-guided directed evolution for protein engineering. Nat. Methods 16, 687\u2013694 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR33\" id=\"ref-link-section-d117614227e1495\">33<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"77 title=\"Hie, B. L. &#038; Yang, K. K. Adaptive machine learning for protein engineering. Curr. Opin. Struct .Biol. 72, 145\u2013152 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR34\" id=\"ref-link-section-d117614227e1498\">34<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Biswas, S., Khimulya, G., Alley, E. C., Esvelt, K. M. &#038; Church, G. M. Low-N protein engineering with data-efficient deep learning. Nat. Methods 18, 389\u2013396 (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR51\" id=\"ref-link-section-d117614227e1501\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Hie, B., Bryson, B. D. &#038; Berger, B. Leveraging uncertainty in machine learning accelerates biological discovery and design. Cell Syst. 11, 461\u2013477 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR52\" id=\"ref-link-section-d117614227e1501_1\">52<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Dallago, C. et al. FLIP: benchmark tasks in fitness landscape inference for proteins. In Proc. of the Neural Information Processing Systems Track on Datasets and Benchmarks \n                https:\/\/datasets-benchmarks-proceedings.neurips.cc\/paper_files\/paper\/2021\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/#ref-CR53\" id=\"ref-link-section-d117614227e1501_2\">53<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"88 title=\"Bileschi, M. L. et al. Using deep learning to annotate the protein universe. Nat. Biotechnol. 40, 932\u2013937 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR54\" id=\"ref-link-section-d117614227e1504\">54<\/a><\/sup>, and supervising a model over multiple experimental rounds might ultimately lead to higher fitness. However, in many practical settings (for example, the rapid development of sotrovimab in response to the COVID-19 pandemic<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"99 title=\"Alexander, E. et al. Antibody therapies for SARS-CoV-2 infection. WO2021252878A1 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR35\" id=\"ref-link-section-d117614227e1508\">35<\/a><\/sup>), the efficiency of an unsupervised, single-round approach is preferable to a protracted, multi-round directed evolution campaign.<\/p>\n<p>A general approach not biased by traditional structural hypotheses can also be valuable because many beneficial mutations are structurally remote to functionally important sites<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"00 title=\"Shimotohno, A., Oue, S., Yano, T., Kuramitsu, S. &#038; Kagamiyama, H. Demonstration of the importance and usefulness of manipulating non-active-site residues in protein design. J. Biochem. 129, 943\u2013948 (2001).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR55\" id=\"ref-link-section-d117614227e1515\">55<\/a><\/sup>. About half of the language-model-recommended substitutions (and about half of the affinity-enhancing substitutions) fall in framework regions, which are typically not proximal to the binding interface and are, therefore, sometimes excluded from directed evolution<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"11 title=\"Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600\u2013612 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR28\" id=\"ref-link-section-d117614227e1519\">28<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"22 title=\"Shan, S. et al. Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization. Proc. Natl Acad. Sci. USA 119, e2122954119 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR56\" id=\"ref-link-section-d117614227e1522\">56<\/a><\/sup>. Although some of these framework changes may improve affinity via protein stabilization, others do not appear to increase thermostability (for example, VL G95P in MEDI8852 UCA) and may, instead, be causing affinity improvements via structural reorientation<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Dunbar, J., Fuchs, A., Shi, J. &#038; Deane, C. M. ABangle: characterising the VH\u2013VL orientation in antibodies. Protein Eng. Des. Sel. 26, 611\u2013620 (2013).\" href=\"http:\/\/www.nature.com\/#ref-CR57\" id=\"ref-link-section-d117614227e1526\">57<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Fera, D. et al. Affinity maturation in an HIV broadly neutralizing B-cell lineage through reorientation of variable domains. Proc. Natl Acad. Sci. USA 111, 10275\u201310280 (2014).\" href=\"http:\/\/www.nature.com\/#ref-CR58\" id=\"ref-link-section-d117614227e1526_1\">58<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"33 title=\"Wedemayer, G. J., Patten, P. A., Wang, L. H., Schultz, P. G. &#038; Stevens, R. C. Structural insights into the evolution of an antibody combining site. Science 276, 1665\u20131669 (1997).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR59\" id=\"ref-link-section-d117614227e1529\">59<\/a><\/sup>. Nature often takes advantage of framework mutations to improve affinity, which represent ~20\u201330% of changes in natural affinity maturation<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"44 title=\"Yeap, L.-S. et al. Sequence-intrinsic mechanisms that target AID mutational outcomes on antibody genes. Cell 163, 1124\u20131137 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR60\" id=\"ref-link-section-d117614227e1533\">60<\/a><\/sup>. In one well-known case, none of the nine residues accounting for a 30,000-fold increase in affinity is in contact with the antigen<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"55 title=\"Wedemayer, G. J., Patten, P. A., Wang, L. H., Schultz, P. G. &#038; Stevens, R. C. Structural insights into the evolution of an antibody combining site. Science 276, 1665\u20131669 (1997).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR59\" id=\"ref-link-section-d117614227e1537\">59<\/a><\/sup>, and, in another case, framework mutations make important contributions to affinity maturation and increased breadth in an HIV-1 antibody<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"66 title=\"Fera, D. et al. Affinity maturation in an HIV broadly neutralizing B-cell lineage through reorientation of variable domains. Proc. Natl Acad. Sci. USA 111, 10275\u201310280 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR58\" id=\"ref-link-section-d117614227e1542\">58<\/a><\/sup>.<\/p>\n<h3 id=\"Sec11\">Generality of fitness improvements<\/h3>\n<p>By leveraging general evolutionary rules, language models recommend more \u2018universal\u2019 changes that seem to generalize better when the definition of fitness changes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4<\/a>). We also observed that general language models outperform antibody-specific language models (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a>), which is consistent with independent in silico benchmarking<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"77 title=\"Nijkamp, E., Ruffolo, J., Weinstein, E. N., Naik, N. &#038; Madani, A. ProGen2: exploring the boundaries of protein language models. Preprint at arXiv \n                https:\/\/doi.org\/10.48550\/arXiv.2206.13517\n                \n               (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR22\" id=\"ref-link-section-d117614227e1560\">22<\/a><\/sup>. When transferring to a new, specific notion of fitness, more general evolutionary information may outweigh the particular biases encoded in antibody repertoire datasets, although further development of antibody language models could improve performance.<\/p>\n<p>Our general approach is designed to improve an existing baseline function (for example, improving the affinity of a weak binder) rather than endowing any protein with an arbitrary function (for example, converting a generic protein into a specific binder). We also note that taking advantage of this strategy for guiding evolution may be more difficult when the selection pressure is unnatural or if the wild-type sequence is already at a fitness peak. However, in many practical design tasks, natural sequences and selection pressures are already preferrable; for example, therapeutic development often prefers human antibodies due to considerations of immunogenicity.<\/p>\n<p>Beyond protein engineering, the success of our approach may also provide insight into natural evolution. The efficiency of evolutionary information alone may reflect natural mechanisms for biasing mutation rates toward higher fitness: for example, somatic hypermutation favors specific parts of an antibody gene via epigenomic and enzymatic sequence biases<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"88 title=\"Yeap, L.-S. et al. Sequence-intrinsic mechanisms that target AID mutational outcomes on antibody genes. Cell 163, 1124\u20131137 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR60\" id=\"ref-link-section-d117614227e1570\">60<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\"99 title=\"Zheng, N.-Y., Wilson, K., Jared, M. &#038; Wilson, P. C. Intricate targeting of immunoglobulin somatic hypermutation maximizes the efficiency of affinity maturation. J. Exp. Med. 201, 1467\u20131478 (2005).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR61\" id=\"ref-link-section-d117614227e1573\">61<\/a><\/sup>. If epigenomic or other mechanisms predispose mutations to have high fitness, then nature could be accelerating evolution in a manner similar to our approach.<\/p>\n<\/div>\n<\/div>\n<div id=\"Sec12-section\" data-title=\"Methods\">\n<h2 id=\"Sec12\">Methods<\/h2>\n<div id=\"Sec12-content\">\n<h3 id=\"Sec13\">Acquiring amino acid substitutions via language model consensus<\/h3>\n<p>We select amino acid substitutions recommended by a consensus of language models. We take as input a single wild-type sequence <i>x<\/i>\u2009=\u2009(<i>x<\/i><sub>1<\/sub>,\u2026,<i>x<\/i><sub><i>N<\/i><\/sub>)<span>\u2208<\/span> <span>(mathcal{X})<\/span><sup><i>N<\/i><\/sup>, where <span>(mathcal{X})<\/span> is the set of amino acids, and <i>N<\/i> is the sequence length. We also require a set of masked language models, which are pre-trained to produce conditional likelihoods <span>(pleft( {x_i^prime |{{{mathbf{x}}}}} right))<\/span>. To guide evolution based on a certain language model, we first compute the set of substitutions with higher language model likelihood than the wild-type\u2014that is, we compute the set<\/p>\n<div id=\"Equa\">\n<p><span>$${{{mathcal{M}}}}left( {p_j} right) = left{ {i in left[ N right],x_i^prime in {{{mathcal{X}}}}:frac{{p_jleft( {x_i^prime |{{{mathbf{x}}}}} right)}}{{p_jleft( {x_i|{{{mathbf{x}}}}} right)}} > alpha } right},$$<\/span><\/p>\n<\/div>\n<p>where <i>p<\/i><sub><i>j<\/i><\/sub> denotes the language model, <i>x<\/i><sub><i>i<\/i><\/sub> denotes the wild-type residue and <i>\u03b1<\/i>\u2009=\u20091. To further filter substitutions to only those with the highest likelihood, we choose substitutions based on a consensus scheme, where, for a new amino acid <span>(x_i^prime)<\/span>, we compute<\/p>\n<div id=\"Equb\">\n<p><span>$$fleft( {x_i^prime } right) = mathop {sum}limits_{j in left[ M right]} 1 left{ {left( {i,x_i^prime } right){{{mathrm{is}}}},{{{mathrm{in}}},}{{{mathcal{M}}}}left( {p_j} right)} right}$$<\/span><\/p>\n<\/div>\n<p>where 1{\u00b7} denotes the indicator function, and there are <i>M<\/i> language models. We then acquire the set of substitutions with higher likelihood than wild-type across multiple language models\u2014that is, we acquire<\/p>\n<div id=\"Equc\">\n<p><span>$${{{mathcal{A}}}} = left{ {i in left[ N right],x_i^prime in {{{mathcal{X}}}}:fleft( {x_i^prime } right) ge k} right}$$<\/span><\/p>\n<\/div>\n<p>where <i>k<\/i> is a user-supplied cutoff that controls the number of corresponding variants to measure. Although we focus on values of <i>k<\/i> that result in small values of <span>(|{{{mathcal{A}}}}|)<\/span> (around 10) that can be screened via low-throughput assays, the number of substitutions can be increased by reducing the value of <i>k<\/i> or by lowering the cutoff stringency <i>\u03b1<\/i>. Our strategy based on computing \u2018wild-type marginal\u2019 likelihoods based on the entire sequence, <span>(pleft( {x_i^prime |{{{mathbf{x}}}}} right))<\/span>, instead of the \u2018masked marginal\u2019 likelihoods in which the site of interest is masked, <span>(pleft( {x_i^prime |{{{mathbf{x}}}}_{left[ N right]backslash left{ i right}}} right))<\/span>, also increases the cutoff stringency (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig5\">1<\/a>).<\/p>\n<p>We use six large-scale masked language models\u2014namely, the ESM-1b model<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"00 title=\"Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl Acad. Sci. USA 118, e2016239118 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR19\" id=\"ref-link-section-d117614227e2338\">19<\/a><\/sup> and the five models that are ensembled together to form ESM-1v<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"11 title=\"Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. Adv. Neural. Inf. Process. Syst. 34 \n                https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\n                \n               (NeurIPS, 2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR20\" id=\"ref-link-section-d117614227e2342\">20<\/a><\/sup>\u2014both obtained from <a href=\"https:\/\/github.com\/facebookresearch\/esm\">https:\/\/github.com\/facebookresearch\/esm<\/a>. ESM-1b was trained on the 2018-03 release of UniRef50 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"22 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e2353\">23<\/a><\/sup>) consisting of ~27 million sequences, and the five models in ESM-1v were each trained on the 2020-03 release of UniRef90 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"33 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e2357\">23<\/a><\/sup>) consisting of ~98 million sequences.<\/p>\n<h3 id=\"Sec14\">Antibody sequence analysis and evolution<\/h3>\n<p>For antibodies, we performed the above steps for the VH and VL sequences separately, obtaining respective sets <span>({{{mathcal{A}}}}_{{{{mathrm{VH}}}}})<\/span> and <span>({{{mathcal{A}}}}_{{{{mathrm{VL}}}}})<\/span>. For round 1 of evolution, we set <i>\u03b1<\/i>\u2009=\u20091 and chose values of <i>k<\/i> such that <span>(|{{{mathcal{A}}}}_{{{{mathrm{VH}}}}} cup {{{mathcal{A}}}}_{{{{mathrm{VL}}}}}|)<\/span> is approximately 10, which is meant to be a reasonable number of antibody variants for one person to express and purify in parallel. We used <i>k<\/i>\u2009=\u20092 for MEDI8852 VH and VL, <i>k<\/i>\u2009=\u20092 for MEDI8852 UCA VH and VL, <i>k<\/i>\u2009=\u20094 for mAb114 VH and VL, <i>k<\/i>\u2009=\u20092 for mAb114 UCA VH and VL, <i>k<\/i>\u2009=\u20092 for S309 VH, <i>k<\/i>\u2009=\u20091 for S309 VL, <i>k<\/i>\u2009=\u20092 for REGN10987 VH and VL and <i>k<\/i>\u2009=\u20092 for C143 VH and VL. We further reduced the size of <span>(|{{{mathcal{A}}}}_{{{{mathrm{VH}}}}} cup {{{mathcal{A}}}}_{{{{mathrm{VL}}}}}|)<\/span> by requiring the substitution to have the highest likelihood at its respective site for at least one language model. Variants were first measured for binding affinity to a given antigen via BLI (more details below), and those that enhanced affinity were recombined such that the second-round variants have two or more substitutions from wild-type, which were tested during round 2 of evolution. Given the small number of affinity-enhancing substitutions found during round 1 of evolution for S309 and REGN10987, we also expanded the set of substitutions considered in round 2 to include those that preserved affinity. For MEDI8852 and MEDI8852 UCA, we tested all possible combinations in round 2; for the other antibodies, where the number of possible combinations far exceeds ~10 variants, we manually selected a set of combinations meant to prioritize inclusion of substitutions that resulted in the largest improvements in affinity during the first round.<\/p>\n<p>We used the wild-type sequences provided by the original study authors describing the respective antibodies<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR29\" id=\"ref-link-section-d117614227e2572\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR30\" id=\"ref-link-section-d117614227e2572_1\">30<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Pinto, D. et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature 583, 290\u2013295 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR31\" id=\"ref-link-section-d117614227e2572_2\">31<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"44 title=\"Hansen, J. et al. Studies in humanized mice and convalescent humans yield a SARS-CoV-2 antibody cocktail. Science 369, 1010\u20131014 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR32\" id=\"ref-link-section-d117614227e2575\">32<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"55 title=\"Gaebler, C. et al. Evolution of antibody immunity to SARS-CoV-2. Nature 591, 639\u2013644 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR38\" id=\"ref-link-section-d117614227e2578\">38<\/a><\/sup>. Wild-type VH and VL sequences are provided in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">Supplementary Information<\/a>. We used the Kabat region definition provided by the abYsis webtool version 3.4.1 (<a href=\"http:\/\/www.abysis.org\/abysis\/index.html\">http:\/\/www.abysis.org\/abysis\/index.html<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"66 title=\"Swindells, M. B. et al. abYsis: integrated antibody sequence and structure\u2014management, analysis, and prediction. J. Mol. Biol. 429, 356\u2013364 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR44\" id=\"ref-link-section-d117614227e2592\">44<\/a><\/sup> to annotate the framework regions and CDRs within the VH and VL sequences.<\/p>\n<h3 id=\"Sec15\">Antibody avidity benchmarking experiments<\/h3>\n<p>We also compared the substitutions recommended by the above strategy (based on language model consensus) to the substitutions recommended by four alternative sequence-based methods. First, we acquired substitutions to a VH or VL sequence based on site-independent mutational frequencies, where we used either the frequencies computed by the abYsis Annotation webtool<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"77 title=\"Swindells, M. B. et al. abYsis: integrated antibody sequence and structure\u2014management, analysis, and prediction. J. Mol. Biol. 429, 356\u2013364 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR44\" id=\"ref-link-section-d117614227e2604\">44<\/a><\/sup> or the frequencies obtained using all sequences in UniRef90 (the training dataset of ESM-1v)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"88 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e2608\">23<\/a><\/sup>. To compute the UniRef90 frequencies, we first performed an exhaustive search to obtain the 10,000 closest sequences by Levenshtein distance, where 10,000 is chosen to reflect the number of immunoglobulin-like sequences in UniRef90. We computed sequence similarity using the partial_ratio function from the FuzzyWuzzy Python package version 0.18.0; we then constructed a multiple sequence alignment of these 10,000 sequences using MAFFT version 7.475 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\"99 title=\"Katoh, K. &#038; Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772\u2013780 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR63\" id=\"ref-link-section-d117614227e2612\">63<\/a><\/sup>) using the VH or VL sequence as the reference; finally, using the alignment, we computed mutational frequencies for each site in the sequence. We selected the top-ranking substitutions by likelihood ratio (the mutant frequency divided by the corresponding wild-type frequency) across the VH and VL sequences, where, for each antibody, we selected the same number of substitutions considered in the first round of our evolutionary campaigns.<\/p>\n<p>We also acquired substitutions based on language models trained specifically on antibody sequences. We used the AbLang heavy chain and light chain language models (<a href=\"https:\/\/github.com\/TobiasHeOl\/AbLang\">https:\/\/github.com\/TobiasHeOl\/AbLang<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0000 title=\"Olsen, T. H., Moal, I. H. &#038; Deane, C. M. AbLang: an antibody language model for completing antibody sequences. Bioinform. Adv. 2, vbac046 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR24\" id=\"ref-link-section-d117614227e2626\">24<\/a><\/sup> and the Sapiens heavy chain and light chain language models (<a href=\"https:\/\/github.com\/Merck\/Sapiens\">https:\/\/github.com\/Merck\/Sapiens<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0101 title=\"Prihoda, D. et al. BioPhi: a platform for antibody design, humanization, and humanness evaluation based on natural antibody repertoires and deep learning. mAbs 14, 2020203 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR25\" id=\"ref-link-section-d117614227e2637\">25<\/a><\/sup> to compute the mutant-to-wild-type likelihood ratios for all single-residue substitutions to the VH or VL sequence (using the language model trained on sequences from the corresponding chain). We selected the top-ranking substitutions by likelihood ratio across the VH and VL sequences and, following our use of the general protein language models, also required the substitution to have the highest likelihood at its site. For each antibody, we selected the same number of substitutions considered in the first round of our evolutionary campaigns.<\/p>\n<p>We used these four methods (abYsis, UniRef90, AbLang and Sapiens) to select substitutions to our three unmatured antibodies (MEDI8852 UCA, mAb114 UCA and C143) and used BLI to measure IgG avidity to their respective antigens (HA H1 Solomon, GP and Beta S-6P). To purify the larger number of variants involved in these benchmarking studies, we used a medium-throughput system using a robotic liquid handler, described in more detail below. With this system, we expressed and purified antibody variants containing single-residue substitutions from wild-type recommended by the consensus of ESM language models as well as by the four baseline methods, observing similar purities and affinities when the same variants were also expressed and purified via the low-throughput system (described below) used in our evolutionary campaigns. Antibodies with a final concentration of less than 0.1\u2009mg\u2009ml<sup>\u22121<\/sup> in 200\u2009\u03bcl after the medium-throughput purification were re-expressed and purified using the low-throughput methodology.<\/p>\n<h3 id=\"Sec16\">UniRef90 robustness and statistical significance analysis<\/h3>\n<p>For the UniRef90 benchmark, we additionally assessed robustness to differences in multiple sequence alignment (MSA) construction by computing the number of known affinity-enhancing substitutions while varying the sequence alignment depth from 1,000 to 9,000 sequences at increments of 1,000 (for a total of nine alignment depth cutoffs). At each cutoff, we re-ran the procedure described above to select substitutions (constructing MSAs and calculating mutational likelihood ratios). We performed this for all three experimentally benchmarked antibodies, representing a total of 27 MSAs. Among the top-ranked substitutions for each cutoff and benchmarked antibody, we counted the number of known affinity-enhancing substitutions and provide the results in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig7\">3<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a>.<\/p>\n<p>We also used the UniRef90 benchmark to assess the statistical significance of the number of avidity-enhancing substitutions recommended by the language models. In particular, we calculated the probability of acquiring 12 or more avidity-enhancing substitutions (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">12<\/a>) by simulating different outcomes of a site-independent model based on UniRef90 alignments. To construct the null distribution, we first simulated variation in UniRef90 alignments using the nine MSAs of varying alignment depth and their corresponding recommended substitutions, described in the previous paragraph. We then simulated experimental measurement of these mutations for avidity enhancement across the three benchmarked antibodies: for each top-ranked substitution with an unknown effect on avidity, we assigned a success probability based on the observed probabilities from our experimental benchmark (2\/8 = 25% for MEDI8852 UCA; 5\/9 = 56% for mAb114 UCA; and 1\/14 = 7% for C143); for each top-ranked substitution with a known effect on avidity, we fixed its value to its experimentally determined status. We ran 500,000 simulations for each of the nine MSA cutoffs (a total of 4.5 million simulations), where each simulation returns a total number of avidity-enhancing substitutions across the three antibodies. We report the <i>P<\/i> value as the number of simulations resulting in 12 or more avidity-enhancing substitutions divided by the total number of simulations.<\/p>\n<h3 id=\"Sec17\">Antibody cloning<\/h3>\n<p>We cloned the antibody sequences into the CMV\/R plasmid backbone for expression under a CMV promoter. The heavy chain or light chain sequence was cloned between the CMV promoter and the bGH poly(A) signal sequence of the CMV\/R plasmid to facilitate improved protein expression. Variable regions were cloned into the human IgG1 backbone; REGN10987 and C143 variants were cloned with a lambda light chain, whereas variants of all other antibodies were cloned with a kappa light chain. The vector for both heavy and light chain sequences also contained the HVM06_Mouse (UniProt: <a href=\"https:\/\/www.uniprot.org\/uniprot\/P01750\">P01750<\/a>) Ig heavy chain V region 102 signal peptide (MGWSCIILFLVATATGVHS) to allow for protein secretion and purification from the supernatant. VH and VL segments were ordered as gene blocks from Integrated DNA Technologies and were cloned into linearized CMV\/R backbones with 5\u00d7 In-Fusion HD Enzyme Premix (Takara Bio); a list of oligonucleotides and gene blocks used in the study is provided as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM8\">6<\/a>.<\/p>\n<h3 id=\"Sec18\">Antigen cloning<\/h3>\n<p>HA, GP, Spike and RBD sequences were cloned into a pADD2 vector between the rBeta-globin intron and \u03b2-globin poly(A). HA constructs contain a Foldon trimerization domain. GP and Spike constructs contain a GCN4 trimerization domain. All HAs, GP, Wuhan-Hu-1 S-6P and Omicron BA.1 RBD constructs contain an AviTag. All constructs contain a C-terminal 6\u00d7His tag. We used HA sequences from the following strains: A\/New Caledonia\/20\/1999(H1N1) (H1 Caledonia), A\/Solomon Islands\/3\/2006(H1N1) (H1 Solomon), A\/Japan\/305\/1957 (H2N2) (H2 Japan), A\/Panama\/2007\/1999(H3N2) (H3 Panama), A\/Victoria\/3\/1975(H3N2) (H3 Victoria), A\/swine\/Hubei\/06\/2009(H4N1) (H4 Hubei), A\/Vietnam\/1203\/2004(H5N1) (H5 Vietnam), A\/Hong Kong\/61\/2016(H7N9) (H7 HK16) and A\/Hong Kong\/125\/2017(H7N9) (H7 HK17). We used Ebola GP ectodomain (Mayinga, Zaire, 1976, GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/11761750\">AAG40168.1<\/a>) with the mucin-like domain deleted (\u0394309\u2013489). Spike or RBD sequences were based off wild-type Wuhan-Hu-1 (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/BCN86353.1\">BCN86353.1<\/a>), Beta (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/QUT64557.1\">QUT64557.1<\/a>) or Omicron BA.1 (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/UFO69279.1\">UFO69279.1<\/a>).<\/p>\n<h3 id=\"Sec19\">DNA preparation<\/h3>\n<p>Plasmids were transformed into Stellar competent cells (Takara Bio), and transformed cells were plated and grown at 37\u2009\u00b0C overnight. Colonies were mini-prepped per the manufacturer\u2019s recommendations (GeneJET, K0502, Thermo Fisher Scientific) and sequence confirmed (Sequetech) and then maxi-prepped per the manufacturer\u2019s recommendations (NucleoBond Xtra Maxi, Macherey-Nagel). Plasmids were sterile filtered using a 0.22-\u03bcm syringe filter and stored at 4\u2009\u00b0C.<\/p>\n<h3 id=\"Sec20\">Protein expression<\/h3>\n<p>All proteins were expressed in Expi293F cells (Thermo Fisher Scientific, A14527). Proteins containing a biotinylation tag (AviTag) were also expressed in the presence of a BirA enzyme, resulting in spontaneous biotinylation during protein expression. Expi293F cells were cultured in media containing 66% FreeStyle\/33% Expi media (Thermo Fisher Scientific) and grown in TriForest polycarbonate shaking flasks at 37\u2009\u00b0C in 8% carbon dioxide. The day before transfection, cells were spun down and resuspended to a density of 3\u2009\u00d7\u200910<sup>6<\/sup> cells per milliliter in fresh media. The next day, cells were diluted and transfected at a density of approximately 3\u20134\u2009\u00d7\u200910<sup>6<\/sup> cells per milliliter. Transfection mixtures were made by adding the following components: maxi-prepped DNA, culture media and FectoPRO (Polyplus) would be added to cells to a ratio of 0.5\u2009\u03bcg: 100\u2009\u03bcl: 1.3\u2009\u03bcl: 900\u2009\u03bcl. For example, for a 100-ml transfection, 50\u2009\u03bcg of DNA would be added to 10\u2009ml of culture media, followed by the addition of 130\u2009\u03bcl of FectoPRO. For antibodies, we divided the transfection DNA equally among heavy and light chains; in the previous example, 25\u2009\u03bcg of heavy chain DNA and 25\u2009\u03bcg of light chain DNA would be added to 10\u2009ml of culture media. After mixing and a 10-min incubation, the example transfection cocktail would be added to 90\u2009ml of cells. The cells were harvested 3\u20135\u2009days after transfection by spinning the cultures at >7,000<i>g<\/i> for 15\u2009min. Supernatants were filtered using a 0.45-\u03bcm filter.<\/p>\n<h3 id=\"Sec21\">Antibody purification (low throughput)<\/h3>\n<p>We purified antibodies using a 5-ml MabSelect Sure PRISM column on the \u00c4KTA pure fast protein liquid chromatography (FPLC) instrument (Cytiva). The \u00c4KTA system was equilibrated with line A1 in 1\u00d7 PBS, line A2 in 100\u2009mM glycine pH 2.8, line B1 in 0.5\u2009M sodium hydroxide, Buffer line in 1\u00d7 PBS and Sample lines in water. The protocol washes the column with A1, followed by loading of the sample in the Sample line until air is detected in the air sensor of the sample pumps, followed by five column volume washes with A1, elution of the sample by flowing of 20\u2009ml of A2 directly into a 50-ml conical containing 2\u2009ml of 1\u2009M tris(hydroxymethyl)aminomethane (Tris) pH 8.0, followed by five column volumes of A1, B1 and A1. We concentrated the eluted samples using 50-kDa or 100-kDa cutoff centrifugal concentrators, followed by buffer exchange using a PD-10 column (Sephadex) that had been pre-equilibrated into 1\u00d7 PBS. Purified antibodies were stored at \u221220\u2009\u00b0C.<\/p>\n<h3 id=\"Sec22\">Antibody purification (medium throughput)<\/h3>\n<p>For our benchmarking experiments, we purified antibody variants with a medium-throughput system using an Agilent Bravo robotic liquid handling platform and VWorks software version 13.1.0.1366 with custom programming routines. For each antibody wild-type or variant, a 2.5-ml culture of Expi293F cells was transfected with corresponding antibody heavy and light chain plasmids as previously described. Cultures were harvested 3\u20135\u2009days after transfection by centrifugation at 4,200<i>g<\/i> for 10\u2009min, followed by collecting 2\u2009ml of supernatant. ProPlus PhyTip column tips (Biotage, PTV-92-20-07) were loaded on the Bravo 96 LT head and equilibrated by aspirating and dispensing 75\u2009\u03bcl of PBS, repeating four times. Sample binding to the tip resin was performed by aspirating and dispensing 98\u2009\u03bcl of harvested supernatant, followed by washing via aspirating and dispensing 100\u2009\u03bcl of PBS, repeating the binding and washing steps nine times (in total processing 882\u2009\u03bcl of harvest for each run). Elution was performed by aspirating 100\u2009\u03bcl of 100\u2009mM glycine pH 2.8, followed by dispensing into a well with 10\u2009\u03bcl of 1\u2009M Tris pH 8.<\/p>\n<h3 id=\"Sec23\">Antigen purification<\/h3>\n<p>All antigens were His-tagged and purified using HisPur Ni-NTA resin (Thermo Fisher Scientific, 88222). Cell supernatants were diluted with 1\/3 volume of wash buffer (20\u2009mM imidazole, 20\u2009mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES) pH 7.4, 150\u2009mM sodium chloride (NaCl) or 20\u2009mM imidazole, 1\u00d7 PBS), and the Ni-NTA resin was added to diluted cell supernatants. For all antigens except SARS-CoV-2 Spike, the samples were then incubated at 4\u2009\u00b0C while stirring overnight. SARS-CoV-2 Spike antigens were incubated at room temperature while stirring overnight. Resin\/supernatant mixtures were added to chromatography columns for gravity flow purification. The resin in the column was washed with wash buffer (20\u2009mM imidazole, 20\u2009mM HEPES pH 7.4, 150\u2009mM NaCl or 20\u2009mM imidazole, 1\u00d7 PBS), and the proteins were eluted with 250\u2009mM imidazole, 20\u2009mM HEPES pH 7.4, 150\u2009mM NaCl or 20\u2009mM imidazole, 1\u00d7 PBS. Column elutions were concentrated using centrifugal concentrators at 10-kDa, 50-kDa or 100-kDa cutoffs, followed by size-exclusion chromatography on an \u00c4KTA pure system (Cytiva). \u00c4KTA pure FPLC with a Superdex 6 Increase (S6) or Superdex 200 Increase (S200) gel filtration column was used for purification. Then, 1\u2009ml of sample was injected using a 2-ml loop and run over the S6 or S200, which had been pre-equilibrated in degassed 20\u2009mM HEPES, 150\u2009mM NaCl or 1\u00d7 PBS before use and stored at \u221220\u2009\u00b0C.<\/p>\n<h3 id=\"Sec24\">Fab production and purification<\/h3>\n<p>Next, 1\/10 volume of 1\u2009M Tris pH 8 was added to IgGs at ~2\u2009mg\u2009ml<sup>\u22121<\/sup> in 1\u00d7 PBS. Then, 2\u2009\u03bcl of a 1\u2009mg\u2009ml<sup>\u22121<\/sup> stock of Lys-C (stock stored at \u221220\u2009\u00b0C) was added for each milligram of human IgG1 and digested for 1\u2009h at 37\u2009\u00b0C with moderate rotation. Digested Fabs were purified using a 5-ml HiTrap SP HP cation exchange chromatography column on an \u00c4KTA system using 50\u2009mM sodium acetate (NaOAc) pH 5.0 with gradient NaCl elution (using 50\u2009mM NaOAc + 1\u2009M NaCl pH 5.0). Fab fractions were pooled and dialyzed against 1\u00d7 PBS and concentrated using 30-kDa concentrators. Purified Fabs were stored at \u221220\u2009\u00b0C.<\/p>\n<h3 id=\"Sec25\">BLI binding experiments<\/h3>\n<p>All reactions were run on an Octet RED96 at 30\u2009\u00b0C, and samples were run in 1\u00d7 PBS with 0.1% BSA and 0.05% Tween 20 (Octet buffer). IgGs and Fabs were assessed for binding to biotinylated antigens using streptavidin biosensors (Sartorius\/ForteBio) or to unbiotinylated, His-tagged antigens using Anti-Penta-HIS biosensors (Sartorius\/ForteBio). Antigen was loaded to a threshold of 1-nm shift. Tips were then washed and baselined in wells containing only Octet buffer. Samples were then associated in wells containing IgG or Fab at 100\u2009nM concentration unless otherwise stated (other concentrations are given in Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM3\">1<\/a>). A control well with loaded antigen but that was associated in a well containing only 200\u2009\u03bcl of Octet buffer was used as a baseline subtraction for data analysis. Association and dissociation binding curves were fit in Octet System Data Analysis Software version 9.0.0.15 using a 1:2 bivalent model for IgGs to determine apparent <i>K<\/i><sub>d<\/sub> and a 1:1 model for Fabs to determine <i>K<\/i><sub>d<\/sub>. Averages of fitted <i>K<\/i><sub>d<\/sub> values from at least two independent experiments are reported to two significant figures. Wild-type and the highest-affinity variants were also tested at multiple concentrations, and <i>K<\/i><sub>d<\/sub> values were averaged across all replicates and concentrations (Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM3\">1<\/a>). To estimate measurement error, we computed the coefficient of variation (CV; the ratio of the s.d. to the mean across replicates) for each antibody\u2212antigen <i>K<\/i><sub>d<\/sub> pair, and we report the mean CV for each antigen in Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">4<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>.<\/p>\n<h3 id=\"Sec26\">Thermal melts<\/h3>\n<p>We measured thermal melting profiles of proteins by differential scanning fluorimetry on a Prometheus NT.48 instrument. Protein samples (0.1\u2009mg\u2009ml<sup>\u22121<\/sup>) were loaded into glass capillaries and then subjected to a temperature gradient from 20\u2009\u00b0C to 95\u2009\u00b0C at a heating rate of 1\u2009\u00b0C per minute. Intrinsic fluorescence (350\u2009nm and 330\u2009nm) was recorded as a function of temperature using PR.ThermControl version 2.3.1 software. Thermal melting curves were plotted using the first derivative of the ratio (350\u2009nm\/330\u2009nm). Melting temperatures were calculated automatically by the instrument and represented peaks in the thermal melting curves.<\/p>\n<h3 id=\"Sec27\">PolySpecificity Particle assay<\/h3>\n<p>Polyspecificity reagent (PSR) was obtained as described by Xu et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0202 title=\"Xu, Y. et al. Addressing polyspecificity of antibodies selected from an in vitro yeast presentation system: a FACS-based, high-throughput selection and analytical tool. Protein Eng. Des. Sel. 26, 663\u2013670 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR41\" id=\"ref-link-section-d117614227e2850\">41<\/a><\/sup>. Soluble membrane proteins were isolated from homogenized and sonicated Expi 293F cells followed by biotinylation with Sulfo-NHC-SS-Biotin (Thermo Fisher Scientific, 21331) and stored in PBS at \u221280\u2009\u00b0C. The PolySpecificity Particle (PSP) assay was performed following Makowski et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0303 title=\"Makowski, E. K., Wu, L., Desai, A. A. &#038; Tessier, P. M. Highly sensitive detection of antibody nonspecific interactions using flow cytometry. mAbs 13, 1951426 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR42\" id=\"ref-link-section-d117614227e2854\">42<\/a><\/sup>. Protein A magnetic beads (Invitrogen, 10001D) were washed three times in PBSB (PBS with 1\u2009mg\u2009ml<sup>\u22121<\/sup> BSA) and diluted to 54\u2009\u03bcg\u2009ml<sup>\u22121<\/sup> in PBSB. Then, 30\u2009\u03bcl of the solution containing the beads was incubated with 85\u2009\u03bcl of antibodies at 15\u2009\u00b5g\u2009ml<sup>\u22121<\/sup> overnight at 4\u2009\u00b0C with rocking. The coated beads were then washed twice with PBSB using a magnetic plate stand (Invitrogen, 12027) and resuspended in PBSB. We then incubated 50\u2009\u03bcl of 0.1\u2009mg\u2009ml<sup>\u22121<\/sup> PSR with the washed beads at 4\u2009\u00b0C with rocking for 20\u2009min. Beads were then washed with PBSB and incubated with 0.001\u00d7 streptavidin-APC (BioLegend, 405207) and 0.001\u00d7 goat anti-human Fab fragment FITC (Jackson ImmunoResearch, 109-097-003) at 4\u2009\u00b0C with rocking for 15\u2009min. Beads were then washed and resuspended with PBSB. Beads were profiled via flow cytometry using a BD Accuri C6 flow cytometer. Data analysis was performed with BD CSampler Plus software version 1.0.34.1 to obtain median fluorescence intensity (MFI) values, which are reported for each antibody across three or more replicate wells. Elotuzumab (purified using the low-throughput FPLC methodology described above), ixekizumab (FPLC purified as described above) and 4E10 (HIV Reagent Program, ARP-10091) are also included in each assay as controls.<\/p>\n<h3 id=\"Sec28\">Lentivirus production<\/h3>\n<p>We produced SARS-CoV-2 Spike (D614G and Beta variants) pseudotyped lentiviral particles. Viral transfections were done in HEK293T cells (American Type Culture Collection, CRL-3216) using BioT (BioLand) transfection reagent. Six million cells were seeded in D10 media (DMEM + additives: 10% FBS, L-glutamate, penicillin, streptomycin and 10\u2009mM HEPES) in 10-cm plates 1\u2009day before transfection. A five-plasmid system was used for viral production, as described in Crawford et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0404 title=\"Crawford, K. H. D. et al. Protocol and reagents for pseudotyping lentiviral particles with SARS-CoV-2 spike protein for neutralization assays. Viruses 12, 513 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR64\" id=\"ref-link-section-d117614227e2875\">64<\/a><\/sup>. The Spike vector contained the 21-amino-acid truncated form of the SARS-CoV-2 Spike sequence from the Wuhan-Hu-1 strain of SARS-CoV-2 (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/BCN86353.1\">BCN86353.1<\/a>) or the Beta variant of concern (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/QUT64557.1\">QUT64557.1<\/a>). The other viral plasmids, used as previously described<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0505 title=\"Crawford, K. H. D. et al. Protocol and reagents for pseudotyping lentiviral particles with SARS-CoV-2 spike protein for neutralization assays. Viruses 12, 513 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR64\" id=\"ref-link-section-d117614227e2893\">64<\/a><\/sup>, are pHAGE-Luc2-IRS-ZsGreen (NR-52516), HDM-Hgpm2 (NR-52517), pRC-CMV-Rev1b (NR-52519) and HDM-tat1b (NR-52518). These plasmids were added to D10 medium in the following ratios: 10\u2009\u03bcg pHAGE-Luc2-IRS-ZsGreen, 3.4\u2009\u03bcg FL Spike, 2.2\u2009\u03bcg HDM-Hgpm2, 2.2\u2009\u03bcg HDM-Tat1b and 2.2\u2009\u03bcg pRC-CMV-Rev1b in a final volume of 1,000\u2009\u03bcl.<\/p>\n<p>Ebola GP-pseudotyped lentiviruses were produced using the same packaging (pHAGE-Luc2-IRS-ZsGreen) and helper plasmids (HDM-Hgpm2, HDM-Tat1b and pRC-CMV-Rev1b) but with the plasmid encoding full-length Ebola GP (GenBank: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/AAG40168.1\">AAG40168.1<\/a>).<\/p>\n<p>After adding plasmids to medium, we added 30\u2009\u03bcl of BioT to form transfection complexes. Transfection reactions were incubated for 10\u2009min at room temperature, and then 9\u2009ml of medium was added slowly. The resultant 10\u2009ml was added to plated HEK cells from which the medium had been removed. Culture medium was removed 24\u2009h after transfection and replaced with fresh D10 medium. Viral supernatants were harvested 72\u2009h after transfection by spinning at 300<i>g<\/i> for 5\u2009min, followed by filtering through a 0.45-\u03bcm filter. Viral stocks were aliquoted and stored at \u221280\u2009\u00b0C until further use.<\/p>\n<h3 id=\"Sec29\">Pseudovirus neutralization<\/h3>\n<p>The target cells used for infection in SARS-CoV-2 pseudovirus neutralization assays are from a HeLa cell line stably overexpressing human angiotensin-converting enzyme 2 (ACE2) as well as the protease known to process SARS-CoV-2: transmembrane serine protease 2 (TMPRSS2). Production of this cell line is described in detail by Rogers et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0606 title=\"Rogers, T. F. et al. Isolation of potent SARS-CoV-2 neutralizing antibodies and protection from disease in a small animal model. Science 369, 956\u2013963 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR65\" id=\"ref-link-section-d117614227e2922\">65<\/a><\/sup> with the addition of stable TMPRSS2 incorporation. ACE2\/TMPRSS2\/HeLa cells were plated 1\u2009day before infection at 8,000 cells per well. For Ebola pseudovirus neutralization assays, HEK293T cells were seeded in 96-well plates 1\u2009day before infection at 20,000 cells per well. Ninety-six-well, white-walled, white-bottom plates were used for neutralization assays (Thermo Fisher Scientific).<\/p>\n<p>On the day of the assay, purified IgGs in 1\u00d7 PBS were sterile filtered using a 0.22-\u03bcm filter. Dilutions of this filtered stock were made into sterile 1\u00d7 Dulbecco\u2019s PBS (DPBS) (Thermo Fisher Scientific), which was 5% by volume D10 medium. A virus mixture was made containing the virus of interest (for example, SARS-CoV-2) and D10 media (DMEM + additives: 10% FBS, L-glutamate, penicillin, streptomycin and 10\u2009mM HEPES). Virus dilutions into media were selected such that a suitable signal would be obtained in the virus-only wells. A suitable signal was selected such that the virus-only wells would achieve a luminescence of at least >5,000,000 relative light units (RLU). Then, 60\u2009\u03bcl of this virus mixture was added to each of the antibody dilutions to make a final volume of 120\u2009\u03bcl in each well. Virus-only wells were made, which contained 60\u2009\u03bcl of 1\u00d7 DPBS and 60\u2009\u03bcl of virus mixture. Cells-only wells were made, which contained 120\u2009\u03bcl of D10 media.<\/p>\n<p>The antibody\/virus mixture was left to incubate for 1\u2009h at 37\u2009\u00b0C. After incubation, the medium was removed from the cells on the plates made 1\u2009day prior. This was replaced with 100\u2009\u03bcl of antibody\/virus dilutions and incubated at 37\u2009\u00b0C for approximately 24\u2009h. Infectivity readout was performed by measuring luciferase levels. SARS-CoV-2 and Ebola pseudovirus neutralization assays were read out 48\u2009h and 72\u2009h after infection, respectively. Medium was removed from all wells, and cells were lysed by the addition of 100\u2009\u03bcl of BriteLite assay readout solution (PerkinElmer) into each well. Luminescence values were measured using an Infinite 200 PRO Microplate Reader (Tecan) using i-control version 2.0 software (Tecan). Each plate was normalized by averaging the cells-only (0% infection) and virus-only (100% infection) wells. We used the neutcurve Python package version 0.5.7 to fit the normalized datapoints and to compute the IC<sub>50<\/sub> values, which we report to two significant digits. To estimate measurement error, we computed the CV for each antibody\u2013virus IC<sub>50<\/sub> pair, and we report the mean CV for each virus in Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">5<\/a>, <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">8<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a>.<\/p>\n<h3 id=\"Sec30\">HLA binding prediction<\/h3>\n<p>As a proxy for predicting T-cell-mediated immunogenicity, we used NetMHCPan version 4.1 and NetMHCIIPan version 4.1 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0707 title=\"Reynisson, B., Alvarez, B., Paul, S., Peters, B. &#038; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 48, W449\u2013W454 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR43\" id=\"ref-link-section-d117614227e2953\">43<\/a><\/sup>) to predict peptide binders to class I and class II HLA, respectively, across a number of alleles. For the class I analysis, we applied NetMHCPan with default parameters to the VH and VL sequences of the wild-type sequences as well as the VH and VL variant sequences listed in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig3\">3a<\/a>. We considered all 9-mer peptides and predicted binding to HLA-A01:01, HLA-A02:01, HLA-A03:01, HLA-A24:02, HLA-A26:01, HLA-B07:02, HLA-B08:01, HLA-B27:05, HLA-B39:01, HLA-B40:01, HLA-B58:01 and HLA-B15:01. For each VH or VL sequence, we counted the number of peptides determined as \u2018strong binders\u2019 or \u2018weak binders\u2019 according to NetMHCPan. We then tested for a significant change in the number of binders between the evolved variant sequence and its corresponding wild-type using the binom_test function in scipy.stats. For the class II analysis, we similarly applied NetMHCIIPan with default parameters to the same set of VH and VL sequences. We considered all 15-mer peptides and predicted binding to DRB1_0101, DRB3_0101, DRB4_0101, DRB5_0101, HLA-DPA10103-DPB10101 and HLA-DQA10101-DQB10201. For each VH or VL sequence, we counted the number of peptides determined as \u2018strong binders\u2019 or \u2018weak binders\u2019 according to NetMHCIIPan. We then tested for a significant change in the number of binders between the evolved variant sequence and its corresponding wild-type using the binom_test function in scipy.stats.<\/p>\n<h3 id=\"Sec31\">Computing frequency of changes to antibody protein sequences<\/h3>\n<p>We computed the frequency of residues involved in affinity-enhancing substitutions by aligning the wild-type VH and VL sequences of our antibodies to databases of protein sequences. The first database that we considered is UniRef90, where we used the same database release used to train ESM-1v. For each antibody protein sequence, we obtained the set of 10,000 sequences in UniRef90 that are closest to the antibody by sequence similarity based on Levenshtein distance (with the farthest sequences having between 18% and 47% sequence similarity). We computed sequence similarity using the FuzzyWuzzy Python package version 0.18.0. We then used MAFFT version 7.475 to perform multiple sequence alignment among the set of sequences. We used the alignment to compute amino acid frequencies at each site in the VH or VL sequence.<\/p>\n<p>The second database that we considered is provided by the abYsis webtool, which also computes the frequency of amino acids at each position based on a multiple sequence alignment. We aligned VH and VL protein sequences using the default settings provided in the \u2018Annotate\u2019 tool, using the database of \u2018All\u2019 sequences as of 1 March 2022.<\/p>\n<p>We also considered the frequency of affinity-enhancing substitutions conditioned on the corresponding V or J gene. We obtained all sequences and corresponding gene annotations from IMGT\/LIGM-DB (the international ImMunoGeneTics information system, Laboratoire d\u2019ImmunoG\u00e9n\u00e9tique Mol\u00e9culaire database) (<a href=\"https:\/\/www.imgt.org\/ligmdb\/\">https:\/\/www.imgt.org\/ligmdb\/<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0808 title=\"Giudicelli, V. et al. IMGT\/LIGM-DB, the IMGT\u00ae comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. Nucleic Acids Res. 34, D781\u2013D784 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR66\" id=\"ref-link-section-d117614227e2981\">66<\/a><\/sup> as of 13 July 2022. For MEDI8852, MEDI8852 UCA, mAb114 and mAb114 UCA, we used the V and J gene annotations from the original publications<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"0909 title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR29\" id=\"ref-link-section-d117614227e2985\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1010 title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR30\" id=\"ref-link-section-d117614227e2988\">30<\/a><\/sup>. For S309, REGN10987 and C143, we used the V and J gene annotations in CoV-AbDab (<a href=\"http:\/\/opig.stats.ox.ac.uk\/webapps\/covabdab\/\">http:\/\/opig.stats.ox.ac.uk\/webapps\/covabdab\/<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Raybould, M. I. J., Kovaltsuk, A., Marks, C. &#038; Deane, C. M. CoV-AbDab: the coronavirus antibody database. Bioinformatics 37, 734\u2013735 (2021).\" href=\"http:\/\/www.nature.com\/#ref-CR67\" id=\"ref-link-section-d117614227e2999\">67<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Jones, E. M. et al. Structural and functional characterization of G protein\u2013coupled receptors with deep mutational scanning. eLife 9, e54895 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR68\" id=\"ref-link-section-d117614227e2999_1\">68<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Stiffler, M. A., Hekstra, D. R. &#038; Ranganathan, R. Evolvability as a function of purifying selection in TEM-1 \u03b2-lactamase. Cell 160, 882\u2013892 (2015).\" href=\"http:\/\/www.nature.com\/#ref-CR69\" id=\"ref-link-section-d117614227e2999_2\">69<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Haddox, H. K., Dingens, A. S. &#038; Bloom, J. D. Experimental estimation of the effects of all amino-acid mutations to HIV\u2019s envelope protein on viral replication in cell culture. PLoS Pathog. 12, e1006114 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR70\" id=\"ref-link-section-d117614227e2999_3\">70<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Doud, M. B. &#038; Bloom, J. D. Accurate measurement of the effects of all amino-acid mutations on influenza hemagglutinin. Viruses 8, 155 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR71\" id=\"ref-link-section-d117614227e2999_4\">71<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Lee, J. M. et al. Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc. Natl Acad. Sci. USA 115, E8276\u2013E8285 (2018).\" href=\"http:\/\/www.nature.com\/#ref-CR72\" id=\"ref-link-section-d117614227e2999_5\">72<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kelsic, E. D. et al. RNA structural determinants of optimal codons revealed by MAGE-Seq. Cell Syst. 3, 563\u2013571 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR73\" id=\"ref-link-section-d117614227e2999_6\">73<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Brenan, L. et al. Phenotypic characterization of a comprehensive set of MAPK1\/ERK2 missense mutants. Cell Rep. 17, 1171\u20131183 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR74\" id=\"ref-link-section-d117614227e2999_7\">74<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1111 title=\"Giacomelli, A. O. et al. Mutational processes shape the landscape of TP53 mutations in human cancer. Nat. Genet. 50, 1381\u20131387 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR75\" id=\"ref-link-section-d117614227e3002\">75<\/a><\/sup>. For a given substitution, we obtained all corresponding V or J protein sequences, performed a multiple sequence alignment with MAFFT version 7.475 and used the resulting alignment to compute amino acid frequencies.<\/p>\n<h3 id=\"Sec32\">Therapeutic antibody database evaluation and runtime benchmark<\/h3>\n<p>We downloaded 742 therapeutically relevant antibodies from the Thera-SAbDab database as of 26 February 2022 (<a href=\"http:\/\/opig.stats.ox.ac.uk\/webapps\/newsabdab\/therasabdab\/\">http:\/\/opig.stats.ox.ac.uk\/webapps\/newsabdab\/therasabdab\/<\/a>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1212 title=\"Raybould, M. I. J. et al. Thera-SAbDab: the therapeutic structural antibody database. Nucleic Acids Res. 48, D383\u2013D388 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR47\" id=\"ref-link-section-d117614227e3021\">47<\/a><\/sup>. For each antibody VH and VL sequence, we used the same procedure described above for computing consensus substitutions that have higher language model likelihood than wild-type. We measured the computational runtime using the time module in Python 3.8. Experiments were performed with an Advanced Micro Devices EPYC Rome 7502P 2.5-GHz CPU and an Nvidia Ampere A40 48GB GPU.<\/p>\n<h3 id=\"Sec33\">Natural protein evaluation and benchmarking based on scanning mutagenesis data<\/h3>\n<p>We evaluated the ability for the language models and algorithms used in our study to guide efficient evolution in other settings beyond antibodies. We used deep mutational scanning (DMS) datasets to validate that our approach would enable a researcher to acquire high-fitness variants. We used all DMS datasets from the benchmarking study by Livesey and Marsh<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1313 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e3033\">48<\/a><\/sup> with 90% or higher coverage of all single-residue substitutions; variants that were not measured were excluded from the analysis. We also used a scanning mutagenesis dataset generated by Markin et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1414 title=\"Markin, C. J. et al. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. Science 373, eabf8761 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR8\" id=\"ref-link-section-d117614227e3037\">8<\/a><\/sup> that measured Michaelis\u2013Menten kinetics of all single-site glycine or valine substitutions to the bacterial enzyme PafA; for this dataset, any language-model-recommended substitutions that did not involve glycine or valine substitutions were excluded from the analysis. We applied a cutoff for each dataset to binarize sequences as high-fitness or low-fitness variants (cutoffs are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">13<\/a>); we then compared enrichment of high-fitness variants among the language-model-recommended variants to the background frequency of high-fitness variants among all single-residue substitutions. For these proteins, as with our antibody experiments, we chose values of <i>k<\/i> that result in a small number (~10<sup>1<\/sup>) of acquired substitutions: we used <i>\u03b1<\/i>\u2009=\u20091 and <i>k<\/i>\u2009=\u20092 for all proteins except those where this resulted in <span>(|{{{mathcal{A}}}}|)<\/span> \u22645, in which case we set <i>k<\/i>\u2009=\u20091 (and additionally <i>\u03b1<\/i>\u2009=\u20090.5 for infA).<\/p>\n<p>To quantify the statistical significance of an enrichment, we assumed that the null distribution of the number of high-fitness, language-model-recommended variants was given by a hypergeometric distribution parameterized by the number of language-model-recommended variants <span>(|{{{mathcal{A}}}}|)<\/span>, the number of high-fitness variants among the all single-residue substitutions and the total number of single-residue substitutions considered, which we used to compute a one-sided <i>P<\/i> value. We used the hypergeometric calculator at <a href=\"https:\/\/stattrek.com\/online-calculator\/hypergeometric.aspx\">https:\/\/stattrek.com\/online-calculator\/hypergeometric.aspx<\/a>.<\/p>\n<p>To test the relationship between likelihood stringency and the fraction of high-fitness substitutions, we also performed a small-scale parameter sweep varying the cutoff values <i>\u03b1<\/i> and <i>k<\/i> and computing (1) the percentage fraction of high-fitness substitutions in <span>({{{mathcal{A}}}})<\/span>; (2) the maximum fitness value of a variant in <span>({{{mathcal{A}}}})<\/span> divided by the maximum fitness value of a variant across the full mutational scan; and (3) the maximum fitness value of a variant in <span>({{{mathcal{A}}}})<\/span> divided by the 99th percentile of the fitness values across the full mutational scan; before this normalization, the raw fitness values are also linearly scaled to take values between 0 and 1, inclusive. Normalized values, the number of acquired variants <span>(|{{{mathcal{A}}}}|)<\/span> and the parameter combinations are plotted in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig8\">4<\/a>.<\/p>\n<p>We also tested how well alternative methods for ranking substitutions would be able to suggest high-fitness variants. To enable a direct comparison to the language model consensus strategy described above, we selected the same number of substitutions and kept all other parameters fixed while only varying the method used to rank substitutions. We used the benchmarking results obtained by Livesey and Marsh<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1515 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e3220\">48<\/a><\/sup> enabling us to test 46 different methods for ranking substitutions, which use evolutionary information, biophysical properties of amino acids or protein structure information; these methods are described in greater detail in Table EV1 of ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1616 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e3224\">48<\/a><\/sup>. We also tested how well using the summed log-likelihood ratios across all ESM language models (that is, computing <span>(mathop {sum}nolimits_j {left( {log p_jleft( {x_i^prime |x} right) &#8211; log p_jleft( {x_i|{{{mathbf{x}}}}} right)} right)})<\/span> at each site <i>i<\/i> and substitution <span>(x_i^prime)<\/span>) would compare to the consensus strategy. For each DMS dataset, we computed the number of high-fitness mutations that were acquired by each of these 47 benchmark methods (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig9\">5<\/a>); we broke any ties in variant effect predictor scores by randomly selecting substitutions and computing the average number of high-fitness variants over 100 random seeds. We aggregated results across DMS datasets by ranking methods within each DMS (averaging the ranks that would have been assigned to tied values) and computed the mean rank across the eight DMS datasets (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig9\">5<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM7\">5<\/a>).<\/p>\n<h3 id=\"Sec34\">Reporting Summary<\/h3>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM2\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n<\/div>\n<\/div><\/div>\n<div data-enable-entitlement-checks>\n<div id=\"data-availability-section\" data-title=\"Data availability\">\n<h2 id=\"data-availability\">Data availability<\/h2>\n<div id=\"data-availability-content\">\n<p>Raw data for this study have been deposited to Zenodo at <a href=\"https:\/\/doi.org\/10.5281\/zenodo.6968342\">https:\/\/doi.org\/10.5281\/zenodo.6968342<\/a>. <i>K<\/i><sub>d<\/sub>, IC<sub>50<\/sub> and <i>T<\/i><sub>m<\/sub> values across replicate experiments are available as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM3\">1<\/a>. Median fluorescence intensity values for the polyspecificity experiments are available as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM4\">2<\/a>. Experimental values for our benchmarking of sequence-based methods and results from our UniRef90 parameter sweeps are available as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a>. High-likelihood amino acid substitutions for 742 therapeutic antibodies are available as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM6\">4<\/a>. Mean rank values for our deep mutational scanning benchmark experiments are available as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM7\">5<\/a>. A list of oligonucleotides used in the study is provided as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM8\">6<\/a>. We also make use of the following publicly available databases and datasets:<\/p>\n<p>\u2022 UniProt: <a href=\"https:\/\/www.uniprot.org\/\">https:\/\/www.uniprot.org\/<\/a><\/p>\n<p>\u2022 UniRef50 2018_03 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1717 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e3563\">23<\/a><\/sup>): <a href=\"https:\/\/ftp.uniprot.org\/pub\/databases\/uniprot\/previous_releases\/release-2018_03\/uniref\/\">https:\/\/ftp.uniprot.org\/pub\/databases\/uniprot\/previous_releases\/release-2018_03\/uniref\/<\/a><\/p>\n<p>\u2022 UniRef90 2020_03 (ref. <sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1818 title=\"Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282\u20131288 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR23\" id=\"ref-link-section-d117614227e3576\">23<\/a><\/sup>): <a href=\"https:\/\/ftp.uniprot.org\/pub\/databases\/uniprot\/previous_releases\/release-2020_03\/uniref\/\">https:\/\/ftp.uniprot.org\/pub\/databases\/uniprot\/previous_releases\/release-2020_03\/uniref\/<\/a><\/p>\n<p>\u2022 abYsis<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"1919 title=\"Swindells, M. B. et al. abYsis: integrated antibody sequence and structure\u2014management, analysis, and prediction. J. Mol. Biol. 429, 356\u2013364 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR44\" id=\"ref-link-section-d117614227e3590\">44<\/a><\/sup>: <a href=\"http:\/\/www.abysis.org\/abysis\/\">http:\/\/www.abysis.org\/abysis\/<\/a><\/p>\n<p>\u2022 IMGT\/LIGM-DB<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"2020 title=\"Giudicelli, V. et al. IMGT\/LIGM-DB, the IMGT\u00ae comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. Nucleic Acids Res. 34, D781\u2013D784 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR66\" id=\"ref-link-section-d117614227e3603\">66<\/a><\/sup>: <a href=\"https:\/\/www.imgt.org\/IMGTindex\/LIGM-DB.php\">https:\/\/www.imgt.org\/IMGTindex\/LIGM-DB.php<\/a><\/p>\n<p>\u2022 Thera-SAbDab<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"2121 title=\"Raybould, M. I. J. et al. Thera-SAbDab: the therapeutic structural antibody database. Nucleic Acids Res. 48, D383\u2013D388 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR47\" id=\"ref-link-section-d117614227e3616\">47<\/a><\/sup>: <a href=\"https:\/\/opig.stats.ox.ac.uk\/webapps\/newsabdab\/therasabdab\/search\/\">https:\/\/opig.stats.ox.ac.uk\/webapps\/newsabdab\/therasabdab\/search\/<\/a><\/p>\n<p>\u2022 Livesey and Marsh benchmarking dataset<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"2222 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e3629\">48<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Jones, E. M. et al. Structural and functional characterization of G protein\u2013coupled receptors with deep mutational scanning. eLife 9, e54895 (2020).\" href=\"http:\/\/www.nature.com\/#ref-CR68\" id=\"ref-link-section-d117614227e3632\">68<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Stiffler, M. A., Hekstra, D. R. &#038; Ranganathan, R. Evolvability as a function of purifying selection in TEM-1 \u03b2-lactamase. Cell 160, 882\u2013892 (2015).\" href=\"http:\/\/www.nature.com\/#ref-CR69\" id=\"ref-link-section-d117614227e3632_1\">69<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Haddox, H. K., Dingens, A. S. &#038; Bloom, J. D. Experimental estimation of the effects of all amino-acid mutations to HIV\u2019s envelope protein on viral replication in cell culture. PLoS Pathog. 12, e1006114 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR70\" id=\"ref-link-section-d117614227e3632_2\">70<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Doud, M. B. &#038; Bloom, J. D. Accurate measurement of the effects of all amino-acid mutations on influenza hemagglutinin. Viruses 8, 155 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR71\" id=\"ref-link-section-d117614227e3632_3\">71<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Lee, J. M. et al. Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc. Natl Acad. Sci. USA 115, E8276\u2013E8285 (2018).\" href=\"http:\/\/www.nature.com\/#ref-CR72\" id=\"ref-link-section-d117614227e3632_4\">72<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kelsic, E. D. et al. RNA structural determinants of optimal codons revealed by MAGE-Seq. Cell Syst. 3, 563\u2013571 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR73\" id=\"ref-link-section-d117614227e3632_5\">73<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Brenan, L. et al. Phenotypic characterization of a comprehensive set of MAPK1\/ERK2 missense mutants. Cell Rep. 17, 1171\u20131183 (2016).\" href=\"http:\/\/www.nature.com\/#ref-CR74\" id=\"ref-link-section-d117614227e3632_6\">74<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 2\"2323 title=\"Giacomelli, A. O. et al. Mutational processes shape the landscape of TP53 mutations in human cancer. Nat. Genet. 50, 1381\u20131387 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR75\" id=\"ref-link-section-d117614227e3635\">75<\/a><\/sup>.<\/p>\n<\/p><\/div>\n<\/div>\n<div id=\"code-availability-section\" data-title=\"Code availability\">\n<h2 id=\"code-availability\">Code availability<\/h2>\n<p>We provide open-source code that enables a user to easily and quickly evaluate the language models on a sequence of interest. We implement this as a simple call to a Python script with the wild-type sequence as the main argument, which is available at <a href=\"https:\/\/github.com\/brianhie\/efficient-evolution\">https:\/\/github.com\/brianhie\/efficient-evolution<\/a>. Code and scripts used in this study are available as <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM9\">Supplementary Code<\/a> and have been deposited to Zenodo at <a href=\"https:\/\/doi.org\/10.5281\/zenodo.6977562\">https:\/\/doi.org\/10.5281\/zenodo.6977562<\/a>.<\/p>\n<\/div>\n<div id=\"MagazineFulltextArticleBodySuffix\" aria-labelledby=\"Bib1\" data-title=\"References\">\n<h2 id=\"Bib1\">References<\/h2>\n<div data-container-section=\"references\" id=\"Bib1-content\">\n<ol data-track-component=\"outbound reference\">\n<li data-counter=\"1.\">\n<p id=\"ref-CR1\">Futuyma, D. J. <i>Evolutionary Biology<\/i> 3rd ed (Sinauer Associates, 1997).<\/p>\n<\/li>\n<li data-counter=\"2.\">\n<p id=\"ref-CR2\">Wright, S. The roles of mutation, inbreeding, crossbreeding and selection in evolution. <i>Proc. of the VI International Congress of Genetics<\/i> 355\u2013366 (Blackwell, 1932).<\/p>\n<\/li>\n<li data-counter=\"3.\">\n<p id=\"ref-CR3\">Arnold, F. H. Directed evolution: bringing new chemistry to life. <i>Angew. Chem. Int. Ed. Engl.<\/i> <b>57<\/b>, 4143\u20134148 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1002\/anie.201708408\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1002%2Fanie.201708408\" aria-label=\"Reference 2\"2424 data-doi=\"10.1002\/anie.201708408\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXhvVOjsrvO\" aria-label=\"Reference 2\"2525>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=29064156\" aria-label=\"Reference 2\"2626>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"2727 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Directed%20evolution%3A%20bringing%20new%20chemistry%20to%20life&#038;journal=Angew.%20Chem.%20Int.%20Ed.%20Engl.&#038;doi=10.1002%2Fanie.201708408&#038;volume=57&#038;pages=4143-4148&#038;publication_year=2018&#038;author=Arnold%2CFH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"4.\">\n<p id=\"ref-CR4\">Fowler, D. M. &#038; Fields, S. Deep mutational scanning: a new style of protein science. <i>Nat. Methods<\/i> <b>11<\/b>, 801\u2013807 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nmeth.3027\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnmeth.3027\" aria-label=\"Reference 2\"2828 data-doi=\"10.1038\/nmeth.3027\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXhslelsLvK\" aria-label=\"Reference 2\"2929>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=25075907\" aria-label=\"Reference 2\"3030>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4410700\" aria-label=\"Reference 2\"3131>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"3232 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Deep%20mutational%20scanning%3A%20a%20new%20style%20of%20protein%20science&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fnmeth.3027&#038;volume=11&#038;pages=801-807&#038;publication_year=2014&#038;author=Fowler%2CDM&#038;author=Fields%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"5.\">\n<p id=\"ref-CR5\">Hunter, S. A. &#038; Cochran, J. R. Cell-binding assays for determining the affinity of protein\u2013protein interactions. <i>Methods Enzymol.<\/i> <b>580<\/b>, 21\u201344 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/bs.mie.2016.05.002\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fbs.mie.2016.05.002\" aria-label=\"Reference 2\"3333 data-doi=\"10.1016\/bs.mie.2016.05.002\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:STN:280:DC%2BC2szntVGlsQ%3D%3D\" aria-label=\"Reference 2\"3434>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27586327\" aria-label=\"Reference 2\"3535>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6067677\" aria-label=\"Reference 2\"3636>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"3737 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Cell-binding%20assays%20for%20determining%20the%20affinity%20of%20protein%E2%80%93protein%20interactions&#038;journal=Methods%20Enzymol.&#038;doi=10.1016%2Fbs.mie.2016.05.002&#038;volume=580&#038;pages=21-44&#038;publication_year=2016&#038;author=Hunter%2CSA&#038;author=Cochran%2CJR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"6.\">\n<p id=\"ref-CR6\">Khersonsky, O. &#038; Tawfik, D. S. Enzyme promiscuity: a mechanistic and evolutionary perspective. <i>Annu. Rev. Biochem.<\/i> <b>79<\/b>, 471\u2013505 (2010).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1146\/annurev-biochem-030409-143718\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1146%2Fannurev-biochem-030409-143718\" aria-label=\"Reference 2\"3838 data-doi=\"10.1146\/annurev-biochem-030409-143718\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3cXpslShtrY%3D\" aria-label=\"Reference 2\"3939>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=20235827\" aria-label=\"Reference 2\"4040>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"4141 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Enzyme%20promiscuity%3A%20a%20mechanistic%20and%20evolutionary%20perspective&#038;journal=Annu.%20Rev.%20Biochem.&#038;doi=10.1146%2Fannurev-biochem-030409-143718&#038;volume=79&#038;pages=471-505&#038;publication_year=2010&#038;author=Khersonsky%2CO&#038;author=Tawfik%2CDS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"7.\">\n<p id=\"ref-CR7\">Bloom, J. D., Labthavikul, S. T., Otey, C. R. &#038; Arnold, F. H. Protein stability promotes evolvability. <i>Proc. Natl Acad. Sci. USA<\/i> <b>103<\/b>, 5869\u20135874 (2006).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.0510098103\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.0510098103\" aria-label=\"Reference 2\"4242 data-doi=\"10.1073\/pnas.0510098103\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD28XktFait7s%3D\" aria-label=\"Reference 2\"4343>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16581913\" aria-label=\"Reference 2\"4444>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC1458665\" aria-label=\"Reference 2\"4545>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"4646 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Protein%20stability%20promotes%20evolvability&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.0510098103&#038;volume=103&#038;pages=5869-5874&#038;publication_year=2006&#038;author=Bloom%2CJD&#038;author=Labthavikul%2CST&#038;author=Otey%2CCR&#038;author=Arnold%2CFH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"8.\">\n<p id=\"ref-CR8\">Markin, C. J. et al. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. <i>Science<\/i> <b>373<\/b>, eabf8761 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.abf8761\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.abf8761\" aria-label=\"Reference 2\"4747 data-doi=\"10.1126\/science.abf8761\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhs1Kiu7jK\" aria-label=\"Reference 2\"4848>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34437092\" aria-label=\"Reference 2\"4949>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8454890\" aria-label=\"Reference 2\"5050>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5151 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Revealing%20enzyme%20functional%20architecture%20via%20high-throughput%20microfluidic%20enzyme%20kinetics&#038;journal=Science&#038;doi=10.1126%2Fscience.abf8761&#038;volume=373&#038;publication_year=2021&#038;author=Markin%2CCJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"9.\">\n<p id=\"ref-CR9\">Wittmann, B. J., Yue, Y. &#038; Arnold, F. H. Informed training set design enables efficient machine learning-assisted directed protein evolution. <i>Cell Syst.<\/i> <b>12<\/b>, 1026\u20131045 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2021.07.008\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2021.07.008\" aria-label=\"Reference 2\"5252 data-doi=\"10.1016\/j.cels.2021.07.008\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhvVehtLrP\" aria-label=\"Reference 2\"5353>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34416172\" aria-label=\"Reference 2\"5454>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5555 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Informed%20training%20set%20design%20enables%20efficient%20machine%20learning-assisted%20directed%20protein%20evolution&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2021.07.008&#038;volume=12&#038;pages=1026-1045&#038;publication_year=2021&#038;author=Wittmann%2CBJ&#038;author=Yue%2CY&#038;author=Arnold%2CFH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"10.\">\n<p id=\"ref-CR10\">Hie, B. L., Yang, K. K. &#038; Kim, P. S. Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins. <i>Cell Syst.<\/i> <b>13<\/b>, 274\u2013285 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2022.01.003\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2022.01.003\" aria-label=\"Reference 2\"5656 data-doi=\"10.1016\/j.cels.2022.01.003\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XislKksbo%3D\" aria-label=\"Reference 2\"5757>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35120643\" aria-label=\"Reference 2\"5858>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"5959 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Evolutionary%20velocity%20with%20protein%20language%20models%20predicts%20evolutionary%20dynamics%20of%20diverse%20proteins&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2022.01.003&#038;volume=13&#038;pages=274-285&#038;publication_year=2022&#038;author=Hie%2CBL&#038;author=Yang%2CKK&#038;author=Kim%2CPS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"11.\">\n<p id=\"ref-CR11\">Eisen, H. N. &#038; Siskind, G. W. Variations in affinities of antibodies during the immune response. <i>Biochemistry<\/i> <b>3<\/b>, 996\u2013100 (1964).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1021\/bi00895a027\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1021%2Fbi00895a027\" aria-label=\"Reference 2\"6060 data-doi=\"10.1021\/bi00895a027\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DyaF2cXktlGgsLs%3D\" aria-label=\"Reference 2\"6161>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=14214095\" aria-label=\"Reference 2\"6262>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"6363 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Variations%20in%20affinities%20of%20antibodies%20during%20the%20immune%20response&#038;journal=Biochemistry&#038;doi=10.1021%2Fbi00895a027&#038;volume=3&#038;pages=996-100&#038;publication_year=1964&#038;author=Eisen%2CHN&#038;author=Siskind%2CGW\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"12.\">\n<p id=\"ref-CR12\">Eisen, H. N. Affinity enhancement of antibodies: how low-affinity antibodies produced early in immune responses are followed by high-affinity antibodies later and in memory B-cell responses. <i>Cancer Immunol. Res.<\/i> <b>2<\/b>, 381\u2013392 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1158\/2326-6066.CIR-14-0029\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1158%2F2326-6066.CIR-14-0029\" aria-label=\"Reference 2\"6464 data-doi=\"10.1158\/2326-6066.CIR-14-0029\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXns1Cgu7g%3D\" aria-label=\"Reference 2\"6565>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24795350\" aria-label=\"Reference 2\"6666>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"6767 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Affinity%20enhancement%20of%20antibodies%3A%20how%20low-affinity%20antibodies%20produced%20early%20in%20immune%20responses%20are%20followed%20by%20high-affinity%20antibodies%20later%20and%20in%20memory%20B-cell%20responses&#038;journal=Cancer%20Immunol.%20Res.&#038;doi=10.1158%2F2326-6066.CIR-14-0029&#038;volume=2&#038;pages=381-392&#038;publication_year=2014&#038;author=Eisen%2CHN\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"13.\">\n<p id=\"ref-CR13\">Victora, G. D. &#038; Nussenzweig, M. C. Germinal centers. <i>Annu. Rev. Immunol.<\/i> <b>40<\/b>, 413\u2013442 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1146\/annurev-immunol-120419-022408\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1146%2Fannurev-immunol-120419-022408\" aria-label=\"Reference 2\"6868 data-doi=\"10.1146\/annurev-immunol-120419-022408\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35113731\" aria-label=\"Reference 2\"6969>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"7070 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Germinal%20centers&#038;journal=Annu.%20Rev.%20Immunol.&#038;doi=10.1146%2Fannurev-immunol-120419-022408&#038;volume=40&#038;pages=413-442&#038;publication_year=2022&#038;author=Victora%2CGD&#038;author=Nussenzweig%2CMC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"14.\">\n<p id=\"ref-CR14\">Wellner, A. et al. Rapid generation of potent antibodies by autonomous hypermutation in yeast. <i>Nat. Chem. Biol.<\/i> <b>17<\/b>, 1057\u20131064 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41589-021-00832-4\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41589-021-00832-4\" aria-label=\"Reference 2\"7171 data-doi=\"10.1038\/s41589-021-00832-4\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhsVWhtLjL\" aria-label=\"Reference 2\"7272>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34168368\" aria-label=\"Reference 2\"7373>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8463502\" aria-label=\"Reference 2\"7474>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"7575 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Rapid%20generation%20of%20potent%20antibodies%20by%20autonomous%20hypermutation%20in%20yeast&#038;journal=Nat.%20Chem.%20Biol.&#038;doi=10.1038%2Fs41589-021-00832-4&#038;volume=17&#038;pages=1057-1064&#038;publication_year=2021&#038;author=Wellner%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"15.\">\n<p id=\"ref-CR15\">Bepler, T. &#038; Berger, B. Learning the protein language: evolution, structure and function. <i>Cell Syst.<\/i> <b>12<\/b>, 654\u2013669 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2021.05.017\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2021.05.017\" aria-label=\"Reference 2\"7676 data-doi=\"10.1016\/j.cels.2021.05.017\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhtlyqtLbO\" aria-label=\"Reference 2\"7777>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34139171\" aria-label=\"Reference 2\"7878>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8238390\" aria-label=\"Reference 2\"7979>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"8080 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Learning%20the%20protein%20language%3A%20evolution%2C%20structure%20and%20function&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2021.05.017&#038;volume=12&#038;pages=654-669&#038;publication_year=2021&#038;author=Bepler%2CT&#038;author=Berger%2CB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"16.\">\n<p id=\"ref-CR16\">Bepler, T. &#038; Berger, B. Learning protein sequence embeddings using information from structure. <i>International Conference on Learning Representations<\/i>. Preprint at <i>arXiv<\/i> <a href=\"https:\/\/doi.org\/10.48550\/arXiv.1902.08661\">https:\/\/doi.org\/10.48550\/arXiv.1902.08661<\/a> (2019).<\/p>\n<\/li>\n<li data-counter=\"17.\">\n<p id=\"ref-CR17\">Hie, B., Zhong, E., Berger, B. &#038; Bryson, B. Learning the language of viral evolution and escape. <i>Science<\/i> <b>371<\/b>, 284\u2013288 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.abd7331\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.abd7331\" aria-label=\"Reference 2\"8181 data-doi=\"10.1126\/science.abd7331\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhsVaitbs%3D\" aria-label=\"Reference 2\"8282>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33446556\" aria-label=\"Reference 2\"8383>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"8484 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Learning%20the%20language%20of%20viral%20evolution%20and%20escape&#038;journal=Science&#038;doi=10.1126%2Fscience.abd7331&#038;volume=371&#038;pages=284-288&#038;publication_year=2021&#038;author=Hie%2CB&#038;author=Zhong%2CE&#038;author=Berger%2CB&#038;author=Bryson%2CB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"18.\">\n<p id=\"ref-CR18\">Alley, E. C., Khimulya, G., Biswas, S., AlQuraishi, M. &#038; Church, G. M. Unified rational protein engineering with sequence-based deep representation learning. <i>Nat. Methods<\/i> <b>16<\/b>, 1315\u20131322 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41592-019-0598-1\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41592-019-0598-1\" aria-label=\"Reference 2\"8585 data-doi=\"10.1038\/s41592-019-0598-1\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXitVSlsbnJ\" aria-label=\"Reference 2\"8686>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31636460\" aria-label=\"Reference 2\"8787>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7067682\" aria-label=\"Reference 2\"8888>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"8989 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Unified%20rational%20protein%20engineering%20with%20sequence-based%20deep%20representation%20learning&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fs41592-019-0598-1&#038;volume=16&#038;pages=1315-1322&#038;publication_year=2019&#038;author=Alley%2CEC&#038;author=Khimulya%2CG&#038;author=Biswas%2CS&#038;author=AlQuraishi%2CM&#038;author=Church%2CGM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"19.\">\n<p id=\"ref-CR19\">Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. <i>Proc. Natl Acad. Sci. USA<\/i> <b>118<\/b>, e2016239118 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.2016239118\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.2016239118\" aria-label=\"Reference 2\"9090 data-doi=\"10.1073\/pnas.2016239118\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXovVantro%3D\" aria-label=\"Reference 2\"9191>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33876751\" aria-label=\"Reference 2\"9292>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8053943\" aria-label=\"Reference 2\"9393>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"9494 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Biological%20structure%20and%20function%20emerge%20from%20scaling%20unsupervised%20learning%20to%20250%20million%20protein%20sequences&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.2016239118&#038;volume=118&#038;publication_year=2021&#038;author=Rives%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"20.\">\n<p id=\"ref-CR20\">Meier, J. et al. Language models enable zero-shot prediction of the effects of mutations on protein function. <i>Adv. Neural. Inf. Process. Syst. 34<\/i> <a href=\"https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf\">https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2021\/file\/f51338d736f95dd42427296047067694-Paper.pdf<\/a> (NeurIPS, 2021).<\/p>\n<\/li>\n<li data-counter=\"21.\">\n<p id=\"ref-CR21\">Elnaggar, A. et al. ProtTrans: towards cracking the language of life\u2019s code through self-supervised deep learning and high performance computing. <i>IEEE Trans. Pattern Anal. Mach. Intell.<\/i> <b>44<\/b>, 7112\u20137127 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1109\/TPAMI.2021.3095381\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1109%2FTPAMI.2021.3095381\" aria-label=\"Reference 2\"9595 data-doi=\"10.1109\/TPAMI.2021.3095381\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34232869\" aria-label=\"Reference 2\"9696>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 2\"9797 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=ProtTrans%3A%20towards%20cracking%20the%20language%20of%20life%E2%80%99s%20code%20through%20self-supervised%20deep%20learning%20and%20high%20performance%20computing&#038;journal=IEEE%20Trans.%20Pattern%20Anal.%20Mach.%20Intell.&#038;doi=10.1109%2FTPAMI.2021.3095381&#038;volume=44&#038;pages=7112-7127&#038;publication_year=2022&#038;author=Elnaggar%2CA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"22.\">\n<p id=\"ref-CR22\">Nijkamp, E., Ruffolo, J., Weinstein, E. N., Naik, N. &#038; Madani, A. ProGen2: exploring the boundaries of protein language models. Preprint at <i>arXiv<\/i> <a href=\"https:\/\/doi.org\/10.48550\/arXiv.2206.13517\">https:\/\/doi.org\/10.48550\/arXiv.2206.13517<\/a> (2022).<\/p>\n<\/li>\n<li data-counter=\"23.\">\n<p id=\"ref-CR23\">Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. &#038; Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. <i>Bioinformatics<\/i> <b>23<\/b>, 1282\u20131288 (2007).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/bioinformatics\/btm098\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fbioinformatics%2Fbtm098\" aria-label=\"Reference 2\"9898 data-doi=\"10.1093\/bioinformatics\/btm098\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2sXntVOjurw%3D\" aria-label=\"Reference 2\"9999>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=17379688\" aria-label=\"Reference 1\"0000>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"0101 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=UniRef%3A%20comprehensive%20and%20non-redundant%20UniProt%20reference%20clusters&#038;journal=Bioinformatics&#038;doi=10.1093%2Fbioinformatics%2Fbtm098&#038;volume=23&#038;pages=1282-1288&#038;publication_year=2007&#038;author=Suzek%2CBE&#038;author=Huang%2CH&#038;author=McGarvey%2CP&#038;author=Mazumder%2CR&#038;author=Wu%2CCH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"24.\">\n<p id=\"ref-CR24\">Olsen, T. H., Moal, I. H. &#038; Deane, C. M. AbLang: an antibody language model for completing antibody sequences. <i>Bioinform. Adv.<\/i> <b>2<\/b>, vbac046 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/bioadv\/vbac046\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fbioadv%2Fvbac046\" aria-label=\"Reference 1\"0202 data-doi=\"10.1093\/bioadv\/vbac046\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=36699403\" aria-label=\"Reference 1\"0303>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC9710568\" aria-label=\"Reference 1\"0404>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"0505 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=AbLang%3A%20an%20antibody%20language%20model%20for%20completing%20antibody%20sequences&#038;journal=Bioinform.%20Adv.&#038;doi=10.1093%2Fbioadv%2Fvbac046&#038;volume=2&#038;publication_year=2022&#038;author=Olsen%2CTH&#038;author=Moal%2CIH&#038;author=Deane%2CCM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"25.\">\n<p id=\"ref-CR25\">Prihoda, D. et al. BioPhi: a platform for antibody design, humanization, and humanness evaluation based on natural antibody repertoires and deep learning. <i>mAbs<\/i> <b>14<\/b>, 2020203 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1080\/19420862.2021.2020203\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1080%2F19420862.2021.2020203\" aria-label=\"Reference 1\"0606 data-doi=\"10.1080\/19420862.2021.2020203\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35133949\" aria-label=\"Reference 1\"0707>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8837241\" aria-label=\"Reference 1\"0808>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"0909 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=BioPhi%3A%20a%20platform%20for%20antibody%20design%2C%20humanization%2C%20and%20humanness%20evaluation%20based%20on%20natural%20antibody%20repertoires%20and%20deep%20learning&#038;journal=mAbs&#038;doi=10.1080%2F19420862.2021.2020203&#038;volume=14&#038;publication_year=2022&#038;author=Prihoda%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"26.\">\n<p id=\"ref-CR26\">Ruffolo, J. A., Gray, J. J. &#038; Sulam J. Deciphering antibody affinity maturation with language models and weakly supervised learning. <i>NeurIPS Workshop on Machine Learning in Structural Biology<\/i>. Preprint at <i>arXiv<\/i> <a href=\"https:\/\/doi.org\/10.48550\/arXiv.2112.07782\">https:\/\/doi.org\/10.48550\/arXiv.2112.07782<\/a> (2021).<\/p>\n<\/li>\n<li data-counter=\"27.\">\n<p id=\"ref-CR27\">Shuai, R. W., Ruffolo, J. A. &#038; Gray, J. J. Generative language modeling for antibody design. Preprint at <i>bioRxiv<\/i> <a href=\"https:\/\/doi.org\/10.1101\/2021.12.13.472419\">https:\/\/doi.org\/10.1101\/2021.12.13.472419<\/a> (2021).<\/p>\n<\/li>\n<li data-counter=\"28.\">\n<p id=\"ref-CR28\">Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. <i>Nat. Biomed. Eng.<\/i> <b>5<\/b>, 600\u2013612 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41551-021-00699-9\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41551-021-00699-9\" aria-label=\"Reference 1\"1010 data-doi=\"10.1038\/s41551-021-00699-9\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhsVWisbjN\" aria-label=\"Reference 1\"1111>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33859386\" aria-label=\"Reference 1\"1212>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"1313 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Optimization%20of%20therapeutic%20antibodies%20by%20predicting%20antigen%20specificity%20from%20antibody%20sequence%20via%20deep%20learning&#038;journal=Nat.%20Biomed.%20Eng.&#038;doi=10.1038%2Fs41551-021-00699-9&#038;volume=5&#038;pages=600-612&#038;publication_year=2021&#038;author=Mason%2CDM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"29.\">\n<p id=\"ref-CR29\">Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. <i>Cell<\/i> <b>166<\/b>, 596\u2013608 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cell.2016.05.073\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cell.2016.05.073\" aria-label=\"Reference 1\"1414 data-doi=\"10.1016\/j.cell.2016.05.073\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28Xht1eju7nL\" aria-label=\"Reference 1\"1515>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27453466\" aria-label=\"Reference 1\"1616>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4967455\" aria-label=\"Reference 1\"1717>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"1818 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Structure%20and%20function%20analysis%20of%20an%20antibody%20recognizing%20all%20influenza%20A%20subtypes&#038;journal=Cell&#038;doi=10.1016%2Fj.cell.2016.05.073&#038;volume=166&#038;pages=596-608&#038;publication_year=2016&#038;author=Kallewaard%2CNL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"30.\">\n<p id=\"ref-CR30\">Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. <i>Science<\/i> <b>351<\/b>, 1339\u20131342 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.aad5224\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.aad5224\" aria-label=\"Reference 1\"1919 data-doi=\"10.1126\/science.aad5224\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XktFarsLk%3D\" aria-label=\"Reference 1\"2020>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26917593\" aria-label=\"Reference 1\"2121>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"2222 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Protective%20monotherapy%20against%20lethal%20Ebola%20virus%20infection%20by%20a%20potently%20neutralizing%20antibody&#038;journal=Science&#038;doi=10.1126%2Fscience.aad5224&#038;volume=351&#038;pages=1339-1342&#038;publication_year=2016&#038;author=Corti%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"31.\">\n<p id=\"ref-CR31\">Pinto, D. et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. <i>Nature<\/i> <b>583<\/b>, 290\u2013295 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41586-020-2349-y\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41586-020-2349-y\" aria-label=\"Reference 1\"2323 data-doi=\"10.1038\/s41586-020-2349-y\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXht1Cmu7bI\" aria-label=\"Reference 1\"2424>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32422645\" aria-label=\"Reference 1\"2525>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"2626 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Cross-neutralization%20of%20SARS-CoV-2%20by%20a%20human%20monoclonal%20SARS-CoV%20antibody&#038;journal=Nature&#038;doi=10.1038%2Fs41586-020-2349-y&#038;volume=583&#038;pages=290-295&#038;publication_year=2020&#038;author=Pinto%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"32.\">\n<p id=\"ref-CR32\">Hansen, J. et al. Studies in humanized mice and convalescent humans yield a SARS-CoV-2 antibody cocktail. <i>Science<\/i> <b>369<\/b>, 1010\u20131014 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.abd0827\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.abd0827\" aria-label=\"Reference 1\"2727 data-doi=\"10.1126\/science.abd0827\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhs1Grsb3N\" aria-label=\"Reference 1\"2828>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32540901\" aria-label=\"Reference 1\"2929>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7299284\" aria-label=\"Reference 1\"3030>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"3131 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Studies%20in%20humanized%20mice%20and%20convalescent%20humans%20yield%20a%20SARS-CoV-2%20antibody%20cocktail&#038;journal=Science&#038;doi=10.1126%2Fscience.abd0827&#038;volume=369&#038;pages=1010-1014&#038;publication_year=2020&#038;author=Hansen%2CJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"33.\">\n<p id=\"ref-CR33\">Yang, K. K., Wu, Z. &#038; Arnold, F. H. Machine-learning-guided directed evolution for protein engineering. <i>Nat. Methods<\/i> <b>16<\/b>, 687\u2013694 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41592-019-0496-6\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41592-019-0496-6\" aria-label=\"Reference 1\"3232 data-doi=\"10.1038\/s41592-019-0496-6\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXhtlOlsb7K\" aria-label=\"Reference 1\"3333>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31308553\" aria-label=\"Reference 1\"3434>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"3535 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Machine-learning-guided%20directed%20evolution%20for%20protein%20engineering&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fs41592-019-0496-6&#038;volume=16&#038;pages=687-694&#038;publication_year=2019&#038;author=Yang%2CKK&#038;author=Wu%2CZ&#038;author=Arnold%2CFH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"34.\">\n<p id=\"ref-CR34\">Hie, B. L. &#038; Yang, K. K. Adaptive machine learning for protein engineering. <i>Curr. Opin. Struct .Biol.<\/i> <b>72<\/b>, 145\u2013152 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.sbi.2021.11.002\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.sbi.2021.11.002\" aria-label=\"Reference 1\"3636 data-doi=\"10.1016\/j.sbi.2021.11.002\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXis1KlsLfN\" aria-label=\"Reference 1\"3737>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34896756\" aria-label=\"Reference 1\"3838>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"3939 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Adaptive%20machine%20learning%20for%20protein%20engineering&#038;journal=Curr.%20Opin.%20Struct%20.Biol.&#038;doi=10.1016%2Fj.sbi.2021.11.002&#038;volume=72&#038;pages=145-152&#038;publication_year=2022&#038;author=Hie%2CBL&#038;author=Yang%2CKK\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"35.\">\n<p id=\"ref-CR35\">Alexander, E. et al. Antibody therapies for SARS-CoV-2 infection. WO2021252878A1 (2021).<\/p>\n<\/li>\n<li data-counter=\"36.\">\n<p id=\"ref-CR36\">Telenti, A., Hodcroft, E. B. &#038; Robertson, D. L. The evolution and biology of SARS-CoV-2 variants. <i>Cold Spring Harb. Perspect. Med.<\/i> <b>12<\/b>, a041390 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1101\/cshperspect.a041390\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1101%2Fcshperspect.a041390\" aria-label=\"Reference 1\"4040 data-doi=\"10.1101\/cshperspect.a041390\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XitlWqtrfO\" aria-label=\"Reference 1\"4141>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35444005\" aria-label=\"Reference 1\"4242>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"4343 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=The%20evolution%20and%20biology%20of%20SARS-CoV-2%20variants&#038;journal=Cold%20Spring%20Harb.%20Perspect.%20Med.&#038;doi=10.1101%2Fcshperspect.a041390&#038;volume=12&#038;publication_year=2022&#038;author=Telenti%2CA&#038;author=Hodcroft%2CEB&#038;author=Robertson%2CDL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"37.\">\n<p id=\"ref-CR37\">Maher, M. C. et al. Predicting the mutational drivers of future SARS-CoV-2 variants of concern. <i>Sci. Transl. Med.<\/i> <b>14<\/b>, eabk3445 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/scitranslmed.abk3445\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscitranslmed.abk3445\" aria-label=\"Reference 1\"4444 data-doi=\"10.1126\/scitranslmed.abk3445\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38Xmt1CjtLc%3D\" aria-label=\"Reference 1\"4545>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35014856\" aria-label=\"Reference 1\"4646>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"4747 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Predicting%20the%20mutational%20drivers%20of%20future%20SARS-CoV-2%20variants%20of%20concern&#038;journal=Sci.%20Transl.%20Med.&#038;doi=10.1126%2Fscitranslmed.abk3445&#038;volume=14&#038;publication_year=2022&#038;author=Maher%2CMC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"38.\">\n<p id=\"ref-CR38\">Gaebler, C. et al. Evolution of antibody immunity to SARS-CoV-2. <i>Nature<\/i> <b>591<\/b>, 639\u2013644 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41586-021-03207-w\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41586-021-03207-w\" aria-label=\"Reference 1\"4848 data-doi=\"10.1038\/s41586-021-03207-w\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXjslKgtbY%3D\" aria-label=\"Reference 1\"4949>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33461210\" aria-label=\"Reference 1\"5050>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8221082\" aria-label=\"Reference 1\"5151>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"5252 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Evolution%20of%20antibody%20immunity%20to%20SARS-CoV-2&#038;journal=Nature&#038;doi=10.1038%2Fs41586-021-03207-w&#038;volume=591&#038;pages=639-644&#038;publication_year=2021&#038;author=Gaebler%2CC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"39.\">\n<p id=\"ref-CR39\">Muecksch, F. et al. Affinity maturation of SARS-CoV-2 neutralizing antibodies confers potency, breadth, and resilience to viral escape mutations. <i>Immunity<\/i> <b>54<\/b>, 1853\u20131868 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.immuni.2021.07.008\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.immuni.2021.07.008\" aria-label=\"Reference 1\"5353 data-doi=\"10.1016\/j.immuni.2021.07.008\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhs1yiu7zI\" aria-label=\"Reference 1\"5454>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34331873\" aria-label=\"Reference 1\"5555>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8323339\" aria-label=\"Reference 1\"5656>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"5757 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Affinity%20maturation%20of%20SARS-CoV-2%20neutralizing%20antibodies%20confers%20potency%2C%20breadth%2C%20and%20resilience%20to%20viral%20escape%20mutations&#038;journal=Immunity&#038;doi=10.1016%2Fj.immuni.2021.07.008&#038;volume=54&#038;pages=1853-1868&#038;publication_year=2021&#038;author=Muecksch%2CF\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"40.\">\n<p id=\"ref-CR40\">Hsieh, C.-L. et al. Structure-based design of prefusion-stabilized SARS-CoV-2 spikes. <i>Science<\/i> <b>369<\/b>, 1501\u20131505 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.abd0826\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.abd0826\" aria-label=\"Reference 1\"5858 data-doi=\"10.1126\/science.abd0826\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhvVOns7zM\" aria-label=\"Reference 1\"5959>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32703906\" aria-label=\"Reference 1\"6060>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"6161 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Structure-based%20design%20of%20prefusion-stabilized%20SARS-CoV-2%20spikes&#038;journal=Science&#038;doi=10.1126%2Fscience.abd0826&#038;volume=369&#038;pages=1501-1505&#038;publication_year=2020&#038;author=Hsieh%2CC-L\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"41.\">\n<p id=\"ref-CR41\">Xu, Y. et al. Addressing polyspecificity of antibodies selected from an in vitro yeast presentation system: a FACS-based, high-throughput selection and analytical tool. <i>Protein Eng. Des. Sel.<\/i> <b>26<\/b>, 663\u2013670 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/protein\/gzt047\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fprotein%2Fgzt047\" aria-label=\"Reference 1\"6262 data-doi=\"10.1093\/protein\/gzt047\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXhsFertbjL\" aria-label=\"Reference 1\"6363>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24046438\" aria-label=\"Reference 1\"6464>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"6565 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Addressing%20polyspecificity%20of%20antibodies%20selected%20from%20an%20in%20vitro%20yeast%20presentation%20system%3A%20a%20FACS-based%2C%20high-throughput%20selection%20and%20analytical%20tool&#038;journal=Protein%20Eng.%20Des.%20Sel.&#038;doi=10.1093%2Fprotein%2Fgzt047&#038;volume=26&#038;pages=663-670&#038;publication_year=2013&#038;author=Xu%2CY\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"42.\">\n<p id=\"ref-CR42\">Makowski, E. K., Wu, L., Desai, A. A. &#038; Tessier, P. M. Highly sensitive detection of antibody nonspecific interactions using flow cytometry. <i>mAbs<\/i> <b>13<\/b>, 1951426 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1080\/19420862.2021.1951426\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1080%2F19420862.2021.1951426\" aria-label=\"Reference 1\"6666 data-doi=\"10.1080\/19420862.2021.1951426\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34313552\" aria-label=\"Reference 1\"6767>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8317921\" aria-label=\"Reference 1\"6868>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"6969 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Highly%20sensitive%20detection%20of%20antibody%20nonspecific%20interactions%20using%20flow%20cytometry&#038;journal=mAbs&#038;doi=10.1080%2F19420862.2021.1951426&#038;volume=13&#038;publication_year=2021&#038;author=Makowski%2CEK&#038;author=Wu%2CL&#038;author=Desai%2CAA&#038;author=Tessier%2CPM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"43.\">\n<p id=\"ref-CR43\">Reynisson, B., Alvarez, B., Paul, S., Peters, B. &#038; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. <i>Nucleic Acids Res.<\/i> <b>48<\/b>, W449\u2013W454 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/nar\/gkaa379\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fnar%2Fgkaa379\" aria-label=\"Reference 1\"7070 data-doi=\"10.1093\/nar\/gkaa379\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXis1entr7E\" aria-label=\"Reference 1\"7171>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32406916\" aria-label=\"Reference 1\"7272>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7319546\" aria-label=\"Reference 1\"7373>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"7474 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=NetMHCpan-4.1%20and%20NetMHCIIpan-4.0%3A%20improved%20predictions%20of%20MHC%20antigen%20presentation%20by%20concurrent%20motif%20deconvolution%20and%20integration%20of%20MS%20MHC%20eluted%20ligand%20data&#038;journal=Nucleic%20Acids%20Res.&#038;doi=10.1093%2Fnar%2Fgkaa379&#038;volume=48&#038;pages=W449-W454&#038;publication_year=2020&#038;author=Reynisson%2CB&#038;author=Alvarez%2CB&#038;author=Paul%2CS&#038;author=Peters%2CB&#038;author=Nielsen%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"44.\">\n<p id=\"ref-CR44\">Swindells, M. B. et al. abYsis: integrated antibody sequence and structure\u2014management, analysis, and prediction. <i>J. Mol. Biol.<\/i> <b>429<\/b>, 356\u2013364 (2017).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.jmb.2016.08.019\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.jmb.2016.08.019\" aria-label=\"Reference 1\"7575 data-doi=\"10.1016\/j.jmb.2016.08.019\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XhsVWksL3J\" aria-label=\"Reference 1\"7676>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27561707\" aria-label=\"Reference 1\"7777>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"7878 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=abYsis%3A%20integrated%20antibody%20sequence%20and%20structure%E2%80%94management%2C%20analysis%2C%20and%20prediction&#038;journal=J.%20Mol.%20Biol.&#038;doi=10.1016%2Fj.jmb.2016.08.019&#038;volume=429&#038;pages=356-364&#038;publication_year=2017&#038;author=Swindells%2CMB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"45.\">\n<p id=\"ref-CR45\">Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. <i>Nature<\/i> <b>529<\/b>, 484\u2013489 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nature16961\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnature16961\" aria-label=\"Reference 1\"7979 data-doi=\"10.1038\/nature16961\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28Xhs12is7w%3D\" aria-label=\"Reference 1\"8080>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26819042\" aria-label=\"Reference 1\"8181>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"8282 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Mastering%20the%20game%20of%20Go%20with%20deep%20neural%20networks%20and%20tree%20search&#038;journal=Nature&#038;doi=10.1038%2Fnature16961&#038;volume=529&#038;pages=484-489&#038;publication_year=2016&#038;author=Silver%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"46.\">\n<p id=\"ref-CR46\">Olsen, T. H., Boyles, F. &#038; Deane, C. M. Observed antibody space: a diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences. <i>Protein Sci.<\/i> <b>31<\/b>, 141\u2013146 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1002\/pro.4205\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1002%2Fpro.4205\" aria-label=\"Reference 1\"8383 data-doi=\"10.1002\/pro.4205\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXitlOltb3E\" aria-label=\"Reference 1\"8484>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=34655133\" aria-label=\"Reference 1\"8585>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"8686 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Observed%20antibody%20space%3A%20a%20diverse%20database%20of%20cleaned%2C%20annotated%2C%20and%20translated%20unpaired%20and%20paired%20antibody%20sequences&#038;journal=Protein%20Sci.&#038;doi=10.1002%2Fpro.4205&#038;volume=31&#038;pages=141-146&#038;publication_year=2022&#038;author=Olsen%2CTH&#038;author=Boyles%2CF&#038;author=Deane%2CCM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"47.\">\n<p id=\"ref-CR47\">Raybould, M. I. J. et al. Thera-SAbDab: the therapeutic structural antibody database. <i>Nucleic Acids Res.<\/i> <b>48<\/b>, D383\u2013D388 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/nar\/gkz827\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fnar%2Fgkz827\" aria-label=\"Reference 1\"8787 data-doi=\"10.1093\/nar\/gkz827\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhslWltbzE\" aria-label=\"Reference 1\"8888>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31555805\" aria-label=\"Reference 1\"8989>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"9090 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Thera-SAbDab%3A%20the%20therapeutic%20structural%20antibody%20database&#038;journal=Nucleic%20Acids%20Res.&#038;doi=10.1093%2Fnar%2Fgkz827&#038;volume=48&#038;pages=D383-D388&#038;publication_year=2020&#038;author=Raybould%2CMIJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"48.\">\n<p id=\"ref-CR48\">Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. <i>Mol. Syst. Biol.<\/i> <b>16<\/b>, e9380 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.15252\/msb.20199380\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.15252%2Fmsb.20199380\" aria-label=\"Reference 1\"9191 data-doi=\"10.15252\/msb.20199380\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhsFWqurvN\" aria-label=\"Reference 1\"9292>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32627955\" aria-label=\"Reference 1\"9393>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7336272\" aria-label=\"Reference 1\"9494>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"9595 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Using%20deep%20mutational%20scanning%20to%20benchmark%20variant%20effect%20predictors%20and%20identify%20disease%20mutations&#038;journal=Mol.%20Syst.%20Biol.&#038;doi=10.15252%2Fmsb.20199380&#038;volume=16&#038;publication_year=2020&#038;author=Livesey%2CBJ&#038;author=Marsh%2CJA\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"49.\">\n<p id=\"ref-CR49\">Zhao, H., Giver, L., Shao, Z., Affholter, J. A. &#038; Arnold, F. H. Molecular evolution by staggered extension process (StEP) in vitro recombination. <i>Nat. Biotechnol.<\/i> <b>16<\/b>, 258\u2013261 (1998).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/nbt0398-258\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fnbt0398-258\" aria-label=\"Reference 1\"9696 data-doi=\"10.1038\/nbt0398-258\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DyaK1cXhvVWlu7g%3D\" aria-label=\"Reference 1\"9797>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=9528005\" aria-label=\"Reference 1\"9898>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 1\"9999 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Molecular%20evolution%20by%20staggered%20extension%20process%20%28StEP%29%20in%20vitro%20recombination&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fnbt0398-258&#038;volume=16&#038;pages=258-261&#038;publication_year=1998&#038;author=Zhao%2CH&#038;author=Giver%2CL&#038;author=Shao%2CZ&#038;author=Affholter%2CJA&#038;author=Arnold%2CFH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"50.\">\n<p id=\"ref-CR50\">Yu, Y. W., Daniels, N. M., Danko, D. C. &#038; Berger, B. Entropy-scaling search of massive biological data. <i>Cell Syst.<\/i> <b>1<\/b>, 130\u2013140 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2015.08.004\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2015.08.004\" aria-label=\"Reference 3\"0000 data-doi=\"10.1016\/j.cels.2015.08.004\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXhtFalur0%3D\" aria-label=\"Reference 3\"0101>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26436140\" aria-label=\"Reference 3\"0202>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4591002\" aria-label=\"Reference 3\"0303>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"0404 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Entropy-scaling%20search%20of%20massive%20biological%20data&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2015.08.004&#038;volume=1&#038;pages=130-140&#038;publication_year=2015&#038;author=Yu%2CYW&#038;author=Daniels%2CNM&#038;author=Danko%2CDC&#038;author=Berger%2CB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"51.\">\n<p id=\"ref-CR51\">Biswas, S., Khimulya, G., Alley, E. C., Esvelt, K. M. &#038; Church, G. M. Low-N protein engineering with data-efficient deep learning. <i>Nat. Methods<\/i> <b>18<\/b>, 389\u2013396 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41592-021-01100-y\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41592-021-01100-y\" aria-label=\"Reference 3\"0505 data-doi=\"10.1038\/s41592-021-01100-y\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXosVCltrg%3D\" aria-label=\"Reference 3\"0606>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33828272\" aria-label=\"Reference 3\"0707>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"0808 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Low-N%20protein%20engineering%20with%20data-efficient%20deep%20learning&#038;journal=Nat.%20Methods&#038;doi=10.1038%2Fs41592-021-01100-y&#038;volume=18&#038;pages=389-396&#038;publication_year=2021&#038;author=Biswas%2CS&#038;author=Khimulya%2CG&#038;author=Alley%2CEC&#038;author=Esvelt%2CKM&#038;author=Church%2CGM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"52.\">\n<p id=\"ref-CR52\">Hie, B., Bryson, B. D. &#038; Berger, B. Leveraging uncertainty in machine learning accelerates biological discovery and design. <i>Cell Syst.<\/i> <b>11<\/b>, 461\u2013477 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2020.09.007\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2020.09.007\" aria-label=\"Reference 3\"0909 data-doi=\"10.1016\/j.cels.2020.09.007\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXisVaqu7nM\" aria-label=\"Reference 3\"1010>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33065027\" aria-label=\"Reference 3\"1111>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"1212 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Leveraging%20uncertainty%20in%20machine%20learning%20accelerates%20biological%20discovery%20and%20design&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2020.09.007&#038;volume=11&#038;pages=461-477&#038;publication_year=2020&#038;author=Hie%2CB&#038;author=Bryson%2CBD&#038;author=Berger%2CB\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"53.\">\n<p id=\"ref-CR53\">Dallago, C. et al. FLIP: benchmark tasks in fitness landscape inference for proteins. In <i>Proc. of the Neural Information Processing Systems Track on Datasets and Benchmarks<\/i> <a href=\"https:\/\/datasets-benchmarks-proceedings.neurips.cc\/paper_files\/paper\/2021\">https:\/\/datasets-benchmarks-proceedings.neurips.cc\/paper_files\/paper\/2021<\/a> (NeurIPS, 2021).<\/p>\n<\/li>\n<li data-counter=\"54.\">\n<p id=\"ref-CR54\">Bileschi, M. L. et al. Using deep learning to annotate the protein universe. <i>Nat. Biotechnol.<\/i> <b>40<\/b>, 932\u2013937 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41587-021-01179-w\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41587-021-01179-w\" aria-label=\"Reference 3\"1313 data-doi=\"10.1038\/s41587-021-01179-w\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XksFCnsLk%3D\" aria-label=\"Reference 3\"1414>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35190689\" aria-label=\"Reference 3\"1515>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"1616 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Using%20deep%20learning%20to%20annotate%20the%20protein%20universe&#038;journal=Nat.%20Biotechnol.&#038;doi=10.1038%2Fs41587-021-01179-w&#038;volume=40&#038;pages=932-937&#038;publication_year=2022&#038;author=Bileschi%2CML\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"55.\">\n<p id=\"ref-CR55\">Shimotohno, A., Oue, S., Yano, T., Kuramitsu, S. &#038; Kagamiyama, H. Demonstration of the importance and usefulness of manipulating non-active-site residues in protein design. <i>J. Biochem.<\/i> <b>129<\/b>, 943\u2013948 (2001).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/oxfordjournals.jbchem.a002941\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Foxfordjournals.jbchem.a002941\" aria-label=\"Reference 3\"1717 data-doi=\"10.1093\/oxfordjournals.jbchem.a002941\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD3MXls1SgsLk%3D\" aria-label=\"Reference 3\"1818>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=11388910\" aria-label=\"Reference 3\"1919>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"2020 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Demonstration%20of%20the%20importance%20and%20usefulness%20of%20manipulating%20non-active-site%20residues%20in%20protein%20design&#038;journal=J.%20Biochem.&#038;doi=10.1093%2Foxfordjournals.jbchem.a002941&#038;volume=129&#038;pages=943-948&#038;publication_year=2001&#038;author=Shimotohno%2CA&#038;author=Oue%2CS&#038;author=Yano%2CT&#038;author=Kuramitsu%2CS&#038;author=Kagamiyama%2CH\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"56.\">\n<p id=\"ref-CR56\">Shan, S. et al. Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization. <i>Proc. Natl Acad. Sci. USA<\/i> <b>119<\/b>, e2122954119 (2022).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.2122954119\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.2122954119\" aria-label=\"Reference 3\"2121 data-doi=\"10.1073\/pnas.2122954119\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB38XotVSls7c%3D\" aria-label=\"Reference 3\"2222>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=35238654\" aria-label=\"Reference 3\"2323>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC8931377\" aria-label=\"Reference 3\"2424>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"2525 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Deep%20learning%20guided%20optimization%20of%20human%20antibody%20against%20SARS-CoV-2%20variants%20with%20broad%20neutralization&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.2122954119&#038;volume=119&#038;publication_year=2022&#038;author=Shan%2CS\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"57.\">\n<p id=\"ref-CR57\">Dunbar, J., Fuchs, A., Shi, J. &#038; Deane, C. M. ABangle: characterising the VH\u2013VL orientation in antibodies. <i>Protein Eng. Des. Sel.<\/i> <b>26<\/b>, 611\u2013620 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/protein\/gzt020\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fprotein%2Fgzt020\" aria-label=\"Reference 3\"2626 data-doi=\"10.1093\/protein\/gzt020\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXhsFertbnM\" aria-label=\"Reference 3\"2727>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23708320\" aria-label=\"Reference 3\"2828>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"2929 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=ABangle%3A%20characterising%20the%20VH%E2%80%93VL%20orientation%20in%20antibodies&#038;journal=Protein%20Eng.%20Des.%20Sel.&#038;doi=10.1093%2Fprotein%2Fgzt020&#038;volume=26&#038;pages=611-620&#038;publication_year=2013&#038;author=Dunbar%2CJ&#038;author=Fuchs%2CA&#038;author=Shi%2CJ&#038;author=Deane%2CCM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"58.\">\n<p id=\"ref-CR58\">Fera, D. et al. Affinity maturation in an HIV broadly neutralizing B-cell lineage through reorientation of variable domains. <i>Proc. Natl Acad. Sci. USA<\/i> <b>111<\/b>, 10275\u201310280 (2014).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.1409954111\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.1409954111\" aria-label=\"Reference 3\"3030 data-doi=\"10.1073\/pnas.1409954111\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2cXhtVOit7bO\" aria-label=\"Reference 3\"3131>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=24982157\" aria-label=\"Reference 3\"3232>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4104889\" aria-label=\"Reference 3\"3333>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"3434 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Affinity%20maturation%20in%20an%20HIV%20broadly%20neutralizing%20B-cell%20lineage%20through%20reorientation%20of%20variable%20domains&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.1409954111&#038;volume=111&#038;pages=10275-10280&#038;publication_year=2014&#038;author=Fera%2CD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"59.\">\n<p id=\"ref-CR59\">Wedemayer, G. J., Patten, P. A., Wang, L. H., Schultz, P. G. &#038; Stevens, R. C. Structural insights into the evolution of an antibody combining site. <i>Science<\/i> <b>276<\/b>, 1665\u20131669 (1997).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.276.5319.1665\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.276.5319.1665\" aria-label=\"Reference 3\"3535 data-doi=\"10.1126\/science.276.5319.1665\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DyaK2sXjvV2gtr4%3D\" aria-label=\"Reference 3\"3636>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=9180069\" aria-label=\"Reference 3\"3737>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"3838 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Structural%20insights%20into%20the%20evolution%20of%20an%20antibody%20combining%20site&#038;journal=Science&#038;doi=10.1126%2Fscience.276.5319.1665&#038;volume=276&#038;pages=1665-1669&#038;publication_year=1997&#038;author=Wedemayer%2CGJ&#038;author=Patten%2CPA&#038;author=Wang%2CLH&#038;author=Schultz%2CPG&#038;author=Stevens%2CRC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"60.\">\n<p id=\"ref-CR60\">Yeap, L.-S. et al. Sequence-intrinsic mechanisms that target AID mutational outcomes on antibody genes. <i>Cell<\/i> <b>163<\/b>, 1124\u20131137 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cell.2015.10.042\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cell.2015.10.042\" aria-label=\"Reference 3\"3939 data-doi=\"10.1016\/j.cell.2015.10.042\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2MXhvVejsL%2FN\" aria-label=\"Reference 3\"4040>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26582132\" aria-label=\"Reference 3\"4141>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4751889\" aria-label=\"Reference 3\"4242>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"4343 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Sequence-intrinsic%20mechanisms%20that%20target%20AID%20mutational%20outcomes%20on%20antibody%20genes&#038;journal=Cell&#038;doi=10.1016%2Fj.cell.2015.10.042&#038;volume=163&#038;pages=1124-1137&#038;publication_year=2015&#038;author=Yeap%2CL-S\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"61.\">\n<p id=\"ref-CR61\">Zheng, N.-Y., Wilson, K., Jared, M. &#038; Wilson, P. C. Intricate targeting of immunoglobulin somatic hypermutation maximizes the efficiency of affinity maturation. <i>J. Exp. Med.<\/i> <b>201<\/b>, 1467\u20131478 (2005).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1084\/jem.20042483\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1084%2Fjem.20042483\" aria-label=\"Reference 3\"4444 data-doi=\"10.1084\/jem.20042483\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD2MXktVCmur8%3D\" aria-label=\"Reference 3\"4545>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=15867095\" aria-label=\"Reference 3\"4646>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC2213188\" aria-label=\"Reference 3\"4747>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"4848 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Intricate%20targeting%20of%20immunoglobulin%20somatic%20hypermutation%20maximizes%20the%20efficiency%20of%20affinity%20maturation&#038;journal=J.%20Exp.%20Med.&#038;doi=10.1084%2Fjem.20042483&#038;volume=201&#038;pages=1467-1478&#038;publication_year=2005&#038;author=Zheng%2CN-Y&#038;author=Wilson%2CK&#038;author=Jared%2CM&#038;author=Wilson%2CPC\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"62.\">\n<p id=\"ref-CR62\">Rujas, E. et al. Structural and thermodynamic basis of epitope binding by neutralizing and nonneutralizing forms of the anti-HIV-1 antibody 4E10. <i>J. Virol.<\/i> <b>89<\/b>, 11975\u201311989 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1128\/JVI.01793-15\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1128%2FJVI.01793-15\" aria-label=\"Reference 3\"4949 data-doi=\"10.1128\/JVI.01793-15\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XjsFyisbs%3D\" aria-label=\"Reference 3\"5050>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26378169\" aria-label=\"Reference 3\"5151>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4645341\" aria-label=\"Reference 3\"5252>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"5353 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Structural%20and%20thermodynamic%20basis%20of%20epitope%20binding%20by%20neutralizing%20and%20nonneutralizing%20forms%20of%20the%20anti-HIV-1%20antibody%204E10&#038;journal=J.%20Virol.&#038;doi=10.1128%2FJVI.01793-15&#038;volume=89&#038;pages=11975-11989&#038;publication_year=2015&#038;author=Rujas%2CE\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"63.\">\n<p id=\"ref-CR63\">Katoh, K. &#038; Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. <i>Mol. Biol. Evol.<\/i> <b>30<\/b>, 772\u2013780 (2013).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/molbev\/mst010\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fmolbev%2Fmst010\" aria-label=\"Reference 3\"5454 data-doi=\"10.1093\/molbev\/mst010\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC3sXksFWisLc%3D\" aria-label=\"Reference 3\"5555>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=23329690\" aria-label=\"Reference 3\"5656>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3603318\" aria-label=\"Reference 3\"5757>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"5858 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=MAFFT%20multiple%20sequence%20alignment%20software%20version%207%3A%20improvements%20in%20performance%20and%20usability&#038;journal=Mol.%20Biol.%20Evol.&#038;doi=10.1093%2Fmolbev%2Fmst010&#038;volume=30&#038;pages=772-780&#038;publication_year=2013&#038;author=Katoh%2CK&#038;author=Standley%2CDM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"64.\">\n<p id=\"ref-CR64\">Crawford, K. H. D. et al. Protocol and reagents for pseudotyping lentiviral particles with SARS-CoV-2 spike protein for neutralization assays. <i>Viruses<\/i> <b>12<\/b>, 513 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.3390\/v12050513\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.3390%2Fv12050513\" aria-label=\"Reference 3\"5959 data-doi=\"10.3390\/v12050513\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhtF2gtLnF\" aria-label=\"Reference 3\"6060>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32384820\" aria-label=\"Reference 3\"6161>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7291041\" aria-label=\"Reference 3\"6262>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"6363 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Protocol%20and%20reagents%20for%20pseudotyping%20lentiviral%20particles%20with%20SARS-CoV-2%20spike%20protein%20for%20neutralization%20assays&#038;journal=Viruses&#038;doi=10.3390%2Fv12050513&#038;volume=12&#038;publication_year=2020&#038;author=Crawford%2CKHD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"65.\">\n<p id=\"ref-CR65\">Rogers, T. F. et al. Isolation of potent SARS-CoV-2 neutralizing antibodies and protection from disease in a small animal model. <i>Science<\/i> <b>369<\/b>, 956\u2013963 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.abc7520\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.abc7520\" aria-label=\"Reference 3\"6464 data-doi=\"10.1126\/science.abc7520\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3cXhs1GrsLjF\" aria-label=\"Reference 3\"6565>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32540903\" aria-label=\"Reference 3\"6666>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7299280\" aria-label=\"Reference 3\"6767>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"6868 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Isolation%20of%20potent%20SARS-CoV-2%20neutralizing%20antibodies%20and%20protection%20from%20disease%20in%20a%20small%20animal%20model&#038;journal=Science&#038;doi=10.1126%2Fscience.abc7520&#038;volume=369&#038;pages=956-963&#038;publication_year=2020&#038;author=Rogers%2CTF\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"66.\">\n<p id=\"ref-CR66\">Giudicelli, V. et al. IMGT\/LIGM-DB, the IMGT<sup>\u00ae<\/sup> comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. <i>Nucleic Acids Res.<\/i> <b>34<\/b>, D781\u2013D784 (2006).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/nar\/gkj088\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fnar%2Fgkj088\" aria-label=\"Reference 3\"6969 data-doi=\"10.1093\/nar\/gkj088\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BD28XisFyjsA%3D%3D\" aria-label=\"Reference 3\"7070>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=16381979\" aria-label=\"Reference 3\"7171>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"7272 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=IMGT%2FLIGM-DB%2C%20the%20IMGT%C2%AE%20comprehensive%20database%20of%20immunoglobulin%20and%20T%20cell%20receptor%20nucleotide%20sequences&#038;journal=Nucleic%20Acids%20Res.&#038;doi=10.1093%2Fnar%2Fgkj088&#038;volume=34&#038;pages=D781-D784&#038;publication_year=2006&#038;author=Giudicelli%2CV\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"67.\">\n<p id=\"ref-CR67\">Raybould, M. I. J., Kovaltsuk, A., Marks, C. &#038; Deane, C. M. CoV-AbDab: the coronavirus antibody database. <i>Bioinformatics<\/i> <b>37<\/b>, 734\u2013735 (2021).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1093\/bioinformatics\/btaa739\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1093%2Fbioinformatics%2Fbtaa739\" aria-label=\"Reference 3\"7373 data-doi=\"10.1093\/bioinformatics\/btaa739\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXhvFGitr7P\" aria-label=\"Reference 3\"7474>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=32805021\" aria-label=\"Reference 3\"7575>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"7676 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=CoV-AbDab%3A%20the%20coronavirus%20antibody%20database&#038;journal=Bioinformatics&#038;doi=10.1093%2Fbioinformatics%2Fbtaa739&#038;volume=37&#038;pages=734-735&#038;publication_year=2021&#038;author=Raybould%2CMIJ&#038;author=Kovaltsuk%2CA&#038;author=Marks%2CC&#038;author=Deane%2CCM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"68.\">\n<p id=\"ref-CR68\">Jones, E. M. et al. Structural and functional characterization of G protein\u2013coupled receptors with deep mutational scanning. <i>eLife<\/i> <b>9<\/b>, e54895 (2020).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.7554\/eLife.54895\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.7554%2FeLife.54895\" aria-label=\"Reference 3\"7777 data-doi=\"10.7554\/eLife.54895\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BB3MXjsFSgtrw%3D\" aria-label=\"Reference 3\"7878>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=33084570\" aria-label=\"Reference 3\"7979>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC7707821\" aria-label=\"Reference 3\"8080>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"8181 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Structural%20and%20functional%20characterization%20of%20G%20protein%E2%80%93coupled%20receptors%20with%20deep%20mutational%20scanning&#038;journal=eLife&#038;doi=10.7554%2FeLife.54895&#038;volume=9&#038;publication_year=2020&#038;author=Jones%2CEM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"69.\">\n<p id=\"ref-CR69\">Stiffler, M. A., Hekstra, D. R. &#038; Ranganathan, R. Evolvability as a function of purifying selection in TEM-1 \u03b2-lactamase. <i>Cell<\/i> <b>160<\/b>, 882\u2013892 (2015).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cell.2015.01.035\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cell.2015.01.035\" aria-label=\"Reference 3\"8282 data-doi=\"10.1016\/j.cell.2015.01.035\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2MXjs1Oqt7k%3D\" aria-label=\"Reference 3\"8383>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=25723163\" aria-label=\"Reference 3\"8484>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"8585 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Evolvability%20as%20a%20function%20of%20purifying%20selection%20in%20TEM-1%20%CE%B2-lactamase&#038;journal=Cell&#038;doi=10.1016%2Fj.cell.2015.01.035&#038;volume=160&#038;pages=882-892&#038;publication_year=2015&#038;author=Stiffler%2CMA&#038;author=Hekstra%2CDR&#038;author=Ranganathan%2CR\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"70.\">\n<p id=\"ref-CR70\">Haddox, H. K., Dingens, A. S. &#038; Bloom, J. D. Experimental estimation of the effects of all amino-acid mutations to HIV\u2019s envelope protein on viral replication in cell culture. <i>PLoS Pathog.<\/i> <b>12<\/b>, e1006114 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1371\/journal.ppat.1006114\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1371%2Fjournal.ppat.1006114\" aria-label=\"Reference 3\"8686 data-doi=\"10.1371\/journal.ppat.1006114\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27959955\" aria-label=\"Reference 3\"8787>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5189966\" aria-label=\"Reference 3\"8888>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"8989 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Experimental%20estimation%20of%20the%20effects%20of%20all%20amino-acid%20mutations%20to%20HIV%E2%80%99s%20envelope%20protein%20on%20viral%20replication%20in%20cell%20culture&#038;journal=PLoS%20Pathog.&#038;doi=10.1371%2Fjournal.ppat.1006114&#038;volume=12&#038;publication_year=2016&#038;author=Haddox%2CHK&#038;author=Dingens%2CAS&#038;author=Bloom%2CJD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"71.\">\n<p id=\"ref-CR71\">Doud, M. B. &#038; Bloom, J. D. Accurate measurement of the effects of all amino-acid mutations on influenza hemagglutinin. <i>Viruses<\/i> <b>8<\/b>, 155 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.3390\/v8060155\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.3390%2Fv8060155\" aria-label=\"Reference 3\"9090 data-doi=\"10.3390\/v8060155\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27271655\" aria-label=\"Reference 3\"9191>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4926175\" aria-label=\"Reference 3\"9292>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"9393 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Accurate%20measurement%20of%20the%20effects%20of%20all%20amino-acid%20mutations%20on%20influenza%20hemagglutinin&#038;journal=Viruses&#038;doi=10.3390%2Fv8060155&#038;volume=8&#038;publication_year=2016&#038;author=Doud%2CMB&#038;author=Bloom%2CJD\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"72.\">\n<p id=\"ref-CR72\">Lee, J. M. et al. Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. <i>Proc. Natl Acad. Sci. USA<\/i> <b>115<\/b>, E8276\u2013E8285 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1073\/pnas.1806133115\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1073%2Fpnas.1806133115\" aria-label=\"Reference 3\"9494 data-doi=\"10.1073\/pnas.1806133115\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1cXitVCkurzJ\" aria-label=\"Reference 3\"9595>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=30104379\" aria-label=\"Reference 3\"9696>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6126756\" aria-label=\"Reference 3\"9797>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 3\"9898 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Deep%20mutational%20scanning%20of%20hemagglutinin%20helps%20predict%20evolutionary%20fates%20of%20human%20H3N2%20influenza%20variants&#038;journal=Proc.%20Natl%20Acad.%20Sci.%20USA&#038;doi=10.1073%2Fpnas.1806133115&#038;volume=115&#038;pages=E8276-E8285&#038;publication_year=2018&#038;author=Lee%2CJM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"73.\">\n<p id=\"ref-CR73\">Kelsic, E. D. et al. RNA structural determinants of optimal codons revealed by MAGE-Seq. <i>Cell Syst.<\/i> <b>3<\/b>, 563\u2013571 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.cels.2016.11.004\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.cels.2016.11.004\" aria-label=\"Reference 3\"9999 data-doi=\"10.1016\/j.cels.2016.11.004\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC2sXhtFamu70%3D\" aria-label=\"Reference 4\"0000>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=28009265\" aria-label=\"Reference 4\"0101>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5234859\" aria-label=\"Reference 4\"0202>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 4\"0303 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=RNA%20structural%20determinants%20of%20optimal%20codons%20revealed%20by%20MAGE-Seq&#038;journal=Cell%20Syst.&#038;doi=10.1016%2Fj.cels.2016.11.004&#038;volume=3&#038;pages=563-571&#038;publication_year=2016&#038;author=Kelsic%2CED\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"74.\">\n<p id=\"ref-CR74\">Brenan, L. et al. Phenotypic characterization of a comprehensive set of <i>MAPK1<\/i>\/<i>ERK2<\/i> missense mutants. <i>Cell Rep.<\/i> <b>17<\/b>, 1171\u20131183 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1016\/j.celrep.2016.09.061\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1016%2Fj.celrep.2016.09.061\" aria-label=\"Reference 4\"0404 data-doi=\"10.1016\/j.celrep.2016.09.061\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28Xhs1CltL7L\" aria-label=\"Reference 4\"0505>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=27760319\" aria-label=\"Reference 4\"0606>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5120861\" aria-label=\"Reference 4\"0707>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 4\"0808 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Phenotypic%20characterization%20of%20a%20comprehensive%20set%20of%20MAPK1%2FERK2%20missense%20mutants&#038;journal=Cell%20Rep.&#038;doi=10.1016%2Fj.celrep.2016.09.061&#038;volume=17&#038;pages=1171-1183&#038;publication_year=2016&#038;author=Brenan%2CL\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"75.\">\n<p id=\"ref-CR75\">Giacomelli, A. O. et al. Mutational processes shape the landscape of <i>TP53<\/i> mutations in human cancer. <i>Nat. Genet.<\/i> <b>50<\/b>, 1381\u20131387 (2018).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1038\/s41588-018-0204-y\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1038%2Fs41588-018-0204-y\" aria-label=\"Reference 4\"0909 data-doi=\"10.1038\/s41588-018-0204-y\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1cXhslCjt77I\" aria-label=\"Reference 4\"1010>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=30224644\" aria-label=\"Reference 4\"1111>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6168352\" aria-label=\"Reference 4\"1212>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 4\"1313 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Mutational%20processes%20shape%20the%20landscape%20of%20TP53%20mutations%20in%20human%20cancer&#038;journal=Nat.%20Genet.&#038;doi=10.1038%2Fs41588-018-0204-y&#038;volume=50&#038;pages=1381-1387&#038;publication_year=2018&#038;author=Giacomelli%2CAO\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"76.\">\n<p id=\"ref-CR76\">Thomas, M. J., Klein, U., Lygeros, J. &#038; Rodr\u00edguez Mart\u00ednez, M. A probabilistic model of the germinal center reaction. <i>Front. Immunol.<\/i> <b>10<\/b>, 689 (2019).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.3389\/fimmu.2019.00689\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.3389%2Ffimmu.2019.00689\" aria-label=\"Reference 4\"1414 data-doi=\"10.3389\/fimmu.2019.00689\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC1MXhtlyrs7nM\" aria-label=\"Reference 4\"1515>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=31001283\" aria-label=\"Reference 4\"1616>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC6456718\" aria-label=\"Reference 4\"1717>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 4\"1818 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=A%20probabilistic%20model%20of%20the%20germinal%20center%20reaction&#038;journal=Front.%20Immunol.&#038;doi=10.3389%2Ffimmu.2019.00689&#038;volume=10&#038;publication_year=2019&#038;author=Thomas%2CMJ&#038;author=Klein%2CU&#038;author=Lygeros%2CJ&#038;author=Rodr%C3%ADguez%20Mart%C3%ADnez%2CM\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<li data-counter=\"77.\">\n<p id=\"ref-CR77\">Tas, J. M. J. et al. Visualizing antibody affinity maturation in germinal centers. <i>Science<\/i> <b>351<\/b>, 1048\u20131054 (2016).<\/p>\n<p><a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"10.1126\/science.aad3439\" data-track-action=\"article reference\" href=\"https:\/\/doi.org\/10.1126%2Fscience.aad3439\" aria-label=\"Reference 4\"1919 data-doi=\"10.1126\/science.aad3439\">Article<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"cas reference\" href=\"http:\/\/www.nature.com\/articles\/cas-redirect\/1:CAS:528:DC%2BC28XjsVagurs%3D\" aria-label=\"Reference 4\"2020>CAS<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?cmd=Retrieve&#038;db=PubMed&#038;dopt=Abstract&#038;list_uids=26912368\" aria-label=\"Reference 4\"2121>PubMed<\/a>\u00a0<br \/>\n    <a data-track=\"click\" rel=\"nofollow noopener\" data-track-label=\"link\" data-track-action=\"pubmed central reference\" href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4938154\" aria-label=\"Reference 4\"2222>PubMed Central<\/a>\u00a0<br \/>\n    <a data-track=\"click\" data-track-action=\"google scholar reference\" data-track-label=\"link\" rel=\"nofollow noopener\" aria-label=\"Reference 4\"2323 href=\"http:\/\/scholar.google.com\/scholar_lookup?&#038;title=Visualizing%20antibody%20affinity%20maturation%20in%20germinal%20centers&#038;journal=Science&#038;doi=10.1126%2Fscience.aad3439&#038;volume=351&#038;pages=1048-1054&#038;publication_year=2016&#038;author=Tas%2CJMJ\"><br \/>\n                    Google Scholar<\/a>\u00a0\n                <\/p>\n<\/li>\n<\/ol>\n<p><a data-track=\"click\" data-track-action=\"download citation references\" data-track-label=\"link\" rel=\"nofollow\" href=\"https:\/\/citation-needed.springer.com\/v2\/references\/10.1038\/s41587-023-01763-2?format=refman&#038;flavour=references\">Download references<\/a><\/p>\n<\/div>\n<\/div>\n<div id=\"Ack1-section\" data-title=\"Acknowledgements\">\n<h2 id=\"Ack1\">Acknowledgements<\/h2>\n<p>We thank B. Bell, B. Clifton, R. Costello, A. Hugenmatter, O. Leddy, D. Maurer and A. Narayan for helpful discussions. We thank L. Lahey for contributing polyspecificity reagent. We thank M. Filsinger Interrante, S. Kim and other members of the Peter Kim laboratory for useful comments on the manuscript. B.L.H. acknowledges the support of the Stanford Science Fellows program. D.X. acknowledges the postdoctoral fellowship from the Stanford Maternal and Child Health Research Institute. S.T. is supported by National Institutes of Health (NIH) National Institute of Child Health and Human Development grant K99HD104924 and a Damon Runyon Cancer Research Foundation fellowship (DRG-2301-17). This work was supported by the Virginia &#038; D. K. Ludwig Fund for Cancer Research (P.S.K.), the Chan Zuckerberg Biohub (P.S.K.) and the NIH (DP1AI158125; P.S.K.). A previous version of this article appeared on bioRxiv (<a href=\"https:\/\/doi.org\/10.1101\/2022.04.10.487811\">https:\/\/doi.org\/10.1101\/2022.04.10.487811<\/a>).<\/p>\n<\/div>\n<div id=\"author-information-section\" aria-labelledby=\"author-information\" data-title=\"Author information\">\n<h2 id=\"author-information\">Author information<\/h2>\n<div id=\"author-information-content\">\n<h3 id=\"affiliations\">Authors and Affiliations<\/h3>\n<ol>\n<li id=\"Aff1\">\n<p>Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA<\/p>\n<p>Brian L. Hie,\u00a0Duo Xu,\u00a0Theodora U. J. Bruun,\u00a0Shaogeng Tang\u00a0&#038;\u00a0Peter S. Kim<\/p>\n<\/li>\n<li id=\"Aff2\">\n<p>Sarafan ChEM-H, Stanford University, Stanford, CA, USA<\/p>\n<p>Brian L. Hie,\u00a0Varun R. Shanker,\u00a0Duo Xu,\u00a0Theodora U. J. Bruun,\u00a0Payton A. Weidenbacher,\u00a0Shaogeng Tang\u00a0&#038;\u00a0Peter S. Kim<\/p>\n<\/li>\n<li id=\"Aff3\">\n<p>Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford, CA, USA<\/p>\n<p>Varun R. Shanker\u00a0&#038;\u00a0Theodora U. J. Bruun<\/p>\n<\/li>\n<li id=\"Aff4\">\n<p>Department of Chemistry, Stanford University, Stanford, CA, USA<\/p>\n<p>Payton A. Weidenbacher<\/p>\n<\/li>\n<li id=\"Aff5\">\n<p>Chan Zuckerberg Biohub, San Francisco, CA, USA<\/p>\n<p>Wesley Wu,\u00a0John E. Pak\u00a0&#038;\u00a0Peter S. Kim<\/p>\n<\/li>\n<\/ol>\n<h3 id=\"contributions\">Contributions<\/h3>\n<p>Conceptualization, investigation and interpretation: B.L.H. and P.S.K. Computational experiments and software development: B.L.H. Antibody cloning, expression and purification: B.L.H., V.R.S., W.W. and J.E.P. Antigen cloning, expression and purification: B.L.H., V.R.S., D.X., T.U.J.B., P.A.W. and S.T. Binding assays: B.L.H and V.R.S. Thermal melts: B.L.H. and V.R.S. Polyspecificity assay: B.L.H. Lentivirus production and pseudovirus neutralization: D.X. Writing (initial draft): B.L.H. Writing (final draft): all authors.<\/p>\n<h3 id=\"corresponding-author\">Corresponding authors<\/h3>\n<p id=\"corresponding-author-list\">Correspondence to<br \/>\n                <a id=\"corresp-c1\" href=\"http:\/\/www.nature.com\/mailto:br******@******rd.edu\" data-original-string=\"HfJ6vQY9HzQzNWJkrm462w==7f4ayVTqOgm70NT\/p85VqBb4Xu\/n5gR7tfNL2ZtTL2+mlg=\" title=\"This contact has been encoded by Anti-Spam by CleanTalk. Click to decode. To finish the decoding make sure that JavaScript is enabled in your browser.\">Brian L. Hie<\/a> or <a id=\"corresp-c2\" href=\"http:\/\/www.nature.com\/mailto:ki******@******rd.edu\" data-original-string=\"q8SELg2mHMav5ngG6Seguw==7f4rN9Z8JFR1TU0W0LsJzOX4HQAYHiGPFkJb+9DCdNGNYk=\" title=\"This contact has been encoded by Anti-Spam by CleanTalk. Click to decode. To finish the decoding make sure that JavaScript is enabled in your browser.\">Peter S. Kim<\/a>.<\/p>\n<\/div>\n<\/div>\n<div id=\"ethics-section\" data-title=\"Ethics declarations\">\n<h2 id=\"ethics\">Ethics declarations<\/h2>\n<div id=\"ethics-content\">\n<h3 id=\"FPar4\">Competing interests<\/h3>\n<p>B.L.H., V.R.S. and P.S.K. are named as inventors on a provisional patent application applied for by Stanford University and the Chan Zuckerberg Biohub related to this study. B.L.H. performs research for Meta Platforms, Inc. The remaining authors declare no competing interests.<\/p>\n<\/p><\/div>\n<\/div>\n<div id=\"peer-review-section\" data-title=\"Peer review\">\n<h2 id=\"peer-review\">Peer review<\/h2>\n<div id=\"peer-review-content\">\n<h3 id=\"FPar3\">Peer review information<\/h3>\n<p><i>Nature Biotechnology<\/i> thanks the anonymous reviewers for their contribution to the peer review of this work.<\/p>\n<\/p><\/div>\n<\/div>\n<div id=\"additional-information-section\" data-title=\"Additional information\">\n<h2 id=\"additional-information\">Additional information<\/h2>\n<p><b>Publisher\u2019s note<\/b> Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.<\/p>\n<\/div>\n<div id=\"Sec36-section\" data-title=\"Extended data\">\n<h2 id=\"Sec36\">Extended data<\/h2>\n<div data-test=\"supplementary-info\" id=\"Sec36-content\">\n<div data-test=\"supp-item\" id=\"Fig5\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/5\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig5_ESM.jpg\">Extended Data Fig. 1 ESM masked versus wildtype marginals.<\/a><\/h3>\n<p>(<b>a<\/b>) Representative scatter plots showing all possible single-site substitutions to an antibody sequence plotted according to their log-likelihood ratios to wildtype, where likelihoods are computed based on either masked marginals (<i>y-<\/i>axis) or wildtype marginals (<i>x<\/i>-axis). A red dashed line is plotted where masked and wildtype marginal values are equal. The wildtype marginal log-likelihoods are consistently lower overall, effectively serving to make the <i>\u03b1<\/i> parameter more stringent, while (<b>b<\/b>) the rank-based correlation between masked marginals and wildtype marginals is close to 1 in all cases.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig6\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/6\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig6_ESM.jpg\">Extended Data Fig. 2 Pseudovirus neutralization of affinity-matured variants.<\/a><\/h3>\n<p>(<b>a<\/b>) Neutralization curves for wildtype antibodies (gray) and variants obtained by our language-model-guided affinity maturation campaigns. Also see Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">5<\/a>, <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">8<\/a>, and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM1\">9<\/a> for corresponding IC<sub>50<\/sub> values. Points indicate the mean; error bars indicate the standard deviation; <i>n<\/i>\u2009=\u20094 independent assays. (<b>b<\/b>) Fold-improvement in <i>k<\/i><sub>on<\/sub> has low correlation with fold-change in IC<sub>50<\/sub> (Spearman <i>r<\/i>\u2009=\u20090.12), while fold-improvement in <i>k<\/i><sub>off<\/sub> has high correlation with fold-change in IC<sub>50<\/sub> (Spearman <i>r<\/i>\u2009=\u20090.79); compare to Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig3\">3c<\/a>. Correlations involve <i>n<\/i>\u2009=\u200915 antibody variants. We define a higher <i>k<\/i><sub>on<\/sub> and a lower <i>k<\/i><sub>off<\/sub> as improved, so we divide the mutant value by the wildtype value to calculate fold-improvement in <i>k<\/i><sub>on<\/sub> and vice-versa to calculate fold-improvement in <i>k<\/i><sub>off<\/sub>.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig7\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/7\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig7_ESM.jpg\">Extended Data Fig. 3 UniRef90 significance and robustness analysis.<\/a><\/h3>\n<p>(<b>a<\/b>) A histogram of the null distribution generated by simulating how many avidity-enhancing substitutions would be recommended from a site-independent model based on UniRef90 alignments. Results are for <i>n<\/i>\u2009=\u20094.5 million simulations as described in Methods. Based on this null distribution and given that the language models recommended 12 avidity-enhancing substitutions, we estimate <i>P<\/i>\u2009=\u20090.0085. (<b>b<\/b>) The number of known avidity-enhancing substitutions recommended by a UniRef90 site-independent model at varying alignment depths, where our benchmark analyses are performed using an alignment depth of 10,000. The red line indicates the number of avidity-enhancing substitutions found by the language models. The combined number of known avidity-enhancing substitutions is provided in the stacked bar plot on the left and are separated by the antibody in the three right panels. The substitutions corresponding to each alignment depth and antibody are provided in Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM5\">3<\/a>.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig8\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/8\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig8_ESM.jpg\">Extended Data Fig. 4 Relationship between likelihood stringency and fitness efficiency.<\/a><\/h3>\n<p>To obtain the set <span>({{{mathcal{A}}}})<\/span> of language-model-recommended variants, we varied two parameters controlling the stringency of acquired variants (where more stringent corresponds to fewer variants): <i>\u03b1<\/i> is a cutoff controlling the likelihood ratio of the mutant probability to the wildtype probability, and <i>k<\/i> is a cutoff controlling the number of consensus language models (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). (<b>a<\/b>) At varying cutoffs, we computed the percentage fraction of variants in <span>({{{mathcal{A}}}})<\/span> that correspond to high-fitness variants, using scanning mutagenesis data for validation. When <i>\u03b1<\/i> = 0 and <i>k<\/i> = 1, this value is equivalent to the percentage of high-fitness variants in the full scanning mutagenesis dataset (a black dashed line is also drawn at this value for each protein). In all cases except for P53, we observe that increasing the likelihood stringency generally improves the efficiency at which high-fitness variants are acquired. In Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4<\/a>, we report values for <i>\u03b1<\/i> = 1, <i>k<\/i> = 2, except for when these cutoffs result in <span>(left| {{{mathcal{A}}}} right|)<\/span> < 5 (infA, MAPK1, and PafA), in which case we report <i>\u03b1<\/i> = 1, <i>k<\/i> = 1. (<b>b, c<\/b>) Given a set of acquired variants <span>({{{mathcal{A}}}})<\/span> at varying cutoffs, we also computed how much the maximum fitness represented in <span>({{{mathcal{A}}}})<\/span> compares either to the maximum possible fitness value obtained across the full mutational scan (<b>b<\/b>) or to the 99<sup>th<\/sup> percentile of fitness values across the full mutational scan (<b>c<\/b>). To compare across proteins, we plotted the maximum acquired fitness value normalized by the maximum possible fitness (<b>b<\/b>) or by the 99<sup>th<\/sup> percentile with a threshold at 1 (<b>c<\/b>). At even at the most stringent cutoffs, the best acquired variant of most proteins has at least 50% of the fitness value of the maximum fitness peak. Additionally, at the most stringent cutoffs, the best acquired variant of all proteins is above or close to the 99<sup>th<\/sup> percentile of fitness values. (<b>d<\/b>) We plotted the number of acquired variants <span>(left| {{{mathcal{A}}}} right|)<\/span>, which is the denominator of the values plotted in (<b>a<\/b>). A gray horizontal dashed line is also plotted at 100.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig9\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/9\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig9_ESM.jpg\">Extended Data Fig. 5 Benchmarking enrichment of high-fitness variants.<\/a><\/h3>\n<p>(<b>a, b<\/b>) Variant effect prediction methods were ranked by the number of high-fitness variants acquired, controlling for the sample size <i>N<\/i> of total acquired variants used in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Fig4\">4<\/a>, and ordered by the mean rank across eight proteins (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#Sec12\">Methods<\/a>). Our consensus voting strategy (\u2018ESM vote\u2019) ranks higher on average than all other methods based on its ability to acquire high-fitness variants. Methods profiled by Livesey and Marsh<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2424 title=\"Livesey, B. J. &#038; Marsh, J. A. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol. Syst. Biol. 16, e9380 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR48\" id=\"ref-link-section-d117614227e4083\">48<\/a><\/sup> are in black text; ESM-based strategies profiled in this study are in red text. The full list of mean ranks is provided as Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM7\">5<\/a>. ESM vote: the consensus strategy for acquiring substitutions used to select variants for experimental measurement in our antibody experiments. ESM summed: acquiring substitutions based on summed language model likelihood across the six language models used in this study. (<b>b<\/b>) Strip plot illustrating the number of high-fitness variants (vertical axis) among the top-<i>N<\/i> acquired substitutions to each protein (horizontal axis), where each point represents a different method for acquiring substitutions. These values are used to calculate the mean rank in (<b>a<\/b>). The expected number of variants that would be acquired via random guessing is plotted as a horizontal dashed line for each protein. (<b>c<\/b>, <b>d<\/b>) A similar analysis as in (<b>a<\/b>, <b>b<\/b>) but comparing the consensus voting strategy to each component of the ESM ensemble individually. Ensembling the recommendations across language models more consistently acquires high-fitness variants than when only using a single language model.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig10\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/10\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig10_ESM.jpg\">Extended Data Fig. 6 Scatter plots of DMS fitness data and ESM-ranked variants.<\/a><\/h3>\n<p>Variants of each protein (with a single-site substitution from wildtype) are plotted as blue circles according to the experimentally-determined fitness value on the <i>y<\/i>-axis and the summed log-likelihood across the six ESM models considered in our analysis. The variants acquired by the ESM consensus voting scheme are plotted as red circles. The cutoff above which we define a high-fitness variant is plotted as a gray dashed line. The marginal distribution of experimental fitness values is also plotted as a histogram along the <i>y<\/i>-axis.<\/p>\n<\/div>\n<div data-test=\"supp-item\" id=\"Fig11\">\n<h3><a data-track=\"click\" data-track-action=\"view supplementary info\" data-track-label=\"link\" data-test=\"supp-info-link\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2\/figures\/11\" data-supp-info-image=\"\/\/media.springernature.com\/lw685\/springer-static\/esm\/art%3A10.1038%2Fs41587-023-01763-2\/MediaObjects\/41587_2023_1763_Fig11_ESM.jpg\">Extended Data Fig. 7 Comparison of affinity fold improvements versus experimental scale.<\/a><\/h3>\n<p>Points indicate the results of affinity maturation beginning with an unmatured starting point (indicated by circles) or with a matured starting point (indicated by plus signs). The horizontal axis indicates the experimental scale in terms of variants tested or the experimental library size. The vertical axis indicates the fold improvement obtained by affinity maturation. Results from this study are plotted in black. While there is substantial uncertainty about the size of the mutational space explored by in-vivo somatic hypermutation (to include the unproductive B cell clones), we estimate a scale between 10<sup>3<\/sup> to 10<sup>6<\/sup> based on the number of B cells contained within a germinal center (about 10<sup>3<\/sup> to 10<sup>4<\/sup>)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2525 title=\"Thomas, M. J., Klein, U., Lygeros, J. &#038; Rodr\u00edguez Mart\u00ednez, M. A probabilistic model of the germinal center reaction. Front. Immunol. 10, 689 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR76\" id=\"ref-link-section-d117614227e4175\">76<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2626 title=\"Tas, J. M. J. et al. Visualizing antibody affinity maturation in germinal centers. Science 351, 1048\u20131054 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR77\" id=\"ref-link-section-d117614227e4178\">77<\/a><\/sup>, the mutation rate of somatic hypermutation (about 1 mutation per kb per division)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2727 title=\"Victora, G. D. &#038; Nussenzweig, M. C. Germinal centers. Annu. Rev. Immunol. 40, 413\u2013442 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR13\" id=\"ref-link-section-d117614227e4183\">13<\/a><\/sup>, the doubling time of B cells (about 10\u2009hours)<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2828 title=\"Thomas, M. J., Klein, U., Lygeros, J. &#038; Rodr\u00edguez Mart\u00ednez, M. A probabilistic model of the germinal center reaction. Front. Immunol. 10, 689 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR76\" id=\"ref-link-section-d117614227e4187\">76<\/a><\/sup>, and a timescale of a few weeks<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"2929 title=\"Victora, G. D. &#038; Nussenzweig, M. C. Germinal centers. Annu. Rev. Immunol. 40, 413\u2013442 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR13\" id=\"ref-link-section-d117614227e4191\">13<\/a><\/sup>. The results of natural affinity maturation of the unmatured antibodies in this study<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"3030 title=\"Kallewaard, N. L. et al. Structure and function analysis of an antibody recognizing all influenza A subtypes. Cell 166, 596\u2013608 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR29\" id=\"ref-link-section-d117614227e4195\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"3131 title=\"Corti, D. et al. Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody. Science 351, 1339\u20131342 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR30\" id=\"ref-link-section-d117614227e4198\">30<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"3232 title=\"Gaebler, C. et al. Evolution of antibody immunity to SARS-CoV-2. Nature 591, 639\u2013644 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR38\" id=\"ref-link-section-d117614227e4201\">38<\/a><\/sup>, are plotted as blue dots (Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#MOESM3\">1<\/a>). We also plot the results of recent studies reporting advances in antibody engineering technologies, including Mason et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"3333 title=\"Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600\u2013612 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR28\" id=\"ref-link-section-d117614227e4208\">28<\/a><\/sup> who achieve a 3-fold improvement in the binding of trastuzumab to human epidermal growth factor receptor 2 (HER2) using a library of ~39\u2009K variants and Wellner et al.<sup><a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\"3434 title=\"Wellner, A. et al. Rapid generation of potent antibodies by autonomous hypermutation in yeast. Nat. Chem. Biol. 17, 1057\u20131064 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41587-023-01763-2#ref-CR14\" id=\"ref-link-section-d117614227e4213\">14<\/a><\/sup> who achieve between a 2.3- and 580-fold improvement in the binding of unmatured nanobodies to SARS-CoV-2 RBD (picked out of a na\u00efve library) using a continuously evolving yeast system involving 10<sup>6<\/sup> to 10<sup>7<\/sup> sorted cells over four or more rounds of selection.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"Sec37-section\" data-title=\"Supplementary information\">\n<h2 id=\"Sec37\">Supplementary information<\/h2>\n<\/div>\n<div id=\"rightslink-section\" data-title=\"Rights and permissions\">\n<h2 id=\"rightslink\">Rights and permissions<\/h2>\n<div id=\"rightslink-content\">\n<p><b>Open Access<\/b>  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article\u2019s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article\u2019s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"license\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p><a data-track=\"click\" data-track-action=\"view rights and permissions\" data-track-label=\"link\" href=\"https:\/\/s100.copyright.com\/AppDispatchServlet?title=Efficient%20evolution%20of%20human%20antibodies%20from%20general%20protein%20language%20models&#038;author=Brian%20L.%20Hie%20et%20al&#038;contentID=10.1038%2Fs41587-023-01763-2&#038;copyright=The%20Author%28s%29&#038;publication=1087-0156&#038;publicationDate=2023-04-24&#038;publisherName=SpringerNature&#038;orderBeanReset=true&#038;oa=CC%20BY\">Reprints and Permissions<\/a><\/p>\n<\/div>\n<\/div>\n<div id=\"article-info-section\" aria-labelledby=\"article-info\" data-title=\"About this article\">\n<h2 id=\"article-info\">About this article<\/h2>\n<div id=\"article-info-content\">\n<p><a data-crossmark=\"10.1038\/s41587-023-01763-2\" target=\"_blank\" rel=\"noopener\" href=\"https:\/\/crossmark.crossref.org\/dialog\/?doi=10.1038\/s41587-023-01763-2\" data-track=\"click\" data-track-action=\"Click Crossmark\" data-track-label=\"link\" data-test=\"crossmark\"><img loading=\"lazy\" decoding=\"async\" width=\"57\" height=\"81\" alt=\"Science &amp; Nature Verify currency and authenticity via CrossMark\" src=\"data:image\/svg+xml;base64,PHN2ZyBoZWlnaHQ9IjgxIiB3aWR0aD0iNTciIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyI+PGcgZmlsbD0ibm9uZSIgZmlsbC1ydWxlPSJldmVub2RkIj48cGF0aCBkPSJtMTcuMzUgMzUuNDUgMjEuMy0xNC4ydi0xNy4wM2gtMjEuMyIgZmlsbD0iIzk4OTg5OCIvPjxwYXRoIGQ9Im0zOC42NSAzNS40NS0yMS4zLTE0LjJ2LTE3LjAzaDIxLjMiIGZpbGw9IiM3NDc0NzQiLz48cGF0aCBkPSJtMjggLjVjLTEyLjk4IDAtMjMuNSAxMC41Mi0yMy41IDIzLjVzMTAuNTIgMjMuNSAyMy41IDIzLjUgMjMuNS0xMC41MiAyMy41LTIzLjVjMC02LjIzLTIuNDgtMTIuMjEtNi44OC0xNi42Mi00LjQxLTQuNC0xMC4zOS02Ljg4LTE2LjYyLTYuODh6bTAgNDEuMjVjLTkuOCAwLTE3Ljc1LTcuOTUtMTcuNzUtMTcuNzVzNy45NS0xNy43NSAxNy43NS0xNy43NSAxNy43NSA3Ljk1IDE3Ljc1IDE3Ljc1YzAgNC43MS0xLjg3IDkuMjItNS4yIDEyLjU1cy03Ljg0IDUuMi0xMi41NSA1LjJ6IiBmaWxsPSIjNTM1MzUzIi8+PHBhdGggZD0ibTQxIDM2Yy01LjgxIDYuMjMtMTUuMjMgNy40NS0yMi40MyAyLjktNy4yMS00LjU1LTEwLjE2LTEzLjU3LTcuMDMtMjEuNWwtNC45Mi0zLjExYy00Ljk1IDEwLjctMS4xOSAyMy40MiA4Ljc4IDI5LjcxIDkuOTcgNi4zIDIzLjA3IDQuMjIgMzAuNi00Ljg2eiIgZmlsbD0iIzljOWM5YyIvPjxwYXRoIGQ9Im0uMiA1OC40NWMwLS43NS4xMS0xLjQyLjMzLTIuMDFzLjUyLTEuMDkuOTEtMS41Yy4zOC0uNDEuODMtLjczIDEuMzQtLjk0LjUxLS4yMiAxLjA2LS4zMiAxLjY1LS4zMi41NiAwIDEuMDYuMTEgMS41MS4zNS40NC4yMy44MS41IDEuMS44MWwtLjkxIDEuMDFjLS4yNC0uMjQtLjQ5LS40Mi0uNzUtLjU2LS4yNy0uMTMtLjU4LS4yLS45My0uMi0uMzkgMC0uNzMuMDgtMS4wNS4yMy0uMzEuMTYtLjU4LjM3LS44MS42Ni0uMjMuMjgtLjQxLjYzLS41MyAxLjA0LS4xMy40MS0uMTkuODgtLjE5IDEuMzkgMCAxLjA0LjIzIDEuODYuNjggMi40Ni40NS41OSAxLjA2Ljg4IDEuODQuODguNDEgMCAuNzctLjA3IDEuMDctLjIzcy41OS0uMzkuODUtLjY4bC45MSAxYy0uMzguNDMtLjguNzYtMS4yOC45OS0uNDcuMjItMSAuMzQtMS41OC4zNC0uNTkgMC0xLjEzLS4xLTEuNjQtLjMxLS41LS4yLS45NC0uNTEtMS4zMS0uOTEtLjM4LS40LS42Ny0uOS0uODgtMS40OC0uMjItLjU5LS4zMy0xLjI2LS4zMy0yLjAyem04LjQtNS4zM2gxLjYxdjIuNTRsLS4wNSAxLjMzYy4yOS0uMjcuNjEtLjUxLjk2LS43MnMuNzYtLjMxIDEuMjQtLjMxYy43MyAwIDEuMjcuMjMgMS42MS43MS4zMy40Ny41IDEuMTQuNSAyLjAydjQuMzFoLTEuNjF2LTQuMWMwLS41Ny0uMDgtLjk3LS4yNS0xLjIxLS4xNy0uMjMtLjQ1LS4zNS0uODMtLjM1LS4zIDAtLjU2LjA4LS43OS4yMi0uMjMuMTUtLjQ5LjM2LS43OC42NHY0LjhoLTEuNjF6bTcuMzcgNi40NWMwLS41Ni4wOS0xLjA2LjI2LTEuNTEuMTgtLjQ1LjQyLS44My43MS0xLjE0LjI5LS4zLjYzLS41NCAxLjAxLS43MS4zOS0uMTcuNzgtLjI1IDEuMTgtLjI1LjQ3IDAgLjg4LjA4IDEuMjMuMjQuMzYuMTYuNjUuMzguODkuNjdzLjQyLjYzLjU0IDEuMDNjLjEyLjQxLjE4Ljg0LjE4IDEuMzIgMCAuMzItLjAyLjU3LS4wNy43NmgtNC4zNmMuMDcuNjIuMjkgMS4xLjY1IDEuNDQuMzYuMzMuODIuNSAxLjM4LjUuMjkgMCAuNTctLjA0LjgzLS4xM3MuNTEtLjIxLjc2LS4zN2wuNTUgMS4wMWMtLjMzLjIxLS42OS4zOS0xLjA5LjUzLS40MS4xNC0uODMuMjEtMS4yNi4yMS0uNDggMC0uOTItLjA4LTEuMzQtLjI1LS40MS0uMTYtLjc2LS40LTEuMDctLjctLjMxLS4zMS0uNTUtLjY5LS43Mi0xLjEzLS4xOC0uNDQtLjI2LS45NS0uMjYtMS41MnptNC42LS42MmMwLS41NS0uMTEtLjk4LS4zNC0xLjI4LS4yMy0uMzEtLjU4LS40Ny0xLjA2LS40Ny0uNDEgMC0uNzcuMTUtMS4wNy40NS0uMzEuMjktLjUuNzMtLjU4IDEuM3ptMi41LjYyYzAtLjU3LjA5LTEuMDguMjgtMS41My4xOC0uNDQuNDMtLjgyLjc1LTEuMTNzLjY5LS41NCAxLjEtLjcxYy40Mi0uMTYuODUtLjI0IDEuMzEtLjI0LjQ1IDAgLjg0LjA4IDEuMTcuMjNzLjYxLjM0Ljg1LjU3bC0uNzcgMS4wMmMtLjE5LS4xNi0uMzgtLjI4LS41Ni0uMzctLjE5LS4wOS0uMzktLjE0LS42MS0uMTQtLjU2IDAtMS4wMS4yMS0xLjM1LjYzLS4zNS40MS0uNTIuOTctLjUyIDEuNjcgMCAuNjkuMTcgMS4yNC41MSAxLjY2LjM0LjQxLjc4LjYyIDEuMzIuNjIuMjggMCAuNTQtLjA2Ljc4LS4xNy4yNC0uMTIuNDUtLjI2LjY0LS40MmwuNjcgMS4wM2MtLjMzLjI5LS42OS41MS0xLjA4LjY1LS4zOS4xNS0uNzguMjMtMS4xOC4yMy0uNDYgMC0uOS0uMDgtMS4zMS0uMjQtLjQtLjE2LS43NS0uMzktMS4wNS0uN3MtLjUzLS42OS0uNy0xLjEzYy0uMTctLjQ1LS4yNS0uOTYtLjI1LTEuNTN6bTYuOTEtNi40NWgxLjU4djYuMTdoLjA1bDIuNTQtMy4xNmgxLjc3bC0yLjM1IDIuOCAyLjU5IDQuMDdoLTEuNzVsLTEuNzctMi45OC0xLjA4IDEuMjN2MS43NWgtMS41OHptMTMuNjkgMS4yN2MtLjI1LS4xMS0uNS0uMTctLjc1LS4xNy0uNTggMC0uODcuMzktLjg3IDEuMTZ2Ljc1aDEuMzR2MS4yN2gtMS4zNHY1LjZoLTEuNjF2LTUuNmgtLjkydi0xLjJsLjkyLS4wN3YtLjcyYzAtLjM1LjA0LS42OC4xMy0uOTguMDgtLjMxLjIxLS41Ny40LS43OXMuNDItLjM5LjcxLS41MWMuMjgtLjEyLjYzLS4xOCAxLjA0LS4xOC4yNCAwIC40OC4wMi42OS4wNy4yMi4wNS40MS4xLjU3LjE3em0uNDggNS4xOGMwLS41Ny4wOS0xLjA4LjI3LTEuNTMuMTctLjQ0LjQxLS44Mi43Mi0xLjEzLjMtLjMxLjY1LS41NCAxLjA0LS43MS4zOS0uMTYuOC0uMjQgMS4yMy0uMjRzLjg0LjA4IDEuMjQuMjRjLjQuMTcuNzQuNCAxLjA0Ljcxcy41NC42OS43MiAxLjEzYy4xOS40NS4yOC45Ni4yOCAxLjUzcy0uMDkgMS4wOC0uMjggMS41M2MtLjE4LjQ0LS40Mi44Mi0uNzIgMS4xM3MtLjY0LjU0LTEuMDQuNy0uODEuMjQtMS4yNC4yNC0uODQtLjA4LTEuMjMtLjI0LS43NC0uMzktMS4wNC0uN2MtLjMxLS4zMS0uNTUtLjY5LS43Mi0xLjEzLS4xOC0uNDUtLjI3LS45Ni0uMjctMS41M3ptMS42NSAwYzAgLjY5LjE0IDEuMjQuNDMgMS42Ni4yOC40MS42OC42MiAxLjE4LjYyLjUxIDAgLjktLjIxIDEuMTktLjYyLjI5LS40Mi40NC0uOTcuNDQtMS42NiAwLS43LS4xNS0xLjI2LS40NC0xLjY3LS4yOS0uNDItLjY4LS42My0xLjE5LS42My0uNSAwLS45LjIxLTEuMTguNjMtLjI5LjQxLS40My45Ny0uNDMgMS42N3ptNi40OC0zLjQ0aDEuMzNsLjEyIDEuMjFoLjA1Yy4yNC0uNDQuNTQtLjc5Ljg4LTEuMDIuMzUtLjI0LjctLjM2IDEuMDctLjM2LjMyIDAgLjU5LjA1Ljc4LjE0bC0uMjggMS40LS4zMy0uMDljLS4xMS0uMDEtLjIzLS4wMi0uMzgtLjAyLS4yNyAwLS41Ni4xLS44Ni4zMXMtLjU1LjU4LS43NyAxLjF2NC4yaC0xLjYxem0tNDcuODcgMTVoMS42MXY0LjFjMCAuNTcuMDguOTcuMjUgMS4yLjE3LjI0LjQ0LjM1LjgxLjM1LjMgMCAuNTctLjA3LjgtLjIyLjIyLS4xNS40Ny0uMzkuNzMtLjczdi00LjdoMS42MXY2Ljg3aC0xLjMybC0uMTItMS4wMWgtLjA0Yy0uMy4zNi0uNjMuNjQtLjk4Ljg2LS4zNS4yMS0uNzYuMzItMS4yNC4zMi0uNzMgMC0xLjI3LS4yNC0xLjYxLS43MS0uMzMtLjQ3LS41LTEuMTQtLjUtMi4wMnptOS40NiA3LjQzdjIuMTZoLTEuNjF2LTkuNTloMS4zM2wuMTIuNzJoLjA1Yy4yOS0uMjQuNjEtLjQ1Ljk3LS42My4zNS0uMTcuNzItLjI2IDEuMS0uMjYuNDMgMCAuODEuMDggMS4xNS4yNC4zMy4xNy42MS40Ljg0LjcxLjI0LjMxLjQxLjY4LjUzIDEuMTEuMTMuNDIuMTkuOTEuMTkgMS40NCAwIC41OS0uMDkgMS4xMS0uMjUgMS41Ny0uMTYuNDctLjM4Ljg1LS42NSAxLjE2LS4yNy4zMi0uNTguNTYtLjk0LjczLS4zNS4xNi0uNzIuMjUtMS4xLjI1LS4zIDAtLjYtLjA3LS45LS4ycy0uNTktLjMxLS44Ny0uNTZ6bTAtMi4zYy4yNi4yMi41LjM3LjczLjQ1LjI0LjA5LjQ2LjEzLjY2LjEzLjQ2IDAgLjg0LS4yIDEuMTUtLjYuMzEtLjM5LjQ2LS45OC40Ni0xLjc3IDAtLjY5LS4xMi0xLjIyLS4zNS0xLjYxLS4yMy0uMzgtLjYxLS41Ny0xLjEzLS41Ny0uNDkgMC0uOTkuMjYtMS41Mi43N3ptNS44Ny0xLjY5YzAtLjU2LjA4LTEuMDYuMjUtMS41MS4xNi0uNDUuMzctLjgzLjY1LTEuMTQuMjctLjMuNTgtLjU0LjkzLS43MXMuNzEtLjI1IDEuMDgtLjI1Yy4zOSAwIC43My4wNyAxIC4yLjI3LjE0LjU0LjMyLjgxLjU1bC0uMDYtMS4xdi0yLjQ5aDEuNjF2OS44OGgtMS4zM2wtLjExLS43NGgtLjA2Yy0uMjUuMjUtLjU0LjQ2LS44OC42NC0uMzMuMTgtLjY5LjI3LTEuMDYuMjctLjg3IDAtMS41Ni0uMzItMi4wNy0uOTVzLS43Ni0xLjUxLS43Ni0yLjY1em0xLjY3LS4wMWMwIC43NC4xMyAxLjMxLjQgMS43LjI2LjM4LjY1LjU4IDEuMTUuNTguNTEgMCAuOTktLjI2IDEuNDQtLjc3di0zLjIxYy0uMjQtLjIxLS40OC0uMzYtLjctLjQ1LS4yMy0uMDgtLjQ2LS4xMi0uNy0uMTItLjQ1IDAtLjgyLjE5LTEuMTMuNTktLjMxLjM5LS40Ni45NS0uNDYgMS42OHptNi4zNSAxLjU5YzAtLjczLjMyLTEuMy45Ny0xLjcxLjY0LS40IDEuNjctLjY4IDMuMDgtLjg0IDAtLjE3LS4wMi0uMzQtLjA3LS41MS0uMDUtLjE2LS4xMi0uMy0uMjItLjQzcy0uMjItLjIyLS4zOC0uM2MtLjE1LS4wNi0uMzQtLjEtLjU4LS4xLS4zNCAwLS42OC4wNy0xIC4ycy0uNjMuMjktLjkzLjQ3bC0uNTktMS4wOGMuMzktLjI0LjgxLS40NSAxLjI4LS42My40Ny0uMTcuOTktLjI2IDEuNTQtLjI2Ljg2IDAgMS41MS4yNSAxLjkzLjc2cy42MyAxLjI1LjYzIDIuMjF2NC4wN2gtMS4zMmwtLjEyLS43NmgtLjA1Yy0uMy4yNy0uNjMuNDgtLjk4LjY2cy0uNzMuMjctMS4xNC4yN2MtLjYxIDAtMS4xLS4xOS0xLjQ4LS41Ni0uMzgtLjM2LS41Ny0uODUtLjU3LTEuNDZ6bTEuNTctLjEyYzAgLjMuMDkuNTMuMjcuNjcuMTkuMTQuNDIuMjEuNzEuMjEuMjggMCAuNTQtLjA3Ljc3LS4ycy40OC0uMzEuNzMtLjU2di0xLjU0Yy0uNDcuMDYtLjg2LjEzLTEuMTguMjMtLjMxLjA5LS41Ny4xOS0uNzYuMzFzLS4zMy4yNS0uNDEuNGMtLjA5LjE1LS4xMy4zMS0uMTMuNDh6bTYuMjktMy42M2gtLjk4di0xLjJsMS4wNi0uMDcuMi0xLjg4aDEuMzR2MS44OGgxLjc1djEuMjdoLTEuNzV2My4yOGMwIC44LjMyIDEuMi45NyAxLjIuMTIgMCAuMjQtLjAxLjM3LS4wNC4xMi0uMDMuMjQtLjA3LjM0LS4xMWwuMjggMS4xOWMtLjE5LjA2LS40LjEyLS42NC4xNy0uMjMuMDUtLjQ5LjA4LS43Ni4wOC0uNCAwLS43NC0uMDYtMS4wMi0uMTgtLjI3LS4xMy0uNDktLjMtLjY3LS41Mi0uMTctLjIxLS4zLS40OC0uMzctLjc4LS4wOC0uMy0uMTItLjY0LS4xMi0xLjAxem00LjM2IDIuMTdjMC0uNTYuMDktMS4wNi4yNy0xLjUxcy40MS0uODMuNzEtMS4xNGMuMjktLjMuNjMtLjU0IDEuMDEtLjcxLjM5LS4xNy43OC0uMjUgMS4xOC0uMjUuNDcgMCAuODguMDggMS4yMy4yNC4zNi4xNi42NS4zOC44OS42N3MuNDIuNjMuNTQgMS4wM2MuMTIuNDEuMTguODQuMTggMS4zMiAwIC4zMi0uMDIuNTctLjA3Ljc2aC00LjM3Yy4wOC42Mi4yOSAxLjEuNjUgMS40NC4zNi4zMy44Mi41IDEuMzguNS4zIDAgLjU4LS4wNC44NC0uMTMuMjUtLjA5LjUxLS4yMS43Ni0uMzdsLjU0IDEuMDFjLS4zMi4yMS0uNjkuMzktMS4wOS41M3MtLjgyLjIxLTEuMjYuMjFjLS40NyAwLS45Mi0uMDgtMS4zMy0uMjUtLjQxLS4xNi0uNzctLjQtMS4wOC0uNy0uMy0uMzEtLjU0LS42OS0uNzItMS4xMy0uMTctLjQ0LS4yNi0uOTUtLjI2LTEuNTJ6bTQuNjEtLjYyYzAtLjU1LS4xMS0uOTgtLjM0LTEuMjgtLjIzLS4zMS0uNTgtLjQ3LTEuMDYtLjQ3LS40MSAwLS43Ny4xNS0xLjA4LjQ1LS4zMS4yOS0uNS43My0uNTcgMS4zem0zLjAxIDIuMjNjLjMxLjI0LjYxLjQzLjkyLjU3LjMuMTMuNjMuMi45OC4yLjM4IDAgLjY1LS4wOC44My0uMjNzLjI3LS4zNS4yNy0uNmMwLS4xNC0uMDUtLjI2LS4xMy0uMzctLjA4LS4xLS4yLS4yLS4zNC0uMjgtLjE0LS4wOS0uMjktLjE2LS40Ny0uMjNsLS41My0uMjJjLS4yMy0uMDktLjQ2LS4xOC0uNjktLjMtLjIzLS4xMS0uNDQtLjI0LS42Mi0uNHMtLjMzLS4zNS0uNDUtLjU1Yy0uMTItLjIxLS4xOC0uNDYtLjE4LS43NSAwLS42MS4yMy0xLjEuNjgtMS40OS40NC0uMzggMS4wNi0uNTcgMS44My0uNTcuNDggMCAuOTEuMDggMS4yOS4yNXMuNzEuMzYuOTkuNTdsLS43NC45OGMtLjI0LS4xNy0uNDktLjMyLS43My0uNDItLjI1LS4xMS0uNTEtLjE2LS43OC0uMTYtLjM1IDAtLjYuMDctLjc2LjIxLS4xNy4xNS0uMjUuMzMtLjI1LjU0IDAgLjE0LjA0LjI2LjEyLjM2cy4xOC4xOC4zMS4yNmMuMTQuMDcuMjkuMTQuNDYuMjFsLjU0LjE5Yy4yMy4wOS40Ny4xOC43LjI5cy40NC4yNC42NC40Yy4xOS4xNi4zNC4zNS40Ni41OC4xMS4yMy4xNy41LjE3LjgyIDAgLjMtLjA2LjU4LS4xNy44My0uMTIuMjYtLjI5LjQ4LS41MS42OC0uMjMuMTktLjUxLjM0LS44NC40NS0uMzQuMTEtLjcyLjE3LTEuMTUuMTctLjQ4IDAtLjk1LS4wOS0xLjQxLS4yNy0uNDYtLjE5LS44Ni0uNDEtMS4yLS42OHoiIGZpbGw9IiM1MzUzNTMiLz48L2c+PC9zdmc+\"><\/a><\/p>\n<div>\n<h3 id=\"citeas\">Cite this article<\/h3>\n<p>Hie, B.L., Shanker, V.R., Xu, D. <i>et al.<\/i> Efficient evolution of human antibodies from general protein language models.<br \/>\n                    <i>Nat Biotechnol<\/i>  (2023). https:\/\/doi.org\/10.1038\/s41587-023-01763-2<\/p>\n<p><a data-test=\"citation-link\" data-track=\"click\" data-track-action=\"download article citation\" data-track-label=\"link\" data-track-external rel=\"nofollow\" href=\"https:\/\/citation-needed.springer.com\/v2\/references\/10.1038\/s41587-023-01763-2?format=refman&#038;flavour=citation\">Download citation<\/a><\/p>\n<ul data-test=\"publication-history\">\n<li>\n<p>Received<span>: <\/span><span><time datetime=\"2022-11-23\">23 November 2022<\/time><\/span><\/p>\n<\/li>\n<li>\n<p>Accepted<span>: <\/span><span><time datetime=\"2023-03-28\">28 March 2023<\/time><\/span><\/p>\n<\/li>\n<li>\n<p>Published<span>: <\/span><span><time datetime=\"2023-04-24\">24 April 2023<\/time><\/span><\/p>\n<\/li>\n<li>\n<p><abbr title=\"Digital Object Identifier\">DOI<\/abbr><span>: <\/span><span>https:\/\/doi.org\/10.1038\/s41587-023-01763-2<\/span><\/p>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div><\/div>\n<p><a href=\"https:\/\/www.nature.com\/articles\/s41587-023-01763-2\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><br \/>\n Brian L. Hie<\/p>\n","protected":false},"excerpt":{"rendered":"<p>MainEvolution searches across an immense space of possible sequences for rare mutations that improve fitness1,2. In nature, this search is based on simple processes of random mutation and recombination1, but using the same approach for directed evolution of proteins in the laboratory3 imposes a considerable experimental burden. Artificial evolution based on random guessing or brute<\/p>\n","protected":false},"author":1,"featured_media":641429,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25304,23857,536],"tags":[],"class_list":["post-641428","post","type-post","status-publish","format-standard","has-post-thumbnail","category-efficient","category-evolution","category-science-nature"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/641428","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=641428"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/641428\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/641429"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=641428"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=641428"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=641428"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}