6,318
Views
10
CrossRef citations to date
0
Altmetric
Research Paper

Challenges in antibody structure prediction

, , , , , & ORCID Icon show all
Article: 2175319 | Received 10 Nov 2022, Accepted 27 Jan 2023, Published online: 12 Feb 2023

ABSTRACT

Advances in structural biology and the exponential increase in the amount of high-quality experimental structural data available in the Protein Data Bank has motivated numerous studies to tackle the grand challenge of predicting protein structures. In 2020 AlphaFold2 revolutionized the field using a combination of artificial intelligence and the evolutionary information contained in multiple sequence alignments. Antibodies are one of the most important classes of biotherapeutic proteins. Accurate structure models are a prerequisite to advance biophysical property predictions and consequently antibody design. Specialized tools used to predict antibody structures based on different principles have profited from current advances in protein structure prediction based on artificial intelligence. Here, we emphasize the importance of reliable protein structure models and highlight the enormous advances in the field, but we also aim to increase awareness that protein structure models, and in particular antibody models, may suffer from structural inaccuracies, namely incorrect cis-amide bonds, wrong stereochemistry or clashes. We show that these inaccuracies affect biophysical property predictions such as surface hydrophobicity. Thus, we stress the importance of carefully reviewing protein structure models before investing further computing power and setting up experiments. To facilitate the assessment of model quality, we provide a tool “TopModel” to validate structure models.

Breakthroughs in protein/antibody structure prediction

Predicting the three-dimensional (3D) structure of a protein based solely on the amino-acid sequence is one of the grand challenges in the field of protein structure prediction.Citation1 Accurate prediction of the 3D structure of a protein is critical to understand its function, as the shape of the protein determines its properties and ultimately its function. To determine the state-of-the-art methods in protein structure prediction, the biennial community-based benchmarking experiment “Critical Assessment of methods in protein Structure Prediction (CASP)” was established.Citation2–4 In CASP14 (2020), DeepMind showcased AlphaFold2, a program based on artificial intelligence (AI) that directly processes multiple sequence alignments.Citation5 Comparable accuracies in predicting protein structures can also be achieved with other methods including RoseTTAFold,Citation6 and specialized tools for antibodies which incorporate the recent advances.Citation7–9 Those tools are highly accurate based on global measures, often with root mean square deviations (RMSDs) to the crystal structure of less than 1 Å. However, there are often higher inaccuracies in specific parts of the protein that should be carefully reviewed.Citation10,Citation11 Post-translational modifications are omitted, but can sometimes be added afterwards.Citation12 Furthermore, the accuracy for multimers, such as antibodies, is still lower.Citation13 Additional challenges can arise for antibodies since VDJ recombination events do not follow the classical pathway of evolution.Citation14

Antibodies are crucial components of the adaptive immune response.Citation15 Genetic recombination and somatic hypermutation events enable the adaptive immune system to produce a vast number of antibodies against a variety of pathogens.Citation14 To understand and optimize antigen recognition and to enable rational design of antibodies, accurate structure models are essential.Citation16 Despite these recent advances, accurate structure prediction of antibodies remains challenging and still needs to be extensively validated. In particular, the flexible loops involved in recognizing the antigen pose a major challenge.Citation17,Citation18 In comparison to other protein superfamilies, the fold of antibodies is generally highly conserved.Citation19–21 In particular, the framework of the antigen-binding fragment (Fab) is structurally almost identical for all antibodies.Citation22,Citation23 However, the area hardest to predict accurately is the six hypervariable loops that can form, together with several framework residues, the antigen-binding site, engaging with the respective epitope. These loops are also known as the complementarity-determining region (CDR) and provide the sequence and structure diversity essential to recognize a wide range of antigens. Five of the six loops tend to adopt canonical cluster folds based on their length and sequence composition. However, the third CDR loop of the heavy chain, the CDR-H3 loop, is the most diverse in length, sequence and structure and therefore is the most challenging loop to predict accurately.

Various methodologies have been developed to improve and advance antibody structure prediction, by incorporating different homology model strategies or using the canonical cluster model,Citation24–28 but recently various antibody-specific deep learning methods such as ABlooper, DeepAb and IgFold have significantly improved the CDR loop modeling accuracy.Citation7–9,Citation29 The predicted structure models achieve similar or better quality than methods that are able to predict all types of protein structures (including AlphaFold2).Citation13,Citation30 Additionally, the CDR-H3 loop conformation is also strongly influenced by the relative interdomain orientation, as it is located in the center, directly in the interface between the heavy and the light chains. It has been shown that, in addition to the CDR loops, the relative interdomain orientation plays a crucial role in defining the shape of the antigen binding site.Citation31–33 Therefore, accurate predictions of the relative VH-VL interdomain orientations are vital to characterize the topology of the antigen binding site.Citation24 To measure the relative VH-VL interdomain orientation, various approaches have been used, the most widely known being ABangle.Citation32 All these improvements have enabled predictions of a vast number of antibody structures at a high level of accuracy, which can then further be used as input structures for virtual screening or to inform rational design of antibodies.Citation16,Citation17

Possible inaccuracies in antibody structure models

Here, we investigated a dataset consisting of 137 antibody sequences published by Jain et al.Citation34 and used the available antibody structure prediction tools ABlooper,Citation7 IgFold,Citation8 DeepAb,Citation9 Immunebuilder,Citation29and the MOE Antibody ModelerCitation35 to generate structure models for further biophysical characterizations.Citation36,Citation37 Careful inspection of the generated models revealed inconsistencies such as cis-amide bonds in the CDR loops, D-amino acids and severe clashes. In total, this resulted in up to 300 D-amino acids and up to 240 cis-amide bonds for the 137 antibody models. We found cis-amide bonds and clashes independent of the applied antibody modeling tools. The only tools that did not introduce any D-amino acids are DeepAb and Immunebuilder. The recently available Immunebuilder tool performs physical plausibility checks, i.e., checks for steric clashes, cis-amide bonds, or bonds with nonphysical lengths. These structural inaccuracies affect the results of structure-based biophysical property predictions.Citation36,Citation37 In addition to the Jain et al. dataset,Citation34 we predicted the structure of the CIS43 antibody, where the experimental X-ray structure (PDB accession code: 7SG5)Citation38 was released after the structure prediction tools were published, to have an experimental reference structure outside of the used training set. shows an overlay of all obtained structure models and reveals an overall high structural similarity, reflected in low overall RMSD values (~1 Å). However, substantial structural variability can be observed in the CDR-H3 loop (RMSD values >2 Å). This result points out one particular challenge in predicting antibody structures: the high variability of the antibody CDR loops cannot be captured or represented by a single static structure.Citation39–41 While there can be properties and metrics that are not too much affected by these issues, metrics that rely on accurate CDR-H3 structures will be strongly distorted. This includes antibody-antigen docking, since the CDR-H3 loop is a central part of the binding interface, as well as structure-based hydrophobicity calculations, since the binding interface frequently contains more hydrophobic amino acids than the rest of the antibody surface.Citation16,Citation17,Citation36 Molecular dynamics (MD) simulations might correctly capture the ensemble in solution if the starting structure is sufficiently close, but current MD approaches cannot sample cis-trans isomerization or transitions from D- to L-amino acids, leading to ensembles that are potentially worse than the starting structure in terms of RMSD to the crystal structure (SI Figure S1 and S2). Additionally, SI Figure S2 shows the relative interdomain orientations of the CIS43 antibody obtained by MD simulations with the orientations of the models (originating from different tools) projected as vertical lines. For this example, the models differ in their interdomain orientation up to 5° from the available X-ray structure. However, all of the orientations found in models preexist in the MD ensemble distribution.

Figure 1. A) Comparison of the available X-ray structure of the CIS43 antibody (PDB code: 7SG5) with the structure models generated with different antibody prediction tools, namely ABlooper, MOE, DeepAb, IgFold and ImmuneBuilder. B) Structural overlay of the obtained Fv models, showing the high variability in the CDR-H3 loop. C) Cα-RMSD matrix of the X-ray structure and the respective models for the whole Fv. D) Cα-RMSD matrix of the X-ray structure and the respective models for the CDR-H3 loop.

Figure 1. A) Comparison of the available X-ray structure of the CIS43 antibody (PDB code: 7SG5) with the structure models generated with different antibody prediction tools, namely ABlooper, MOE, DeepAb, IgFold and ImmuneBuilder. B) Structural overlay of the obtained Fv models, showing the high variability in the CDR-H3 loop. C) Cα-RMSD matrix of the X-ray structure and the respective models for the whole Fv. D) Cα-RMSD matrix of the X-ray structure and the respective models for the CDR-H3 loop.

To show the effect of starting structures with cis-amide bonds and D-amino acids in the CDR loops, we compared the surface hydrophobicity of structure models for the CIS43 antibody variant with the X-ray structure (). The surface hydrophobicity was assigned using the hydrophobicity scale by Wimley and White.Citation42 We found differences in the surface hydrophobicity, which is expected as hydrophobicity is potentially a strongly conformation-dependent property, since small sidechain rearrangements may expose otherwise buried hydrophobic groups. While small inaccuracies in the atomic positions can be fixed by MD simulations, the correct sidechain packing is often impossible when D-amino acids or cis-amide bonds are present, leading to almost irreparable errors in the biophysical property estimation. The same is true for antibody-antigen docking, where an accurate representation of the surface is required to find the correct interactions with the antigen.

Figure 2. Surface hydrophobicity mapped on the X-ray structure and two antibody models. Surface hydrophobicity was assigned by the Whimley and White hydrophobicity scale. Hydrophobic areas are colored in yellow, while hydrophilic parts are depicted in blue.

Surface representation of the respective antibody Fv models, color-coded based on their hydrophobicity. In yellow areas with high surface hydrophobicity are depicted, while in blue areas with low surface hydrophobicity are illustrated.
Figure 2. Surface hydrophobicity mapped on the X-ray structure and two antibody models. Surface hydrophobicity was assigned by the Whimley and White hydrophobicity scale. Hydrophobic areas are colored in yellow, while hydrophilic parts are depicted in blue.

shows examples of cis-amide bonds and D-amino acids in the CDR-H3 loop of CIS43.

Figure 3. Examples of structural inaccuracies observed in some of the models, namely cis-amide bonds, D-amino acids and Van der Waals clashes.

Stick representation of the observed structural inaccuracies, i.e., clashes, D-amino acids, cis-amide bonds (showing the different angle for the amide bond). For the D-amino acids we added dashed lines to indicate information about the stereochemistry.
Figure 3. Examples of structural inaccuracies observed in some of the models, namely cis-amide bonds, D-amino acids and Van der Waals clashes.

Additionally, in one of the obtained models we found a missing proline sidechain at the tip of the CDR-H3 loop. Rebuilding the proline results in severe clashes, as shown in the right panel of , and the only way to avoid these clashes would be to build a D-Proline instead.

To facilitate the identification of these issues, we present a tool (available on Github: https://github.com/liedllab/TopModel), called “TopModel”, that quickly checks the structure for cis-amide bonds, D-amino acids, and clashes. With this tool, structure models can rapidly be checked to assess the quality and accuracy of the models before performing further analysis. At the same time, it offers the possibility to directly visualize these issues in PyMOLCitation43. shows the output obtained by “TopModel”. Residues colored in magenta (D-amino acids) and red (cis-amide bonds) represent issues that should be fixed, while cis-Prolines are colored green, as they occasionally occur in native protein structures. Van der Waals clashes are colored in yellow. For the clashes we also provide an optional score, to quantify the quality of the model, that takes the number of clashes and the length of the protein into account, to quantify the quality of the model. Non-planar amide bonds are depicted in cyan. As these issues can also be found for other protein structure models apart from antibodies, we recommend checking every model with “TopModel” and stress the importance of validating the obtained structures to ensure the most accurate results/predictions as possible.

Figure 4. Pymol visualization of the structural inaccuracies, as output of the “TopModel” tool.

Cartoon representation of the Fv, showing the structural inaccuracies as sticks. The sticks are color-coded according to the detected issue (purple – D-amino acids, red – cis-amide bonds, green – cis-proline, cyan – non-planar amide bond).
Figure 4. Pymol visualization of the structural inaccuracies, as output of the “TopModel” tool.

Discussion

While we want to highlight the enormous advances in the protein structure prediction field, at the same time, we must also emphasize the importance of critically reviewing structure models, especially antibody models, before basing conclusions, further experiments, or MD simulations on potentially erroneous models.Citation44 In particular, we stress the importance of not limiting the characterization of the antigen binding site to a single-static structure model, as there is already high structural variability between the different models (). This high variability suggests that even considering ensembles in solution might more accurately reflect the properties and functions of the antibody (SI Figure S1-S3).Citation18 The tremendous development in protein structure prediction enables fast protein structure predictions that approach the accuracy of experimental structures.Citation30 These breakthroughs have been achieved by combining artificial intelligence with an effective exploitation of the available structural information and incorporation of evolutionary related sequences through multiple sequence alignments (MSAs).Citation30 AlphaFold, RoseTTAFold, ESMFold and other methods have revolutionized protein structure prediction.Citation6,Citation30,Citation45–47 Various deep learning (based) approaches have also been shown to improve antibody structure prediction and outperform all previously available antibody structure prediction methods.Citation7–9,Citation48 Accurately predicting the structure of antibodies is central to understand their function, to elucidate antibody-antigen binding and inform rational antibody design.

However, the most challenging part in antibody structures prediction is concentrated in the six CDR loops, as they reveal the highest variability in both sequence and structure.Citation15,Citation26 In particular, the CDR-H3 loop reveals the highest diversity and variability, which impedes the accurate prediction of its structure.Citation40,Citation49 This is in line with our findings, as the comparison of different antibody prediction methods reveals the most diverging results for the CDR-H3 loop. shows an overlay of all the obtained models, highlighting the high conformational variability of the CDR-H3 loop. To account for this high diversity of the CDR loops one single static structure might not be sufficient and therefore the CDR loops should rather be characterized as ensembles in solution.Citation18,Citation40 Another critical aspect in antibody structure prediction is the relative interdomain orientation (VH-VL).Citation32 The orientation of the two variable domains, VH and VL, strongly influences the shape of the antigen binding site and consequently plays a role in antigen recognition.Citation31,Citation50 A comparison of the different structure models of the CIS43 antibody reveals differences in the interdomain orientations of up to 5°, which can be captured already with short MD simulations (SI Figure S3).Citation50,Citation51 Thus, these findings suggest, that already short MD simulations can be sufficient to optimize the interdomain orientation.Citation50

This is especially important, as various biophysical properties of antibodies, such as hydrophobicity, are conformation dependent and already small sidechain rearrangements reveal distinct surface properties.Citation36,Citation37 MD simulations provide such ensembles in solution, increasing the probability that conformations determining biophysical properties are captured.Citation36 In agreement with these observations, we find that these conformational differences between the models can result in changes in the surface hydrophobicity.

In addition to the high divergence in CDR-H3 loop conformations, we found various cis-amide bonds, D-amino acids, and clashes in the obtained models. Such modeling artifacts are non-natural and can strongly influence biophysical property predictions, resulting in misleading conclusions (SI Figure S1).Citation44 Thus, to address these pitfalls, we provide a tool that quickly inspects protein structure models and identifies issues and flaws in the protein structures, namely the python package “TopModel”. As accurate antibody structures are a prerequisite to reliably understand antibody function and characterize biophysical properties, we strongly suggest an additional validation of the respective structure models to increase the quality of the respective predictions.

Methods

The tool “TopModel” (version 1.0) inspects and highlights issues in a structure model. “TopModel” checks the chirality, amide bond stereochemistry and Van der Waals clashes for every residue in the structure model. The structure models are parsed and analyzed using biopython.Citation52 To calculate the chirality a triangle is defined based on the priority of the atom chains around the chiral center. The direction of a normal vector to this plane, calculated using the cross product, is determined by the priority of the atom chains. By calculating the dot product of this normal vector and a vector from the chiral center to the plane, the orientation of the three atom chains with respect to the center can be determined. Based on the sign of the resulting scalar, the chirality can be assigned. This approach allows us to calculate the chirality even if no hydrogens are included in the structure model. The amide bond is inspected by calculating the dihedral angle of the protein backbone. Cis-amide bonds to proline are labeled separately as they naturally occur more frequently than in other amino acids. Dihedral angles that could neither be assigned cis nor trans are labeled as non-planar. The Van der Waals clashes are quickly computed using a k-d treeCitation53 and Van der Waals radii data gathered using the python package Mendeleev.Citation54 For the clashes an optional score can be displayed, which takes the number of clashes and the length of the protein into account. The chiralities and amide bond orientations are not included in the score. To quickly assess the structural implications of the issues found by “TopModel”, the analyzed structure can be opened in PyMOLCitation43 with the issues highlighted and labeled as shown in .

Abbreviations

Author contributions

M.L.F.Q., J.K., F.W. performed research, analyzed data, and drafted the manuscript. A.M.F. performed research and analyzed data. P.K.Q. analyzed data and contributed in writing the manuscript. C.M.D. and K.R.L. supervised the research.

Supplemental material

Supplemental Material

Download MS Word (812.6 KB)

Acknowledgments

The computational results presented her have been achieved (in part) using the Vienna Scientific Cluster (VSC). We acknowledge CHRONOS for awarding us access to Piz Daint at CSCS, Switzerland.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/19420862.2023.2175319

Additional information

Funding

This work was supported by the Austrian Science Fund (FWF) [P34518]. This work was supported by the Austrian Academy of sciences APART-MINT postdoctoral fellowship to M.L.F.Q.

References

  • Al-Lazikani B, Jung J, Xiang Z, Honig B. Protein structure prediction. Curr Opin Chem Biol. 2001;5:51–6.
  • Simons KT, Bonneau R, Ruczinski I, Baker D. Ab initio protein structure prediction of CASP III targets using ROSETTA. Proteins Struc Func Bioinfo. 1999;37:171–76.
  • Moult J, Pedersen JT, Judson R, Fidelis K. A large-scale experiment to assess protein structure prediction methods. Proteins Struc Func Bioinfo. 1995;23:ii–iv.
  • Vreven T, Moal IH, Vangone A, Pierce BG, Kastritis PL, Torchala M, Chaleil R, Jiménez-García B, Bates PA, Fernandez-Recio J, et al. Updates to the integrated protein-protein interaction benchmarks: docking benchmark version 5 and affinity benchmark version 2. J Mol Biol. 2015;427:3031–41.
  • Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, et al. Applying and improving AlphaFold at CASP14. Proteins Struc Func Bioinfo. 2021;89:1711–21.
  • Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, Wang J, Cong Q, Kinch LN, Schaeffer RD, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373:871–76.
  • Abanades B, Georges G, Bujotzek A, Deane CM. ABlooper: fast accurate antibody CDR loop structure prediction with accuracy estimation. Bioinformatics. 2022;38:1877–80.
  • Ruffolo JA, Chu L-S, Mahajan SP, Gray JJ. Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. bioRxiv. 2022
  • Ruffolo JA, Sulam J, Gray JJ. Antibody structure prediction using interpretable deep learning. Patterns. 2022;3:100406.
  • Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr. 1993;26:283–91.
  • Ramachandran GN, Ramakrishnan C, Sasisekharan V. Stereochemistry of polypeptide chain configurations. J Mol Biol. 1963;7:95–99.
  • Bagdonas H, Fogarty CA, Fadda E, Agirre J. The case for post-predictional modifications in the alphafold protein structure database. Nat Struct Mol Biol. 2021;28:869–70.
  • Evans R, O’Neill M, Pritzel A, Antropova N, Senior A, Green T, Žídek A, Bates R, Blackwell S, Yim J, et al. Protein complex prediction with ALPHAFOLD-multimer. bioRxiv. 2022;1–25.
  • Market E, Papavasiliou FN. V(D)J recombination and the evolution of the adaptive immune system. PLoS Biol. 2003;1:e16.
  • Chiu ML, Goulet DR, Teplyakov A, Gilliland GL. Antibody structure and function: the basis for engineering therapeutics. Antibodies (Basel). 2019;8:55.
  • Guest JD, Vreven T, Zhou J, Moal I, Jeliazkov JR, Gray JJ, Weng Z, Pierce BG. An expanded benchmark for antibody-antigen docking and affinity prediction reveals insights into antibody recognition determinants. Structure. 2021;29:606–21.
  • Fernández-Quintero ML, Vangone A, Loeffler JR, Seidler CA, Georges G, Liedl KR. Paratope states in solution improve structure prediction and docking. Structure. 2022;30:430–40.
  • Fernández-Quintero ML, Pomarici ND, Math BA, Kroell KB, Waibl F, Bujotzek A, Georges G, Liedl KR. Antibodies exhibit multiple paratope states influencing VH–VL domain orientations. Commun Bio. 2020;3:589.
  • Wang J-H. The sequence signature of an Ig-fold. Protein Cell. 2013;4:569–72.
  • Youkharibache P. Topological and structural plasticity of the single IG fold and the double IG fold present in CD19. Biomolecules. 2021;11:1290.
  • Lesk AM, Chothia C. Evolution of proteins formed by β-sheets: II The Core of the Immunoglobulin Domains. J Mol Bio. 1982;160:325–42.
  • Tramontano A, Chothia C, Lesk AM. Framework residue 71 is a major determinant of the position and conformation of the second hypervariable region in the VH domains of immunoglobulins. J Mol Biol. 1990;215:175–82.
  • Honegger A, Malebranche AD, Röthlisberger D, Plückthun A. The influence of the framework core residues on the biophysical properties of immunoglobulin heavy chain variable domains. Protein Eng Design Selection. 2009;22:121–34.
  • Almagro JC, Teplyakov A, Luo J, Sweet RW, Kodangattil S, Hernandez-Guzman F, Gilliland GL. Second antibody modeling assessment (AMA-II). Proteins Struc Func Bioinfo. 2014;82:1553–62.
  • Almagro JC, Beavers MP, Hernandez-Guzman F, Maier J, Shaulsky J, Butenhof K, Labute P, Thorsteinson N, Kelly K, Teplyakov A, et al. Antibody modeling assessment. Proteins Struc Func Bioinfo. 2011;79:3050–66.
  • Chothia C, Lesk AM. Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol. 1987;196:901–17.
  • North B, Lehmann A, Dunbrack JRL. A New Clustering of Antibody CDR Loop Conformations. J Mol Biol. 2011;406:228–56.
  • Fasnacht M, Butenhof K, Goupil-Lamy A, Hernandez-Guzman F, Huang H, Yan L. Automated antibody structure prediction using Accelrys tools: results and best practices. Proteins. 2014;82:1583–98.
  • Abanades B, Wong WK, Boyles F, Georges G, Bujotzek A, Deane CM. ImmuneBuilder: deep-Learning models for predicting the structures of immune proteins. bioRxiv. 2022.
  • Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–89.
  • Bujotzek A, Dunbar J, Lipsmeier F, Schäfer W, Antes I, Deane CM, Georges G. Prediction of VH–VL domain orientation for antibody variable domain modeling. Proteins Struc Func Bioinfo. 2015;83:681–95.
  • Dunbar J, Fuchs A, Shi J, Deane CM. ABangle: characterising the VH–VL orientation in antibodies. Protein Eng Design Selection. 2013;26:611–20.
  • Bujotzek A, Lipsmeier F, Harris SF, Benz J, Kuglstatter A, Georges G. VH-VL orientation prediction for antibody humanization candidate selection: a case study. mAbs. 2016;8:288–305.
  • Jain T, Sun T, Durand S, Hall A, Houston NR, Nett JH, Sharkey B, Bobrowicz B, Caffry I, Yu Y, et al. Biophysical properties of the clinical-stage antibody landscape. Proc Natl Acad Sci USA. 2017;114:944.
  • Chemical Computing Group ULC. Molecular operating environment (MOE). Montreal, QC, Canada, H3A 2R7: Chemical Computing Group ULC, S.S.W., Suite #910. 2020.
  • Waibl F, Fernández-Quintero ML, Wedl FS, Kettenberger H, Georges G, Liedl KR. Comparison of hydrophobicity scales for predicting biophysical properties of antibodies. Front Mol Biosci. 2022;9:960194.
  • Waibl F, Fernández-Quintero ML, Kamenik AS, Kraml J, Hofer F, Kettenberger H, Georges G, Liedl KR. Conformational ensembles of antibodies determine their hydrophobicity. Biophys J. 2021;120:143–57.
  • Banach BB, Tripathi P, Da Silva Pereira L, Gorman J, Nguyen TD, Dillon M, Fahad AS, Kiyuka PK, Madan B, Wolfe JR, et al. Highly protective antimalarial antibodies via precision library generation and yeast display screening. J Exp Med. 2022;219:e20220323.
  • Fernández-Quintero ML, Math BF, Loeffler JR, Liedl KR. Transitions of CDR-L3 loop canonical cluster conformations on the micro-to-millisecond timescale. Front Immunol. 2019;10:2652.
  • Fernández-Quintero ML, Kraml J, Georges G, KR L. CDR-H3 loop ensemble in solution – conformational selection upon antibody binding. mAbs. 2019;11:1077–88.
  • Fernández-Quintero ML, Georges G, Varga JM, Liedl KR. Ensembles in solution as a new paradigm for antibody structure prediction and design. mAbs. 2021;13:1923122.
  • Wimley WC, White SH. Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat Struct Biol. 1996;3:842–48.
  • Schrodinger. The PyMOL molecular graphics system, version 1.8. 2015.
  • Schreiner E, Trabuco LG, Freddolino PL, Schulten K. Stereochemical errors and their implications for molecular dynamics simulations. BMC Bioinform. 2011;12:190.
  • Jisna VA, Jayaraj PB. Protein structure prediction: conventional and deep learning perspectives. Protein J. 2021;40:522–44.
  • Bongirwar V, Mokhade AS. Different methods, techniques and their limitations in protein structure prediction: a review. Prog Biophys Mol Biol. 2022;173:72–82.
  • Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W., Santos Costa dos A, Fazel-Zarandi M, Sercu T, Candido S, et al. Language Models of Protein Sequences at the Scale of Evolution Enable Accurate Structure Prediction. bioRxiv. 2022;2022:20.500902.
  • Leem J, Dunbar J, Georges G, Shi J, Deane CM. ABodyBuilder: automated antibody structure prediction with data–driven accuracy estimation. mAbs. 2016;8:1259–68.
  • Regep C, Georges G, Shi J, Popovic B, Deane CM. The H3 loop of antibodies shows unique structural characteristics. Proteins. 2017;85:1311–18.
  • Fernández-Quintero ML, Hoerschinger VJ, Lamp LM, Bujotzek A, Georges G, KR L. VH-VL interdomain dynamics observed by computer simulations and NMR. Proteins Struc Func Bioinfo. 2020;88:830–39.
  • Fernández-Quintero ML, Kroell KB, Heiss MC, Loeffler JR, Quoika PK, Waibl F, Bujotzek A, Moessner E, Georges G, Liedl KR. Surprisingly fast interface and elbow angle dynamics of antigen-binding fragments. Front Mol Biosci. 2020;7:339.
  • Hamelryck T, Manderick B. PDB file parser and structure class implemented in Python. Bioinformatics. 2003;19:2308–10.
  • Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Meth. 2020;17:261–72.
  • Mentel Ł. mendeleev - A python package with properties of chemical elements, ions, isotopes and methods to manipulate and visualize periodic table. 2014. Available at: https://github.com/lmmentel/mendeleev.