4,084
Views
0
CrossRef citations to date
0
Altmetric
Review

Applications and challenges in designing VHH-based bispecific antibodies: leveraging machine learning solutions

Article: 2341443 | Received 22 Sep 2023, Accepted 05 Apr 2024, Published online: 26 Apr 2024

Figures & data

Figure 1. Schematic representation of a conventional antibody vs a camelid-derived heavy chain-only antibody. (A) Conventional antibodies are composed of two heterodimeric chains, two identical heavy chain fragments consisting of the constant heavy domains CH1-CH3 and the antigen binding variable heavy domain (highlighted in pink) and two identical light chains composed of a constant light-chain domain and a variable light chain domain (highlighted in blue). (B) HCAb antibodies are composed of two identical heavy chain-only fragments that lack a CH1 domain and only have a single variable heavy domain (VHH). (C) Ribbon representation of the crystal structure of a VHH domain (PDB: 1I3V) showing the complementary-determining loops, CDR1 in green, CDR2 in magenta and CDR3 in orange. Framework regions are shown in gray. 3D structure created using Pymol.

Representation of human and llama antibodies using oval shapes for domains connected together by lines into a Y-shape. One of the llama oval shapes has a cartoon associated showing the structure of the domain in three dimensions.
Figure 1. Schematic representation of a conventional antibody vs a camelid-derived heavy chain-only antibody. (A) Conventional antibodies are composed of two heterodimeric chains, two identical heavy chain fragments consisting of the constant heavy domains CH1-CH3 and the antigen binding variable heavy domain (highlighted in pink) and two identical light chains composed of a constant light-chain domain and a variable light chain domain (highlighted in blue). (B) HCAb antibodies are composed of two identical heavy chain-only fragments that lack a CH1 domain and only have a single variable heavy domain (VHH). (C) Ribbon representation of the crystal structure of a VHH domain (PDB: 1I3V) showing the complementary-determining loops, CDR1 in green, CDR2 in magenta and CDR3 in orange. Framework regions are shown in gray. 3D structure created using Pymol.

Figure 2. Structural representation of framework 2 residues in human VH and camelid VHHs. Ribbon representation of the crystal structure of (A) human VH (PDB: 7OBF) and (B) camelid-derived VHH (PDB: 1JTT) highlighting the framework 2 positions for each domain. Position 37: Human Valine, Camelid Phenylalanine (red), Position 44: Human Glycine, Camelid Glutamic acid (blue), position 45: Human Leucine, Camelid Arginine (yellow) and position 47: Human Tryptophan, Camelid Glycine (cyan); Kabat numbering. The interaction position of the human variable light chain with the human variable heavy chain is represented by the dotted line. The complementary-determining loops, CDR1 in green, CDR2 in magenta and CDR3 in orange. Framework regions are shown in gray. 3D structure created using Pymol.

A cartoon representation of human and llama heavy chain domains with arrows and lines representing the structure of each domain. Spheres coloured in blue, red, yellow, and cyan represent amino acids in three-dimensional space.
Figure 2. Structural representation of framework 2 residues in human VH and camelid VHHs. Ribbon representation of the crystal structure of (A) human VH (PDB: 7OBF) and (B) camelid-derived VHH (PDB: 1JTT) highlighting the framework 2 positions for each domain. Position 37: Human Valine, Camelid Phenylalanine (red), Position 44: Human Glycine, Camelid Glutamic acid (blue), position 45: Human Leucine, Camelid Arginine (yellow) and position 47: Human Tryptophan, Camelid Glycine (cyan); Kabat numbering. The interaction position of the human variable light chain with the human variable heavy chain is represented by the dotted line. The complementary-determining loops, CDR1 in green, CDR2 in magenta and CDR3 in orange. Framework regions are shown in gray. 3D structure created using Pymol.

Figure 3. Overview of VHH antibody discovery methods. VHH antibodies are typically sourced from either alpacas or llamas after immunization with an antigen of choice. An alternative source can be a naive library from llamas or alpacas, or a synthetic library design using a combination of sequence and structural information. Constructed libraries can be interrogated by phage display and combined with a machine learning approach to identify key attributes for affinity, developability and function.

Figure 3. Overview of VHH antibody discovery methods. VHH antibodies are typically sourced from either alpacas or llamas after immunization with an antigen of choice. An alternative source can be a naive library from llamas or alpacas, or a synthetic library design using a combination of sequence and structural information. Constructed libraries can be interrogated by phage display and combined with a machine learning approach to identify key attributes for affinity, developability and function.

Figure 4. Targeting tumor cells co-expressing two different receptors at the cell surface (left) not co-expressed on normal cells with a multispecific VHH, thereby minimizing antibody engagement against healthy cells.

A two section image showing how an antibody can bind to a tumor cell and T-cell at the same time. Gray shapes with two small lines projected outwards represent the cell membrane. Embedded in this membrane are colored rectangular boxes connected by thick lines. These structures represent the cell surface receptors responsible for tumor growth. On the left is an antibody represented by oval shapes connected by a wavy line, and the antibody is touching the t-cell and tumor cell at the same time. On the right, there is only one tumor cell with fewer receptors, and the antibody does not bind to the cell due to the low level of receptors.
Figure 4. Targeting tumor cells co-expressing two different receptors at the cell surface (left) not co-expressed on normal cells with a multispecific VHH, thereby minimizing antibody engagement against healthy cells.

Figure 5. Topological representation of examples of clinically relevant VHH-based therapeutic bispecifics.

A two section picture representing different types of antibody formats. On the left-hand side are five images one above the other. These images contain colored oval shapes representing llama or human antibody domains connected by wavy lines. On the right hand side of the image are three more colored oval shapes representing llama or human antibody domains connected by wavy lines. Below are two images that represent antibodies using oval shapes for domains connected together by lines into a Y-shape.
Figure 5. Topological representation of examples of clinically relevant VHH-based therapeutic bispecifics.

Table 1. Preclinical and clinical stage VHH-based bispecific molecules. All data correct as of 1st September 2023.

Figure 6. Overview of machine learning tools for antibodies and VHHs. (A) Tools for structural prediction, encompassing the ImmuneBuilder suite, IgFold, and Nanonet. (B) Language model-based sequence optimization with AbLang, AntiBERTy, IgLM, Sapiens and NanoBERT, alongside Bayesian Optimization techniques like AntBO and LaMBO2. Additionally, humanization tools include Hu-Mab, PLAN and CUMAb.

A four section picture with curved arrows and lines curving into a barrel-like structure representing the three-dimensional structure of a llama heavy chain antibody domain. To the right of this image is a gray rectangular box with letters connected to the gray box by straight lines representing amino acids. To the right of this image is a three-dimensional graph represented by three gray surfaces at right-angles to each other. In the center of this graph is a blue mountain-like structure with yellow peaks. This image represents how Bayesian optimization works. To the right of this graph is a cartoon image of a llama, with an arrow pointing to a cartoon image of a male human figure. This image represents the humanization process.
Figure 6. Overview of machine learning tools for antibodies and VHHs. (A) Tools for structural prediction, encompassing the ImmuneBuilder suite, IgFold, and Nanonet. (B) Language model-based sequence optimization with AbLang, AntiBERTy, IgLM, Sapiens and NanoBERT, alongside Bayesian Optimization techniques like AntBO and LaMBO2. Additionally, humanization tools include Hu-Mab, PLAN and CUMAb.

Table 2. Summary of language models applicable to antibodies and VHHs.