Table 3.

Perplexity comparison between the protein language model (LM) ESM-2 (Lin et al. 2023), the antibody-specific LMs AntiBERTy (Ruffolo et al. 2021) and AbLang-1 (Olsen et al. 2022b), and our new selection of antibody-specific LMs (see Section 2.4).a

Germline residues
Nongermline residues
Heavy
Light
Heavy
Light
FWRCDR1/2FWRCDR1/2FWRCDR1/2CDR3FWRCDR1/2CDR3
ESM-21.914.122.546.1132.0324.3620.8523.2019.3724.29
AntiBERTy1.051.101.171.2829.6421.5118.4440.1421.7516.95
AbLang-11.031.081.071.1625.8017.7314.4752.1425.7216.75
Ab-Unpaired1.021.071.011.0526.8118.9514.4237.6019.3717.25
Ab-Paired1.021.061.021.0527.2418.7014.2338.9519.2516.98
Ab-FL1.101.171.091.1610.3311.1812.6910.8210.2411.04
Ab-ModMask1.111.181.091.1710.2611.1313.1810.7810.1911.42
Ab-FT1.111.181.101.1810.8811.9113.6711.2510.6312.29
AbLang-21.101.171.091.169.9211.1312.4710.099.5410.77
Germline residues
Nongermline residues
Heavy
Light
Heavy
Light
FWRCDR1/2FWRCDR1/2FWRCDR1/2CDR3FWRCDR1/2CDR3
ESM-21.914.122.546.1132.0324.3620.8523.2019.3724.29
AntiBERTy1.051.101.171.2829.6421.5118.4440.1421.7516.95
AbLang-11.031.081.071.1625.8017.7314.4752.1425.7216.75
Ab-Unpaired1.021.071.011.0526.8118.9514.4237.6019.3717.25
Ab-Paired1.021.061.021.0527.2418.7014.2338.9519.2516.98
Ab-FL1.101.171.091.1610.3311.1812.6910.8210.2411.04
Ab-ModMask1.111.181.091.1710.2611.1313.1810.7810.1911.42
Ab-FT1.111.181.101.1810.8811.9113.6711.2510.6312.29
AbLang-21.101.171.091.169.9211.1312.4710.099.5410.77
a

While most of the models are near perfect at predicting masked germline residues, predictions for nongermline (NGL) residues show significantly higher perplexities. For ESM-2, AntiBERTy, AbLang-1, Ab-Unpaired, and Ab-Paired NGL perplexities are close to or worse than a random prediction. The largest improvement for NGL prediction came from switching to focal loss. Scaling up the model also improved performance, e.g. as seen by AbLang-2’s performances compared to Ab-FT. The best perplexity for each region is shown in bold.

Table 3.

Perplexity comparison between the protein language model (LM) ESM-2 (Lin et al. 2023), the antibody-specific LMs AntiBERTy (Ruffolo et al. 2021) and AbLang-1 (Olsen et al. 2022b), and our new selection of antibody-specific LMs (see Section 2.4).a

Germline residues
Nongermline residues
Heavy
Light
Heavy
Light
FWRCDR1/2FWRCDR1/2FWRCDR1/2CDR3FWRCDR1/2CDR3
ESM-21.914.122.546.1132.0324.3620.8523.2019.3724.29
AntiBERTy1.051.101.171.2829.6421.5118.4440.1421.7516.95
AbLang-11.031.081.071.1625.8017.7314.4752.1425.7216.75
Ab-Unpaired1.021.071.011.0526.8118.9514.4237.6019.3717.25
Ab-Paired1.021.061.021.0527.2418.7014.2338.9519.2516.98
Ab-FL1.101.171.091.1610.3311.1812.6910.8210.2411.04
Ab-ModMask1.111.181.091.1710.2611.1313.1810.7810.1911.42
Ab-FT1.111.181.101.1810.8811.9113.6711.2510.6312.29
AbLang-21.101.171.091.169.9211.1312.4710.099.5410.77
Germline residues
Nongermline residues
Heavy
Light
Heavy
Light
FWRCDR1/2FWRCDR1/2FWRCDR1/2CDR3FWRCDR1/2CDR3
ESM-21.914.122.546.1132.0324.3620.8523.2019.3724.29
AntiBERTy1.051.101.171.2829.6421.5118.4440.1421.7516.95
AbLang-11.031.081.071.1625.8017.7314.4752.1425.7216.75
Ab-Unpaired1.021.071.011.0526.8118.9514.4237.6019.3717.25
Ab-Paired1.021.061.021.0527.2418.7014.2338.9519.2516.98
Ab-FL1.101.171.091.1610.3311.1812.6910.8210.2411.04
Ab-ModMask1.111.181.091.1710.2611.1313.1810.7810.1911.42
Ab-FT1.111.181.101.1810.8811.9113.6711.2510.6312.29
AbLang-21.101.171.091.169.9211.1312.4710.099.5410.77
a

While most of the models are near perfect at predicting masked germline residues, predictions for nongermline (NGL) residues show significantly higher perplexities. For ESM-2, AntiBERTy, AbLang-1, Ab-Unpaired, and Ab-Paired NGL perplexities are close to or worse than a random prediction. The largest improvement for NGL prediction came from switching to focal loss. Scaling up the model also improved performance, e.g. as seen by AbLang-2’s performances compared to Ab-FT. The best perplexity for each region is shown in bold.

Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close