CryptoBench: cryptic protein–ligand binding sites dataset and benchmark

Abstract

Motivation

Structure-based methods for detecting protein–ligand binding sites play a crucial role in various domains, from fundamental research to biomedical applications. However, current prediction methodologies often rely on holo (ligand-bound) protein conformations for training and evaluation, overlooking the significance of the apo (ligand-free) states. This oversight is particularly problematic in the case of cryptic binding sites (CBSs) where holo-based assessment yields unrealistic performance expectations.

Results

To advance the development in this domain, we introduce CryptoBench, a benchmark dataset tailored for training and evaluating novel CBS prediction methodologies. CryptoBench is constructed upon a large collection of apo–holo protein pairs, grouped by UniProtID, clustered by sequence identity, and filtered to contain only structures with substantial structural change in the binding site. CryptoBench comprises 1107 structures with predefined cross-validation splits, making it the most extensive CBS dataset to date. To establish a performance baseline, we measured the predictive power of sequence- and structure-based CBS residue prediction methods using the benchmark. We selected PocketMiner as the state-of-the-art representative of the structure-based methods for CBS detection, and P2Rank, a widely-used structure-based method for general binding site prediction that is not specifically tailored for cryptic sites. For sequence-based approaches, we trained a neural network to classify binding residues using protein language model embeddings. Our sequence-based approach outperformed PocketMiner and P2Rank across key metrics, including area under the curve, area under the precision-recall curve, Matthew’s correlation coefficient, and F1 scores. These results provide baseline benchmark results for future CBS and potentially also non-CBS prediction endeavors, leveraging CryptoBench as the foundational platform for further advancements in the field.

Availability and implementation

The CryptoBench dataset, including the benchmark model, is available on Open Science Framework—https://osf.io/pz4a9/. The code and tutorial are available at the GitHub repository—https://github.com/skrhakv/CryptoBench/.

1 Introduction

Proteins serve as the molecular workhorses of living organisms, executing a myriad of functions essential for life. Their three-dimensional structure, which serves as the blueprint for their biochemical activities, is central to the functionality of proteins. Protein structure embodies a dynamic landscape characterized by conformational flexibility and adaptability in response to environmental triggers. In this context, ligand binding sites, regions where a protein binds to its interacting partners, deserve special attention. Specifically, cryptic binding sites (CBSs) are ligand binding sites that are not readily apparent or accessible in their ligand-free (apo) state but become exposed due to external triggers, thus enabling the binding of a ligand and forming the ligand-bound (holo) state. Therefore, CBSs can be loosely defined as sites identifiable in the ligand-bound but not in the unbound structure (Vajda et al. 2018).

Due to their importance as potential targets for drug discovery and protein engineering, binding sites, in general, have gathered considerable attention, resulting in the development of a wide range of binding site prediction methods (Zhao et al. 2020). As the structure primarily drives the interaction, the structure-based approaches are typically considered superior to sequence-based methods. However, structure-based approaches are fallible with respect to protein conformational flexibility as they are developed/trained to recognize certain structural characteristics. For instance, the authors of P2Rank (Krivák and Hoksza 2018) quantified the effect of different features on the prediction quality, showing that the biggest effect by far comes from the protrusion of surface regions. The reliance on structural properties becomes problematic in the case of CBSs as their shape can differ significantly between the apo and holo forms, resulting in decreased performance when measured on the apo form of the proteins (Ehrt 2019, Škrhák et al. 2023). This observation led to the development of methods specialized in predicting CBSs. These methods are based on molecular dynamics (Kuzmanic et al. 2020, Martinez-Rosell et al. 2020, Smith and Carlson 2021, Zheng 2021, Egbert et al. 2022), machine learning (ML) (Cimermancic et al. 2016, Škrhák et al. 2023) or a combination of both (Meller et al. 2023). A commonly used dataset to evaluate CBS prediction approaches is the one introduced with the CryptoSite method (Beglov et al. 2018). The CryptoSite dataset, however, faces several shortcomings: (i) size, (ii) definition of crypticity, and (iii) lack of pockets with substantial conformational changes. Regarding size, the CryptoSite dataset comprises only 93 cryptic binding pockets. As for the second point, the CryptoSite dataset construction methodology utilizes two other protein–ligand prediction tools, FPocket and ConCavity, and declares a pocket cryptic when detected in its holo but not in its apo form by these tools. A concern with such an approach is that biases within these predictors can leak into the dataset and, therefore, propagate if the dataset is further employed for testing or training new software. Finally, the CryptoSite dataset includes a relatively small number of cryptic pockets requiring large conformational changes (Meller et al. 2023), leading to a potential underrepresentation of such pockets within the dataset. The recently introduced PocketMiner dataset (Meller et al. 2023) does not suffer from the possibility of introducing biases and also comprises apo–holo pairs with large structural rearrangements but comes with a significant trade-off in size, containing only 39 cryptic pockets.

As shown above, the current datasets are quite small, which is understandable considering that building a dataset on a large scale is a complex endeavor. It requires identifying apo–holo pairs for several ligand binding sites across the entire PDB, classifying them as cryptic or non-cryptic (i.e. by using a suitable metric), and selecting a representative apo candidate in the presence of multiple. However, without such a dataset, a substantial volume of cryptic pockets within our observed proteome remains unexplored and underrepresented. To address these issues, we present CryptoBench, the most comprehensive dataset of CBSs so far. In this work, we understand a binding site to be cryptic when there is a significant structural change between the apo and holo form (for motivation and definition, see the Dataset Construction section, and for further discussion, refer to the Discussion section). By introducing CryptoBench, we aim to expand the coverage of CBSs from protein structures available in the PDB (wwPDB Consortium 2019) thereby facilitating the detection of CBSs, a promising strategy for drug repurposing (Wakefield et al. 2022).

2 Materials and methods

2.1 Dataset construction

The assembled dataset of CBSs is built on top of AHoJ-DB (Feidakis et al. 2024), a database of precomputed apo–holo results of the AHoJ tool (Feidakis et al. 2022). AHoJ-DB links apo and holo states of a ligand binding site, which are derived from different structures in the PDB. AHoJ defines the binding site by considering protein residues within a user-defined distance threshold (default 4.5 Å) from the atoms of the specified ligand.

At the time of writing (August 2024), AHoJ-DB featured apo–holo and holo–holo pairs for 522 153 biologically relevant protein–ligand interactions (Zhang et al. 2024) across the entire PDB. AHoJ-DB is constructed by mapping the binding residues of each specified protein–ligand interaction across all other structures in the PDB that are assigned the same UniProt accession, and registering these candidate pockets as holo or apo depending on the presence or absence of bound ligands. Apo and holo structures are mapped residue-wise by linking the common binding residues by their UniProt sequence indices, thereby tracking the same pocket across multiple structures. Several metrics are reported for each pocket [including RMSD, volume, solvent accessible surface area (SASA), molecular surface] and the chains that comprise them. The resource features a substantial number of both holo–holo (∼43 million) and apo–holo pocket pairs (∼14 million) that are, however, not explicitly annotated as cryptic or non-cryptic. Furthermore, of the 522 153 holo pockets cataloged in AHoJ-DB, ∼53.6% (280 058 holo pockets) lack a corresponding apo structure. While some of these pockets may potentially be cryptic, without sufficient data for comparison of the apo and holo form, we cannot classify them as such.

Although, for a single protein–ligand structure, AHoJ-DB includes all apo and holo structures, when assembling the dataset, we considered only the apo states, ie, disregarding the alternative holo structures. Subsequently, we filtered the apo–holo pairs down by keeping only pockets from structures with a resolution of 2.5 Å or higher. Further, we refined the results by only keeping apo pockets where the number of binding residues is equal in both apo and holo states. After conducting these steps, the filtered subset of AHoJ-DB totaled 4 683 968 pairs, each representing a single binding pocket in its apo and holo state.

2.1.1 Selecting a suitable crypticity metric

To differentiate cryptic and non-cryptic apo–holo pairs, a suitable metric to distinguish between regular and CBSs has to be selected. Such a metric can then be utilized to extract a dataset comprising solely cryptic pockets from the broader AHoJ-DB dataset. The primary metrics we considered included pocket SASA (Lee and Richards 1971), pocket molecular surface (Richards 1977), pocket volume (Smith et al. 2019), and all-atom pocket RMSD (see Supplementary Information for the difference between SASA and molecular surface). All the metrics were evaluated exclusively within the scope of the pocket, i.e. the metrics were computed using only the pocket residues. These metrics were chosen for their potential to capture the dynamic nature of cryptic pockets, as opposed to the static nature of regular pockets within the apo–holo pairs. The metric values for each apo–holo pair were extracted from AHoJ-DB.

To identify the most appropriate metric and its threshold for dataset filtering, we conducted a comparative analysis between two datasets: one extracted from AHoJ-DB (a subset of AHoJ-DB was used—records with low resolution (>2.5 Å) or with pockets containing unobserved residues were filtered out; that is, the subset consisted of 4 683 968 apo–holo pairs), representing a general dataset containing a mixture of regular and potentially CBSs, and the other being the PocketMiner test dataset (Meller et al. 2023), which contained 38 hand-picked structures with exclusively CBSs. While the metrics values for the records from AHoJ-DB were already precomputed, we additionally computed the metrics values for the records from the PocketMiner test dataset. Subsequently, the metric values for every pocket in both datasets were analyzed to assess their suitability for distinguishing between regular and CBSs. The resulting violin plots in Fig. 1 illustrate the behavior of each metric within the context of these datasets. It is important to note that while molecular surface, SASA, and volume yield two values for each pair—one for apo and the other for holo—the RMSD yields only a single value for each pair. Therefore, in the context of the violin plots, while RMSD values were used as they were, the molecular surface, SASA, and volume values were computed as the difference between the apo and holo values, normalized over the number of pocket residues (pocket length):

difference = \frac{apo_value - holo_value}{pocket_length}

Figure 1.

The distribution of four metrics—pocket SASA, pocket molecular surface, pocket volume, and pocket RMSD—across two datasets: one representing cryptic-only binding sites (PocketMiner test dataset) and the other representing general binding sites (AHoJ-DB dataset). Overall, the violin plots for pocket molecular surface largely overlap, indicating similar distributions between the two datasets. In contrast, the violin plots for pocket volume and pocket SASA show partial overlap, suggesting some differences in distribution between cryptic and general binding sites. Notably, the violin plots for pocket RMSD exhibit nearly no overlap, with a clear borderline between 1.5 and 2 Å.

Open in new tab Download slide

As observed from the violin plots, the molecular surface exhibits significant overlap in its distributions, suggesting it is not practical for distinguishing between cryptic and regular binding sites. While there is less overlap in the distributions of pocket SASA and pocket volume, a clear boundary between the values of cryptic and regular datasets is not apparent. Finally, pocket RMSD shows promising results, with values for the general dataset typically not exceeding 1.5 Å, and values for the cryptic dataset generally remaining above 2 Å.

Thus, out of the considered metrics, pocket RMSD seems to be the most effective metric for distinguishing between cryptic and regular binding sites. Given the insights provided by the violin plots in Fig. 1, we established pocket RMSD bigger than 2 Å as a suitable threshold for differentiating between cryptic and regular binding sites. By selecting the upper bound of the borderline interval [1.5,2] Å, we aimed to minimize the risk of unintentionally including regular binding sites in the cryptic dataset, particularly when filtering a general dataset to create a cryptic one.

To further evaluate the selected threshold, we looked at the RMSD values within the PocketMiner dataset of CBSs. We found that 32 out of 38 pairs have higher than 2 Å RMSD and are, therefore, in agreement with our threshold. Furthermore, we manually inspected the remaining six pairs with RMSD between 1.35 and 1.93Å. There were four structure pairs with larger rearrangements of the binding sites—see, e.g. the surface representation of the 4i92 (apo) and 4i94 (holo) structure pair in Fig. 2. The substantial part of the binding sites is not present in the apo form due to the conformational change of side chains of Asn71 and Arg186. There are, however, also two structure pairs where the conformational change is less substantial (4w51-4w58, 3ppn-3ppr)—see Supplementary Data for examples. However, the degree to which these binding sites should be classified as cryptic remained uncertain. Ultimately, these findings indicated that although our selected threshold can miss some CBSs, our decision to set the threshold at 2 Å is reasonable.

Figure 2.

A comparison of the apo and holo CBS. (A) The holo structure of plant pseudokinase binding AMP (4i94) in surface representation. (B) The apo structure of the same protein (4i92) in surface representation—the right part of the binding site is altered.

Open in new tab Download slide

From the previous discussion follows the definition of a CBS for the purpose of the CryptoBench dataset. The CBS refers to a region in a protein that can bind a ligand and undergoes a significant structural change between its holo (ligand-bound) and apo (unbound) forms. A significant change is defined as a difference of at least 2 Å RMSD between the binding residues in the apo and holo forms. Binding residues are defined as those located within a 4.5 Å radius from the ligand in the holo form. The corresponding binding residues in the apo form are identified by mapping the holo-form binding residues onto the apo structure using UniProt mapping.

2.1.2 Apo–holo pairs pool filtering

Considering the lack of a consensus definition for binding site crypticity, and the inclusivity of AHoJ-DB in terms of conformational changes that occur within a given protein (UniProt ID), it is important to establish not only a minimum threshold for detecting significant conformational changes in the binding site (ie, crypticity), but also an upper limit to ensure that (i) both the global structure of the protein, beyond the binding site, remains relatively stable and recognizable between the two states, and (ii) the relative position of the binding site does not deviate significantly between the two states. While an argument can be made for the inclusion of any structure of the same UniProt, regardless of the extent of its global conformational changes that can extend to disorder-to-order transitions or domain swaps and hinge-like motions, often the difference between the global similarity (i.e. TM-score) of structures of the same sequence, can be larger than that of different sequences, or of the minimum recommended TM-score of 0.5 for structures that “assume generally the same fold in SCOP/CATH” (Xu and Zhang 2010). Such differences could also prove problematic during visual inspection where there is no visible resemblance between the global structures of the apo and holo states, but perhaps more importantly, the scope of the CBS can be overstretched or lost when the structural changes span the entire protein chain. In addition, we choose to exclude cases where the compactness of the binding site changes significantly on account of a disordered or unfolded state compared to the typically well-defined and compact holo state. To construct the cryptic dataset, the original 4 683 968 apo–holo pairs retrieved from AHoJ-DB were processed as follows:

resolution filtering: All records with a resolution worse than 2.5 Å were filtered out; on top of that, all records where pocket length in the apo state and holo state did not match were filtered out,
geometric quality assurance filtering: to ensure that the conformational changes are restricted to the binding site, we establish a threshold for the minimum accepted global similarity between the two states of the protein chain that comprises the binding site, achieved by a minimum TM-score of 0.5. Also, to filter out domain swaps and large intrachain motions, we establish a maximum distance threshold of 4 Å between the centers of apo and holo binding sites, after the global structural alignment. Furthermore, we establish a threshold for the allowed change in the level of compactness of the binding site between the apo and holo states, by allowing up to a 20% change in the radius of gyration from the holo state. Lastly, at least 50 observed residues from the protein must overlap with its UniProt sequence. All metrics were taken from AHoJ-DB,
pocket RMSD filtering: records with pocket RMSD below 2 Å were filtered out,
ligand filtering: similarly to P2Rank (Krivák and Hoksza 2018), we excluded ligands where the number of atoms is less than 5. Furthermore, the name of the PDB group is not on the list of ignored groups: HOH, DOD, WAT, UNK, ABA, MPD, GOL, SO4, PO4 (the original P2Rank ignored group list also included sugars MAN, GLC, and NAG. However, these sugars are biologically relevant, as evidenced by cases like the 1esw-5jiw apo–holo pair involved in starch synthesis. Additionally, other studies like AlphaFold3 (Abramson et al. 2024) chose to include these sugars),
clustering: unique UniProt sequences were clustered based on 40% similarity (Steinegger and Söding 2017),
selection of representatives: from each cluster from the previous step, we selected one representative apo–holo pair based on maximal pocket RMSD, aiming to include pairs with the most significant structural changes,
searching for additional pockets: considering that one apo structure can contain multiple cryptic pockets or a single cryptic pocket may bind multiple types of ligands, we conducted a second search within the output of the pocket RMSD filtering phase to identify these pockets and include them into the dataset. Therefore, within our dataset, one apo structure can be paired with more than one holo structures.

The number of apo–holo pairs outputted by each filtering phase summarizes Table 1.

Table 1.

Open in new tab

The number of apo–holo pairs left after each filtering phase.

	Number of remaining apo–holo pairs	Remaining size % compared to initial size
AHoJ-DB	14 054 029	100%
resolution filtering	4 683 968	33.328%
geometric quality assurance filtering	4 423 220	31.473%
pocket RMSD filtering	221 026	1.573%
ligand filtering	171 876	1.223%
clustering & selection of representatives	1107	0.008%
additional pockets search	5493	0.039%

	Number of remaining apo–holo pairs	Remaining size % compared to initial size
AHoJ-DB	14 054 029	100%
resolution filtering	4 683 968	33.328%
geometric quality assurance filtering	4 423 220	31.473%
pocket RMSD filtering	221 026	1.573%
ligand filtering	171 876	1.223%
clustering & selection of representatives	1107	0.008%
additional pockets search	5493	0.039%

The last column shows impact of each filter compared to the initial size in percentages.

Table 1.

Open in new tab

The number of apo–holo pairs left after each filtering phase.

	Number of remaining apo–holo pairs	Remaining size % compared to initial size
AHoJ-DB	14 054 029	100%
resolution filtering	4 683 968	33.328%
geometric quality assurance filtering	4 423 220	31.473%
pocket RMSD filtering	221 026	1.573%
ligand filtering	171 876	1.223%
clustering & selection of representatives	1107	0.008%
additional pockets search	5493	0.039%

	Number of remaining apo–holo pairs	Remaining size % compared to initial size
AHoJ-DB	14 054 029	100%
resolution filtering	4 683 968	33.328%
geometric quality assurance filtering	4 423 220	31.473%
pocket RMSD filtering	221 026	1.573%
ligand filtering	171 876	1.223%
clustering & selection of representatives	1107	0.008%
additional pockets search	5493	0.039%

The last column shows impact of each filter compared to the initial size in percentages.

3 Results

3.1 Dataset structure

Practically, the resulting dataset comprises a list of apo structures identified by their apo PDB ID, with each PDB ID associated with one or more cryptic pockets. Each cryptic pocket represents a single record in the dataset. Within the context of a single apo structure, each pocket may correspond to a different holo structure and a different holo chain as well. Consequently, each pocket in the apo structure is characterized by the PDB identifier of the holo structure, chain identifier, ligand name, ligand residue number (to distinguish between different ligands of the same type), and both apo and holo pocket residue selections.

3.2 Statistics

The apo–holo pairs may contain multiple CBSs, each with its unique set of residues. For the sake of statistics, to differentiate between distinct binding sites within a single apo structure, we applied a criterion where two binding sites are considered separate if their residues have less than a 75% overlap. Similarly, we identified CBSs capable of binding more than one type of ligand—promiscuous pockets. Using the same 75% threshold, we classified a pocket as promiscuous if it overlapped by more than 75% with another pocket binding different ligands. Next, we counted the number of CBSs spanning more than one chain. The statistics are shown in Table 2 together with a comparison of the PocketMiner and CryptoSite datasets. Lastly, the AHoJ-DB was scanned one more time to find non-CBSs for the CryptoBench apo structures. We consider non-CBSs as those that do not meet the crypticity criterion, ie with pocket RMSD below the 2 Å threshold. The statistics for these sites can be found in Table 3.

Table 2.

Open in new tab

Comparison between CryptoBench, PocketMiner, and CryptoSite datasets.

Dataset	CryptoBench	PocketMiner	CryptoSite
apo structures	1107	38	93
cryptic pockets	1361	39	98
avg. pocket RMSD	2.89 ± 0.87	3.46 ± 1.99	2.65 ± 2.36
avg. # of binding residues per protein	16.60 ± 7.22	22.92 ± 8.17	17.88 ± 7.59
avg. # of observed residues per protein^a	290.34 ± 154.00	288.37 ± 140.63	322.48 ± 179.73
promiscuous pockets	371	0	0
multi-chain pockets	197	0	0

Dataset	CryptoBench	PocketMiner	CryptoSite
apo structures	1107	38	93
cryptic pockets	1361	39	98
avg. pocket RMSD	2.89 ± 0.87	3.46 ± 1.99	2.65 ± 2.36
avg. # of binding residues per protein	16.60 ± 7.22	22.92 ± 8.17	17.88 ± 7.59
avg. # of observed residues per protein^a	290.34 ± 154.00	288.37 ± 140.63	322.48 ± 179.73
promiscuous pockets	371	0	0
multi-chain pockets	197	0	0

For multiple-chain records, each chain was considered separately to calculate the average number of observed residues.

Table 2.

Open in new tab

Comparison between CryptoBench, PocketMiner, and CryptoSite datasets.

Dataset	CryptoBench	PocketMiner	CryptoSite
apo structures	1107	38	93
cryptic pockets	1361	39	98
avg. pocket RMSD	2.89 ± 0.87	3.46 ± 1.99	2.65 ± 2.36
avg. # of binding residues per protein	16.60 ± 7.22	22.92 ± 8.17	17.88 ± 7.59
avg. # of observed residues per protein^a	290.34 ± 154.00	288.37 ± 140.63	322.48 ± 179.73
promiscuous pockets	371	0	0
multi-chain pockets	197	0	0

Dataset	CryptoBench	PocketMiner	CryptoSite
apo structures	1107	38	93
cryptic pockets	1361	39	98
avg. pocket RMSD	2.89 ± 0.87	3.46 ± 1.99	2.65 ± 2.36
avg. # of binding residues per protein	16.60 ± 7.22	22.92 ± 8.17	17.88 ± 7.59
avg. # of observed residues per protein^a	290.34 ± 154.00	288.37 ± 140.63	322.48 ± 179.73
promiscuous pockets	371	0	0
multi-chain pockets	197	0	0

For multiple-chain records, each chain was considered separately to calculate the average number of observed residues.

Table 3.

Open in new tab

Statistics for non-CBSs within CryptoBench.

Dataset	CryptoBench: non-cryptic pockets
non-cryptic pockets	1445
avg. # of binding residues per protein	11.72 ± 6.87
promiscuous pockets	410
multi-chain pockets	74

If the non-cryptic pocket significantly overlapped with the cryptic pockets (>75%), it was excluded from the statistics.

Table 3.

Open in new tab

Statistics for non-CBSs within CryptoBench.

Dataset	CryptoBench: non-cryptic pockets
non-cryptic pockets	1445
avg. # of binding residues per protein	11.72 ± 6.87
promiscuous pockets	410
multi-chain pockets	74

If the non-cryptic pocket significantly overlapped with the cryptic pockets (>75%), it was excluded from the statistics.

The conservation score (Jakubec et al. 2019) for cryptic and non-CBSs was retrieved from PDBe API (Varadi et al. 2020). It was revealed that both cryptic and non-cryptic binding residues show higher average conservation scores than non-binding residues. Furthermore, cryptic binding residues exhibit higher conservation scores than non-cryptic binding residues, see Table 4.

Table 4.

Open in new tab

Comparison of average conservation score between cryptic binding residues, non-cryptic binding residues and non-binding residues within CryptoBench.

	Avg. conservation score
Cryptic binding residues	0.997 ± 1.241
Non-cryptic binding residues	0.613 ± 1.034
Non-binding residues	0.414 ± 0.781

Table 4.

Open in new tab

Comparison of average conservation score between cryptic binding residues, non-cryptic binding residues and non-binding residues within CryptoBench.

	Avg. conservation score
Cryptic binding residues	0.997 ± 1.241
Non-cryptic binding residues	0.613 ± 1.034
Non-binding residues	0.414 ± 0.781

According to a gene ontology analysis with PANTHER (Thomas et al. 2022), the vast majority of the 618 classified genes in CryptoBench (out of 1117 total) are enzymes with catalytic activity (52.1%) such as transferases and ligand binding proteins (34.1%). For a more detailed view and comparison between the entire AHoJ-DB and CryptoBench, gene ontology analysis graphs are provided in the Supplementary Information.

3.3 CryptoBench benchmark

To be able to use the dataset for fair validation of ML-based prediction approaches, we pre-defined the train-test splits of the CryptoBench dataset. As shown in another study, the methods’ performance and superiority might be influenced by how K-fold splits are defined (Škoda and Hoksza 2016). Therefore, we also established the K-fold splits for the train set. Furthermore, unlike datasets with other media, such as images, protein datasets are vulnerable to information leakage between splits if not carefully managed (AlQuraishi 2019). To mitigate the risk of including homologous proteins in different splits, causing information leakage, we conducted another round of clustering on the dataset. This round of clustering utilized a threshold of 10% sequence identity to ensure minimal relation between records from each data split (Steinegger and Söding 2017). These clusters were then joined into the splits forming the resulting CryptoBench benchmark.

We used an 80:20 ratio for the train-test split, resulting in a test set with 222 apo structures and a train set with 885 apo structures. The train set was further divided into 4 folds, with three folds containing 222 structures each, and one fold containing 219 structures.

3.3.1 Baseline evaluation

To validate the dataset’s usability, we used it to evaluate one sequence-based and one structure-based method for detecting cryptic binding residues. We used PocketMiner (Meller et al. 2023) as the representative of structure-based methods, which, according to the presented experiments (Meller et al. 2023), has demonstrated superior performance over the previous state-of-the-art tool, CryptoSite (Cimermancic et al. 2016). For the sequence baseline, we implemented our own neural network architecture utilizing a protein language model (pLM-NN), as a similar architecture proved promising when evaluated on the CryptoSite dataset (Škrhák et al. 2023). Finally, we included P2Rank in the comparison as a representative of non-CBS-specific methods, which has been shown to perform quite well for CBS detection (Ehrt 2019, Škrhák et al. 2023).

It should be emphasized that PocketMiner, P2Rank and the pLM-NN models were trained on different datasets. While the pLM-NN model was trained on CryptoBench, we used the model available on the project GitHub page for PocketMiner without any retraining or fine-tuning, as the repository does not provide a documented way of retraining/tuning the model on new structures and labels. Similarly, we utilized P2Rank’s off-the-shelf pretrained prediction model, which was trained on P2Rank’s own training dataset, without any further adjustment. This might skew the performance of PocketMiner and P2Rank as more data might be available for training. On the other hand, we did not control for possible data leakage between PocketMiner/P2Rank training and CryptoBench test sets. Notably, in the PocketMiner training set of 37 apo structures, three apo structures have >95% sequence similarity with apo structures in the CryptoBench test set.

3.3.1.1 Protein language model classifier implementation

The implemented method is purely sequence-based, as the input for the pLM-NN consisted solely of protein embeddings from the ESM2-3B model (Lin et al. 2022, 2023), generated from whole UniProt sequences. The ESM2-3B model generates an embedding of size 2560 for each sequence residue. Although the embeddings were computed from the whole sequence, only those embeddings corresponding to the residues observed in the structure were kept for training and evaluation.

The residue labels, indicating whether a particular residue is part of a cryptic site, were generated by merging all available cryptic pockets. Therefore, if a protein structure contains multiple pockets, all of them are incorporated into the labeling, resulting in a binary classification problem where each residue is either binding or non-binding.

Figure 3 illustrates the classification process. First, embeddings were acquired from the ESM2-3B model using the entire UniProt sequence. Subsequently, only the embeddings corresponding to observed residues were kept, while unobserved residues or those not present in the PDB structure were discarded. These filtered embeddings served as input for the pLM-NN. The network outputs the probability of a residue being part of a CBS. Therefore, the pLM-NN yields results only for the observed residues, with the unobserved residues colored grey in the diagram.

Figure 3.

Overview of the pLM-NN. Grey coloring depicts unobserved residues. The prediction was not made for the unobserved residues, as the embeddings for unobserved residues were discarded for the sake of fair comparison with structure-based methods.

Open in new tab Download slide

To determine the optimal architecture of the pLM-NN, SklearnTuner (https://github.com/keras-team/keras-tuner) was used to perform the cross-validated hyperparameter search using the predefined 4-fold train split of the training subset. The final architecture was composed of 3 layers. The first two layers contained 256 neurons and employed an L2 regularizer with ReLU activation. Also, a dropout rate equal to 0.3 was applied to these two layers. The third layer consisted of 2 neurons and used softmax activation. Overall, the setup yielded 721 922 parameters. The binary cross-entropy loss function was used and optimized using the Adam optimizer, with a learning rate set to 1e−04. The training was executed for 7 epochs, with a batch size 2048. The number of epochs was determined from a separate training run using one train fold as a validation set. A second run using a validation set was conducted to determine the decision threshold, which was set at 0.95. The decision threshold was selected based on the F1 metric.

3.3.1.2 Results

It should be emphasized that the pLM-NN utilized solely sequence information and training was conducted exclusively using observed residues. This was done to ensure a fair comparison between the trained pLM-NN and PocketMiner, which is a structure-based method, therefore, it cannot evaluate unobserved residues. Both methods output a probability of a residue being cryptic. For pLM-NN, we applied the 0.95 threshold. For PocketMiner, we used the 0.7 threshold recommended by its authors (Meller et al. 2023). As for P2Rank, it directly provides a binary classification of residues into positive or negative classes, so no additional thresholding was necessary.

Using the aforementioned binary classification, true positive rate (TPR), false positive rate (FPR), F1 score (F1), accuracy (ACC), and Matthew’s correlation coefficient (MCC) were calculated for each prediction method. In the case of the area under the curve (AUC), the probabilities were utilized directly to construct the receiver operating characteristics curve from which AUC was computed. Similarly, for the area under the precision-recall curve (AUPRC), the probabilities were used to construct the precision-recall curve, and the AUPRC value was calculated from the area under this curve.

PocketMiner encountered prediction errors for 22 structures from the test subset, leading to their exclusion from the evaluation. Additionally, PocketMiner cannot make predictions for multi-chain structures, which resulted in another 38 structures being removed from the test set. Thus, in total, 60 structures were excluded during PocketMiner evaluation, representing more than a quarter of the entire test set. Furthermore, during the evaluation of P2Rank, only single-chain structures were used. Consequently, three rounds of pLM-NN evaluation were conducted: the first using the entire test set to establish the benchmark (in the first round of the evaluation, for structures with multiple chains, each chain was treated as an individual entity during the process), the second for a direct comparison with PocketMiner, excluding the structures where PocketMiner was unable to make predictions, and the third for a direct comparison with P2Rank, excluding only structures with pockets spanning more than one chain.

The respective values are reported in Table 5. Remarkably, the fairly straightforward approach utilizing the train set from the CryptoBench dataset combined with the embeddings from the ESM2-3B model resulted in a neural network that matches or even surpasses the performance of the PocketMiner tool on the CryptoBench test set. Particularly, the neural network significantly outperforms PocketMiner in key metrics such as AUC and AUPRC, which are independent of the decision threshold selection.

Table 5.

Open in new tab

Performance of the benchmark method, PocketMiner, and P2Rank was evaluated across different subsets of the CryptoBench test set.

Method	Dataset	AUC	AUPRC	ACC	FPR	TPR	MCC	F1 Score
pLM-NN	CB-full	0.86	0.36	0.93	0.05	0.48	0.39	0.92
pLM-NN	CB-PM	0.88	0.43	0.93	0.04	0.52	0.44	0.93
PocketMiner	CB-PM	0.76	0.19	0.82	0.16	0.51	0.22	0.78
pLM-NN	CB-P2RANK-apo	0.88	0.42	0.93	0.04	0.51	0.43	0.93
P2RANK	CB-P2RANK-apo	0.81	0.21	0.85	0.14	0.62	0.27	0.81
P2RANK	CB-P2RANK-holo	0.89	0.34	0.85	0.15	0.84	0.38	0.81

Method	Dataset	AUC	AUPRC	ACC	FPR	TPR	MCC	F1 Score
pLM-NN	CB-full	0.86	0.36	0.93	0.05	0.48	0.39	0.92
pLM-NN	CB-PM	0.88	0.43	0.93	0.04	0.52	0.44	0.93
PocketMiner	CB-PM	0.76	0.19	0.82	0.16	0.51	0.22	0.78
pLM-NN	CB-P2RANK-apo	0.88	0.42	0.93	0.04	0.51	0.43	0.93
P2RANK	CB-P2RANK-apo	0.81	0.21	0.85	0.14	0.62	0.27	0.81
P2RANK	CB-P2RANK-holo	0.89	0.34	0.85	0.15	0.84	0.38	0.81

In the first evaluation round, the benchmark method was assessed using the full CryptoBench test set (CB-full). In the second round, both the benchmark method and PocketMiner were evaluated on a subset that included only structures for which PocketMiner did not fail (CB-PM); see Supplementary Material for details. In the third round, the benchmark method and P2rank were tested on a subset consisting solely of single-chain apo structures (CB-P2RANK-apo). Lastly, P2Rank was evaluated on holo structures (CB-P2RANK-holo), which are the counterparts of the apo structures from the CB-P2RANK-apo subset. The comparison between P2Rank’s performance on CB-P2RANK-apo and CB-P2RANK-holo highlights the performance drop when identifying CBSs using a method not specialized for detecting such sites. The F1 score was computed using a weighted average.

Table 5.

Open in new tab

Performance of the benchmark method, PocketMiner, and P2Rank was evaluated across different subsets of the CryptoBench test set.

Method	Dataset	AUC	AUPRC	ACC	FPR	TPR	MCC	F1 Score
pLM-NN	CB-full	0.86	0.36	0.93	0.05	0.48	0.39	0.92
pLM-NN	CB-PM	0.88	0.43	0.93	0.04	0.52	0.44	0.93
PocketMiner	CB-PM	0.76	0.19	0.82	0.16	0.51	0.22	0.78
pLM-NN	CB-P2RANK-apo	0.88	0.42	0.93	0.04	0.51	0.43	0.93
P2RANK	CB-P2RANK-apo	0.81	0.21	0.85	0.14	0.62	0.27	0.81
P2RANK	CB-P2RANK-holo	0.89	0.34	0.85	0.15	0.84	0.38	0.81

Method	Dataset	AUC	AUPRC	ACC	FPR	TPR	MCC	F1 Score
pLM-NN	CB-full	0.86	0.36	0.93	0.05	0.48	0.39	0.92
pLM-NN	CB-PM	0.88	0.43	0.93	0.04	0.52	0.44	0.93
PocketMiner	CB-PM	0.76	0.19	0.82	0.16	0.51	0.22	0.78
pLM-NN	CB-P2RANK-apo	0.88	0.42	0.93	0.04	0.51	0.43	0.93
P2RANK	CB-P2RANK-apo	0.81	0.21	0.85	0.14	0.62	0.27	0.81
P2RANK	CB-P2RANK-holo	0.89	0.34	0.85	0.15	0.84	0.38	0.81

Although PocketMiner achieves fairly competitive FPR values at the decision threshold set to 0.7, it also exhibits reduced TPR values. However, as shown in Fig. 4, adjusting the decision thresholds to increase TPR also causes the FPR to rise steeply. This is particularly problematic for cryptic pocket predictions, given the imbalance between the number of binding and non-binding residues (as can be observed in Table 2—on average, the binding residues only correspond to less than 5% of all residues in the whole protein). As a result, when selecting different decision thresholds, the high FPR combined with the imbalance between binding and non-binding residues could cause the number of false positives to significantly outnumber the true positives, which may reduce the usefulness of PocketMiner’s predictions.

Figure 4.

The performance of the benchmark method (pLM-NN) and PocketMiner on the test subset (CB-PM) illustrated by the ROC and PR curves. Both curves were constructed utilizing the probabilities generated by the neural network, as opposed to the statistics from Table 5, which were mostly computed using binary classified values of 0 and 1.

Open in new tab Download slide

Although not specifically designed for CBS detection, P2Rank shows competitive performance on apo structures within the CryptoBench test set, a pattern consistent with findings from other studies (Ehrt 2019, Škrhák et al. 2023) on different CBS datasets. However, it does not surpass the benchmark method (pLM-NN), as shown in Table 5. As a validation step, P2Rank was also evaluated on holo structures to assess the performance difference of a non-CBS-specific method between apo and holo structures (any apo structure in CryptoBench can be associated with more than one holo structure - in the case of evaluating P2Rank on holo structures, a single holo structure needed to be selected for a fair comparison; therefore, holo structure with the largest pocket RMSD was selected, see Supplementary Information). In the case of structure-based methods, predicting CBSs is generally less challenging on holo structures, as their pockets are more clearly defined compared to their apo counterparts and align better with the holo-based training sets. Therefore, it is not surprising that P2Rank performs better on holo structures than on apo structures, a trend also observed in the aforementioned studies (Ehrt 2019, Škrhák et al. 2023) and also reflected in the results from the CryptoBench test set, as shown in Table 5 (comparing P2Rank performance on CB-P2RANK-apo versus CB-P2RANK-holo dataset).

4 Discussion

4.1 Crypticity definition

A universally accepted definition of what constitutes a CBS does not exist, as discussed in detail in Vajda et al. (2018). Typically, a pocket is considered cryptic if it binds a ligand in one form (holo) but not in another (apo). The RMSD-based crypticity criterion, as detailed in section Dataset Construction, aligns well with this definition since a large structural rearrangement (either closing or opening) is likely to prevent ligand binding. However, this criterion may fail to detect CBSs with small changes, such as a minor side chain rotation. While such modifications might prevent ligand binding, the overall site may not appear significantly altered. The previously introduced Cryptosite dataset has 48 structure pairs with a pocket RMSD smaller than 2 Å with the smallest difference as small as 0.43 Å. Figures 5 and 6 give two examples from the original Cryptosite dataset. Figure 5 describes a structural pair where we do not see any substantial conformational change of the binding site, while Fig. 6 shows an example of Cryptosite structure pair, where pocket RMSD is around 1 Å (similar to the previous example), but there is a substantial conformational change due to a change in conformation of one of the binding residues side chains that will likely affect the ligand binding. Although our definition of a CBS might miss some genuine cryptic sites, the main focus of the benchmark is to minimize false positives (non-CBS regions labeled as CBS) rather than false negatives (CBS not labeled as CBS). Moreover, because our primary motivation for creating CryptoBench is to aid in the development of CBS prediction methods, a CBS definition that emphasizes large structural changes is appropriate. Detecting a binding site in an apo structure that significantly differs from the holo state is more challenging and thus aligns with our objectives. Finally, a crypticity criterion independent of the results of another prediction tool (such as a docking program) prevents the introduction of a bias or dependency on a certain set of parameters.

Figure 5.

Example of a Cryptosite pair with pocket RMSD smaller than 2 Å without a major change of the binding pocket. Bacterial esterase in the apo form (1qlw, left) and in the holo form (2wkw, right) with W22 ligand. Two structures were aligned and W22 from the holo structure is also shown in the aligned apo structure binding pocket to visualize the fit of the ligand into the apo structure. The binding site in both structures is almost identical.

Open in new tab Download slide

Figure 6.

Example of Cryptosite pair with pocket RMSD smaller than 2 Å with a major change of the binding pocket. Ricin in the apo form (1rtc, left) and in the holo form (1br6, right) with PT1 ligand. Two structures were aligned and PT1 from the holo structure is shown also in the aligned apo structure binding pocket to visualize the fit of the ligand into the apo structure. Tyr 80 sidechain conformation obstructs the ligand-binding site in the apo form.

Open in new tab Download slide

To visually showcase the CryptoBench dataset, we give two examples with different RMSDs—one with RMSD close to the inclusion threshold and the other with a clear difference between the apo and holo states. Figure 7 shows cobyrinic acid a, c diamne synthase with pocket RMSD 2.21 Å. The ANP ligand does not fit into the apo binding pocket due to a conformational change or residues 21–23. Figure 8 describes the difference of binding pockets of apo and holo structures of Cap-specific mRNA methyltransferase—the pocket RMSD of these structures is 3.03 Å. A loop consisting of binding site residues 277–280 has different conformation in apo structure resulting in reduced size of the binding site. For aligned cartoon visualizations of the aforementioned apo–holo pairs, refer to the Supplementary Material.

Figure 7.

Example of a CBS from CryptoBench dataset. Cobyrinic acid a, c diamide synthase in the apo form (4pfs, left) and in the holo form (5if9, right) with ATP analog ANP. Two structures were aligned, and ANP from the holo structure is shown also in the aligned apo structure binding pocket to visualize the fit of the ligand into the apo structure. Residues Gly 21 to Ala 23 extend into one more turn of a helix in apo structure while they turn into a loop in the holo structure. This extra turn of helix in apo structure occupies the ligand-binding site.

Open in new tab Download slide

Figure 8.

Example of a CBS from CryptoBench dataset. Cap-specific mRNA methyltransferase in the apo form (4n4a, left) and in the holo form (4n49, right) with SAM in the binding pocket. Two structures were aligned and SAM from the holo structure is shown also in the aligned apo structure binding pocket to visualize the fit of the ligand into the apo structure. The loop covering residues Ala 277 to Pro 280 has a different conformation in apo structure, which changes the binding pocket.

Open in new tab Download slide

4.2 Alternative crypticity metrics

During the creation of the dataset, other criteria were considered for inclusion in the filtering pipeline, in addition to those mentioned in the section apo–holo pairs pool filtering. Records with a pocket RMSD larger than 6 Å were manually analyzed to determine if an upper bound on pocket RMSD was necessary. After visual inspection, it was concluded that even pockets with a pocket RMSD greater than 6 Å are valid cryptic pockets; therefore, it was determined that an upper bound for pocket RMSD is unnecessary. Similarly, we considered setting an upper limit for the number of atoms in ligands to ensure the biological relevance of large ligands. We manually reviewed the list of apo–holo pairs where the number of heavy atoms in ligands exceeded 60. After visual inspection, we concluded that even large ligands remain biologically relevant, and therefore, we decided to keep them in the dataset.

Further, we have decided to keep even ligands that are covalently attached as they have proven to be relevant and successful in drug design (Singh et al. 2011).

4.3 Distinguishing cryptic and non-cryptic binding sites

Given a measure of crypticity that categorizes binding sites as cryptic or non-cryptic, a natural question arises: are there certain properties that differentiate cryptic and non-cryptic pockets and would thus allow for the classification of pockets as either cryptic or non-cryptic from the apo experimental structure? This is a different question from what CryptoBench addresses, as with CryptoBench, we are interested in whether a method is able to recognize a CBS, not to differentiate between a cryptic and regular binding site.

Crypticity plays apparently a significant role in structure-based methods, as these rely on the particular conformational state of the protein, leading to varying prediction performances on apo experimental structures for regular versus cryptic pockets. This is evident from the reduced performance of the P2Rank method on apo structures (Table 5). Thus, there clearly exists a structural signal that separates CBSs from regular ones. However, can this distinction also be observed at the sequence level, where apo and holo forms of the same protein are indistinguishable? If not, then distinguishing between cryptic and regular binding sites may be irrelevant for sequence-based prediction methods.

Our analysis reveals that cryptic binding residues tend to be more conserved (Table 4), hinting at the possibility of a sequence-based signal that distinguishes cryptic from non-cryptic binding residues. To explore this, we used the non-cryptic pocket subset of the CryptoBench dataset (nc-CB) and trained a neural network. This network utilized pLM embeddings of binding residues to classify them as cryptic or non-cryptic (see Supplementary Material Section S9 for details).

The results suggest that sequence information, as captured by pLM embeddings, provides a partial signal for distinguishing between cryptic and non-cryptic binding residues. However, the distinction is not perfect, as evidenced by the AUC of 0.68 (Supplementary Material Section S9). Importantly, the inability to achieve clear separation does not necessarily imply that the sequence lacks a strong signal. Improved separation might be attainable by enhancing either the prediction method or the dataset. For the method, fine-tuning the pLM or developing an end-to-end non-CBS versus CBS prediction approach could help. However, it is crucial to recognize that the data itself is inherently noisy. For non-cryptic pockets in the nc-CB dataset, the absence of an apo structure with sufficient conformational change does not definitively indicate that the pocket is rigid. In contrast, for cryptic pockets in CryptoBench, crypticity is confirmed by the observation of an apo–holo pair exhibiting significant conformational change.

Thus, there appears to be a sequence-level signal that distinguishes CBSs from regular ones, suggesting that considering this distinction could be relevant in the context of the development of purely sequence-based methods.

5 Conclusion

Leveraging the pocket RMSD metric, we curated the CryptoBench dataset, filtering apo–holo pairs and identifying cryptic pockets in PDB. The dataset’s structural diversity enables fair validation of ML models for CBS prediction. We also provided an evaluation benchmark comprising representatives of sequence- and structure-based prediction methods. We believe that CryptoBench will prove valuable to the scientific community, further catalyzing the development of more accurate and reliable methods for identifying both general and CBSs.

Acknowledgements

We would like to thank the reviewers for their valuable feedback and constructive suggestions, which have significantly contributed to improving this manuscript.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest: No competing interest is declared.

Funding

This work was supported by the Czech Science Foundation (GAČR) grant number 23-07349S, the ELIXIR CZ Research Infrastructure (ID LM2018131, MEYS CR), and the Charles University Project SVV number 260 698. Computational resources were provided by the e-INFRA CZ Project (ID: 90254), supported by the Ministry of Education, Youth and Sports of the Czech Republic.

References

Abramson

Adler

Dunger

et al.

Accurate structure prediction of biomolecular interactions with alphafold 3

Nature

2024

;

630

493

–

500

10.1038/s41586-024-07487-w

AlQuraishi

ProteinNet: A standardized data set for machine learning of protein structure

BMC Bioinformatics

2019

;

311

10.1186/s12859-019-2932-0

Beglov

Hall

Wakefield

et al.

Exploring the structural origins of cryptic sites on proteins

Proc Natl Acad Sci U S A

2018

;

115

E3416

–

10.1073/pnas.1711490115

Cimermancic

Weinkam

Rettenmaier

et al.

Cryptosite: expanding the druggable proteome by characterization and prediction of cryptic binding sites

J Mol Biol

2016

;

428

709

–

10.1016/j.jmb.2016.01.029

Egbert

Jones

Collins

et al.

Ftmove: a web server for detection and analysis of cryptic and allosteric binding sites by mapping multiple protein structures

J Mol Biol

2022

;

434

167587

Ehrt

Protein binding site comparison. PhD Thesis. Technische Universität Dortmund,

2019

Feidakis

Krivak

Hoksza

et al.

AHoJ-DB: A PDB-wide assignment of apo & holo relationships based on individual protein-ligand interactions

J Mol Biol

2024

;

436

168545

10.1016/j.jmb.2024.168545

Feidakis

Krivak

Hoksza

et al.

Ahoj: rapid, tailored search and retrieval of apo and holo protein structures for user-defined ligands

Bioinformatics

2022

;

5452

–

Jakubec

Vondrášek

Finn

RD.

3DPatch: fast 3D structure visualization with residue conservation

Bioinformatics

2019

;

332

–

10.1093/bioinformatics/bty464

Krivák

Hoksza

P2rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure

J Cheminform

2018

;

Kuzmanic

Bowman

Juarez-Jimenez

et al.

Investigating cryptic binding sites by molecular dynamics simulations

Acc Chem Res

2020

;

654

–

10.1021/acs.accounts.9b00613

Lee

Richards

The interpretation of protein structures: estimation of static accessibility

J Mol Biol

1971

;

379

–

400

10.1016/0022-2836(71)90324-X

Lin

Akin

Rao

et al.

Evolutionary-scale prediction of atomic-level protein structure with a language model

Science

2023

;

379

1123

–

10.1126/science.ade2574

Lin

Akin

Rao

et al. Language models of protein sequences at the scale of evolution enable accurate structure prediction. bioRxiv,

10.1101/2022.07.20.500902

, 2022, preprint: not peer reviewed

Martinez-Rosell

Lovera

Sands

et al.

Playmolecule crypticscout: predicting protein cryptic sites using mixed-solvent molecular simulations

J Chem Inf Model

2020

;

2314

–

10.1021/acs.jcim.9b01209

Meller

Ward

Borowsky

et al.

Predicting locations of cryptic pockets from single protein structures using the pocketminer graph neural network

Nat Commun

2023

;

1177

10.1038/s41467-023-36699-3

Richards

FM.

Areas, volumes, packing, and protein structure

Annu Rev Biophys Bioeng

1977

;

151

–

10.1146/annurev.bb.06.060177.001055

PMID: 326146.

Singh

Petter

Baillie

et al.

The resurgence of covalent drugs

Nat Rev Drug Discov

2011

;

307

–

Škoda

Hoksza

Benchmarking platform for ligand-based virtual screening. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), p.

1220

–

, Shenzhen, China: IEEE, December

2016

10.1109/BIBM.2016.7822693

Škrhák

Riedlova

Novotny

et al. Cryptic binding site prediction with protein language models. In: 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), p.

2883

–

, Los Alamitos, CA: IEEE Computer Society, December

2023

10.1109/BIBM58861.2023.10385497

Smith

Carlson

HA.

Identification of cryptic binding sites using mixmd with standard and accelerated molecular dynamics

J Chem Inf Model

2021

;

1287

–

10.1021/acs.jcim.0c01002

Smith

RHB

Dar

Schlessinger

Pyvol: a pymol plugin for visualization, comparison, and volume calculation of drug-binding sites. bioRxiv,

10.1101/816702

2019

, preprint: not peer reviewed.

Steinegger

Söding

Mmseqs2 enables sensitive protein sequence searching for the analysis of massive data sets

Nat Biotechnol

2017

;

1026

–

Thomas

Ebert

Muruganujan

et al.

Panther: making genome-scale phylogenetics accessible to all

Protein Sci

2022

;

–

Vajda

Beglov

Wakefield

et al.

Cryptic binding sites on proteins: definition, detection, and druggability

Curr Opin Chem Biol

2018

;

–

10.1016/j.cbpa.2018.05.003

Varadi

Berrisford

Deshpande

et al.

Pdbe-kb: a community-driven resource for structural and functional annotations

Nucleic Acids Res

2020

;

D344

–

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Wakefield

Kozakov

Vajda

Mapping the binding sites of challenging drug targets

Curr Opin Struct Biol

2022

;

102396

wwPDB Consortium

Protein data bank: the single global archive for 3D macromolecular structure data

Nucleic Acids Res

2019

;

D520

–

Zhang

How significant is a protein structure similarity with tm-score = 0.5?

Bioinformatics

2010

;

889

–

10.1093/bioinformatics/btq066

Zhang

Freddolino

et al.

BioLiP2: an updated structure database for biologically relevant ligand–protein interactions

Nucleic Acids Res

2024

;

D404

–

Zhao

Cao

Zhang

Exploring the computational methods for protein-ligand binding site prediction

Comput Struct Biotechnol J

2020

;

417

–

Zheng

Predicting cryptic ligand binding sites based on normal modes guided conformational sampling

Proteins

2021

;

416

–

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Associate Editor:

Download all slides

Month:	Total Views:
December 2024	110
January 2025	385
February 2025	401
March 2025	338
April 2025	280
May 2025	108

Article Contents

CryptoBench: cryptic protein–ligand binding sites dataset and benchmark

Abstract

1 Introduction

2 Materials and methods

2.1 Dataset construction

2.1.1 Selecting a suitable crypticity metric

2.1.2 Apo–holo pairs pool filtering

3 Results

3.1 Dataset structure

3.2 Statistics

3.3 CryptoBench benchmark

3.3.1 Baseline evaluation

3.3.1.1 Protein language model classifier implementation

3.3.1.2 Results

4 Discussion

4.1 Crypticity definition

4.2 Alternative crypticity metrics

4.3 Distinguishing cryptic and non-cryptic binding sites

5 Conclusion

Acknowledgements

Supplementary data

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

Article Contents

CryptoBench: cryptic protein–ligand binding sites dataset and benchmark

Abstract

1 Introduction

2 Materials and methods

2.1 Dataset construction

2.1.1 Selecting a suitable crypticity metric

2.1.2 Apo–holo pairs pool filtering

3 Results

3.1 Dataset structure

3.2 Statistics

3.3 CryptoBench benchmark

3.3.1 Baseline evaluation

3.3.1.1 Protein language model classifier implementation

3.3.1.2 Results

4 Discussion

4.1 Crypticity definition

4.2 Alternative crypticity metrics

4.3 Distinguishing cryptic and non-cryptic binding sites

5 Conclusion

Acknowledgements

Supplementary data

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

This Feature Is Available To Subscribers Only