AI for predicting chemical-effect associations at the chemical universe level— deepFPlearn

Number and sizes of hidden layers for each trained neural network. NN—neural network; AE—autoencoder; FNN—feed-forward NN

NN	Input	Input size	Hidden layers
AE	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 512, 1024
FNN	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 128
FNN	compressed FP	\|$L_z=256$\|	128, 64, 32

NN	Input	Input size	Hidden layers
AE	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 512, 1024
FNN	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 128
FNN	compressed FP	\|$L_z=256$\|	128, 64, 32

Table 1

Number and sizes of hidden layers for each trained neural network. NN—neural network; AE—autoencoder; FNN—feed-forward NN

NN	Input	Input size	Hidden layers
AE	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 512, 1024
FNN	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 128
FNN	compressed FP	\|$L_z=256$\|	128, 64, 32

NN	Input	Input size	Hidden layers
AE	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 512, 1024
FNN	FP	\|$L_{FP}=2048$\|	1024, 512, 256, 128
FNN	compressed FP	\|$L_z=256$\|	128, 64, 32

The SELU activation function and lecun_normal weight initialization were used in hidden layers, and the Sigmoid activation function for the output layer. The model was compiled with binary cross-entropy as loss function and Adam optimizer.

A deep neural network was constructed as a sequential FNN for the classification task. The dimensions of the stacked layers depend on the mode of action of deepFPlearn. If feature compression via the AE is enabled, the FNN is used subsequently. Then, the size of the input layer of the FNN matches the length of the latent space |$L_z$| vectors. Otherwise, the size of the input layer matches the length of the molecular fingerprint |$L_{FP}$|⁠. The hidden layers were of decreasing sizes, followed by an output layer of size 1. The number of hidden layers |$\hat{N}_H$| and their sizes |$\hat{S}_j, j \in \left [1..\hat{N}_H\right ]$| depend on the provided input size |$L_{input}$| (which is either |$L_{FP}$| or |$L_z$|⁠). The last four layers with only a few neurons (e.g. less than 32 in the example case of |$L_{FP}=2048$|⁠) were not included. See Table 1 for the applied sizes of the hidden layers.

$$\begin{align} \hat{N}_H &= \lfloor\log_2(L_{input})/2\rfloor - 4 \end{align}$$

(4)

$$\begin{align} \hat{S}_j &= L_{input}/2^j\quad j \in [1..\hat{N}_H] \end{align}$$

(5)

Dense layers were used with the SELU activation function, lecun_normal weight initialization and AlphaDropout. The Sigmoid activation function was used for the output layer. All hidden layers were followed by a dropout layer. To reflect a potential imbalance in the training data, we introduced an initial bias of |$\log (P/N)$| (with |$P$| equal to the number of 1-values and |$N$| to the number of 0-values in the target vector) to the output layer. The FNN model was compiled using the Adam optimizer and binary cross entropy as loss function

Different datasets were collected from the literature and public databases. A manually curated dataset |$S$| was downloaded from the supplemental material of [36]. It contained chemical-target associations for 7248 chemicals and six gene targets that are involved in endocrine disruption (ED) in humans (androgen receptor (AR), estrogen receptor (ER), glucochorticoid receptor (GR), thyroid receptor (TR), PPARg and Aromatase). See supplemental Fig. S1 for an overview of these data’s size and class distributions. Initially, these data had been retrieved from bioassay data of the Tox21 program [37], and carefully transformed to binary associations by [36]: Associations were considered as not available (NA) if no bioassay data were available, and as 1 or 0, if an association between chemical and gene target had been confirmed in a bioassay or not, respectively. See [36] for details. The dataset |$S$| was extended by the artificial target ED that combines all existing target associations with a logical OR operation. Chemicals in the |$S$| dataset were identified by their SMILES string.

Further, a dataset |$D$| was generated from the 719 996 chemicals listed in the CompTox Chemistry Dashboard [40,accessed on 2020/07/13]. Chemicals in the |$D$| dataset were identified by their InChI identifiers.

For benchmarking, we downloaded two datasets from MoleculeNet [41], a database of benchmarking datasets for classification problems in molecular ML. First, we selected the Tox21 Challenge dataset—|$Tox21$|⁠, which associates chemicals and gene targets. Second, we used the Side Effect Resource (⁠|$SIDER$|⁠) database that associates drugs with grouped adverse drug reactions. These datasets contained 7831 and 1427 compounds, and 12 and 27 targets, respectively, and comprised binarized associations between those compounds and the targets. We followed the recommended metric and splitting patterns [41] to generate training data from these datasets and selected targets with a 1-0-ratio of at least 0.2 and a minimal number of 200 samples in the positive class for training.

See Table 2 for an overview which of those datasets were used in which training or prediction case.

Table 2

Overview of the usage of the different datasets for training or prediction

case	train AE	use AE	train FNN	predict
1	\|$S$\|	-	-	-
2	\|$D$\|	-	-	-
3	-	-	\|$S$\|	-
4	-	\|$S$\|	\|$S$\|	-
5	-	\|$D$\|	\|$S$\|	-
6	-	\|$D$\|	\|$SIDER$\|	-
7	-	\|$D$\|	\|$Tox21$\|	-
8	-	\|$D$\|	-	\|$D$\|

case	train AE	use AE	train FNN	predict
1	\|$S$\|	-	-	-
2	\|$D$\|	-	-	-
3	-	-	\|$S$\|	-
4	-	\|$S$\|	\|$S$\|	-
5	-	\|$D$\|	\|$S$\|	-
6	-	\|$D$\|	\|$SIDER$\|	-
7	-	\|$D$\|	\|$Tox21$\|	-
8	-	\|$D$\|	-	\|$D$\|

Table 2

Overview of the usage of the different datasets for training or prediction

case	train AE	use AE	train FNN	predict
1	\|$S$\|	-	-	-
2	\|$D$\|	-	-	-
3	-	-	\|$S$\|	-
4	-	\|$S$\|	\|$S$\|	-
5	-	\|$D$\|	\|$S$\|	-
6	-	\|$D$\|	\|$SIDER$\|	-
7	-	\|$D$\|	\|$Tox21$\|	-
8	-	\|$D$\|	-	\|$D$\|

case	train AE	use AE	train FNN	predict
1	\|$S$\|	-	-	-
2	\|$D$\|	-	-	-
3	-	-	\|$S$\|	-
4	-	\|$S$\|	\|$S$\|	-
5	-	\|$D$\|	\|$S$\|	-
6	-	\|$D$\|	\|$SIDER$\|	-
7	-	\|$D$\|	\|$Tox21$\|	-
8	-	\|$D$\|	-	\|$D$\|

Implementation.deepFPlearn was implemented as a Python (version 3.9.12) package with three different usage-modes. First, convert imports the dataset (for training or prediction) and calculates molecular fingerprints for all structures from their respective SMILES or InChi representation. A data frame combines the original representation, the calculated fingerprint and all targets. It is then serialized to disc as a Pickle file to accelerate the data import for subsequent sessions. Importantly, deepFPlearn assumes that SMILES have been canonicalized and cleaned. We recommend to either use ChemAxon’s chemical structure representation toolkit (https://chemaxon.com/products/chemical-structure-representation-toolkit) or a chemical structure curation pipeline relying on RDKit [4]. The second mode is training. The neural networks can easily be (re-)trained with any dataset that associates chemical structures with an effect. All necessary information is logged during the training to validate and evaluate the trained models. The third mode is to predict the association of a provided list of chemicals with an effect using the trained models.

The user can adjust all neural network settings and the mode of action in a JSON configuration file.

Dependencies to external libraries and software are managed using a platform-independent conda (https://www.anaconda.com) environment, which we provide in the code repository. A singularity container(https://sylabs.io/) was set up that encapsulates the whole project at the state of publication for usage and reproducibility. It includes the required resources, source code, compiled package and test data.

Results

We developed the stand-alone, ready-to-use DL approach deepFPlearn to associate chemicals with gene/pathway level targets. We further evaluated the potential of feature compression to increase the applicability to substances beyond the limited amount of available training data.

Our workflow combined a pre-training strategy via a deep autoencoder to reduce the feature space and to generate a universal encoding of binary fingerpints, followed by a classification step using a deep FNN, see Figure 1.

Figure 1

The deepFPlearn workflow. (A) The molecular fingerprints serve as input for the neural networks. (B) An AE is used to compress the fingerprints. (C) An FNN) is used for direct classification of the input. (D) An FNN is used for classification of the compressed input. Sizes of layers, activation and loss functions are different for each network and depend on the input size, see methods section.

For the FNNs, we employed 5-fold cross-validation to show that the selection of the train-test-split has no significant impact on the model performance. In particular, the standard deviation of the ROC-AUC values (calculated on the validation data) was |$\sim $|1%, see supplement Fig. S2. Therefore, we used a single stratified train-test-split to finetune and train our models.

Feature compression comprehensively reduced trainable parameters while keeping comparable classification performance. We applied different training setups: First, feature compression was disabled (no AE, Figure 1 from A to C), and FNN training used the full-length molecular fingerprints. The ratio between positive (1) and negative (0) associations differed substantially between the individual targets, see supplement Fig. S1. We introduced an initial bias to the output layer of the FNN to reflect that imbalance and selected AR, ER and the artificial target ED as subsets with an acceptable imbalance to train individual FNN models. Due to the fingerprint size of 2048, the respective hidden layer sizes of the FNN were 1024, 512, 256, 128 resulting in about 2.8e6 trainable parameters. The training stopped early before |$\sim $|100 epochs. Binary accuracy values of 0.85, 0.83, 0 .78 and ROC-AUC values of 0.81, 0.83, 0.81 were reached for AR, ER and ED, respectively. See Figure 3 A (top panels) for the training histories, and Figure 3 B (lightgray bars) for the values of precision, recall, F1 scores and further metrics that describe the performance of our FNN models. See Figure 4 A for ROC and precision-recall curves of the AR target for the classification without AE, and supplement Fig. S 3–5 for confusion matrices, ROC and precision-recall curves of all three targets.

Second, we applied feature reduction before the classification by training an AE with a latent space size of |$L_z=256$|⁠. This reduced the respective hidden layer sizes of the FNN to 128, 64, 32, resulting in only 43.3e3 trainable parameters, which is 1.55% of the uncompressed case above. We trained both a specific AE using the (small) |$S$| dataset and a generic AE using the (large) |$D$| dataset. See Figure 1 from A over B to C. The training of the specific autoencoder stopped early at 28 epochs which is due to the small number of training samples. The validation loss reached a value of 0.026. The generic autoencoder trained for around 320 epochs and stopped at a validation loss of 0.159. See Figure 2 for the training histories and a UMAP visualization of the high-dimensional uncompressed feature space and the low-dimensional latent space of dataset |$S$|⁠. Coloring compounds from the uncompressed and compressed space with labels calculated on the uncompressed feature space yielded similar cluster associations in the UMAP. Therefore, the AE preserves relevant (structural) information during feature compression.

$(A) ROC-AUC and loss values during training (calculated on the training and validation data after each epoch) of the specific ($S$ – Sun et al. 2019) and the generic ($D$ – CompTox) autoencoder. The training stopped early at 28 epochs for the specific AE—due to the small number of available training samples and reached a validation loss of 0.026. The training of the generic AE stopped at $\sim $320 epochs reaching a validation loss of 0.159. (B) UMAP visualizations of uncompressed and compressed representations of all compounds from $S$ dataset; the color indicates cluster assignment of a $k$-means clustering with $k=4$ on the uncompressed features.$

Figure 2

(A) ROC-AUC and loss values during training (calculated on the training and validation data after each epoch) of the specific (⁠|$S$| – Sun et al. 2019) and the generic (⁠|$D$| – CompTox) autoencoder. The training stopped early at 28 epochs for the specific AE—due to the small number of available training samples and reached a validation loss of 0.026. The training of the generic AE stopped at |$\sim $|320 epochs reaching a validation loss of 0.159. (B) UMAP visualizations of uncompressed and compressed representations of all compounds from |$S$| dataset; the color indicates cluster assignment of a |$k$|-means clustering with |$k=4$| on the uncompressed features.

Subsequently, we trained the FNNs and used the latent space representation as input. The training stopped early before |$\sim $|400 epochs. Binary accuracy values of 0.85, 0.80 and 0 .77 and ROC-AUC values of 0.81, 0.81 and 0.78 were reached for AR, ER and ED, respectively, when the specific AE was used to encode the fingerprints. We observed no significant discrepancy in these values when using the generic AE. In particular, we reached values of 0.85, 0.80 and 0.74 for binary accuracy, and 0.80, 0.79 and 0.76 for ROC-AUC values. Therefore, when the input features are compressed with the generic AE, the FNNs may be applied to a much more comprehensive range of molecular structures without compromising on the predictive power. See Figure 3 A (middle and lower panels) for the training histories, and Figure 3 B (medium and dark gray bars) for the values of precision, recall, F1 score and further metrics that describe the performance of our FNN models that were trained with compressed fingerprints. See Figure 4 B for ROC and precision-recall curves of the AR target for the classification with the generic AE, and supplementary Fig. 3–5 for confusion matrices, ROC and precision-recall curves for all three targets.

Figure 3

(A) Training histories of the feed forward neural networks stratified by the selected targets/models for androgen (AR) and estrogen (ER) receptors, and endocrine disruption (ED), and the degree of feature compression (uncompressed, specific AE, and generic AE); the shown metrics are ROC-AUC (red), loss (orange) calculated on the training (dotted) and validation data (solid) during training. (B) Comparison of the values of balanced accuracy (Balanced ACC), area under the receiver-operator curve (AUC), precision (PREC), recall (REC), F1 score (F1), specificity (SPEC) and MCC of the individual models using no (lightgray), the specific (medium gray) and the generic AE (dark gray). (C) MCC was calculated for increasing thresholds from 0 to 1 on the predicted validation data. The threshold with maximum MCC was selected as the individual classification threshold for each model. Example generated for model: AR, uncompressed input.

Benchmarking confirmed our strategy.

We compared the results of our strategy against the results of Sun et al. [36], the publication from which we extracted our FNN training data, the introduced approaches eToxPred [32] and DeepTox [26], and the results reported by MoleculeNet [41]. Sun et al. [36] reported balanced accuracy values in their results and we reached the same range between 74 and 81% on the same data. Pu et al. [32], Mayr et al. [26] and Wu et al. [41] reported ROC-AUC values of 72, 82 and 83%, respectively, on the Tox21 data of MoleculeNet, while our models achieved ROC-AUC values of 88%. For the SIDER dataset Wu et al. [41] reported 67% ROC-AUC values, while we reached 84%. For the MoleculeNet datasets, we also observed only a slight drop in performance when using the generic AE. In summary, our models perform either in the same range as existing approaches or better, which is satisfying compared with the increased applicability of our strategy.

deepFPlearn is ready to be applied to huge datasets. We used deepFPlearn with generic feature compression and selected the trained models for AR, ER and ED to predict associations of the |$\sim 700k$| chemicals from dataset |$D$|⁠. For most of those compounds, the probability of acting as endocrine disruptors was not known. deepFPlearn predicted |$\sim $|60k with high prediction probability |$P>0.85$|⁠.

From the ED predictions of dataset |$D$|⁠, we investigated the top 200 and bottom 200 (ranked by prediction probability) and empirically investigated their biological feasibility. We found compounds among the top 200 like Estriol, 17alpha-Ethinylestradiol, 17beta-Ethinylestradiol, Mestranol, Prednisolone Dexamethasone, Betamethasone and respective derivates. These chemicals are well known to interact with the human estrogen receptors and pathways or with the glucocorticoid pathway. Interestingly, Escher et al. [11] also identified some of those to interact selectively with AR in the cell assay screenings. Also, the top 200 list contains the chemicals Ezlopitant dihydrate dihydrochloride, 5-Bromo-2, 2-diethyl-5-nitro-1,3-dioxane or Schinifoline, a metabolite of the Japanese Pepper plant Zanthoxylum schinifolium. To our knowledge, those substances have not been tested in bioassays so far. In the bottom 200 predictions (⁠|$P<0.01$|⁠) we found derivates of carbamic, acetic and amino acids. Those chemicals have never been discussed in the context of steroid hormone related ED as far as we know.

Figure 4

Receiver-operator (left of both panels) and precision-recall (right of both panels) curves of a single fold of the AR target without using feature compression (A), and with generic feature compression (B). The color indicates the value of the respective classification threshold. Supplemental Figure S2 depicts the standard deviations of the AUC for the five folds.

Recently, Escher et al. [11] categorized a selection of 355 out of 7968 investigated chemicals and their activity with the ED receptors AR and ER as selective (41), specific and unspecific (314, summarized as other) binders.

We predicted the associations for the subset of 339 chemicals that have not been part of our training data with and without generic feature compresssion. The models for ER and ED that were trained on the compressed fingerprints captured substantially more of the selective compounds with higher prediction probability than the models that used the uncompressed fingerprints. However, this was not true for the AR model. See Figure 5 B and C for probability distributions and counts of the ED model and supplement Fig. S6 for the comparison of all three models.

Figure 5

(A) Values for all metrics calculated on the validation data for the benchmarking data sets SIDER and Tox21 summarized across all targets: balanced accuracy (Balanced ACC), area under the receiver operator curve (AUC), precision (PREC), recall (REC), F1 score (F1), specificity (SPEC) and MCC of the individual models using no (light gray), the specific (medium gray) and the generic AE (dark gray). (B)deepFPlearn prediction probabilities using the ED model with generic AE on the compounds that have been experimentally measured for quantified target association and, respectively, differentiated into selective and non-specifically acting compounds by Escher et al. [11]. Probability distributions are compared using the Kolmogorow–Smirnow test, and the significance levels for rejecting the null hypotheses that both distributions are similar was ^* for P-values below 0.05. (C) Comparison of the counts of predicted 1 (active) and 0 (inactive) labels for the same compounds as described in Figure B shown for the ED model.

Discussion

There is a great need for systematic prediction of chemical-effect associations in toxicology. They are required to prioritize chemicals for experimental screening, a smart selection of chemicals for monitoring and the design of novel chemicals. Several approaches and implementations exist that partially address these challenges. However, no tools for large-scale application are available, and the option for retraining with additional data sets is absent. While MoleculeNet [41] and Deepchem [34] provide capable frameworks for developing learning applications on chemicals, readily applicable tools, e.g. for predicting ED, are missing.

With deepFPlearn we present an application to investigate sets of chemicals for their potential associations to gene targets involved in ED. It is a DL approach with the possibility of training custom models to predict different associations of interest.

The small number of labeled training data is in contrast to the high number of features necessary to describe a chemical’s molecular structure. Also, the natural interaction of chemicals and biomolecules is biased toward ‘no interaction’ (label of 0) such that the data suffer from a substantial imbalance between 1 and 0 labels. Assessing the association of chemicals and biomolecules requires measuring a range of concentrations per substance and assay and thus poses a substantial effort even with high-throughput technologies. Since the number of substances with measured associations is small compared with the universe of chemicals, there is a lack of labeled training data. Due to the high speed at which new chemicals are developed, this situation will not change in the foreseeable future. To make things worse, many positive associations (label of 1) are potentially wrong due to mistakes during screening result interpretation. Examples are unclear effect thresholds, high variability in the experimental designs and limitations in the statistics of modeling the observed effect. The imbalance of the training data together with a large number of parameters can easily lead to overfitting. This is reflected by a large discrepancy between the training and validation loss, which we still observe in the cases where we do not use feature compression. However, our strategy to initialize the output layer of the FNN with the correct bias to reflect the imbalanced class distribution, which has been recently proposed by [1], our extended hyperparameter tuning and the application of fallback mechanisms, reduced overfitting also for the uncompressed FNN. In supplemental Figure S8 we show how the model can be driven into overfitting when one of these strategies is disabled.

We reduced the discrepancy between large descriptor size and the limited training data by compressing features with a deep autoencoder. Further, this reduced the large number of trainable parameters to 1.55|$\%$| of the networks that do not use an AE. Using a large repertoire of chemicals for training the AE further improved the domain extrapolation without reducing the predictive power of the subsequent classification. We tested different training situations, (i) without feature compression, (ii) feature compression with a subset of chemicals (specific AE) and (iii) feature compression with a large set of chemicals (generic AE). We reached good training performances with ROC-AUC values above 80%, with satisfying sensitivity up to 75%, and specificity up to 97%.

Using the benchmark datasets from MoleculeNet, and reported binary accuracies and ROC-AUC values from other approaches that used the same data sets we showed that deepFPlearn performed comparably or better. However, those methods also demand significant adjustments to the training data to cope with imbalance. We found that our predictions with the generic AE captured more of the compounds that have been experimentally analyzed and classified by [11] than the models trained on the uncompressed fingerprints, which verifies our assumption on predicting unseen data.

deepFPlearn allows for selecting different usage modes depending on the classification problem: If the compounds to be classified are expected to reside within the domain of the training data the FNN without AE provides superior classification performance. However, given the overall comparable accuracy of deepFPlearn when pre-training on a large data set, we consider this the more robust, computationally efficient and generally more applicable approach in particular for large, heterogeneous and imbalanced data.

The quality of our predictions is also high on the large CompTox dataset. Among the top 1–associated predictions were chemicals that are well known to interact with human estrogen or the glucocorticoid receptor or related pathways. Likewise, among the respective top 0–associated predictions were chemicals that have never been discussed to be involved in ED, which further enhances the confidence in our models.

Our high values for specificity also suggest an application of deepFPlearn to predict secondary effects in drug design.

The deepFPlearn results on the chemicals experimentally classified as selective and unspecific also confirmed our prediction quality. Although a relatively broad distribution of prediction probabilities for selective binders suggests that there is still room for methodological improvement, many of the chemicals predicted with a very high probability are indeed selective binders.

We suggest a more detailed investigation of the predicted associations and experimental validation in upcoming studies to confirm or decline effects in endocrine disruption.

Conclusion

With deepFPlearn we model the associations between chemical structures and effects on the gene/pathway level with a deep learning approach.

In contrast to existing approaches and implementations, deepFPlearn is a ready-to-use tool. It comes as a stand-alone Python software package and (additionally) wrapped in a Singularity Container to overcome the dependency on the operating system and required software. deepFPlearn can capture a much more comprehensive range of substances than those contained in the training data of the classification network. It can be applied to classify hundreds of thousands of chemicals in seconds. Moreover, with its different application modes, we provide the flexibility to train custom models with any meaningful dataset that associates chemicals with an effect. deepFPlearn substantially contributes to the systematic in silico investigation of chemicals, even for data-driven hypothesis generation on novel substance-effect associations. With deepFPlearn we can cope with the large, constantly and rapidly growing chemical universe and support prioritization of chemicals for experimental testing, assist in the smart selection of chemicals for monitoring and contribute to the sustainable design of the future chemicals.

Key Points

All living species are exposed to a vast amount (and mixtures) of chemicals; many pose risks; this risk is not known for the majority.
To support the lab-based risk assessment and subsequent regulation of use, prioritize chemicals for experimental design and hypothesis generation, efficient and systematic tools that can evaluate the chemical-effect association on a large scale are required, but are not available so far.
We present the ready-to-use deep learning application deepFPlearn that predicts the association between the chemical’s molecular structure and the observed effect on the gene/pathway level.
We solved the discrepancy between large feature space describing the molecular structure and the low amount of labeled training data with a pre-training strategy for feature compression on the chemical inventory.
We confirmed the good performance and high prediction quality of deepFPlearn with benchmarking and experimentally validated datasets.

Availability of source code and data

The source code is available in a git repository at github: https://github.com/yigbt/deepFPlearn under the terms of the UFZ license, which is based on GNU General Public License as published by the Free Software Foundation version 3 or later. We refer to this repository for installation and usage instructions. For ease of use we also provide Docker and Singularity containers, which is accessible via this repository. These containers also contain the data used for training the models.

Author contributions statement

J.S. and J.H. planned the study; J.S. and C.L. defined the neural network architectures; J.S. preprocessed all data; J.S. and P.S. implemented the software package and analyzed the results; M.B. and P.S. built the singularity container and all github actions; all authors wrote the manuscript. All authors read and approved the final manuscript.

Acknowledgments

The authors are grateful to Martin Krauss for helpful discussions.

Funding

This work was supported in part by the Helmholtz. AI project XAI-graph and by the CEFIC Long Range Initiative through funding the project C5 - XomeTox, the Helmholtz program ``Changing Earth - Sustaining our Future'' topic 9, and the Horizon Europe Partnership for the Assessment of Risk from Chemicals.

Author Biographies

Jana Schor heads the Group Data Science in Bioinformatics, Department of Computational Biology at the Helmholtz-Centre for Environmental Research GmbH – UFZ. She has a strong background in computer science and bioinformatics. Jana implements state-of-the-art data science methods, and promotes the principles of reproducible research in the field of computational toxicology.

Jörg Hackermüller heads the Department of Computational Biology at the Helmholtz-Centre for Environmental Research GmbH – UFZ and is professor of Computational Biology at Leipzig University. His research group develops bioinformatics, systems biology, and data science approaches to advance mechanistic understanding in toxicology and environmental health. Jörg has a background in computational biology and biochemistry.

References

Classification on imbalanced data

2022

. URL https://www.tensorflow.org/tutorials/structured_data/imbalanced_data (18 April 2022, date last accessed). Online tutorial.

Abadi

Barham

Chen

, et al. TensorFlow: A system for large-scale machine learning. In:

Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016

. USENIX Association, Savannah, GA,

2016

265

–

Anderson

Mcdonnell

Ximing

, et al.

The Challenge of Micropollutants in Aquatic Systems

Science

2006

;

313

(

August

1072

–

PubMed

. https://doi.org/10.1186/s13321-020-00456-1.

Bento

Hersey

Félix

, et al.

An open source chemical structure curation pipeline using rdkit

J Chem

2020

ISSN 17582946

;

. https://doi.org/10.1177/0748233719893198.

Biewald

Experiment tracking with weights and biases

2020

. URL https://www.wandb.com/.

Software available from wandb.com

(19 April 2022, date last accessed).

Bond

Garny

Inventory and evaluation of publicly available sources of information on hazards and risks of industrial chemicals

Toxicol Ind Health

2019

ISSN 0748-2337

;

(

11-12

738

–

Busch

Schmidt

Kühne

, et al.

Micropollutants in European rivers: A mode of action survey to support the development of effect-based tools for water monitoring

Environ Toxicol Chem

2016

ISSN 15528618

;

(

1887

–

. https://doi.org/10.1002/etc.3460.

Cas. No Title

. https://www.cas.org/support/documentation/chemical-substances (13 April 2022, date last accessed).

2020

Cherkasov

Muratov

Fourches

, et al.

QSAR modeling: Where have you been? Where are you going to?

J Med Chem

2014

ISSN 15204804

;

(

4977

–

5010

. https://doi.org/10.1021/jm4004285.

10.

Desforges

Hall

McConnell

, et al.

Predicting global killer whale population collapse from PCB pollution

Science

2018

ISSN 10959203

;

361

(

6409

1373

–

. https://doi.org/10.1126/science.aat1953.

11.

Escher

Henneberger

König

, et al.

Cytotoxicity burst? Differentiating specific from nonspecific effects in tox21 in vitro reporter gene assays

Environ Health Perspect

2020

;

128

(

–

. https://doi.org/10.1289/EHP6664. URL. https://ehp.niehs.nih.gov/doi/10.1289/EHP6664.

. URL https://echa.europa.eu/documents/10162/13628/evaluation_under_reach_progress_ en.pdf.

12.

European Chemials Agency

. No Title. https://echa.europa.eu/de/universe-of-registered-substances, (13 April 2022, date last accessed) 2019.

13.

European Chemicals Agency

Evaluation under REACH: progress report 2017 - 10 years of experience

Technical report, European Chemicals Agency

Helsinki

2018

Google Preview

. https://ec.europa.eu/info/sites/info/files/european-green-deal-communication_en.pdf.

14.

European Commission

Communication from the commission to the european parliament, the European Council, the council, the European economic and social committee and the committee of the regions

The European Green Deal Technical Report COM(2019) 640 final, European Commission

2019

15.

European Environment Agency

State and Outlook 2015 the European Environment

Technical report, European Environment Agency

2015

Google Preview

. https://doi.org/10.1038/nature13531.

16.

Fischer

KEMI Market List (Version NORMAN-SLE-S17.0.1.4)

. https://doi.org/10.5281/zenodo.3959394 (13 April 2022, date last accessed).

2017

17.

Hallmann

Foppen

RPB

Van Turnhout

, et al.

Declines in insectivorous birds are associated with high neonicotinoid concentrations

Nature

2014

ISSN 14764687

;

511

(

7509

341

–

18.

Köhler

Triebskorn

Wildlife ecotoxicology of pesticides: Can we track effects to the population level and beyond?

Science

2013

ISSN 10959203

;

341

(

6147

759

–

. https://doi.org/10.1126/science.1237591.

19.

Landrigan

Fuller

Acosta

NJR

, et al.

The Lancet Commission on pollution and health

Lancet (London, England)

2018

;

391

(

10119

462

–

512

. https://doi.org/10.1016/S0140-6736(17)32345-0.

20.

Landrum

RDKit: Open-source Cheminformatics

2006

ISSN 00028282

21.

Lepailleur

Poezevara

Bureau

Automated detection of structural alerts (chemical fragments) in (eco)toxicology

Comput Struct Biotechnol J

2013

ISSN 20010370

;

(

):e201302013. https://doi.org/10.5936/csbj.201302013.

. https://doi.org/10.1016/S0140-6736(12)61766-8.

22.

Lim

Vos

Flaxman

, et al.

A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990-2010: a systematic analysis for the Global Burden of Disease Study 2010

Lancet (London, England)

2012

ISSN 1474-547X

;

380

(

9859

2224

–

23.

Liu

Gao

Peng

, et al.

TarPred: A web application for predicting therapeutic and side effect targets of chemical compounds

Bioinformatics

2015

ISSN 14602059

;

(

2049

–

. https://doi.org/10.1093/bioinformatics/btv099.

24.

Rensi

Torng

, et al.

Machine learning in chemoinformatics and drug discovery

Drug Discov Today

2018

ISSN 18785832

;

(

1538

–

. https://doi.org/10.1016/j.drudis.2018.05.010.

25.

Mattingly

Colby

Forrest

, et al.

Generating the blood exposome database using a comprehensive text mining and database fusion approach

Environ Health Perspect

2016

ISSN 13624962

;

(

769

–

. https://doi.org/10.1371/journal.pone.0154387.

. https://doi.org/10.3389/fenvs.2015.00080.

26.

Mayr

Klambauer

Unterthiner

, et al.

DeepTox: Toxicity Prediction Using Deep Learning

Front Environ Sci

2015

;

. https://doi.org/10.21105/joss.00861.

27.

McInnes

Healy

Saul

, et al.

UMAP: Uniform Manifold Approximation and Projection

Journal of Open Source Software

2018

ISSN 2475-9066

;

(

861

28.

Norman Network. EMPODAT Database

. https://www.norman-network.com/nds/empodat/chemicalStatistics.php (13 April 2022, date last accessed).

2020

29.

Fabianpedregosa

Michel

Oliviergrisel

, et al.

Matthieu Perrot

Technical report

2011

. https://doi.org/10.1897/01-171.

30.

Perkins

Fang

Tong

, et al.

Quantitative structure-activity relationship methods: Perspectives on drug discovery and toxicology

Environ Toxicol Chem

1666–1679

;

(

2003

ISSN 07307268

. https://doi.org/10.1002/etc.4373.

31.

Posthuma

van Gils

Zijp

, et al.

Species sensitivity distributions for use in environmental protection, assessment, and management of aquatic ecosystems for 12 386 chemicals

Environ Toxicol Chem

2019

ISSN 15528618

;

(

703

–

32.

Naderi

Liu

, et al.

eToxPred: a machine learning-based approach to estimate the toxicity of drug candidates

BMC Pharmacol Toxicol

2019

;

(

). https://doi.org/10.1186/s40360-018-0282-6.

. https://doi.org/10.1002/wcms.1240.

33.

Raies

Bajic

In silico toxicology: computational methods for the prediction of chemical toxicity

Wiley Interdisciplinary Reviews: Computational Molecular Science

2016

ISSN 17590884

;

(

April

147

–

PubMed

34.

Ramsundar

Eastman

Walters

, et al.

Deep Learning for the Life Sciences

O’Reilly Media, Inc

2019

ISBN 9781492039839

Google Preview

35.

Rappaport

Genetic Factors Are Not the Major Causes of Chronic Diseases

Plos One

2016

ISSN 1932-6203

;

(

):e0154387. https://doi.org/10.1371/journal.pone.0154387.

. https://doi.org/10.1021/acs.jcim.8b00551.

36.

Sun

Yang

Cai

, et al.

In Silico Prediction of Endocrine Disrupting Chemicals Using Single-Label and Multilabel Models

J Chem Inf Model

2019

;

. https://doi.org/10.14573/altex.1803011.

37.

Thomas

The US Federal Tox21 Program: A strategic and operational plan for continued leadership

ALTEX

2018

ISSN 1868596X

;

163

–

. https://doi.org/10.1016/j.yrtph.2010.04.004.

Vink

Mikkers

Bouwman

, et al.

Use of read-across and tiered exposure assessment in risk assessment under REACH–a case study on a phase-in substance

Regulatory toxicology and pharmacology : RTP

2010

ISSN 1096-0295

;

(

–

39.

Wang

Walker

Muir

, et al.

Toward a Global Understanding of Chemical Pollution: A First Comprehensive Analysis of National and Regional Chemical Inventories

Environ Sci Tech

2020

ISSN 15205851

;

(

2575

–

. https://doi.org/10.1021/ACS.EST.9B06379/SUPPL_FILE/ES9B06379_SI_001.PDF.

. https://doi.org/10.1186/s13321-017-0247-6.

40.

Williams

Grulke

Edwards

, et al.

The CompTox Chemistry Dashboard: a community data resource for environmental chemistry Open Access

J Chem

2017

;