Table 2

List of methods to predict the thermodynamic stability changes of proteins upon mutations based on machine learning approaches, developed or updated in the last 15 years and freely available to users via web servers. All the web servers listed here were available at the time of the submission of the review

Name and original referenceInput dataInput data and optional settingsType of approachOutputAvailability
I-Mutant 2.0 [21, 23]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermDirection of the free energy change and its value for either all possible mutations of a particular residue or only for a specific mutationhttp://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant2.0/I-Mutant2.0.cgi
MUpro [26]Protein sequence. Protein structure if availableSVM- and neural network-based predictors, trained on the same dataset as I-Mutant 2.0.Prediction of the value of energy change using support vector machine, using regression methods (recommended). Prediction of the sign of energy change using support vector machines and neural networks, using classification methodshttp://mupro.proteomics.ics.uci.edu/
I-Mutant 3.0 [31]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermΔΔG value and binary classification (ΔΔG ≥ 0, ΔΔG <0) or ternary classification (ΔΔG < -0.5, −0.5 ≤ ΔΔG ≤ 0.5,
ΔΔG >0.5)
http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi
mCSM [46]Protein structureSingle mutation, mutation list or systematic mutations on a single residueGraph-based distance patterns among atoms to represent the residue environment and a ‘pharmacophore count’ vector to account for the atom changes introduced by the mutation. The resulting signature vector is used to train predictive machine learning methods in regression and classification tasksPrediction of the direction of the change in stability and actual numerical experimental value. Also prediction of change in affinity of protein–protein and protein–DNA complexes upon mutationhttp://biosig.unimelb.edu.au/mcsm/
NeEMO [43]Protein structureTemperature; pH; amino acid to be substituted (one or more, manually edited)Calculation of residue–residue interaction networks where nodes represent residues and edges represent different types of physicochemical bonds. These graphs are used to train neural network for the prediction of stability changesPrediction of ΔΔG changes upon point mutationshttp://protein.bio.unipd.it/neemo/
AUTO-MUTE 2.0 (Stability changes tool) [32, 44]Protein structure (no multiple models, no gaps; no alternative conformations for alpha-carbon atoms)Temperature; pH; amino acid to be substituted (single or systematic)Two supervised classification models (random forest and SVM) to predict only the sign of ΔΔG; two regression models (tree regression and SVM regression) to predict the actual value of ΔΔGEither predicted sign of ΔΔG along with a confidence level or predicted value of ΔΔG with other information about structural features of the mutanthttp://binf2.gmu.edu/automute/AUTO-MUTE_Stability_ddG.html
INPS-MD [47, 54]Protein sequence (INPS) or structure (INPS3D)Support vector regression trained on descriptors encoding mutation type (in particular, substitution score, hydrophobicity score, mutability index of native residue, molecular weights of native and mutant residues) and evolutionary information (INPS). Addition of structural features such as relative solvent accessibility of native residue and local energy difference calculated by a contact potential (INPS3D)Changes in ΔG values upon residue substitution in the protein sequencehttps://inpsmd.biocomp.unibo.it/welcome/default/index
EASE-MM [51]Protein sequence of a single domain monomeric proteinCombination of five specialized SVM models to predict ΔΔG of mutations. Each SVM combines a different set of features encoding evolutionary conservation, amino acid parameters and predicted structural properties such as secondary structures and different levels of accessible surface areasPredicted ΔΔG and stability class: ∆∆Gu in (−inf, −1), destabilizing; ∆∆Gu in (−1, −0.5), likely destabilizing; ∆∆Gu in (−0.5, 0.5), neutral; ∆∆Gu in (0.5, 1), likely stabilizing; ∆∆Gu in (1, +inf), stabilizing. Predicted secondary structure and relative accessible surface area of the mutation sitehttps://sparks-lab.org/server/ease-mm/
STRUM [53]Protein sequenceSingle variation or multiple variationGradient boosting regression approach using different features sequence profile scores for evolutionary information, structural profile scores and different energy functions providing accurate environment informationPredicted ΔΔG of single-point mutationhttp://zhanglab.ccmb.med.umich.edu/STRUM/
PON-tstab [63]Protein sequenceTemperature; pH; single variation or multiple variationRandom forests tool based on similarity features, conservation features, amino acid features, variation type features, neighborhood features, and other sequence-based protein featuresPredicted ΔΔG of single-point mutation and predicted probabilityhttp://structure.bmc.lu.se/PON-Tstab/
DeepDDG [64]Protein structureSingle mutations or a list of mutationsNeural network-based predictor in which the parameters are shared for each target residue–neighbor residue pairPrediction of the change in folding free energy upon mutation (ΔΔG)http://protein.org.cn/ddg.html
Name and original referenceInput dataInput data and optional settingsType of approachOutputAvailability
I-Mutant 2.0 [21, 23]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermDirection of the free energy change and its value for either all possible mutations of a particular residue or only for a specific mutationhttp://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant2.0/I-Mutant2.0.cgi
MUpro [26]Protein sequence. Protein structure if availableSVM- and neural network-based predictors, trained on the same dataset as I-Mutant 2.0.Prediction of the value of energy change using support vector machine, using regression methods (recommended). Prediction of the sign of energy change using support vector machines and neural networks, using classification methodshttp://mupro.proteomics.ics.uci.edu/
I-Mutant 3.0 [31]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermΔΔG value and binary classification (ΔΔG ≥ 0, ΔΔG <0) or ternary classification (ΔΔG < -0.5, −0.5 ≤ ΔΔG ≤ 0.5,
ΔΔG >0.5)
http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi
mCSM [46]Protein structureSingle mutation, mutation list or systematic mutations on a single residueGraph-based distance patterns among atoms to represent the residue environment and a ‘pharmacophore count’ vector to account for the atom changes introduced by the mutation. The resulting signature vector is used to train predictive machine learning methods in regression and classification tasksPrediction of the direction of the change in stability and actual numerical experimental value. Also prediction of change in affinity of protein–protein and protein–DNA complexes upon mutationhttp://biosig.unimelb.edu.au/mcsm/
NeEMO [43]Protein structureTemperature; pH; amino acid to be substituted (one or more, manually edited)Calculation of residue–residue interaction networks where nodes represent residues and edges represent different types of physicochemical bonds. These graphs are used to train neural network for the prediction of stability changesPrediction of ΔΔG changes upon point mutationshttp://protein.bio.unipd.it/neemo/
AUTO-MUTE 2.0 (Stability changes tool) [32, 44]Protein structure (no multiple models, no gaps; no alternative conformations for alpha-carbon atoms)Temperature; pH; amino acid to be substituted (single or systematic)Two supervised classification models (random forest and SVM) to predict only the sign of ΔΔG; two regression models (tree regression and SVM regression) to predict the actual value of ΔΔGEither predicted sign of ΔΔG along with a confidence level or predicted value of ΔΔG with other information about structural features of the mutanthttp://binf2.gmu.edu/automute/AUTO-MUTE_Stability_ddG.html
INPS-MD [47, 54]Protein sequence (INPS) or structure (INPS3D)Support vector regression trained on descriptors encoding mutation type (in particular, substitution score, hydrophobicity score, mutability index of native residue, molecular weights of native and mutant residues) and evolutionary information (INPS). Addition of structural features such as relative solvent accessibility of native residue and local energy difference calculated by a contact potential (INPS3D)Changes in ΔG values upon residue substitution in the protein sequencehttps://inpsmd.biocomp.unibo.it/welcome/default/index
EASE-MM [51]Protein sequence of a single domain monomeric proteinCombination of five specialized SVM models to predict ΔΔG of mutations. Each SVM combines a different set of features encoding evolutionary conservation, amino acid parameters and predicted structural properties such as secondary structures and different levels of accessible surface areasPredicted ΔΔG and stability class: ∆∆Gu in (−inf, −1), destabilizing; ∆∆Gu in (−1, −0.5), likely destabilizing; ∆∆Gu in (−0.5, 0.5), neutral; ∆∆Gu in (0.5, 1), likely stabilizing; ∆∆Gu in (1, +inf), stabilizing. Predicted secondary structure and relative accessible surface area of the mutation sitehttps://sparks-lab.org/server/ease-mm/
STRUM [53]Protein sequenceSingle variation or multiple variationGradient boosting regression approach using different features sequence profile scores for evolutionary information, structural profile scores and different energy functions providing accurate environment informationPredicted ΔΔG of single-point mutationhttp://zhanglab.ccmb.med.umich.edu/STRUM/
PON-tstab [63]Protein sequenceTemperature; pH; single variation or multiple variationRandom forests tool based on similarity features, conservation features, amino acid features, variation type features, neighborhood features, and other sequence-based protein featuresPredicted ΔΔG of single-point mutation and predicted probabilityhttp://structure.bmc.lu.se/PON-Tstab/
DeepDDG [64]Protein structureSingle mutations or a list of mutationsNeural network-based predictor in which the parameters are shared for each target residue–neighbor residue pairPrediction of the change in folding free energy upon mutation (ΔΔG)http://protein.org.cn/ddg.html
Table 2

List of methods to predict the thermodynamic stability changes of proteins upon mutations based on machine learning approaches, developed or updated in the last 15 years and freely available to users via web servers. All the web servers listed here were available at the time of the submission of the review

Name and original referenceInput dataInput data and optional settingsType of approachOutputAvailability
I-Mutant 2.0 [21, 23]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermDirection of the free energy change and its value for either all possible mutations of a particular residue or only for a specific mutationhttp://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant2.0/I-Mutant2.0.cgi
MUpro [26]Protein sequence. Protein structure if availableSVM- and neural network-based predictors, trained on the same dataset as I-Mutant 2.0.Prediction of the value of energy change using support vector machine, using regression methods (recommended). Prediction of the sign of energy change using support vector machines and neural networks, using classification methodshttp://mupro.proteomics.ics.uci.edu/
I-Mutant 3.0 [31]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermΔΔG value and binary classification (ΔΔG ≥ 0, ΔΔG <0) or ternary classification (ΔΔG < -0.5, −0.5 ≤ ΔΔG ≤ 0.5,
ΔΔG >0.5)
http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi
mCSM [46]Protein structureSingle mutation, mutation list or systematic mutations on a single residueGraph-based distance patterns among atoms to represent the residue environment and a ‘pharmacophore count’ vector to account for the atom changes introduced by the mutation. The resulting signature vector is used to train predictive machine learning methods in regression and classification tasksPrediction of the direction of the change in stability and actual numerical experimental value. Also prediction of change in affinity of protein–protein and protein–DNA complexes upon mutationhttp://biosig.unimelb.edu.au/mcsm/
NeEMO [43]Protein structureTemperature; pH; amino acid to be substituted (one or more, manually edited)Calculation of residue–residue interaction networks where nodes represent residues and edges represent different types of physicochemical bonds. These graphs are used to train neural network for the prediction of stability changesPrediction of ΔΔG changes upon point mutationshttp://protein.bio.unipd.it/neemo/
AUTO-MUTE 2.0 (Stability changes tool) [32, 44]Protein structure (no multiple models, no gaps; no alternative conformations for alpha-carbon atoms)Temperature; pH; amino acid to be substituted (single or systematic)Two supervised classification models (random forest and SVM) to predict only the sign of ΔΔG; two regression models (tree regression and SVM regression) to predict the actual value of ΔΔGEither predicted sign of ΔΔG along with a confidence level or predicted value of ΔΔG with other information about structural features of the mutanthttp://binf2.gmu.edu/automute/AUTO-MUTE_Stability_ddG.html
INPS-MD [47, 54]Protein sequence (INPS) or structure (INPS3D)Support vector regression trained on descriptors encoding mutation type (in particular, substitution score, hydrophobicity score, mutability index of native residue, molecular weights of native and mutant residues) and evolutionary information (INPS). Addition of structural features such as relative solvent accessibility of native residue and local energy difference calculated by a contact potential (INPS3D)Changes in ΔG values upon residue substitution in the protein sequencehttps://inpsmd.biocomp.unibo.it/welcome/default/index
EASE-MM [51]Protein sequence of a single domain monomeric proteinCombination of five specialized SVM models to predict ΔΔG of mutations. Each SVM combines a different set of features encoding evolutionary conservation, amino acid parameters and predicted structural properties such as secondary structures and different levels of accessible surface areasPredicted ΔΔG and stability class: ∆∆Gu in (−inf, −1), destabilizing; ∆∆Gu in (−1, −0.5), likely destabilizing; ∆∆Gu in (−0.5, 0.5), neutral; ∆∆Gu in (0.5, 1), likely stabilizing; ∆∆Gu in (1, +inf), stabilizing. Predicted secondary structure and relative accessible surface area of the mutation sitehttps://sparks-lab.org/server/ease-mm/
STRUM [53]Protein sequenceSingle variation or multiple variationGradient boosting regression approach using different features sequence profile scores for evolutionary information, structural profile scores and different energy functions providing accurate environment informationPredicted ΔΔG of single-point mutationhttp://zhanglab.ccmb.med.umich.edu/STRUM/
PON-tstab [63]Protein sequenceTemperature; pH; single variation or multiple variationRandom forests tool based on similarity features, conservation features, amino acid features, variation type features, neighborhood features, and other sequence-based protein featuresPredicted ΔΔG of single-point mutation and predicted probabilityhttp://structure.bmc.lu.se/PON-Tstab/
DeepDDG [64]Protein structureSingle mutations or a list of mutationsNeural network-based predictor in which the parameters are shared for each target residue–neighbor residue pairPrediction of the change in folding free energy upon mutation (ΔΔG)http://protein.org.cn/ddg.html
Name and original referenceInput dataInput data and optional settingsType of approachOutputAvailability
I-Mutant 2.0 [21, 23]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermDirection of the free energy change and its value for either all possible mutations of a particular residue or only for a specific mutationhttp://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant2.0/I-Mutant2.0.cgi
MUpro [26]Protein sequence. Protein structure if availableSVM- and neural network-based predictors, trained on the same dataset as I-Mutant 2.0.Prediction of the value of energy change using support vector machine, using regression methods (recommended). Prediction of the sign of energy change using support vector machines and neural networks, using classification methodshttp://mupro.proteomics.ics.uci.edu/
I-Mutant 3.0 [31]Protein structure or sequenceTemperature; pHSVM-based web server, trained on a dataset derived from ProThermΔΔG value and binary classification (ΔΔG ≥ 0, ΔΔG <0) or ternary classification (ΔΔG < -0.5, −0.5 ≤ ΔΔG ≤ 0.5,
ΔΔG >0.5)
http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi
mCSM [46]Protein structureSingle mutation, mutation list or systematic mutations on a single residueGraph-based distance patterns among atoms to represent the residue environment and a ‘pharmacophore count’ vector to account for the atom changes introduced by the mutation. The resulting signature vector is used to train predictive machine learning methods in regression and classification tasksPrediction of the direction of the change in stability and actual numerical experimental value. Also prediction of change in affinity of protein–protein and protein–DNA complexes upon mutationhttp://biosig.unimelb.edu.au/mcsm/
NeEMO [43]Protein structureTemperature; pH; amino acid to be substituted (one or more, manually edited)Calculation of residue–residue interaction networks where nodes represent residues and edges represent different types of physicochemical bonds. These graphs are used to train neural network for the prediction of stability changesPrediction of ΔΔG changes upon point mutationshttp://protein.bio.unipd.it/neemo/
AUTO-MUTE 2.0 (Stability changes tool) [32, 44]Protein structure (no multiple models, no gaps; no alternative conformations for alpha-carbon atoms)Temperature; pH; amino acid to be substituted (single or systematic)Two supervised classification models (random forest and SVM) to predict only the sign of ΔΔG; two regression models (tree regression and SVM regression) to predict the actual value of ΔΔGEither predicted sign of ΔΔG along with a confidence level or predicted value of ΔΔG with other information about structural features of the mutanthttp://binf2.gmu.edu/automute/AUTO-MUTE_Stability_ddG.html
INPS-MD [47, 54]Protein sequence (INPS) or structure (INPS3D)Support vector regression trained on descriptors encoding mutation type (in particular, substitution score, hydrophobicity score, mutability index of native residue, molecular weights of native and mutant residues) and evolutionary information (INPS). Addition of structural features such as relative solvent accessibility of native residue and local energy difference calculated by a contact potential (INPS3D)Changes in ΔG values upon residue substitution in the protein sequencehttps://inpsmd.biocomp.unibo.it/welcome/default/index
EASE-MM [51]Protein sequence of a single domain monomeric proteinCombination of five specialized SVM models to predict ΔΔG of mutations. Each SVM combines a different set of features encoding evolutionary conservation, amino acid parameters and predicted structural properties such as secondary structures and different levels of accessible surface areasPredicted ΔΔG and stability class: ∆∆Gu in (−inf, −1), destabilizing; ∆∆Gu in (−1, −0.5), likely destabilizing; ∆∆Gu in (−0.5, 0.5), neutral; ∆∆Gu in (0.5, 1), likely stabilizing; ∆∆Gu in (1, +inf), stabilizing. Predicted secondary structure and relative accessible surface area of the mutation sitehttps://sparks-lab.org/server/ease-mm/
STRUM [53]Protein sequenceSingle variation or multiple variationGradient boosting regression approach using different features sequence profile scores for evolutionary information, structural profile scores and different energy functions providing accurate environment informationPredicted ΔΔG of single-point mutationhttp://zhanglab.ccmb.med.umich.edu/STRUM/
PON-tstab [63]Protein sequenceTemperature; pH; single variation or multiple variationRandom forests tool based on similarity features, conservation features, amino acid features, variation type features, neighborhood features, and other sequence-based protein featuresPredicted ΔΔG of single-point mutation and predicted probabilityhttp://structure.bmc.lu.se/PON-Tstab/
DeepDDG [64]Protein structureSingle mutations or a list of mutationsNeural network-based predictor in which the parameters are shared for each target residue–neighbor residue pairPrediction of the change in folding free energy upon mutation (ΔΔG)http://protein.org.cn/ddg.html
Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close