Network control principles for identifying personalized driver genes in cancer Free

Summary of the available data sources including the personalized samples resources and gene annotation resources for identifying PDGs

Datasets	Description	Website	References
Resources of personalized samples
TCGA	Characterization of over 20 000 primary cancer and matched normal samples spanning 33 cancer types	http://cancergenome.nih.gov	[13]
GEO	A public functional genomics data repository supporting MIAME-compliant data submissions	https://www.ncbi.nlm.nih.gov/geo/	[15]
COSMIC	The largest source of expert manually curated somatic mutation information relating to human cancers	http://cancer.sanger.ac.uk/cosmic	[59]
CCLE	A public access to genomic data, analysis and visualization for over 1457 cell lines	https://portals.broadinstitute.org/ccle/	[11, 12]
cBioPortal	An open-access resource for interactive exploration of multidimensional cancer genomics datasets	http://www.cbioportal.org	[60]
ICCG	A resource for functional roles of mutations	https://icgc.org	[14]
Resources of gene annotations
CCG	Mutations of different forms that were experimentally validated as driver genes for different cancer types	https://cancer.sanger.ac.uk/census/	[59]
BioGPS	An online gene annotation resource for enabling users to easily aggregate data on a gene from more than 150 external sources.	http://biogps.gnf.org	[61]
NCG	A list of 2372 cancer genes including 711 known cancer genes and tumor suppressor or oncogene annotations from 273 manually curated publications.	http://ncg.kcl.ac.uk/	[63]
DisGeNET	A platform containing one of the largest publicly available collections of genes and variants associated to human diseases.	http://www.disgenet.org/	[64]

Datasets	Description	Website	References
Resources of personalized samples
TCGA	Characterization of over 20 000 primary cancer and matched normal samples spanning 33 cancer types	http://cancergenome.nih.gov	[13]
GEO	A public functional genomics data repository supporting MIAME-compliant data submissions	https://www.ncbi.nlm.nih.gov/geo/	[15]
COSMIC	The largest source of expert manually curated somatic mutation information relating to human cancers	http://cancer.sanger.ac.uk/cosmic	[59]
CCLE	A public access to genomic data, analysis and visualization for over 1457 cell lines	https://portals.broadinstitute.org/ccle/	[11, 12]
cBioPortal	An open-access resource for interactive exploration of multidimensional cancer genomics datasets	http://www.cbioportal.org	[60]
ICCG	A resource for functional roles of mutations	https://icgc.org	[14]
Resources of gene annotations
CCG	Mutations of different forms that were experimentally validated as driver genes for different cancer types	https://cancer.sanger.ac.uk/census/	[59]
BioGPS	An online gene annotation resource for enabling users to easily aggregate data on a gene from more than 150 external sources.	http://biogps.gnf.org	[61]
NCG	A list of 2372 cancer genes including 711 known cancer genes and tumor suppressor or oncogene annotations from 273 manually curated publications.	http://ncg.kcl.ac.uk/	[63]
DisGeNET	A platform containing one of the largest publicly available collections of genes and variants associated to human diseases.	http://www.disgenet.org/	[64]

Table 1

Summary of the available data sources including the personalized samples resources and gene annotation resources for identifying PDGs

Datasets	Description	Website	References
Resources of personalized samples
TCGA	Characterization of over 20 000 primary cancer and matched normal samples spanning 33 cancer types	http://cancergenome.nih.gov	[13]
GEO	A public functional genomics data repository supporting MIAME-compliant data submissions	https://www.ncbi.nlm.nih.gov/geo/	[15]
COSMIC	The largest source of expert manually curated somatic mutation information relating to human cancers	http://cancer.sanger.ac.uk/cosmic	[59]
CCLE	A public access to genomic data, analysis and visualization for over 1457 cell lines	https://portals.broadinstitute.org/ccle/	[11, 12]
cBioPortal	An open-access resource for interactive exploration of multidimensional cancer genomics datasets	http://www.cbioportal.org	[60]
ICCG	A resource for functional roles of mutations	https://icgc.org	[14]
Resources of gene annotations
CCG	Mutations of different forms that were experimentally validated as driver genes for different cancer types	https://cancer.sanger.ac.uk/census/	[59]
BioGPS	An online gene annotation resource for enabling users to easily aggregate data on a gene from more than 150 external sources.	http://biogps.gnf.org	[61]
NCG	A list of 2372 cancer genes including 711 known cancer genes and tumor suppressor or oncogene annotations from 273 manually curated publications.	http://ncg.kcl.ac.uk/	[63]
DisGeNET	A platform containing one of the largest publicly available collections of genes and variants associated to human diseases.	http://www.disgenet.org/	[64]

Datasets	Description	Website	References
Resources of personalized samples
TCGA	Characterization of over 20 000 primary cancer and matched normal samples spanning 33 cancer types	http://cancergenome.nih.gov	[13]
GEO	A public functional genomics data repository supporting MIAME-compliant data submissions	https://www.ncbi.nlm.nih.gov/geo/	[15]
COSMIC	The largest source of expert manually curated somatic mutation information relating to human cancers	http://cancer.sanger.ac.uk/cosmic	[59]
CCLE	A public access to genomic data, analysis and visualization for over 1457 cell lines	https://portals.broadinstitute.org/ccle/	[11, 12]
cBioPortal	An open-access resource for interactive exploration of multidimensional cancer genomics datasets	http://www.cbioportal.org	[60]
ICCG	A resource for functional roles of mutations	https://icgc.org	[14]
Resources of gene annotations
CCG	Mutations of different forms that were experimentally validated as driver genes for different cancer types	https://cancer.sanger.ac.uk/census/	[59]
BioGPS	An online gene annotation resource for enabling users to easily aggregate data on a gene from more than 150 external sources.	http://biogps.gnf.org	[61]
NCG	A list of 2372 cancer genes including 711 known cancer genes and tumor suppressor or oncogene annotations from 273 manually curated publications.	http://ncg.kcl.ac.uk/	[63]
DisGeNET	A platform containing one of the largest publicly available collections of genes and variants associated to human diseases.	http://www.disgenet.org/	[64]

Problem formulation for network control-based PDGs identification

Cancer can be perceived as a dysfunction of molecular networks that regulate molecular communications and cellular processes [65]. Molecular networks, such as gene regulation networks or signaling networks, are highly adaptable and dynamic. To fully understand the cancer progression, we need to understand the dynamics of these networks in respect to control theory [66]. In particular, we consider the cancer progression from a normal state to a disease state as a network control problem, in which PDGs are altered genes by the input signals whose state transition can change the whole network state from the initial state to the desired state. To address this kind of problem, the following broader model class is generally considered [67]:

$$\begin{equation} \left\{\begin{array}{@{}l} dx/ dt=\mathrm{F}\left(x,A\right)+ Bu\\ {}y=\mathbf{C}x\end{array}\right., \end{equation}$$

(1)

where x∈R^N and y∈R^N° respectively denote the gene state and observable gene state at time t in an individual system; |$\mathbf{A}\in {R}^{N\times N}$| and |$\mathbf{C}\in {R}^{N_O\times N}$| respectively represent the state transition matrix and output matrix; |$\mathbf{B}\in {R}^{N\times {N}_C}$| characterizes the driving by N_C controllers with the genes. The ‘controllers’ in network control can produce the input signals to make the state transition of the whole network. As in many cases, it is assumed that one driver node can be altered by one independent input signal [68–74]. The element B_ij is nonzero if the j-th input signal directly acts on node v_i. The output and input matrices are set as |${\mathbf{C}}^T=\Big[\mathbf{I}\Big({c}_1\Big),\mathbf{I} \Big({c}_2\Big),\dots, \mathbf{I}\Big({c}_{N_O}\Big)\Big]$| and |${\mathbf{B}}^T=\Big[\mathbf{I}\Big({b}_{k_1}\Big),\mathbf{I} \Big({b}_{k_2}\Big),\dots, \mathbf{I}\Big({b}_{k_T}\Big)\Big]$|⁠, respectively; |$\Big\{{\mathrm{c}}_1,{c}_2,\dots, {c}_{N_O}\Big\}$| and |$\Big\{{b}_1,{b}_2,\dots, {b}_{N_C}\Big\}$| are the index of the set of observable genes O and constrained control genes U, respectively; |$\mathbf{I}(i)$|denotes the i-th column of the |$N\times N$| identity matrix I.

In biological system/network, the observable genes are usually identified as the differentially expressed genes, and the constrained control genes are defined as the genes that can be altered by the input signals. Generally, it is assumed that if the observable genes are changed from the initial attractor (i.e. one stable state) to the desired attractor (i.e. the other stable state), the system is controlled by the input signals (i.e., oncogene activation signal). In this case, PDGs identification is necessary to find the feasible subset of genes K from the constrained control genes set U, which are injected by proper input signals to nudge a complex nonlinear individual system in cancer from the normal state to the disease state (Figure 1), satisfying the following requirements:

$$\begin{equation} u\left({v}_i\right)=\left\{\begin{array}{l}{u}_i,\mathrm{if}\kern0.5em {v}_i\in K\\ {}0,\mathrm{otherwise}\end{array}\right., \end{equation}$$

(2)

Figure 1

Network control problem to identify PDGs. We assume that each patient has a personalized state transition network during cancer progression in which each edge denotes a pair of interacted genes, while the principles of the personalized network dynamics are unknown. The personalized state transition network is defined as a directed or undirected graph in which each edge denotes the significant interaction difference of gene pairs between the normal state and the tumor state. The network control problem for identifying PDGs is formed as how we can select a feasible subset of network nodes (PDGs) from the personalized state transition network, which can be injected through oncogene activations, to nudge a complex, nonlinear individual system from normal state to disease state. The PDGs may be oncogene activated or drug activated. If the PDGs are activated by oncogene signals, the system state will be changed from a normal state to a disease state. Meanwhile, if the PDGs are drug-activated, the system state should change from disease state to normal state.

Considering the gene expression profiles in normal and tumor samples as the respective state of a given patient, network control tools aim to detect a small number of altered genes by the input signals related with the state transition of individual patient depending on adequate knowledge of the network structure. The PDGs are the altered nodes/genes by the input signals which can make the state transition of the whole biological network. The input signals may be oncogene activation signals such as genetic mutation, gene amplification, chromosomal rearrangement or transposable elements. The ‘controllers’ in network control problem for identifying PDGs mean the genetic or environment factors which produce the oncogene activation signals.

As noted, the state transition of a system from the disease state to the normal state also can be handled when the input signals include drug activations or similar indications (Figure 1). These PDGs can also be efficient sources of drug targets for drug discovery and drug repositioning [75–78].

Obviously, applying network control methods for PDGs identification needs two key steps. One is to construct the personalized state transition networks which are involved in the state transition during disease development for each patient. Another is to design the optimal network control methods based on the structure of personalized state transition networks. Therefore, we will give the detailed discussions on how to apply network control tools for identifying PDGs from such two respects below.

Reconstruction of the personalized state transition network

The personalized state transition network represents which gene pairs are involved in the disease development for each patient. Because the principles of the personalized network dynamics are hidden, it is important to reconstruct the personalized state transition networks with the personalized genetic data (e.g. expression profiles). The ability to unravel the dynamic nature of gene regulation during a biological process is a key challenge in systems biology. Most of the studies for exploiting gene regulation, such as the Gene Network Reconstruction tool [79], the dynamic cascaded method [80], the HotNet2 [21] and the local Bayesian network [81], can describe only the dynamic gene regulation for population samples, and they are not suitable for an individual patient. Some researchers have recognized that most of the currently available methods do not adequately account for heterogeneity in the number of mutations expected by chance. Consequently, they yield many false-positive calls, particularly in cancers with high mutation rates [82–84]. Thus, it is urgently to develop more efficient methods to discover gene regulations or dysfunctional regulations for individuals in a network manner. This section will introduce several techniques to search for the personalized state transition networks (Table 2), including SCS [23], Linear Interpolation to Obtain Network Estimates for Single Samples (LIONESS) [85], Single-Sample Network (SSN) [86], Paired-Single-Sample Network (Paired-SSN) [87] and VarWalker [88].

Table 2

Summary of different methods to construct the personalized state transition networks

Methods	Description	Software website	Year	Reference
SCS	Include the significant mutated genes and the differential expressed genes and the frequently interrupted interactions in directed gene interaction network	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2018	[23]
VarWalker	Include mutation genes and their close interactions in undirected gene interaction network	http://bioinfo.mc.vanderbilt.edu/VarWalker.html	2014	[88]
SSN	Construct an individual-specific network based on statistical perturbation analysis of tumor sample against a group of given control samples	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2016	[86]
LIONESS	Model regulatory network changes over time and to characterize the regulatory processes active in individual samples.	None	2019	[85]
Paired-SSN	The differential coexpression network between normal sample network and tumor sample network for each patient	None	2019	[87]

Methods	Description	Software website	Year	Reference
SCS	Include the significant mutated genes and the differential expressed genes and the frequently interrupted interactions in directed gene interaction network	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2018	[23]
VarWalker	Include mutation genes and their close interactions in undirected gene interaction network	http://bioinfo.mc.vanderbilt.edu/VarWalker.html	2014	[88]
SSN	Construct an individual-specific network based on statistical perturbation analysis of tumor sample against a group of given control samples	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2016	[86]
LIONESS	Model regulatory network changes over time and to characterize the regulatory processes active in individual samples.	None	2019	[85]
Paired-SSN	The differential coexpression network between normal sample network and tumor sample network for each patient	None	2019	[87]

Table 2

Summary of different methods to construct the personalized state transition networks

Methods	Description	Software website	Year	Reference
SCS	Include the significant mutated genes and the differential expressed genes and the frequently interrupted interactions in directed gene interaction network	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2018	[23]
VarWalker	Include mutation genes and their close interactions in undirected gene interaction network	http://bioinfo.mc.vanderbilt.edu/VarWalker.html	2014	[88]
SSN	Construct an individual-specific network based on statistical perturbation analysis of tumor sample against a group of given control samples	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2016	[86]
LIONESS	Model regulatory network changes over time and to characterize the regulatory processes active in individual samples.	None	2019	[85]
Paired-SSN	The differential coexpression network between normal sample network and tumor sample network for each patient	None	2019	[87]

Methods	Description	Software website	Year	Reference
SCS	Include the significant mutated genes and the differential expressed genes and the frequently interrupted interactions in directed gene interaction network	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2018	[23]
VarWalker	Include mutation genes and their close interactions in undirected gene interaction network	http://bioinfo.mc.vanderbilt.edu/VarWalker.html	2014	[88]
SSN	Construct an individual-specific network based on statistical perturbation analysis of tumor sample against a group of given control samples	http://sysbio.sibcb.ac.cn/cb/chenlab/software.htm	2016	[86]
LIONESS	Model regulatory network changes over time and to characterize the regulatory processes active in individual samples.	None	2019	[85]
Paired-SSN	The differential coexpression network between normal sample network and tumor sample network for each patient	None	2019	[87]

Figure 2

Overview of SCS to identify a personalized state transition network. For the gene expression data and gene mutation profiles (SNP and CNV) for individual patient, SCS identifies the differentially expressed genes and extracted the mutation genes and their close interactors as the personalized state transition network using the RWR algorithm and a randomization-based test in the directed gene interaction network.

To provide a guide and comparison for selecting a reasonable method for reconstructing the personalized state transition networks, we grouped the above approaches into two categories according to the features of used datasets.

(i) Mutation data-based methods (i.e., VarWalker and SCS)

These methods include mutation genes and their close interactors for each sample in the human gene interaction network. VarWalker [88] first assesses the mutation probabilities of all human genes by fitting them to a generalized additive model based on the patient- or sample-specific mutational profile. Then the Random Walker with Restart (RWR) method is executed for each sample to search for interactions among the filtered mutation genes in the human interactome. Finally, VarWalker introduces a randomization-based test to evaluate the candidate interactors by utilizing multiple topologically matched random networks which form the personalized state transition networks.

SCS [23] first identifies the differential expression genes by calculating the log2 fold-change of gene expression between the paired tumor and normal samples. A significance of ±1 is used to indicate the differentially expressed genes for each patient. Then both the mutation genes and their interactors are extracted from each patient by using the RWR algorithm for each patient. Finally, the individual mutation genes, the individual Differentially Expressed Genes method (DEGs) and the interactors are formed as the personalized state transition networks (Figure 2).

(ii) Bulk gene expression data-based methods (i.e., LIONESS, SSN and Paired-SSN)

These methods reconstruct the personalized state transition networks in the bulk gene expression data of tumor samples. LIONESS [85] does not rely upon differential analysis between the tumor sample and a group of normal samples but reconstructs the individual specific network in a population of tumor samples as the personalized state transition networks for each tumor sample. LIONESS constructs the personalized state transition networks by calculating the edge statistical significance between all the tumor samples and the tumor samples without a given single sample. Furthermore, LIONESS can use multiple aggregate network reconstruction techniques for constructing single sample network including Pearson correlation coefficient (PCC) [89], Passing Attributes between Networks for Data Assimilation (PANDA) [90], Mutual Information (MI) [91] and Context Likelihood of Relatedness (CLR) [92].

SSN [86] is a statistical method to construct an individual-specific network solely based on expression data of a single sample, rather than the aggregated network for a group of samples, based on statistical perturbation analysis of a single sample against a group of given control samples. In particular, the SSN method quantifies the individual-specific network of each sample against a group of given control samples in terms of statistical significance in an accurate manner. The SSN needs to have expression data for a group of normal samples, which serve as the reference samples. By using this group of samples, SSN firstly constructs the co-expression network by PCCs and then adds the single sample to the reference samples for constructing the perturbed network. Thus, the SSN method can construct the personalized state transition networks by quantifying the differential network between the reference and perturbed networks. To more precisely demonstrate the regulatory mechanism of the personalized state transition networks, SSN usually uses the protein interaction network to filter the noise of PCC between gene pairs. Note that, SSN method constructs actually a differential co-expression network between groups of normal and a single disease sample [93–97], and the advantage of SSN is that it gives a mathematical criterion to evaluate whether the edge is differential significantly or not. It can also be generalized to other calculation of edge relationship methods, such as conditional mutual inclusive information [98] and Part MI [99] and Partial MI [100].

For the Paired-SSN method [87], the co-expression network of the tumor sample network and normal sample network for each patient is constructed based on statistical perturbation analysis of one sample against a group of given reference samples (e.g. choosing the normal samples data of all of the patients as the reference data here) with the SSN method [86]. In addition, the P-value of an edge for the tumor sample or normal sample also can be obtained by using the SSN method. All of the edges with significant differential correlations (e.g., P-value <0.05) were used to constitute the normal sample network or tumor sample network. Then, the personalized differential co-expression network can be constructed in which edges exist if the P-value of the gene pairs is less than (greater than) 0.05 in the tumor network but greater than (less than) 0.05 in the normal network for each patient. Note that by using Paired-SSN, the protein interaction network is also usually used to filter the noise of co-expression network of the tumor sample network or normal sample network. Finally, personalized state transition networks can be obtained, whose edges are those existing in both the gene interaction network and personalized differential co-expression network for each patient (Figure 3).

Figure 3

Overview of using the Paired-SSN to construct the personalized state transition network. First, select all of the normal data as the reference data, and construct the tumor network and normal network based on the reference data with the SSN method [86]. Then, construct the single sample network with SSN, in which edges are those existing in both the gene–gene interaction network and the co-expression network for each sample. Finally, construct the personalized state transition network in which the edge between gene i and gene j exists if the P-value of the edge is less than (greater than) 0.05 in the tumor network, but greater than (less than) 0.05 in the normal network.

The SSN and Paired-SSN methods quantify the individual-specific network against a group of given control samples in terms of statistical significance of PCC in an accurate manner. Both SSN and Paired-SSN need a group of normal samples in terms of expression data, which serve as the reference samples. The SSN constructs the personalized state transition networks based on statistical perturbation analysis of a tumor sample against a group of given reference samples while the Paired-SSN method constructs personalized state transition networks based on statistical differential analysis of the normal sample network and tumor sample network. Therefore compared with LIONESS method, SSN and Paired-SSN consider more information of individual patients and are suitable for identifying the PDGs of individual cancer patients.

Table 3

Summary of different structure-based control methods including MMS-based control methods (i.e. full control, target control and constrained target control), MDS-based control method and FVS-based control methods (i.e. DFVS and NCUA)

Methods	Categories	Targeted states	Network Styles	Dynamics	Year	Reference
MMS-based full controllability	MMS control	Any	Directed	Local nonlinear	2011	[52]
MMS-based target controllability	MMS control	Any	Directed	Local nonlinear	2014	[101,102]
MMS-based constrained target controllability	MMS control	Any	Directed	Local nonlinear	2017	[49]
MDS	MDS control	Any	Undirected	Nonlinear	2012	[55]
DFVS	FVS control	Attractors	Directed	Nonlinear	2017	[53]
NCUA	FVS control	Attractors	Undirected	Nonlinear	2019	[87]

Methods	Categories	Targeted states	Network Styles	Dynamics	Year	Reference
MMS-based full controllability	MMS control	Any	Directed	Local nonlinear	2011	[52]
MMS-based target controllability	MMS control	Any	Directed	Local nonlinear	2014	[101,102]
MMS-based constrained target controllability	MMS control	Any	Directed	Local nonlinear	2017	[49]
MDS	MDS control	Any	Undirected	Nonlinear	2012	[55]
DFVS	FVS control	Attractors	Directed	Nonlinear	2017	[53]
NCUA	FVS control	Attractors	Undirected	Nonlinear	2019	[87]

Table 3

Summary of different structure-based control methods including MMS-based control methods (i.e. full control, target control and constrained target control), MDS-based control method and FVS-based control methods (i.e. DFVS and NCUA)

Methods	Categories	Targeted states	Network Styles	Dynamics	Year	Reference
MMS-based full controllability	MMS control	Any	Directed	Local nonlinear	2011	[52]
MMS-based target controllability	MMS control	Any	Directed	Local nonlinear	2014	[101,102]
MMS-based constrained target controllability	MMS control	Any	Directed	Local nonlinear	2017	[49]
MDS	MDS control	Any	Undirected	Nonlinear	2012	[55]
DFVS	FVS control	Attractors	Directed	Nonlinear	2017	[53]
NCUA	FVS control	Attractors	Undirected	Nonlinear	2019	[87]

Methods	Categories	Targeted states	Network Styles	Dynamics	Year	Reference
MMS-based full controllability	MMS control	Any	Directed	Local nonlinear	2011	[52]
MMS-based target controllability	MMS control	Any	Directed	Local nonlinear	2014	[101,102]
MMS-based constrained target controllability	MMS control	Any	Directed	Local nonlinear	2017	[49]
MDS	MDS control	Any	Undirected	Nonlinear	2012	[55]
DFVS	FVS control	Attractors	Directed	Nonlinear	2017	[53]
NCUA	FVS control	Attractors	Undirected	Nonlinear	2019	[87]

Design of the structure-based network control methods

The control process usually is determined by an intrinsic structure and dynamic propagation. Although we have adequate knowledge of the underlying wiring diagram, we lack knowledge of the specific functional forms for biological systems [53]. Analyzing such complicated systems requires the concepts and approaches of structure-based control, which can be used to investigate the controllability of complex networks through a minimum set of driver nodes, even though the edge weights are precisely unknown. Table 3 summarizes the differences among the various Maximum Matching Sets (MMS)-based control methods, including full control, target-control, constrained target control, Minimum Dominating sets (MDSs)-based control method and Feedback Vertex Sets (FVSs)-based control methods. This comparison makes the structure-based control concepts and methods easier to understand. These tools may give a specific view of the network control properties of a system with linear or nonlinear dynamics, which are introduced in detail as follows.

Figure 4

Demonstration of the structure-based control methods. (A) Demonstration of MMS-based control method (full control). MMS-based control method can identify the unmatched nodes {v₃,v₄,v₅,v₆,v₇,v₁₀} (red nodes) on the right side of the bipartite graph transferred from the directed network, as the driver nodes. (B) Demonstration of MDS-based control method. The network is structurally controllable by selecting an MDS {v₁,v₄} because each dominated node has its own control signal. (C) Demonstration of DFVS. For DFVS algorithm, the controllability is determined by the FVS {v₉} and the source nodes of the network {v₃}. (D) Demonstration of NCUA. The NCUA assumes that the edges of the undirected networks are modeled as the bidirected edges. NCUA first constructs a bipartite graph from the original undirected network, in which the nodes of the top side are the nodes of the original graph, and the nodes of the bottom side are the edges of the original graph. Then, it determines the MDS of the top-side nodes {v₁,v₄,v₉} to cover the bottom-side nodes in the bipartite graph using ILP. The red nodes in (A–D) are the driver nodes identified by MMS, MDS, DFVS and NCUA, respectively.

The first type is the MMS-based control methods. It enables us to investigate the controllability of linear or local nonlinear structural networks through a minimum set of driver nodes [52]. In the MMS-based control methods, a network with the canonical linear time-invariant dynamics can be considered, and thus the system (1) is reduced as follows:

$$\begin{equation} \left\{\begin{array}{l}\frac{\mathrm{d}\dot{x}}{\mathrm{d}t}=\mathbf{A}x+\mathbf{B}u\\ {}y=\mathbf{C}x\end{array}\right.. \end{equation}$$

(3)

For the MMS-based control methods, we need to find the minimum set K among the constrained control nodes set U to satisfy the following criterion:

$$\begin{equation} \max \left\{\operatorname{rank}\left(\left[\mathbf{CB},\mathbf{CAB},{\mathbf{CA}}^{\mathbf{2}}\mathbf{B},\dots, {\mathbf{CA}}^{\mathbf{N}-\mathbf{1}}\mathbf{B}\right]\right)\right\}={N}_O. \end{equation}$$

(4)

When Equation (4) is satisfied, the system in Equation (3) is structurally controllable. Note that the maximum in Equation (4) implies that given the input and output matrices B and C, we need to choose the proper nonzero weights in A to satisfy Equation (4)—that is when|${N}_O=N,{N}_C=N$|⁠, both |$\Big\{{\mathrm{c}}_1,{c}_2,\dots, {c}_{N_O}\Big\}$| and |$\Big\{{b}_1,{b}_2,\dots, {b}_{N_C}\Big\}$| are all of the nodes in the network, and the controllability is the full controllability [52]. The full controllability concerns the ability to drive the state of all of the nodes of the network to their desired values by selecting driver nodes from all of the nodes. Thus, when |${N}_C=N$|⁠,|$\Big\{{b}_1,{b}_2,\dots, {b}_{N_C}\Big\}$| are all of the nodes of the network, and the controllability is the output controllability or the target controllability [101, 102]. This target controllability aims to control the state of the target nodes by choosing driver nodes from all of the nodes. In contrast, the objective of constraint target controllability (CTC) is to control the state of target nodes by choosing driver nodes only from the set of constrained control nodes (i.e. the candidate driver nodes) [49]. The CTC can be viewed as a more general framework than full controllability [52] or target controllability [102]. So far, this kind of structure-based network control approach has been applied widely to analyze the mechanism of biological networks in diverse fields [103–107]. In Figure 4A, we intuitively explained how the MMS-based method (full control) can be used to analyze the controllability of complex networks.

The second type is the MDS-based control methods. It is another important model for the controllability of dynamic networks. Nacher and Akutsu [55] introduced the MDS method to the controllability study for undirected networks by assuming that each edge in a network is bidirectional, and they showed that a network is structurally controllable by selecting the nodes in the MDS as the driver nodes (Figure 4B). Nacher and Akutsu [55] observed that only driver nodes in the MMS model can be controlled directly through external signals, whereas each driver node in the MDS model can control its associated edges independently. Furthermore, each non-driver node is controllable if it is at least adjacent to a driver node. The MDS-based model may lead to the identification of important nodes for the control of networks [55]. It also has been recognized that the MDS model is capable of controlling an undirected network, by assuming that each node in the MDS can control all of its outgoing edges separately [55, 108]. Despite its success and widespread application in searching for the important genes in the protein interaction network [107, 109–112], the MDS-based model may be more expensive, because each driver node controls its outgoing links independently and thus more powerful control is required (Figure 4B).

The third type is the FVS-based control methods. Network dynamics are commonly nonlinear, especially at the level of nodes or small groups of nodes in the network [113]. In past decades, the focus of network control research has shifted from linear dynamics to nonlinear dynamics [50, 67, 114–117], and one of these methods, i.e. the FVS-based control [50, 118] can be reliably applied to large complex networks in which the structure is well known and the functional form of the governing equations is not arbitrary but must satisfy some specific properties. To drive the state of a network to any one of its naturally occurring end states (i.e. dynamical attractors), Feedback Control needs to manipulate a set of FVS that intersects every feedback loop in the network. For this Feedback Control, F(x,A) in Equation (1) is not any nonlinear function but requires only a few conditions (e.g. continuous, dissipative and decaying) on the nonlinear functions [50, 118]. Precisely, the definition of FVS is a subset of nodes in the graph, such that the removal of the set leaves the graph without feedback loops. Note that Akutsu et al. [119] gave a preliminary result for Feedback Control. For Boolean networks, they showed that singleton attractors (stable steady states) can be determined only from the states of nodes in FVS. Therefore, the Feedback Control can be used to search the minimum set of nodes which drive the state of a network to any one of its attractors. Given a directed graph G = (V‚ E), the FVS can be calculated using integer linear programming (ILP) formalization [120]. This algorithm utilizes a scheme that enables weights to be assigned to vertices to capture an ordering relationship among the vertices. The ILP is formulized is as follows:

$$\begin{equation} {\displaystyle \begin{array}{l}\min \sum \limits_i{x}_i\\ {}s.t.{w}_i-{w}_j+{nx}_i\ge 1\kern0.5em \left(\ {v}_i\to {v}_j\in E\right)\\ {}0\le {w}_i\le 1,{x}_i=\left\{0,1\right\}\end{array}}, \end{equation}$$

(5)

where each constrained condition is set for each edge in E. Then the noted ILP can be used to perform the FVS calculation in the directed network.

Recently, the Directed FVS-based control method (DFVS) is proposed under the framework of the Feedback Control to study dynamic models of direct networks. DFVS illustrates that controllability can be determined by the cycle structure and the source nodes of a directed FVS network [53]. DFVS, however, mainly focuses on the structural control of direct networks with nonlinear dynamics. Therefore, to solve the control problem of nonlinear undirected networks, Guo et al. developed a novel algorithm, Nonlinear Control of Undirected networks Algorithm (NCUA) under the framework of Feedback Control, which is based on the assumption that the edges of these undirected networks are modeled as bi-directed edges [87]. For a given undirected network G (V, E), Guo et al. assumed that each edge is bidirectional, and G (V, E) is converted into a bipartite graph G (V_T,V_⊥,E₁), where V_T≡V and V_⊥≡E. If v_i∈V_T is one of the nodes for v_j∈V_⊥, they added an edge connecting v_i and v_j into set E₁. After the bipartite graph is obtained, they adopted a modified version of the dominating set, in which the dominating set must be selected from V_T and is also sufficient to dominate all of the nodes in V_⊥. The MDS cover problem can be solved by the following ILP model:

$$\begin{equation} \displaystyle \begin{array}{l}\min \sum \limits_{v\in {V}_{\mathrm{T}}}{x}_v\kern0.5em \\ {}s.t.\sum \limits_{\left\{v,u\right\}\in E}{x}_v\ge 1\left(\mathrm{every}u\in {V}_{\perp}\right),{x}_v\in \left\{0,1\right\}\end{array},\end{equation}$$

(6)

where x_i will take the value 1 when node i belongs to the dominating cover set; the objective is to obtain the minimum number of nodes in |${V}_{\mathrm{T}}$| to cover all the nodes in |${V}_{\perp }$|⁠.

For FVS-based control methods, including DFVS and NCUA, the optional solution can be obtained efficiently for moderate sizes of graphs with up to a few tens of thousands of nodes by utilizing an algorithm that applies the classic branch and bound method [121, 122] to determine an optimal solution, although the search for FVS is a non-deterministic polynomial-hard problem. Figure 4C and D illustrates the corresponding processes of DFVS and NCUA to discover the driver nodes, respectively.

In fact, the target control of complex networks is usually useful for the practical application in biological networks [102, 103]. However, as seen from the above discussions, MDS-based control method and FVS-based control methods (DFVS and NCUA) focus on the full control of networks. That is they assume the observable nodes O and constrained control nodes U are the whole network nodes. Therefore, in the future, for more actual control purpose in biological networks it will be necessary to introduce the target control [102, 103] and constrained target control [104] into MDS-based control method and FVS-based control methods.

Performance assessment of the structure control-based PDGs identification methods

According to the requirement of network control principles, normal–disease paired samples needed to be obtained. That is each individual should have paired samples (i.e. a normal sample and a tumor sample). To demonstrate the usage of the structure control principles, we used those cancer datasets that contain sufficient number of normal–disease paired samples (>20 paired samples) in TCGA for case study here. By searching TCGA, we found 13 cancer datasets which can meet the requirements. In Table 4, we gave a summary of the sample information including the number of normal samples, tumor samples and paired samples in all 33 cancer datasets of TCGA.

Based on the collected 13 cancer datasets, we used LIONESS, SSN and Paired-SSN for constructing personalized state transition networks. As LIONESS can be applied to multiple aggregate network reconstruction approaches, we here used PCC in the LIONESS to guarantee the fair comparison with SSN and Paired-SSN methods. To construct the personalized state transition networks, after obtaining the PCC distribution S of all gene pairs by using LIONESS, we chose a threshold to determine the differential expression edges in the sample specific network as follows:

$$\begin{equation} w=\mu (S)+2\delta (S), \end{equation}$$

(7)

where |$\mu (S)$| and |$\delta (S)$| are the mean and standard variance of the PCC absolute value distribution S of all gene pairs, respectively.

To keep the edge direction and filter the noise of PCC correlation in the personalized state transition networks, the directed protein interaction network was also used on the LIONESS and SSN and Paired-SSN, which was obtained from the literature [28] by integrating multiple types of datasets, including Mutual Exclusivity Modules in cancer [123, 124], Reactome [125], NCI-Nature Curated PID [126] and Kyoto Encyclopedia of Genes and Genomes [127]. The directed protein interaction network consists of 11 648 proteins and 211 794 edges, including self-loops within the network to account for auto-regulation events.

Table 4

Summary of the sample information including the number of normal samples, tumor samples and paired samples in all 33 TCGA cancer datasets

Abbreviations	Full name	Number of normal samples	Number of tumor samples	Number of paired samples
LAML	Acute myeloid leukemia	0	151	0
ACC	Adrenocortical carcinoma	0	79	0
BLCA	Bladder urothelial carcinoma	19	413	≤19
LGC	Breast lobular carcinoma	0	529	0
BRCA	Breast ductal carcinoma	113	1102	112
CESC	Cervical carcinoma	3	304	≤3
CHOL	Cholangiocarcinoma	9	36	≤9
COAD	Colorectal adenocarcinoma	50	478	50
ESCA	Esophageal carcinoma	11	161	≤11
GBM	Glioblastoma multiforme	5	156	≤5
HNSC	Head and neck squamous cell carcinoma	44	500	43
KICH	Kidney chromophobe carcinoma	24	65	23
KIRC	Kidney clear cell carcinoma	72	538	72
KIRP	Kidney papillary cell carcinoma	32	288	31
LIHC	Liver hepatocellular carcinoma	50	371	50
LUAD	Lung adenocarcinoma	59	533	57
LUSC	Lung squamous cell carcinoma	49	502	49
DLBC	Lymphoid neoplasm diffuse large B-cell lymphoma	0	48	0
MESO	Mesothelioma	0	86	0
OV	Ovarian serous adenocarcinoma	0	379	0
PAAD	Pancreatic ductal adenocarcinoma	4	177	≤4
PCPG	Paraganglioma & pheochromocytoma	3	178	≤3
PRAD	Prostate adenocarcinoma	52	498	52
READ	Adenocarcinoma	10	166	≤10
SARC	Sarcoma	2	259	≤2
SKCM	Skin cutaneous melanoma	1	103	≤1
STAD	Stomach adenocarcinoma	32	375	32
TGCT	Testicular germ cell cancer	0	156	0
THYM	Thymoma	2	119	≤2
THCA	Thyroid papillary carcinoma	58	502	58
UCS	Uterine carcinosarcoma	0	56	0
UCEC	Uterine corpus endometrioid carcinoma	35	551	23

Abbreviations	Full name	Number of normal samples	Number of tumor samples	Number of paired samples
LAML	Acute myeloid leukemia	0	151	0
ACC	Adrenocortical carcinoma	0	79	0
BLCA	Bladder urothelial carcinoma	19	413	≤19
LGC	Breast lobular carcinoma	0	529	0
BRCA	Breast ductal carcinoma	113	1102	112
CESC	Cervical carcinoma	3	304	≤3
CHOL	Cholangiocarcinoma	9	36	≤9
COAD	Colorectal adenocarcinoma	50	478	50
ESCA	Esophageal carcinoma	11	161	≤11
GBM	Glioblastoma multiforme	5	156	≤5
HNSC	Head and neck squamous cell carcinoma	44	500	43
KICH	Kidney chromophobe carcinoma	24	65	23
KIRC	Kidney clear cell carcinoma	72	538	72
KIRP	Kidney papillary cell carcinoma	32	288	31
LIHC	Liver hepatocellular carcinoma	50	371	50
LUAD	Lung adenocarcinoma	59	533	57
LUSC	Lung squamous cell carcinoma	49	502	49
DLBC	Lymphoid neoplasm diffuse large B-cell lymphoma	0	48	0
MESO	Mesothelioma	0	86	0
OV	Ovarian serous adenocarcinoma	0	379	0
PAAD	Pancreatic ductal adenocarcinoma	4	177	≤4
PCPG	Paraganglioma & pheochromocytoma	3	178	≤3
PRAD	Prostate adenocarcinoma	52	498	52
READ	Adenocarcinoma	10	166	≤10
SARC	Sarcoma	2	259	≤2
SKCM	Skin cutaneous melanoma	1	103	≤1
STAD	Stomach adenocarcinoma	32	375	32
TGCT	Testicular germ cell cancer	0	156	0
THYM	Thymoma	2	119	≤2
THCA	Thyroid papillary carcinoma	58	502	58
UCS	Uterine carcinosarcoma	0	56	0
UCEC	Uterine corpus endometrioid carcinoma	35	551	23

Table 4

Summary of the sample information including the number of normal samples, tumor samples and paired samples in all 33 TCGA cancer datasets

Abbreviations	Full name	Number of normal samples	Number of tumor samples	Number of paired samples
LAML	Acute myeloid leukemia	0	151	0
ACC	Adrenocortical carcinoma	0	79	0
BLCA	Bladder urothelial carcinoma	19	413	≤19
LGC	Breast lobular carcinoma	0	529	0
BRCA	Breast ductal carcinoma	113	1102	112
CESC	Cervical carcinoma	3	304	≤3
CHOL	Cholangiocarcinoma	9	36	≤9
COAD	Colorectal adenocarcinoma	50	478	50
ESCA	Esophageal carcinoma	11	161	≤11
GBM	Glioblastoma multiforme	5	156	≤5
HNSC	Head and neck squamous cell carcinoma	44	500	43
KICH	Kidney chromophobe carcinoma	24	65	23
KIRC	Kidney clear cell carcinoma	72	538	72
KIRP	Kidney papillary cell carcinoma	32	288	31
LIHC	Liver hepatocellular carcinoma	50	371	50
LUAD	Lung adenocarcinoma	59	533	57
LUSC	Lung squamous cell carcinoma	49	502	49
DLBC	Lymphoid neoplasm diffuse large B-cell lymphoma	0	48	0
MESO	Mesothelioma	0	86	0
OV	Ovarian serous adenocarcinoma	0	379	0
PAAD	Pancreatic ductal adenocarcinoma	4	177	≤4
PCPG	Paraganglioma & pheochromocytoma	3	178	≤3
PRAD	Prostate adenocarcinoma	52	498	52
READ	Adenocarcinoma	10	166	≤10
SARC	Sarcoma	2	259	≤2
SKCM	Skin cutaneous melanoma	1	103	≤1
STAD	Stomach adenocarcinoma	32	375	32
TGCT	Testicular germ cell cancer	0	156	0
THYM	Thymoma	2	119	≤2
THCA	Thyroid papillary carcinoma	58	502	58
UCS	Uterine carcinosarcoma	0	56	0
UCEC	Uterine corpus endometrioid carcinoma	35	551	23

Abbreviations	Full name	Number of normal samples	Number of tumor samples	Number of paired samples
LAML	Acute myeloid leukemia	0	151	0
ACC	Adrenocortical carcinoma	0	79	0
BLCA	Bladder urothelial carcinoma	19	413	≤19
LGC	Breast lobular carcinoma	0	529	0
BRCA	Breast ductal carcinoma	113	1102	112
CESC	Cervical carcinoma	3	304	≤3
CHOL	Cholangiocarcinoma	9	36	≤9
COAD	Colorectal adenocarcinoma	50	478	50
ESCA	Esophageal carcinoma	11	161	≤11
GBM	Glioblastoma multiforme	5	156	≤5
HNSC	Head and neck squamous cell carcinoma	44	500	43
KICH	Kidney chromophobe carcinoma	24	65	23
KIRC	Kidney clear cell carcinoma	72	538	72
KIRP	Kidney papillary cell carcinoma	32	288	31
LIHC	Liver hepatocellular carcinoma	50	371	50
LUAD	Lung adenocarcinoma	59	533	57
LUSC	Lung squamous cell carcinoma	49	502	49
DLBC	Lymphoid neoplasm diffuse large B-cell lymphoma	0	48	0
MESO	Mesothelioma	0	86	0
OV	Ovarian serous adenocarcinoma	0	379	0
PAAD	Pancreatic ductal adenocarcinoma	4	177	≤4
PCPG	Paraganglioma & pheochromocytoma	3	178	≤3
PRAD	Prostate adenocarcinoma	52	498	52
READ	Adenocarcinoma	10	166	≤10
SARC	Sarcoma	2	259	≤2
SKCM	Skin cutaneous melanoma	1	103	≤1
STAD	Stomach adenocarcinoma	32	375	32
TGCT	Testicular germ cell cancer	0	156	0
THYM	Thymoma	2	119	≤2
THCA	Thyroid papillary carcinoma	58	502	58
UCS	Uterine carcinosarcoma	0	56	0
UCEC	Uterine corpus endometrioid carcinoma	35	551	23

Based on these personalized state transition networks constructed by LIONESS, SSN and Paired-SSN, respectively, we applied the network control methods for identifying a subset of genes as the potential PDGs. On the one hand, the MMS and DFVS use the directed information but the MDS and NCUA do not consider the directed information for identifying driver nodes. On the other hand, as conventional methods selecting the PDGs, the DEG–FoldChange selects the PDGs by calculating the fold-change between a normal sample and a tumor sample (|log2(fold-change)|>1); the DEG–P-value and DEG–FDR select the PDGs by calculating P-value and FDR, respectively, between a cancer tumor sample and a group of control samples; the hub genes selection method (Network-Degree) regards the hub genes in the constructed network as cancer driver genes, where we obtained the degree distribution of all genes D in the personalized state transition networks and we also used a threshold as introduced in formula (7) to obtain the hub genes.

To give comprehensive comparisons of the network control methods with the traditional methods, the key cancer genes annotated in three lists of CCG, NCG and CSD were adopted to assess the F-measure considering both the precision and the recall of the predicted driver genes using the formula

$$\begin{equation} {F}_i=2{P}_i\ast {R}_i/\left({P}_i+{R}_i\right), \end{equation}$$

(8)

where P_i denotes the fraction of correctly predicted PDGs among all the predicted PDGs; R_i denotes the fraction of correctly predicted PDGs among all the CCG genes, NCG genes or CSD genes. Note that when using the NCG genes, we only chose the 711 known cancer driver genes. The CSD list was obtained by integrating Disease Ontology database [128] and DisGeNET database [64].

As shown in Figures 5–9, we carried on computational comparisons for evaluating the performance of these personalized state transition network construction methods (i.e. LIONESS, SSN and Paired-SSN) and network control methods (i.e. MMS, DFVS, MDS and NCUA) for identifying PDGs. The conclusions drawn from Figures 5 and 6 can be summarized as follows:

Figure 5

Performance comparisons of the network structure-based control methods (i.e. NMS, MDS, DFVS and NCUA) and the traditional methods (i.e. DEG–Foldchange, DEG–P-value, DEG–FDR and Network-Degree) on the different personalized state transition networks constructed by respectively using LIONESS, SSN and Paired-SSN methods. The personalized driver cancer genes from 13 cancer datasets are annotated in the lists of CCG, NCG and CSD databases. (A–C) are the results of the network structure-based control methods and the traditional methods in CCG list by respectively using the LIONESS, SSN and Paired-SSN network constructing methods; (D–F) are the results of the network structure-based control methods and the traditional methods in NCG list by respectively using the LIONESS, SSN and Paired-SSN network constructing methods.

(i) The network control methods are more effective for discovering the PDGs than the traditional DEG and Hub-gene selection method (Network-Degree). For example, in Figures 5 and 6, the F-measures of the network control methods are all higher than those of traditional methods, according to the enrichment of predicted personalized driver genes in the gold-standard cancer gene lists like CCG, NCG and CSD.

(ii) For the CCG and NCG lists, the F-measures of network control methods are dependent on the constructed personalized state transition networks, and we suggest SSN and Paired-SSN method as the preferred sample network construction method. For example, in Figure 5, we can see that for the CCG and NCG lists, the F-measures of network control methods would be higher when SSN and Paired-SSN networks rather than LIONESS are used.

Figure 6

Performance comparisons of the network structure-based control methods (i.e. NMS, MDS, DFVS and NCUA) and the traditional methods (i.e. DEG–Foldchange, DEG–P-value, DEG–FDR and Network-Degree) in the CSD list on the different personalized state transition networks which are constructed by respectively using (A) LIONESS, (B) SSN and (C) Paired-SSN methods.

(iii) For the CSD lists, the F-measures of network control methods show strong cancer sample heterogeneity in different cancer datasets. For example in Figure 6B and C, we can see that the F-measures of NCUA on SSN and Paired-SSN networks for BRCA cancer dataset are above 0.1, while the F-measures for lung squamous cell carcinoma (LUSC), kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP) and uterine corpus endometrial carcinoma (UCEC) cancer datasets are much lower than 0.05. Such sample heterogeneity can also be observed by other network control methods, although they would detect different cancer sites or samples.

Table 5

Summary of how to choose the network construction methods and network control methods for 13 kinds of cancer patients

	CCG	NCG	CSD
BRCA	SSN_MMS	Paired-SSN_NCUA	Paired-SSN_NCUA
COAD	Paired-SSN_NCUA	Paired-SSN_NCUA	SSN_MMS
KICH	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUSC	Paired-SSN_MDS	Paired-SSN_NCUA	Paired-SSN_DFVS
KIRC	SSN_MMS	SSN_MMS	LIONESS_MDS
KIRP	SSN_MMS	SSN_MMS	SSN_DFVS
LIHC	SSN_MMS	SSN_MMS	SSN_MMS
UCEC	SSN_MMS	SSN_NCUA	LIONESS_MDS
STAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
THCA	Paired-SSN_MMS	Paired-SSN_NCUA	SSN_MMS
PRAD	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS
HNSC	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS

	CCG	NCG	CSD
BRCA	SSN_MMS	Paired-SSN_NCUA	Paired-SSN_NCUA
COAD	Paired-SSN_NCUA	Paired-SSN_NCUA	SSN_MMS
KICH	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUSC	Paired-SSN_MDS	Paired-SSN_NCUA	Paired-SSN_DFVS
KIRC	SSN_MMS	SSN_MMS	LIONESS_MDS
KIRP	SSN_MMS	SSN_MMS	SSN_DFVS
LIHC	SSN_MMS	SSN_MMS	SSN_MMS
UCEC	SSN_MMS	SSN_NCUA	LIONESS_MDS
STAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
THCA	Paired-SSN_MMS	Paired-SSN_NCUA	SSN_MMS
PRAD	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS
HNSC	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS

Table 5

Summary of how to choose the network construction methods and network control methods for 13 kinds of cancer patients

	CCG	NCG	CSD
BRCA	SSN_MMS	Paired-SSN_NCUA	Paired-SSN_NCUA
COAD	Paired-SSN_NCUA	Paired-SSN_NCUA	SSN_MMS
KICH	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUSC	Paired-SSN_MDS	Paired-SSN_NCUA	Paired-SSN_DFVS
KIRC	SSN_MMS	SSN_MMS	LIONESS_MDS
KIRP	SSN_MMS	SSN_MMS	SSN_DFVS
LIHC	SSN_MMS	SSN_MMS	SSN_MMS
UCEC	SSN_MMS	SSN_NCUA	LIONESS_MDS
STAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
THCA	Paired-SSN_MMS	Paired-SSN_NCUA	SSN_MMS
PRAD	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS
HNSC	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS

	CCG	NCG	CSD
BRCA	SSN_MMS	Paired-SSN_NCUA	Paired-SSN_NCUA
COAD	Paired-SSN_NCUA	Paired-SSN_NCUA	SSN_MMS
KICH	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
LUSC	Paired-SSN_MDS	Paired-SSN_NCUA	Paired-SSN_DFVS
KIRC	SSN_MMS	SSN_MMS	LIONESS_MDS
KIRP	SSN_MMS	SSN_MMS	SSN_DFVS
LIHC	SSN_MMS	SSN_MMS	SSN_MMS
UCEC	SSN_MMS	SSN_NCUA	LIONESS_MDS
STAD	SSN_MMS	Paired-SSN_NCUA	SSN_MMS
THCA	Paired-SSN_MMS	Paired-SSN_NCUA	SSN_MMS
PRAD	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS
HNSC	Paired-SSN_MMS	Paired-SSN_NCUA	Paired-SSN_MMS

By summarizing the comparison results of methods and datasets from Figures 5 and 6, we gave Table 5 to show how to choose the network construction method and network control method for 13 kinds of cancer patients according to the gene lists of CCG, NCG and CSD. In addition, the case of patient ‘TCGA-BJ-A28W’ in THCA cancer is used to show the performance of the network control approaches of MMS, MDS, DFVS and NCUA by choosing a sub-network of the personalized state transition network which contains FGR gene and its neighborhood genes. The results are shown in Figure 7. In this case, the genes of {BTK, ABL1, EPHA7} in CCG and NCG gene lists are considered as the gold standard driver genes. For MMS method, it identifies 10 genes of {EPHA7, DOK1, EPHA5, EGF, EFNB1, BLNK, CCL11, EDN1, FGR, EFNA1} as the driver genes of ‘TCGA-BJ-A28W’ patient, and there is one gold standard driver gene (EPHA7) among 10 driver genes. For MDS method, there are no gold standard driver genes among the identified driver genes. For DFVS method, it identifies the FVS node of DOK1 that intersects two feedback loops in the network as driver genes, and there are no gold standard driver genes among the identified driver genes. For the NCUA method, it identifies four driver genes of {FGR, ABL1, EDN1, CSK} which can cover all the edges (feedback loop) in the undirected network, and there is one gold standard driver gene (‘ABL1’) among these four driver genes.

To further support the efficiency of network control methods by statistical significance, the enrichment P-values (calculated by using the hyper geometric test [129]) of the predicted driver genes in CCG and NCG and CSD lists were evaluated for different network control methods as shown in Figures 8 and 9.

(i) From the results in Figure 8, we can see that network control methods are actually significant for predicting the PDGs enriched in the CCG and NCG lists.
(ii) Furthermore, from Figure 9, we found the following facts: for the CSD list, the enrichment P-values of network control methods vary in the different cancer datasets. For example for UCEC cancer dataset, all the network control methods (i.e. MMS, MDS, DFVS and NCUA) on SSN, Paired-SSN and LIONESS networks do not have significant enrichment results (Figure 9). But for other cancer datasets, there are significant P-value enriched in the CSD list by using the NCUA method on the Paired-SSN networks (Figure 9C).
(iii) For the CSD list, the enrichment P-value results of network control methods are dependent on the constructed personalized state transition networks, and we suggest the Paired-SSN method as the preferred sample network construction method. For example in THCA cancer dataset, MMS and DFVS and NCUA methods on the Paired-SSN networks are significantly enriched in CSD list (Figure 9C), while all the network control methods on the LIONESS networks are not significant (Figure 9A). Furthermore, for LUSC cancer dataset, the results of DFVS and NCUA on Paired-SSN network are significantly enriched in the CSD list (Figure 9C), while the P-values of all the network control methods on SSN and LIONESS networks are not significant for predicting the PDGs enriched in the CSD list (Figure 9A and B).

The paired samples of the 13 cancer datasets and gene annotation datasets (CCG, NCG and CSD gene lists) used in this work, and the evaluation pipeline called Cancer_Network_control package, can be freely downloaded at https://github.com/NWPU-903PR/Cancer_Network_Control. Especially, the evaluation pipeline can implement three kinds of the single-sample network construction methods (i.e. LIONESS, SSN and Paired-SSN) and four kinds of the network control methods (i.e. MMS, MDS, DFVS and NCUA) for detecting PDGs in cancer.

Figure 7

Schematic demonstration of MMS, MDS, DFVS and NCUA methods on the personalized state transition network constructed with the Paired-SSN method for patient ‘TCGA-BJ-A28W’ in THCA cancer. A sub-network (containing FGR gene and its neighborhood genes) of the personalized state transition network for the ‘TCGA-BJ-A28W’ cancer patient is chosen to evaluate the performance of MMS, MDS, DFVS and NCUA. The genes {BTK, ABL1, EPHA7} in the CCG gene list (within the rectangular box in blue color) and NCG gene list (within the rectangular box in green color) are considered as the gold standard driver genes. (A) MMS method found the matching edges (red color edges) which results 10 unmatched nodes (red color nodes). These 10 genes are considered as the driver genes in which there is one standard driver gene (EPHA7). (B) MDS method identified the MDS {FGR} (red color node) as driver genes in which there are no standard driver genes. MDS assumes that the driver node can independently control its associated edges (red color edges). (C) DFVS method identified the FVS node {DOK1} (red color node) that intersects two feedback loops as the driver genes in which there are no standard driver genes.(D) NCUA method identified four driver genes (red color nodes) which can cover all the edges (feedback loop) in the undirected network. There is one standard driver gene among four driver genes.

Figure 8

The enrichment significance scores of the PDGs identified with the network structure-based control methods. The enrichment significant P-value is calculated by using the hyper geometric test [129]. The enrichment score ESg is defined as ESg = −log10(P-value). The personalized driver cancer genes from 13 cancer datasets are annotated in the lists of CCG, NCG and CSD databases. (A–C) are the results of the MMS, MDS, DFVS and NCUA methods in the CCG list by respectively using the LIONESS, SSN and Paired-SSN network constructing methods; (D–F) are the results of the MMS, MDS, DFVS and NCUA methods in the NCG list by respectively using the LIONESS, SSN and Paired-SSN network constructing methods; the red line denotes the significant threshold value ESG = 2. If the ESG for the predicted PDGs is larger than this threshold value, we think that the enrichment result is significant.

Figure 9

The enrichment significance scores of MMS, MDS, DFVS and NCUA methods in the CSD list by respectively using the (A) LIONESS, (B) SSN and (C) Paired-SSN network constructing methods. The enrichment significant P-value is calculated by using the hyper geometric test [129]. The enrichment score ESg is defined as ESg = −log10(P-value). The red line denotes the significant threshold value ESG = 2. If the ESG for the predicted PDGs is larger than this threshold value, we think that the enrichment result is significant.

Remarks on the structure control methods

Many studies have focused on controlling the system through any minimum driver-node set, but little work has been conducted for multiple driver-node sets to control the network. From the perspective of identifying PDGs, the existing structural controllability may not be efficient or optimal, and this limitation hinders the application of network control methods in the fields of biology and biomedicine. With the recent rapid development of structural controllability, many applications to complex biological networks have verified that structural controllability can provide meaningful results. Liu et al. [101] defined the classification of nodes as being critical, redundant or ordinary if the node’s absence increases, decreases or is equal to the driver nodes, respectively. This classification has been applied to the identification of disease genes and drug targets in a directed protein interaction network [130]. The control capacity is introduced as a full controllability measure to quantify the importance of a node in networks [131]. Furthermore, the control capacity can be applied to the detection of driver metabolites in the human liver metabolic network [132], driver proteins in a human signaling network [133] and critical regulatory genes in a cancer signaling network [134]. Similarly, Guo et al. [23] applied control capacity in constrained target controllability to evaluate the controlling role of a single gene to detect driver genes in gene interaction networks. In addition, Bao et al. [135] presented an algorithm to compute and evaluate the critical, intermittent and redundant vertices for controlling direct networks under the Feedback control framework. Furthermore, physical controllability [136, 137] is an important metric to evaluate how much energy is required to achieve a control purpose for the structure-based control methods.

Recently, Ronquist et al. [138] developed the data guided control (DGC) method to identify the key temporal transcription factors (TFs) in cellular reprogramming. The DGC first constructed the temporal state transition network using the principle that the state transition network should differ as little from the identity as possible. The temporal state transition network can be constructed as follows:

$$\begin{equation} {A}_k={I}_N+\frac{\left({x}_{k+1}-{x}_k\right){x}_k^T}{x_k^T{x}_k},k=1,2,\dots, T, \end{equation}$$

(9)

where N is the number of genes, x_k is the temporal expression data at time k and T is the desired time. Then, given a set of TF p, the DGC method can determine the quality and timing of TF, u, which minimizes the difference between the desired state x_T and the predicted final state |${z}_T$| as follows:

$$\begin{equation} {\displaystyle \begin{array}{l}\begin{array}{cc}\underset{u}{\min }& \left\Vert {x}_T-{z}_T(u)\right\Vert \end{array}\\ {}s.t.\left\{\begin{array}{l}{u}_{m,k}\ge 0,k\in \left[1,T-1\right]\\ {}{u}_{m,k}=0,k\notin p\\ {}{u}_{m,k+1}\ge {u}_{m,k}\\ {}{z}_T={A}_{T-1}{A}_{T-2}\dots {A}_1{x}_1+\sum \limits_{j=2}^{T-1}{A}_j\dots {A}_{T-1}{Bu}_{j-1}+{Bu}_{T-1}\end{array}\right.\end{array}}, \end{equation}$$

(10)

where B denotes the control matrix encoding the relationship between TF and genes. In this DGC method, it is assumed that the TF is a constant controller that produces constant quality signal (greater than zero) to change the network state from the initial state to the desired state in cell reprogramming. The control matrix B should be determined according to prior information, unlike the previously mentioned structure-based control methods. This DGC method provides a new perspective for investigating the controllability of networks from data and can be applied successfully to identify TF in cellular reprogramming. The DGC method, however, ignores the prior gene interaction data and also suffers from higher computational cost. Therefore, it is highly required that we will be able to identify PDGs in complex diseases with more accuracy and efficiency in the future.

Future directions

Human genomics is an area rightly shifting toward integrative analysis, in which accurately and reliably identifying PDGs is a key focus. Mathematical and computational modeling techniques are critical tools that enable us to design more personalized therapy schemes, as they allow for the analysis of large datasets and complex dynamic processes [57, 65]. This review introduced state-of-the-art techniques to enhance the connections between network control theory and the PDGs identification related to the biological system transition from the normal state to the disease state. These insights may provide novel clues for identifying individual disease genes for precision medicine. However, advances in molecular biology such as single-cell RNA sequencing (i.e. scRNA-seq) techniques can make it possible for developing more efficient methods to improve the PDGs discovery. This section will discuss future directions for the application of these controllability principles to benefit the PDGs identification.

Characterizing controllability of ncRNA regulatory network

As well known to us, cancer drivers can be divided into two types: coding drivers (genes) and non-coding drivers (e.g. miRNA, lncRNA). In this review, we mainly discuss the identification of personalized coding driver genes by using network control principles. However, computational identification of non-coding drivers is in many ways even more challenging than coding drivers owing to their complex and varied modes of action as well as our poor understanding of non-coding regions in general. The non-coding regions contain a wealth of regulatory sequences and non-coding RNAs (ncRNAs) whose role in cancer remains largely unknown [139]. No network control studies have investigated roles of ncRNAs in cancer, in particular at an individual level; we expect that future studies will discover novel candidate individual-specific ncRNAs drivers by using network control principles on the ncRNAs datasets [140]. Furthermore, it was reported that ncRNAs are highly expressed in a subset of cells in a population, while their expression tends to be lower than protein-coding mRNAs [141, 142]. Therefore, reconstructing the personalized state transition networks and designing the corresponding structure-based network control methods at the single cell level may efficiently identify non-coding drivers that are important in cancer.

Characterizing controllability of biological networks in single cells

The change of gene expression networks is dependent on a number of factors, including cell types and genetic signatures. The influence of each of these expressions can be determined by scRNA-seq data [84, 143]. Therefore, state transition networks found from bulk-based expression data cannot truly represent those found at the single-cell level, and single-cell transcriptome measurements can reveal unexplored biological diversity [144, 145]. Furthermore, scRNA-seq techniques allow researchers to quantify gene expression in up to thousands of individual cells [146]. For example to depict the baseline landscape of the composition, lineage and functional states of tumor infiltrating lymphocytes, Zhang et al. obtained 12 346 T cells scRNA-seq data from 14 treatment-naïve non-small-cell lung cancer patients [147]. Meanwhile they obtained transcriptomes of 11 138 single T cells from 12 patients with colorectal cancer and developed single T cell analysis by RNA sequencing and TCR tracking indices to quantitatively analyze the dynamic relationships among 20 identified T cell subsets with distinct functions and clonalities [148]. However, these scRNA-seq technologies suffer from many sources of technical noise and bias that must be modeled to account for the resulting uncertainty in downstream analyses, including undersampling of mRNA molecules (often called ‘dropout’), which can severely obscure important gene–gene relationships [149, 150]. The unique features of scRNA-seq data may provide opportunities to enhance personalized state transition networks, which will require the development of new algorithms to take these data features into account when constructing state transition networks to further improve PDGs identification at a single-cell level [151, 152].

Characterizing controllability of biological networks in edge dynamics

All of the above described works mainly focus on the identification of PDGs by exploiting dynamic analysis of node-networks. Recently, edge-network dynamic analysis has become a hot topic in both the edge (gene pairs) state network construction of single samples [97] and control principles of edge dynamic [51, 153–156]. Furthermore, recent studies have highlighted the importance of targeting key gene interactions for designing drug combinations in disease control [75, 157–159]. Therefore, how to identify the PDGs with proper network control methods by considering the edge dynamic of individual state transition network is an important direction in the future.

Characterizing controllability of biological network in temporal dynamics

The above-introduced techniques construct personalized state transition networks mainly by using personalized normal and tumor expression data; however, these methods ignore the personalized temporal data information. The usage of temporal expression data is important for identifying PDGs because it can provide additional dynamic information for inferring developmental trajectories of PDGs in cancer propagation [160]. Meanwhile, it remains a significant challenge to use temporal expression data to identify PDGs. The first issue is to obtain the personalized temporal data in complex diseases. Although modern sequencing technology has provided significant resource data to study human diseases, we still lack temporal data for a single patient. The second issue is to construct the personalized state transition networks and to further develop control models to identify PDGs by using such temporal expression data. Recently, some techniques on temporal data may provide new clues for identifying the PDGs with temporal data information, including detection for disease tipping points [161–162], dynamic network biomarker (DNB) discovery [163–164], DGC method [138], structural controllability of temporal networks [165–168] and personalized DNB identification [169].

Characterizing optional decision of network controllers

The existing research has focused mainly on the control roles of a single node and has ignored the control roles of different driver-node sets. When one actually expects to control a system for optimization, different driver-node sets may not participate equally in control and different sets of driver nodes may be related preferentially to specific biological function and progress [170]. To measure such control roles of different sets of driver nodes demands the ability to combine controllability and other control measures, such as time cost and energy cost, to obtain the optimal driver node set. Notably, the energy cost is a typical measure used to analyze the control objective, which is related with the usage of the controllers.

The existing structure-based control methods do not consider how to obtain the actual control by designing optimal controller or driver signal. Although feedback and closed-loop controllers have been applied to many physical systems [67, 171, 172], the task of designing a controller for structure-based control methods in biological networks remains unresolved. The constant controllers have been applied to identify the key TFs in cell reprogramming [138]. Therefore, we expect that such discussed controllers also will be able to enhance the key choice of different sets of driver nodes and to identify PDGs in personalized medicine or in the study of general biological transitions.

Identification of the personalized driver pathways by using network control principles

It is well known that different gene mutations may target the same pathway [173]. Therefore, it is necessary to shift the study from gene to pathway level, which is helpful in capturing the heterogeneous functional patterns in cancer. There have been several studies to discover the mutation patterns at the pathway level [174–176]. Different cancer types may have different network control principles at the pathway level, which is critical for personalized therapy (e.g. precision medicine in cancer treatment). Furthermore, several studies suggest that there are functional differences between different cancer types, even different samples with the same cancer [177]. Taking into account such cancer heterogeneity, it is indispensable to develop new algorithms from network control perspectives to discover personalized driver pathways.

Conclusions

Cancer can be perceived as a dysfunction in molecular networks that regulate molecular communications and cellular processes. Molecular networks, such as gene regulation networks or signaling networks, are highly adaptable and dynamic. Thus, to fully understand the mechanism of cancer progression, we need to understand the dynamics of these networks in accordance with control theory. Tumor heterogeneity differs in survival fitness, invasive potential and adaptability to the tumor microenvironment, which has been a primary obstacle for understanding the functional importance in personalized therapies. The ability to identify PDGs of individual patients can provide an efficient source of the PDGs and drug targets and is essential for understanding tumor heterogeneity. Most of the existing computational methods for analyzing driver genes, however, have mainly focused on the cohort information rather than the individual PDGs. Recently, structure-based network control approaches have given us the ability to investigate the controllability of the complex networks with an adequate knowledge of the network structure through a minimum set of the driver nodes. These structure-based network control approaches can make it possible to deeply understand the mechanism of PDGs in cancer progression.

In this review, we considered cancer progression of individual patients from a normal state to a disease state as a network control problem and gave a comprehensive review of the recent work including the network reconstruction methods on single samples and network control models to solve such problem. In addition, we gave a performance assessment of structure-based control methods on multiple cancer datasets from TCGA. We found that to identify the PDGs, network control methods are more effective than the traditional DEG methods, which are remarkably dependent on the single-sample network construction methods. However, there are still promising and on-going efforts for the accurate PDGs identifications with network control approaches, such as the usage of personalized data (e.g. single cells), identification of personalized non-coding drivers and personalized driver pathways.

Key Points

Considering cancer progression from a normal state to a disease state as a network control problem to identify PDGs
Introducing several network construction techniques on single samples to bridge network control theory and PDGs identification
Providing a comprehensive review of the network control methods and assessing the performance of the network control methods for identifying PDGs
Providing evaluation data and pipeline for users to easily apply network control methods on PDGs identifications

Acknowledgements

We thank the reviewers for giving us valuable comments, which benefit for the improvement of this paper. We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript. We thank the members of Tatsuya Akutsu’s laboratory in Bioinformatics Center, Institute for Chemical Research, Kyoto University for valuable discussions on this topic. We apologize that we cannot cite all related studies owing to the limited space of manuscript space.

Funding

This paper was supported by the National Natural Science Foundation of China (grant nos. 61873202, 61473232, 91430111, 31771476, 81471047, 11871456), National Key R&D Program (grant nos. 2017YFA0505500, 2016YFC0903400), Strategic Priority Research Program of the Chinese Academy of Sciences (grant no. XDB13040700), Natural Science Foundation of Shanghai (grant no. 17ZR1446100) and Shanghai Municipal Science and Technology Major Project (grant no. 2017SHZDZX01).

Wei-Feng Guo is currently working toward a PhD degree at the School of Automation from Northwestern Polytechnical University, China. His work is currently co-supervised by Professor Shao-Wu Zhang and Professor Luonan Chen. He does researches in design of complex network control algorithms and its applications to human cancer genomics.

Shao-Wu Zhang received a PhD degree at the School of Automation from Northwestern Polytechnical University, China, at 2004. He has been a professor in Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, China. His current research interests include bioinformatics, complex networks and machine learning.

Tao Zeng received BS, MS and PhD degrees from Wuhan University, Wuhan, China, in 2003, 2006 and 2010, respectively. Since 2013, he has been an associate professor in Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences and now in Shanghai Research Center for Brain Science and Brain-Inspired Intelligence, Shanghai, China. His research interests include bioinformatics, network biology, computational biology, machine learning and graph theory.

Tatsuya Akutsu received BE and ME degrees in aeronautics and a DE degree in information engineering from The University of Tokyo, Tokyo, Japan, in 1984, 1986, and 1989, respectively. He has been a professor with the Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan, since 2001. His current research interests include bioinformatics and discrete algorithms.

Luonan Chen received a BS degree from Huazhong University of Science and Technology, Wuhan, China, in 1984, and MS and PhD degrees from Tohoku University, Sendai, Japan, in 1988 and 1991, respectively. Since 2010, he has been a professor and the executive director at Key Laboratory of Systems Biology, Center for Excellence in Animal Evolution and Genetics, Center for Excellence in Molecular Cell Science, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences. His fields of interest are computational systems biology, bioinformatics and nonlinear dynamics. In recent years, he published over 300 journal papers and 2 monographs in the area of computational systems biology.

References

1.

Jang

HS

,

Shah

NM

,

Du

AY

, et al.

Transposable elements drive widespread expression of oncogenes in human cancers

.

Nat Genet

2019

;

51

:

611

–

7

.

2.

Waisberg

M

,

Joseph

P

,

Hale

B

, et al.

Molecular and cellular mechanisms of cadmium carcinogenesis

.

Toxicology

2003

;

192

:

95

–

117

.

3.

Vasavi

S

,

Yong

C

,

Zhihai

M

, et al.

Widespread contribution of transposable elements to the innovation of gene regulatory networks

.

Genome Res

2014

;

24

:

1963

–

76

.

PubMed

4.

Vogelstein

B

,

Papadopoulos

N

,

Velculescu

VE

, et al.

Cancer genome landscapes

.

Science

2013

;

339

:

1546

–

58

.

5.

Ozturk

K

,

Dow

M

,

Carlin

DE

, et al.

The emerging potential for network analysis to inform precision cancer medicine

.

J Mol Biol

2018

;

430

:

2875

–

99

.

6.

Pleasance

ED

,

Cheetham

RK

,

Stephens

PJ

, et al.

A comprehensive catalogue of somatic mutations from a human cancer genome

.

Nature

2010

;

463

:

191

–

6

.

7.

Klein

CA

.

Selection and adaptation during metastatic cancer progression

.

Nature

2013

;

501

:

365

–

72

.

8.

Ren

X

,

Kang

B

,

Zhang

Z

.

Understanding tumor ecosystems by single-cell sequencing: promises and limitations

.

Genome Biol

2018

;

19

:

211

.

9.

Sun

D

,

Ren

X

,

Ari

E

, et al.

Discovering cooperative biomarkers for heterogeneous complex disease diagnoses

.

Brief Bioinform

2017

;

20

:

89

–

101

.

10.

Network

CGA

.

Comprehensive molecular portraits of human breast tumours

.

Nature

2012

;

490

:

61

–

70

.

11.

Consortium CCLE

,

Consortium GDSC

.

Pharmacogenomic agreement between two cancer cell line data sets

.

Nature

2015

;

528

:

84

–

7

.

PubMed

12.

Barretina

J

,

Caponigro

G

,

Stransky

N

, et al.

The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity

.

Nature

2012

;

483

:

603

–

7

.

13.

Chin

L

,

Andersen

JN

,

Futreal

PA

.

Cancer genomics: from discovery science to personalized medicine

.

Nat Med

2011

;

17

:

297

–

303

.

14.

Gonzalez-Perez

A

,

Mustonen

V

,

Reva

B

, et al.

Computational approaches to identify functional genetic variants in cancer genomes

.

Nat Methods

2013

;

10

:

723

–

9

.

15.

Edgar

R

,

Domrachev

M

,

Lash

AE

.

Gene Expression Omnibus: NCBI gene expression and hybridization array data repository

.

Nucleic Acids Res

2002

;

30

:

207

–

10

.

16.

Wong

WC

,

Kim

D

,

Carter

H

, et al.

CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer

.

Bioinformatics

2011

;

27

:

2147

–

8

.

17.

Shihab

HA

,

Gough

J

,

Cooper

DN

, et al.

Predicting the functional consequences of cancer-associated amino acid substitutions

.

Bioinformatics

2013

;

29

:

1504

–

10

.

18.

Mao

Y

,

Chen

H

,

Liang

H

, et al.

CanDrA: cancer-specific driver missense mutation annotation with optimized features

.

PLoS One

2013

;

8

:

e77945

.

19.

Dees

ND

,

Zhang

Q

,

Kandoth

C

, et al.

MuSiC: identifying mutational significance in cancer genomes

.

Genome Res

2012

;

22

:

1589

–

98

.

20.

Reva

B

,

Antipin

Y

,

Sander

C

.

Predicting the functional impact of protein mutations: application to cancer genomics

.

Nucleic Acids Res

2011

;

39

:

e118

.

21.

Leiserson

MD

,

Vandin

F

,

Wu

H-T

, et al.

Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes

.

Nat Genet

2015

;

47

:

106

–

14

.

22.

Bashashati

A

,

Haffari

G

,

Ding

J

, et al.

DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer

.

Genome Biol

2012

;

13

:

R124

.

23.

Guo

W-F

,

Zhang

S-W

,

Liu

L-L

, et al.

Discovering personalized driver mutation profiles of single samples in cancer by network control strategy

.

Bioinformatics

2018

;

34

:

1893

–

903

.

24.

Ng

S

,

Collisson

EA

,

Sokolov

A

, et al.

PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis

.

Bioinformatics

2012

;

28

:

i640

–

6

.

25.

Shrestha

R

,

Hodzic

E

,

Sauerwald

T

, et al.

HIT'nDRIVE: patient-specific multidriver gene prioritization for precision oncology

.

Genome Res

2017

;

27

:

1573

–

88

.

26.

Bertrand

D

,

Chng

KR

,

Sherbaf

FG

, et al.

Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles

.

Nucleic Acids Res

2015

;

43

:

e44

.

27.

Dinstag

G

,

Shamir

R

.

PRODIGY: personalized prioritization of driver genes

.

bioRxiv

2018

,

doi: 10.1101/456723

.

28.

Hou

JP

,

Ma

J

.

DawnRank: discovering personalized driver genes in cancer

.

Genome Med

2014

;

6

:

56

.

29.

Tamborero

D

,

Gonzalez-Perez

A

,

Lopez-Bigas

N

.

OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes

.

Bioinformatics

2013

;

29

:

2238

–

44

.

30.

Lawrence

MS

,

Stojanov

P

,

Polak

P

, et al.

Mutational heterogeneity in cancer and the search for new cancer-associated genes

.

Nature

2013

;

499

:

214

–

8

.

31.

Luo

P

,

Ding

Y

,

Lei

X

, et al.

deepDriver: predicting cancer driver genes based on somatic mutations using deep convolutional neural networks

.

Front Genet

2019

;

10

:

13

.

32.

Zhang

S-Y

,

Zhang

S-W

,

Liu

L

, et al.

m6A-Driver: identifying context-specific mRNA m6A methylation-driven gene interaction networks

.

PLoS Comput Biol

2016

;

12

:

e1005287

.

33.

Kroschinsky

F

,

Stölzel

F

,

von Bonin

S

, et al.

New drugs, new toxicities: severe side effects of modern targeted and immunotherapy of cancer and their management

.

Crit Care

2017

;

21

:

89

.

34.

Boccaletti

S

,

Latora

V

,

Moreno

Y

, et al.

Complex networks: structure and dynamics

.

Phys Rep

2006

;

424

:

175

–

308

.

35.

Zeng

T

,

Zhang

W

,

Yu

X

, et al.

Big-data-based edge biomarkers: study on dynamical drug sensitivity and resistance in individuals

.

Brief Bioinform

2015

;

17

:

576

–

92

.

36.

Martelotto

LG

,

Ng

CK

,

De Filippo

MR

, et al.

Benchmarking mutation effect prediction algorithms using functionally validated cancer-related missense mutations

.

Genome Biol

2014

;

15

:

484

.

37.

Kanhaiya

K

,

Czeizler

E

,

Gratie

C

, et al.

Controlling directed protein interaction networks in cancer

.

Sci Rep

2017

;

7

:

10327

.

38.

Guo

WF

,

Zhang

SW

,

Shi

QQ

, et al.

A novel algorithm for finding optimal driver nodes to target control complex networks and its applications for drug targets identification

.

BMC Genomics

2018

;

19

:

924

.

39.

Li

A

,

Cornelius

SP

,

Liu

YY

, et al.

The fundamental advantages of temporal networks

.

Science

2017

;

358

:

1042

–

6

.

40.

Nacher

JC

,

Akutsu

T

.

Minimum dominating set-based methods for analyzing biological networks

.

Methods

2016

;

102

:

57

–

63

.

41.

Guo

WF

,

Zhang

SW

,

Zeng

T

, et al.

Identifying drug combinations of individual cancer patients by personalized drug controller method

.

bioRxiv

2019

.

doi:10.1101/571620

.

42.

Asgari

Y

,

Salehzadeh-Yazdi

A

,

Schreiber

F

, et al.

Controllability in cancer metabolic networks according to drug targets as driver nodes

.

PLoS One

2013

;

8

:

e79397

.

43.

Srihari

S

,

Raman

V

,

Leong

HW

, et al.

Evolution and controllability of cancer networks: a Boolean perspective

.

IEEE/ACM Trans Comput Biol Bioinform

2014

;

11

:

83

–

94

.

44.

Wang

B

,

Gao

L

,

Zhang

Q

, et al.

Diversified control paths: a significant way disease genes perturb the human regulatory network

.

PLoS One

2015

;

10

:

e0135491

.

45.

Lin

CT

.

Structural controllability

.

IEEE Trans Automat Contr

1974

;

19

:

201

–

8

.

46.

Lombardi

A

,

Hörnquist

M

.

Controllability analysis of networks

.

Phys Rev E

2007

;

75

:

056110

.

47.

Wu

FX

,

Wu

L

,

Wang

J

, et al.

Transittability of complex networks and its applications to regulatory biomolecular networks

.

Sci Rep

2014

;

4

:

4819

.

48.

Wang

B

,

Gao

L

,

Gao

Y

.

Control range: a controllability-based index for node significance in directed networks

.

J Stat Mech

2012

;

4

:

P04011

.

49.

Guo

WF

,

Zhang

SW

,

Wei

ZG

, et al.

Constrained target controllability of complex networks

.

J Stat Mech

2017

;

6

:

063402

.

50.

Fiedler

B

,

Kurosawa

G

,

Saito

D

.

Dynamics and control at feedback vertex sets. I: informative and determining nodes in regulatory networks

.

J Dyn Differ Equ

2013

;

25

:

563

–

604

.

51.

Nepusz

T

,

Vicsek

T

.

Controlling edge dynamics in complex networks

.

Nat Phys

2012

;

8

:

568

–

73

.

52.

Liu

Y

,

Slotine

JJ

,

Barabasi

AL

.

Controllability of complex networks

.

Nature

2011

;

473

:

167

.

53.

Zañudo

JGT

,

Yang

G

,

Albert

R

.

Structure-based control of complex networks with nonlinear dynamics

.

Proc Natl Acad Sci U S A

2017

;

114

:

7234

–

9

.

54.

Yuan

Z

,

Zhao

C

,

Di

Z

, et al.

Exact controllability of complex networks

.

Nat Commun

2013

;

4

:

2447

.

55.

Nacher

JC

,

Akutsu

T

.

Dominating scale-free networks with variable scaling exponent: heterogeneous networks are not difficult to control

.

New J Phys

2012

;

14

:

073005

.

56.

Nacher

JC

,

Akutsu

T

.

Structural controllability of unidirectional bipartite networks

.

Sci Rep

2013

;

3

:

1647

.

57.

Cheng

F

,

Zhao

J

,

Zhao

Z

.

Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes

.

Brief Bioinform

2015

;

17

:

642

–

56

.

58.

Turanli

B

,

Karagoz

K

,

Gulfidan

G

, et al.

A network-based cancer drug discovery: from integrated multi-omics approaches to precision medicine

.

Curr Pharm Des

2018

;

24

:

3778

–

90

.

59.

Forbes

SA

,

Beare

D

,

Gunasekaran

P

, et al.

COSMIC: exploring the world's knowledge of somatic mutations in human cancer

.

Nucleic Acids Res

2014

;

43

:

D805

–

11

.

60.

Gao

J

,

Aksoy

BA

,

Dogrusoz

U

, et al.

Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal

.

Sci Signal

2013

;

6

:

l1

.

61.

Wu

C

,

Orozco

C

,

Boyer

J

, et al.

BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources

.

Genome Biol

2009

;

10

:

R130

.

62.

Futreal

PA

,

Coin

L

,

Marshall

M

, et al.

A census of human cancer genes [J]

.

Nat Rev Cancer

2004

;

4

:

177

–

83

.

63.

Repana

D

,

Nulsen

J

,

Dressler

L

, et al.

The Network of Cancer Genes (NCG): a comprehensive catalogue of known and candidate cancer genes from cancer sequencing screens

.

Genome Biol

2019

;

20

:

1

.

64.

Piñero

J

,

Bravo

À

,

Queralt-Rosinach

N

, et al.

DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants

.

Nucleic Acids Res

2017

;

45

:

D833

–

9

.

65.

Kolch

W

,

Halasz

M

,

Granovskaya

M

, et al.

The dynamic control of signal transduction networks in cancer cells

.

Nat Rev Cancer

2015

;

15

:

515

–

27

.

66.

Wisdom

JO

.

Cybernetics, or control and communication in the animal and the machine

.

Int J Psychoanal

1949

;

30

:

133

–

7

.

67.

Sun

YZ

,

Leng

SY

,

Lai

YC

, et al.

Closed-loop control of complex networks: a trade-off between time and energy

.

Phys Rev Lett

2017

;

119

:

198301

.

68.

Cornelius

SP

,

Kath

WL

,

Motter

AE

.

Realistic control of network dynamics

.

Nat Commun

2013

;

4

:

1942

.

69.

Kim

JB

,

Sebastiano

V

,

Wu

G

, et al.

Oct4-induced pluripotency in adult neural stem cells

.

Cell

2009

;

136

:

411

–

9

.

70.

Takahashi

K

,

Yamanaka

S

.

Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors

.

Cell

2006

;

126

:

663

–

76

.

71.

Vierbuchen

T

,

Ostermeier

A

,

Pang

ZP

, et al.

Direct conversion of fibroblasts to functional neurons by defined factors

.

Nature

2010

;

463

:

1035

–

41

.

72.

Ieda

M

,

Fu

J-D

,

Delgado-Olguin

P

, et al.

Direct reprogramming of fibroblasts into functional cardiomyocytes by defined factors

.

Cell

2010

;

142

:

375

–

86

.

73.

Szabo

E

,

Rampalli

S

,

Risueno

RM

, et al.

Direct conversion of human fibroblasts to multilineage blood progenitors

.

Nature

2010

;

468

:

521

–

6

.

74.

Huang

P

,

He

Z

,

Ji

S

, et al.

Induction of functional hepatocyte-like cells from mouse fibroblasts by defined factors

.

Nature

2011

;

475

:

386

–

9

.

75.

Han

K

,

Jeng

EE

,

Hess

GT

, et al.

Synergistic drug combinations for cancer identified in a CRISPR screen for pairwise genetic interactions

.

Nat Biotechnol

2017

;

35

:

463

–

74

.

76.

Quan

Y

,

Liu

M-Y

,

Liu

Y-M

, et al.

Facilitating anti-cancer combinatorial drug discovery by targeting epistatic disease genes

.

Molecules

2018

;

23

:

736

.

77.

Cheng

F

,

Kovács

IA

,

Barabási

A-L

.

Network-based prediction of drug combinations

.

Nat Commun

2019

;

10

:

1197

.

78.

Hu

Y

,

Chen

C

,

Ding

Y

, et al.

Optimal control nodes in disease-perturbed networks as targets for combination therapy

.

Nat Commun

2019

;

10

:

2180

.

79.

Wang

Y

,

Joshi

T

,

Zhang

X-S

, et al.

Inferring gene regulatory networks from multiple microarray datasets

.

Bioinformatics

2006

;

22

:

2413

–

20

.

80.

Zhu

H

,

Rao

RSP

,

Zeng

T

, et al.

Reconstructing dynamic gene regulatory networks from sample-based transcriptional data

.

Nucleic Acids Res

2012

;

40

:

10657

–

67

.

81.

Liu

F

,

Zhang

S-W

,

Guo

W-F

, et al.

Inference of gene regulatory network based on local Bayesian networks

.

PLoS Comput Biol

2016

;

12

:

e1005024

.

82.

Tokheim

CJ

,

Papadopoulos

N

,

Kinzler

KW

, et al.

Evaluating the evaluation of cancer driver genes

.

Proc Natl Acad Sci

2016

;

113

:

14330

–

5

.

83.

Gamazon

ER

,

Segrè

AV

,

van de Bunt

M

, et al.

Using an atlas of gene regulation across 44 human tissues to inform complex disease-and trait-associated variation

.

Nat Genet

2018

;

50

:

956

.

84.

Van der Wijst

MG

,

de Vries

DH

, et al.

An integrative approach for building personalized gene regulatory networks for precision medicine

.

Genome Med

2018

;

10

:

96

.

85.

Kuijjer

ML

,

Tung

M

,

Yuan

G

, et al.

Estimating sample-specific regulatory networks

.

iScience

2019

;

14

:

226

–

40

.

86.

Liu

X

,

Wang

Y

,

Ji

H

, et al.

Personalized characterization of diseases using sample-specific networks

.

Nucleic Acids Res

2016

;

44

:

e164

.

87.

Guo

W-F

,

Zhang

S-W

,

Zeng

T

, et al.

A novel network control model for identifying personalized driver genes in cancer

.

bioRxiv

2019

.

doi: 10.1101/503565

.

88.

Jia

P

,

Zhao

Z

.

VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data

.

PLoS Comput Biol

2014

;

10

:

e1003460

.

89.

Carter

SL

,

Brechbühler

CM

,

Michael

G

, et al.

Gene co-expression network topology provides a framework for molecular characterization of cellular state

.

Bioinformatics

2004

;

20

:

2242

–

50

.

90.

Kimberly

G

,

Curtis

H

,

John

Q

, et al.

Passing messages between biological networks to refine predicted interactions

.

PLoS One

2013

;

8

:

59

.

91.

Kraskov

A

,

Stögbauer

H

,

Grassberger

P

.

Estimating mutual information

.

Phys Rev E

2004

;

69

:

066138

.

92.

Faith

JJ

,

Hayete

B

,

Thaden

JT

, et al.

Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles

.

PLoS Biol

2007

;

5

:

e8

.

93.

Yu

X

,

Zeng

T

,

Wang

X

, et al.

Unravelling personalized dysfunctional gene network of complex diseases based on differential network model

.

J Transl Med

2015

;

13

:

189

.

94.

Zhang

W

,

Zeng

T

,

Liu

X

, et al.

Diagnosing phenotypes of single-sample individuals by edge biomarkers

.

J Mol Cell Biol

2015

;

7

:

231

–

41

.

95.

Zhang

W

,

Zeng

T

,

Chen

L

.

EdgeMarker: identifying differentially correlated molecule pairs as edge-biomarkers

.

J Theor Biol

2014

;

362

:

35

–

43

.

96.

Zeng

T

,

Zhang

W

,

Yu

X

, et al.

Edge biomarkers for classification and prediction of phenotypes

.

Sci China Life Sci

2014

;

57

:

1103

–

14

.

97.

Yu

X

,

Zhang

J

,

Sun

S

, et al.

Individual-specific edge-network analysis for disease prediction

.

Nucleic Acids Res

2017

;

45

:

e170

.

98.

Zhang

X

,

Zhao

J

,

Hao

J-K

, et al.

Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks

.

Nucleic Acids Res

2014

;

43

:

e31

.

99.

Zhao

J

,

Zhou

Y

,

Zhang

X

, et al.

Part mutual information for quantifying direct associations in networks

.

Proc Natl Acad Sci

2016

;

113

:

5130

–

5

.

100.

Frenzel

S

,

Pompe

B

.

Partial mutual information for coupling analysis of multivariate time series

.

Phys Rev Lett

2007

;

99

:

204101

.

101.

Wu

L

,

Shen

Y

,

Li

M

, et al.

Network output controllability-based method for drug target identification

.

IEEE Trans Nanobioscience

2015

;

14

:

184

–

91

.

102.

Gao

J

,

Liu

Y-Y

,

D'Souza

RM

, et al.

Target control of complex networks

.

Nat Commun

2014

;

5

:

5415

.

103.

Wang

B

,

Gao

L

,

Gao

Y

, et al.

Controllability and observability analysis for vertex domination centrality in directed networks

.

Sci Rep

2014

;

4

:

5399

.

104.

Yan

G

,

Vértes

PE

,

Towlson

EK

, et al.

Network control principles predict neuron function in the Caenorhabditis elegans connectome

.

Nature

2017

;

550

:

519

–

23

.

105.

Liu

X

,

Pan

L

.

Detection of driver metabolites in the human liver metabolic network using structural controllability analysis

.

BMC Syst Biol

2014

;

8

:

51

.

106.

Ravindran

V

,

Nacher

JC

,

Akutsu

T

, et al.

Network controllability analysis of intracellular signalling reveals viruses are actively controlling molecular systems

.

Sci Rep

2019

;

9

:

2066

.

107.

Wuchty

S

.

Controllability in protein interaction networks

.

Proc Natl Acad Sci U S A

2014

;

111

:

7156

.

108.

Liu

YY

,

Barabási

AL

.

Control principles of complex networks

.

Rev Mod Phys

2016

;

88

:

035006

.

109.

Sun

PG

,

Ma

X

.

Understanding the controllability of complex networks from the microcosmic to the macrocosmic

.

New J Phys

2017

;

19

:

013022

.

110.

Sun

PG

.

Controllability and modularity of complex networks

.

Inform Sci

2015

;

325

:

20

–

32

.

111.

Ishitsuka

M

,

Akutsu

T

,

Nacher

JC

.

Critical controllability in proteome-wide protein interaction network integrating transcriptome

.

Sci Rep

2016

;

6

:

23541

.

112.

Wuchty

S

,

Boltz

T

,

Küçükmcginty

H

.

Links between critical proteins drive the controllability of protein interaction networks

.

Proteomics

2017

;

17

:

1700056

.

113.

Lai

YC

.

Controlling complex, non-linear dynamical networks

.

Natl Sci Rev

2014

;

1

:

339

.

114.

Schiff

S

,

Whalen

A

,

Brennan

S

, et al.

Observability and controllability of nonlinear networks: the role of symmetry

.

Phys Rev X

2015

;

5

:

011005

.

115.

Wang

LZ

,

Su

RQ

,

Huang

ZG

, et al.

A geometrical approach to control and controllability of nonlinear dynamical networks

.

Nat Commun

2016

;

7

:

11323

.

116.

Karl

S

,

Dandekar

T

.

Convergence behaviour and control in non-linear biological networks

.

Sci Rep

2015

;

5

:

9746

.

117.

Li

M

,

Gao

H

,

Wang

J

, et al.

Control principles for biological networks

.

Brief Bioinform

2018

;

bby088

doi:10.1093/bib/bby088

.

118.

Mochizuki

A

,

Fiedler

B

,

Kurosawa

G

, et al.

Dynamics and control at feedback vertex sets. II: a faithful monitor to determine the diversity of molecular activities in regulatory networks

.

J Theor Biol

2013

;

335

:

130

–

46

.

119.

Akutsu

T

,

Kuhara

S

,

Maruyama

O

, et al.

A system for identifying genetic networks from gene expression patterns produced by gene disruptions and overexpressions

.

Genome Inform

1998

;

9

:

151

–

60

.

120.

Chakradhar

ST

,

Balakrishnan

A

,

Agrawal

VD

.

An exact algorithm for selecting partial scan flip-flops

.

J. Electron Test

1995

;

7

:

83

–

93

.

121.

Lenstra

HW

.

Integer programming with a fixed number of variables

.

Math Oper Res

1983

;

8

:

538

–

48

.

122.

Williams

HP

.

Integer and combinatorial optimization

.

J Oper Res Soc

1990

;

41

:

177

–

8

.

123.

Ciriello

G

,

Cerami

E

,

Sander

C

, et al.

Mutual exclusivity analysis identifies oncogenic network modules

.

Genome Res

2012

;

22

:

398

–

406

.

124.

Wu

G

,

Feng

X

,

Stein

L

.

A human functional protein interaction network and its application to cancer data analysis

.

Genome Biol

2010

;

11

:

R53

.

125.

Croft

D

,

O’Kelly

G

,

Wu

G

, et al.

Reactome: a database of reactions, pathways and biological processes

.

Nucleic Acids Res

2010

;

39

:

D691

–

7

.

126.

Schaefer

CF

,

Anthony

K

,

Krupa

S

, et al.

PID: the pathway interaction database

.

Nucleic Acids Res

2009

;

37

:

D674

–

9

.

127.

Kanehisa

M

,

Goto

S

,

Sato

Y

, et al.

KEGG for integration and interpretation of large-scale molecular data sets

.

Nucleic Acids Res

2011

;

40

:

D109

–

14

.

128.

Kibbe

WA

,

Arze

C

,

Felix

V

, et al.

Disease ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data

.

Nucleic Acids Res

2014

;

43

:

D1071

–

8

.

129.

Rivals

I

,

Personnaz

L

,

Taing

L

, et al.

Enrichment or depletion of a GO category within a class of genes: which test?

Bioinformatics

2007

;

23

:

401

–

7

.

130.

Vinayagam

A

,

Gibson

TE

,

Lee

H-J

, et al.

Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets

.

Proc Natl Acad Sci

2016

;

113

:

4976

–

81

.

131.

Jia

T

,

Barabási

A-L

.

Control capacity and a random sampling method in exploring controllability of complex networks

.

Sci Rep

2013

;

3

:

2354

.

132.

Nacher

JC

,

Akutsu

T

.

Analysis of critical and redundant nodes in controlling directed and undirected complex networks using dominating sets

.

J Complex Netw

2014

;

2

:

394

–

412

.

133.

Liu

X

,

Pan

L

.

Identifying driver nodes in the human signaling network using structural controllability analysis

.

IEEE/ACM Trans Comput Biol Bioinform

2015

;

12

:

467

–

72

.

134.

Ravindran

V

,

Sunitha

V

,

Bagler

G

.

Identification of critical regulatory genes in cancer signaling network using controllability analysis

.

Physica A

2017

;

474

:

134

–

43

.

135.

Bao

Y

,

Hayashida

M

,

Liu

P

, et al.

Analysis of critical and redundant vertices in controlling directed complex networks using feedback vertex sets

.

J Comput Biol

2018

;

25

:

1071

–

90

.

136.

Wang

L-Z

,

Chen

Y-Z

,

Wang

W-X

, et al.

Physical controllability of complex networks

.

Sci Rep

2017

;

7

:

40198

.

137.

Chen

YZ

,

Wang

LZ

,

Wang

WX

, et al.

Energy scaling and reduction in controlling complex networks

.

R Soc Open Sci

2016

;

3

:

160064

.

138.

Ronquist

S

,

Patterson

G

,

Muir

LA

, et al.

Algorithm for cellular reprogramming

.

Proc Natl Acad Sci

2017

;

114

:

11832

–

7

.

139.

Gutschner

T

,

Diederichs

S

.

The hallmarks of cancer: a long non-coding RNA point of view

.

RNA Biol

2012

;

9

:

703

–

19

.

140.

Hu

J

,

Zeng

T

,

Xia

Q

, et al.

Unravelling miRNA regulation in yield of rice (Oryza sativa) based on differential network model

.

Sci Rep

2018

;

8

:

8498

.

141.

Liu

SJ

,

Nowakowski

TJ

,

Pollen

AA

, et al.

Single-cell analysis of long non-coding RNAs in the developing human neocortex

.

Genome Biol

2016

;

17

:

67

.

142.

Lanzós

A

,

Carlevaro-Fita

J

,

Mularoni

L

, et al.

Discovery of cancer driver long noncoding RNAs across 1112 tumour genomes: new candidates and distinguishing features

.

Sci Rep

2017

;

7

:

41544

.

143.

Ulirsch

JC

,

Lareau

CA

,

Bao

EL

, et al.

Interrogation of human hematopoiesis at single-cell and single-variant resolution

.

Nat Genet

2019

;

51

:

683

–

93

.

144.

Da Rocha

EL

,

Rowe

RG

,

Lundin

V

, et al.

Reconstruction of complex single-cell trajectories using CellRouter

.

Nat Commun

2018

;

9

:

892

.

145.

Qiu

X

,

Mao

Q

,

Tang

Y

, et al.

Reversed graph embedding resolves complex single-cell trajectories

.

Nat Methods

2017

;

14

:

979

.

146.

Kiselev

VY

,

Andrews

TS

,

Hemberg

M

.

Challenges in unsupervised clustering of single-cell RNA-seq data

.

Nat Rev Genet

2019

;

20

:

273

–

82

.

147.

Guo

X

,

Zhang

Y

,

Zheng

L

, et al.

Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing

.

Nat Med

2018

;

24

:

978

.

148.

Zhang

L

,

Yu

X

,

Zheng

L

, et al.

Lineage tracking reveals dynamic relationships of T cells in colorectal cancer

.

Nature

2018

;

564

:

268

–

72

.

149.

Van Dijk

D

,

Sharma

R

,

Nainys

J

, et al.

Recovering gene interactions from single-cell data using data diffusion

.

Cell

2018

;

174

:

716

–

729.e27

.

150.

Lopez

R

,

Regier

J

,

Cole

MB

, et al.

Deep generative modeling for single-celltranscriptomics

.

Nat Methods

2018

;

15

:

1053

.

151.

Dai

H

,

Li

L

,

Zeng

T

, et al.

Cell-specific network constructed by single-cell RNA sequencing data

.

Nucleic Acids Res

2019

;

gkz172

doi: 10.1093/nar/gkz172

.

152.

Karikomi

M

,

Wang

S

,

MacLean

AL

, et al.

Cell lineage and communication network inference via optimization for single-cell transcriptomics

.

Nucleic Acids Res

2019

;

gkz204

doi: 10.1093/nar/gkz204

.

153.

Nguyen

DH

.

Reduced-order distributed consensus controller design via edge dynamics

.

IEEE Trans Automat Contr

2017

;

62

:

475

–

80

.

154.

DeLellis

P

,

di Bernardo

M

,

Porfiri

M

.

Pinning control of complex networks via edge snapping

.

Chaos

2011

;

21

:

033119

.

155.

Pang

S-P

,

Hao

F

.

Target control of edge dynamics in complex networks

.

Physica A

2018

;

512

:

14

–

26

.

156.

Pang

S-P

,

Hao

F

.

Controllable subspace of edge dynamics in complex networks

.

Physica A

2017

;

481

:

209

–

23

.

157.

Azmi

AS

.

Network pharmacology for cancer drug discovery: are we there yet?

Future Med Chem

2012

;

4

:

939

–

41

.

158.

Yip

DK

,

Chan

LL

,

Pang

IK

, et al.

A network approach to exploring the functional basis of gene-gene epistatic interactions in disease susceptibility

.

Bioinformatics

2018

;

34

:

1741

–

9

.

159.

Phillips

PC

.

Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems

.

Nat Rev Genet

2008

;

9

:

855

.

160.

Caravagna

G

,

Graudenzi

A

,

Ramazzotti

D

, et al.

Algorithmic methods to infer the evolutionary trajectories in cancer progression

.

Proc Natl Acad Sci

2016

;

113

:

E4025

–

34

.

161.

Liu

X

,

Chang

X

,

Leng

S

, et al.

Detection for disease tipping points by landscape dynamic network biomarkers

.

Natl Sci Rev

2018

.

doi:10.1093/nsr/nwy162

.

162.

Liu

R

,

Wang

J

,

Ukai

M

, et al.

Hunt for the tipping point during endocrine resistance process in breast cancer by dynamic network biomarkers

.

J Mol Cell Biol

2018

;

mjy059

doi:10.1093/jmcb/mjy059

.

163.

Chen

L

,

Liu

R

,

Liu

Z-P

, et al.

Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers

.

Sci Rep

2012

;

2

:

342

.

164.

Liu

R

,

Wang

X

,

Aihara

K

, et al.

Early diagnosis of complex diseases by molecular biomarkers, network biomarkers, and dynamical network biomarkers

.

Med Res Rev

2014

;

34

:

455

–

78

.

165.

Pósfai

M

,

Hövel

P

.

Structural controllability of temporal networks

.

New J Phys

2014

;

16

:

123055

.

166.

Yao

P

,

Hou

B-Y

,

Pan

Y-J

, et al.

Structural controllability of temporal networks with a single switching controller

.

PLoS One

2017

;

12

:

e0170584

.

167.

Li

A

,

Cornelius

SP

,

Liu

Y-Y

, et al.

The fundamental advantages of temporal networks

.

Science

2017

;

358

:

1042

–

6

.

168.

Pósfai

M

,

Gao

J

,

Cornelius

SP

, et al.

Controllability of multiplex, multi-time-scale networks

.

Phys Rev E

2016

;

94

:

032316

.

169.

Liu

X

,

Chang

X

,

Liu

R

, et al.

Quantifying critical states of complex diseases using single-sample dynamic network biomarkers

.

PLoS Comput Biol

2017

;

13

:

e1005633

.

170.

Wang

P

,

Wang

D

,

Lu

J

.

Controllability analysis of a gene network for Arabidopsis thaliana reveals characteristics of functional gene families

.

IEEE/ACM Trans Comput Biol Bioinform

2019

;

16

:

912

–

24

.

171.

Chen

F

,

Chen

Z

,

Xiang

L

, et al.

Reaching a consensus via pinning control

.

Automatica

2009

;

45

:

1215

–

20

.

172.

Wang

XF

,

Chen

G

.

Pinning control of scale-free dynamical networks

.

Physica A

2002

;

310

:

521

–

31

.

173.

Hahn

WC

,

Weinberg

RA

.

Modelling the molecular circuitry of cancer

.

Nat Rev Cancer

2002

;

2

:

331

–

41

.

174.

Vandin

F

,

Upfal

E

,

Raphael

BJ

.

De novo discovery of mutated driver pathways in cancer

.

Genome Res

2012

;

22

:

375

–

85

.

175.

Zhao

J

,

Zhang

S

,

Wu

L-Y

, et al.

Efficient methods for identifying mutated driver pathways in cancer

.

Bioinformatics

2012

;

28

:

2940

–

7

.

176.

Feng

L

,

Gao

L

,

Wang

B

.

Detection of driver modules with rarely mutated genes in cancers

.

IEEE/ACM Trans Comput Biol Bioinform

2018

.

doi:10.1109/TCBB.2018.2846262

.

177.

Li

F

,

Gao

L

,

Wang

P

, et al.

Identifying cancer specific driver modules using a network-based method

.

Molecules

2018

;

23

:

1114

.