A physics-informed neural SDE network for learning cellular dynamics from time-series scRNA-seq data

Held-one-out performance across five seeds on Veres data.

	Held-out t = 1		Held-out t = 2		Held-out t = 3		Held-out t = 4
Model	Train	Test	Train	Test	Train	Test	Train	Test
TrajectoryNet	10.59 ± 1.08	12.81 ± 0.08	10.71 ± 1.01	11.69 ± 0.24	11.51 ± 0.49	9.62 ± 0.14	11.09 ± 0.83	10.38 ± 0.13
MIOFlow	10.09 ± 0.50	10.91 ± 0.02	10.33 ± 0.47	10.98 ± 0.22	10.37 ± 0.35	9.31 ± 0.18	10.34 ± 0.51	10.31 ± 0.11
PRESCIENT(+g)^a	10.18 ± 1.19	11.45 ± 0.04	9.86 ± 1.16	9.54 ± 0.09	10.46 ± 1.10	8.11 ± 0.06	9.93 ± 1.08	9.06 ± 0.09
PRESCIENT	7.83 ± 0.37	10.45 ± 0.11	7.96 ± 0.37	8.91 ± 0.02	8.22 ± 0.37	7.52 ± 0.08	8.17 ± 0.34	7.79 ± 0.09
PI-SDE	7.36 ± 0.32∗	10.36 ± 0.05	7.36 ± 0.40	8.35 ± 0.12	7.65 ± 0.39	7.41 ± 0.03	7.69 ± 0.44	7.61 ± 0.35

	Held-out t = 1		Held-out t = 2		Held-out t = 3		Held-out t = 4
Model	Train	Test	Train	Test	Train	Test	Train	Test
TrajectoryNet	10.59 ± 1.08	12.81 ± 0.08	10.71 ± 1.01	11.69 ± 0.24	11.51 ± 0.49	9.62 ± 0.14	11.09 ± 0.83	10.38 ± 0.13
MIOFlow	10.09 ± 0.50	10.91 ± 0.02	10.33 ± 0.47	10.98 ± 0.22	10.37 ± 0.35	9.31 ± 0.18	10.34 ± 0.51	10.31 ± 0.11
PRESCIENT(+g)^a	10.18 ± 1.19	11.45 ± 0.04	9.86 ± 1.16	9.54 ± 0.09	10.46 ± 1.10	8.11 ± 0.06	9.93 ± 1.08	9.06 ± 0.09
PRESCIENT	7.83 ± 0.37	10.45 ± 0.11	7.96 ± 0.37	8.91 ± 0.02	8.22 ± 0.37	7.52 ± 0.08	8.17 ± 0.34	7.79 ± 0.09
PI-SDE	7.36 ± 0.32∗	10.36 ± 0.05	7.36 ± 0.40	8.35 ± 0.12	7.65 ± 0.39	7.41 ± 0.03	7.69 ± 0.44	7.61 ± 0.35

The table presents results from four distinct held-out tasks, where Day 1, Day 2, Day 3, and Day 4 were excluded from the training process, respectively. For each task, we compute the average Wasserstein distance between observed data and predicted data (training loss) and Wasserstein distance between unseen data and predicted data (test loss).

The bold values imply the best performance.

PRESCIENT with estimated growth rate.

Table 1.

Held-one-out performance across five seeds on Veres data.

	Held-out t = 1		Held-out t = 2		Held-out t = 3		Held-out t = 4
Model	Train	Test	Train	Test	Train	Test	Train	Test
TrajectoryNet	10.59 ± 1.08	12.81 ± 0.08	10.71 ± 1.01	11.69 ± 0.24	11.51 ± 0.49	9.62 ± 0.14	11.09 ± 0.83	10.38 ± 0.13
MIOFlow	10.09 ± 0.50	10.91 ± 0.02	10.33 ± 0.47	10.98 ± 0.22	10.37 ± 0.35	9.31 ± 0.18	10.34 ± 0.51	10.31 ± 0.11
PRESCIENT(+g)^a	10.18 ± 1.19	11.45 ± 0.04	9.86 ± 1.16	9.54 ± 0.09	10.46 ± 1.10	8.11 ± 0.06	9.93 ± 1.08	9.06 ± 0.09
PRESCIENT	7.83 ± 0.37	10.45 ± 0.11	7.96 ± 0.37	8.91 ± 0.02	8.22 ± 0.37	7.52 ± 0.08	8.17 ± 0.34	7.79 ± 0.09
PI-SDE	7.36 ± 0.32∗	10.36 ± 0.05	7.36 ± 0.40	8.35 ± 0.12	7.65 ± 0.39	7.41 ± 0.03	7.69 ± 0.44	7.61 ± 0.35

	Held-out t = 1		Held-out t = 2		Held-out t = 3		Held-out t = 4
Model	Train	Test	Train	Test	Train	Test	Train	Test
TrajectoryNet	10.59 ± 1.08	12.81 ± 0.08	10.71 ± 1.01	11.69 ± 0.24	11.51 ± 0.49	9.62 ± 0.14	11.09 ± 0.83	10.38 ± 0.13
MIOFlow	10.09 ± 0.50	10.91 ± 0.02	10.33 ± 0.47	10.98 ± 0.22	10.37 ± 0.35	9.31 ± 0.18	10.34 ± 0.51	10.31 ± 0.11
PRESCIENT(+g)^a	10.18 ± 1.19	11.45 ± 0.04	9.86 ± 1.16	9.54 ± 0.09	10.46 ± 1.10	8.11 ± 0.06	9.93 ± 1.08	9.06 ± 0.09
PRESCIENT	7.83 ± 0.37	10.45 ± 0.11	7.96 ± 0.37	8.91 ± 0.02	8.22 ± 0.37	7.52 ± 0.08	8.17 ± 0.34	7.79 ± 0.09
PI-SDE	7.36 ± 0.32∗	10.36 ± 0.05	7.36 ± 0.40	8.35 ± 0.12	7.65 ± 0.39	7.41 ± 0.03	7.69 ± 0.44	7.61 ± 0.35

The bold values imply the best performance.

PRESCIENT with estimated growth rate.

Table 2.

Held-one-out performance across five seeds on Veres data (continued).

	Held-out t = 5		Held-out t = 6		Held-out t = 7
Model	Train	Test	Train	Test	Train	Test
TrajectoryNet	11.06 ± 0.97	11.33 ± 0.08	10.99 ± 1.09	11.39 ± 0.16	10.76 ± 0.94	13.06 ± 0.45
MIOFlow	10.19 ± 0.53	10.59 ± 0.14	10.35 ± 0.52	10.68 ± 0.07	10.26 ± 0.59	11.01 ± 0.07
PRESCIENT(+g)^a	9.77 ± 1.12	9.76 ± 0.36	9.76 ± 1.08	11.13 ± 0.20	9.85 ± 1.18	13.43 ± 0.19
PRESCIENT	8.08 ± 0.32	7.87 ± 0.10	8.10 ± 0.40	8.27 ± 0.14	7.92 ± 0.41	9.19 ± 0.04
PI-SDE	7.51 ± 0.31∗	7.40 ± 0.17	7.43 ± 0.30	7.66 ± 0.15	7.34 ± 0.33	8.61 ± 0.17

	Held-out t = 5		Held-out t = 6		Held-out t = 7
Model	Train	Test	Train	Test	Train	Test
TrajectoryNet	11.06 ± 0.97	11.33 ± 0.08	10.99 ± 1.09	11.39 ± 0.16	10.76 ± 0.94	13.06 ± 0.45
MIOFlow	10.19 ± 0.53	10.59 ± 0.14	10.35 ± 0.52	10.68 ± 0.07	10.26 ± 0.59	11.01 ± 0.07
PRESCIENT(+g)^a	9.77 ± 1.12	9.76 ± 0.36	9.76 ± 1.08	11.13 ± 0.20	9.85 ± 1.18	13.43 ± 0.19
PRESCIENT	8.08 ± 0.32	7.87 ± 0.10	8.10 ± 0.40	8.27 ± 0.14	7.92 ± 0.41	9.19 ± 0.04
PI-SDE	7.51 ± 0.31∗	7.40 ± 0.17	7.43 ± 0.30	7.66 ± 0.15	7.34 ± 0.33	8.61 ± 0.17

The table presents results from three distinct held-out tasks where Day 5, Day 6, and Day 7 were excluded from the training process, respectively. For each task, we compute the average Wasserstein distance between observed data and predicted data (training loss) and Wasserstein distance between unseen data and predicted data (test loss).

The bold values imply the best performance.

PRESCIENT with estimated growth rate.

Table 2.

Held-one-out performance across five seeds on Veres data (continued).

	Held-out t = 5		Held-out t = 6		Held-out t = 7
Model	Train	Test	Train	Test	Train	Test
TrajectoryNet	11.06 ± 0.97	11.33 ± 0.08	10.99 ± 1.09	11.39 ± 0.16	10.76 ± 0.94	13.06 ± 0.45
MIOFlow	10.19 ± 0.53	10.59 ± 0.14	10.35 ± 0.52	10.68 ± 0.07	10.26 ± 0.59	11.01 ± 0.07
PRESCIENT(+g)^a	9.77 ± 1.12	9.76 ± 0.36	9.76 ± 1.08	11.13 ± 0.20	9.85 ± 1.18	13.43 ± 0.19
PRESCIENT	8.08 ± 0.32	7.87 ± 0.10	8.10 ± 0.40	8.27 ± 0.14	7.92 ± 0.41	9.19 ± 0.04
PI-SDE	7.51 ± 0.31∗	7.40 ± 0.17	7.43 ± 0.30	7.66 ± 0.15	7.34 ± 0.33	8.61 ± 0.17

	Held-out t = 5		Held-out t = 6		Held-out t = 7
Model	Train	Test	Train	Test	Train	Test
TrajectoryNet	11.06 ± 0.97	11.33 ± 0.08	10.99 ± 1.09	11.39 ± 0.16	10.76 ± 0.94	13.06 ± 0.45
MIOFlow	10.19 ± 0.53	10.59 ± 0.14	10.35 ± 0.52	10.68 ± 0.07	10.26 ± 0.59	11.01 ± 0.07
PRESCIENT(+g)^a	9.77 ± 1.12	9.76 ± 0.36	9.76 ± 1.08	11.13 ± 0.20	9.85 ± 1.18	13.43 ± 0.19
PRESCIENT	8.08 ± 0.32	7.87 ± 0.10	8.10 ± 0.40	8.27 ± 0.14	7.92 ± 0.41	9.19 ± 0.04
PI-SDE	7.51 ± 0.31∗	7.40 ± 0.17	7.43 ± 0.30	7.66 ± 0.15	7.34 ± 0.33	8.61 ± 0.17

The bold values imply the best performance.

PRESCIENT with estimated growth rate.

Tables 1 and 2 detailed the performance for each held-one-out task. In all subproblems, PI-SDE not only achieved the best fit on the training data, but also showed superior predictive ability on the test data, particularly excelling at predicting later time points. For instance, when Day 6 was excluded, PRESCIENT achieved a mean loss of 8.27 (without estimated growth rate), outperforming MIOFlow (10.68) and TrajectoryNet (11.39). PI-SDE achieved a test loss of 7.66, marking a 7.6% improvement over that of PRESCIENT and showcasing its long-term prediction accuracy. Additionally, PI-SDE clearly outperformed the baselines in three held-multi-tasks, which held out two time points (Supplementary Table S2).

3.2 PI-SDE reconstructs a biologically interpretable cellular potential energy landscape

We next trained PI-SDE using data from all time points and focused on the potential energy landscape learned by PI-SDE across two scRNA-seq datasets. Overall, the estimated potential energy values clearly recapitulated the developmental processes (Figs 2c and 3c). We hypothesized that the potential energy landscape derived by PI-SDE should be able to quantify cell differentiation potency. Specifically, we expected cells at earlier time points to exhibit higher potential energy when compared to more differentiated cells at later time points. Supplementary Fig. S2 confirmed this hypothesis, showing a clear trend of decreasing cell potential over time. To validate this observation statistically, we followed the testing approach proposed by Shi et al. (2019), and used the one-sided Wilcoxon rank-sum test between consecutive time points. Our results were highly significant, affirming that cells at earlier stages consistently had higher potential energy than those at later stages. For instance, in MH data, P-values were less than 2.95e-248 and 7.52e-62 when comparing Day 2–Day 4 and Day 4–Day 6, respectively.

For Veres data, the potential energy landscape reconstructed by PI-SDE exhibited progressive decline trend over developmental time (right panel of Fig. 2c). This trend was consistent with the underlying cellular dynamics. In contrast, the potential landscape reconstructed by PRESCIENT showed a sharp drop of potential energy for the cells collected at the final time point (Day 7) (middle panel of Fig. 2c). Moreover, this landscape by PRESCIENT seemed to be overly dominated by the data collected on Day 7, as shown by the red circled region in the left and middle panels of Fig. 2c. This phenomenon was likely due to the entropic regularization used by PRESCIENT, which aims to minimize the potential energy function at the final time point, possibly leading to an overfitting of the potential energy for cells at this stage. In addition, in the PRESCIENT’s landscape, we found that a handful of cells with high potential energy were scattered around the low potential energy area (the red circled region in the middle panel of Fig. 2c).

To further illustrate the superiority of our potential landscape, we visualized cellular velocity derived from the negative gradient of the resulting potential function. Since TrajectoryNet can directly infer cellular velocity, we added TrajectoryNet to the comparative analysis here (Fig. 2d). The cellular velocity predicted by PI-SDE correctly oriented cells toward terminal fates, whereas the vector fields inferred by both TrajectoryNet and PRESCIENT occasionally showed cells deviating from the data manifolds (e.g. the regions circled in red dashed lines in Fig. 2d). PI-SDE also successfully recapitulated the two differentiation branches during pancreatic β-cell differentiation, as shown in Fig. 2a. Specifically, for branch 1, our results clearly showed distinct directional differences in prog_sox2 and prog_nkx61 at Day 0, which stands in contrast to PRESCIENT’s results which indicated a homogeneous pattern, while TrajectoryNet displayed chaotic vectors.

As for MH data, we compared the potential energy and cellular velocity generated by PI-SDE, PRESCIENT (with growth rate), and PRESCIENT. In general, the estimated potential energy by PI-SDE clearly recapitulated the developmental process of mouse hematopoiesis, showing a transcriptional continuum from undifferentiated cells to mature cells. Similarly, the cellular velocity inferred by PI-SDE delineated the expected directional flow along the differentiation path when visualized on UMAP embedding (left panel of Fig. 3c).

By comparing the velocity for cells near the earliest time point (the regions circled in red dashed lines in Fig. 3c), we found that the cellular velocity inferred by PI-SDE clearly delineated the main cell lineages of monocyte and neutrophil. In contrast, the cellular velocity predicted by PRESCIENT (with growth rate) indicated a slightly trend of bifurcation, while PRESCIENT predicted chaotic movement in the early stages. As explored by PRESCIENT, the addition of cell growth rate altered the potential landscape near the earliest time point, improving the performance of fate prediction. We showed that our proposed physics-informed loss function could further improve the potential landscape near the earliest time point. This suggested that PI-SDE tended to capture cell fate information much earlier in time than PRESCIENT, even accounting for cell growth rate. To further demonstrate the superiority of our velocity, we followed the work of Yeo et al. (2021) to introduce in silico perturbations at the initial time point. Our results showed that PI-SDE was able to predict the expected outcome of transcription factor perturbations (Supplementary Fig. S3). This suggested that the cellular velocity generated by PI-SDE aligned well with the directional dynamics of cell movement (Supplementary Note S7).

3.3 HJ regularization stabilizes the training process

Next, we demonstrated the power of HJ regularization in the training process of PI-SDE. To illustrate its utility, we used the Veres data as an example and conducted a hyperparameter sensitivity analysis for the held-one-out task that removes data at Day 6. We set the diffusion coefficient as a vector parameter to be optimized during model training and explored a range of learning rates during optimization (lr = 0.001, 0.002, 0.005, and 0.01) and regularization strengths (λ = 0, 0.001, 0.01, 0.1, 0.5, and 1) to assess their impact on the model’s training and test performance.

As depicted in Supplementary Fig. S4, we observed that higher learning rates often led to instability, particularly when regularization strength λ did not exceed 0.01. For instance, when $λ = 0.001$ ⁠, an increase in the learning rate led to an increase in the best test loss for the held-out time (Day 6), from 42.880 (lr = 0.001) to 51.755 (lr = 0.01). However, the situation was reversed when $λ > 0.01$ ⁠. For example, when $λ = 0.05$ ⁠, the best test loss in held-out time (Day 6) decreased from 33.548 (lr = 0.001) to 27.377 (lr = 0.01), as learning rate increased. Since the magnitude of λ reflects the strength of HJ regularization in our physics-informed loss function, these results suggested that HJ regularization guided PI-SDE to find the optimal potential energy landscape with accurate predictions.

Beyond achieving superior test results, our comprehensive hyperparameter sensitivity analysis across a broad spectrum of parameter configurations revealed that PI-SDE achieved consistent and robust performance. As HJ regularization became more pronounced (λ ranging from 0 to 1), the model’s learning curves tended to converge and no longer fluctuated sharply (Supplementary Fig. S4). Such stability during the training process can be largely attributed to the integration of HJ regularization, since the physical principle, HJ equation, can be used to deduce that potential function is a Lyapunov function. A fundamental property of a Lyapunov function is its decreasing pattern over time, a characteristic that facilitates the analysis of global stability (Bressloff 2021). Consequently, the presence of HJ regularization (⁠ $λ > 0.01$ ⁠) significantly ensured that our learned potential function would satisfy HJ equation, thus contributing to a more stable convergence.

4 Discussion

In this paper, we propose PI-SDE, a physics-informed Neural-SDE framework, to reconstruct the underlying cellular potential energy landscape from time-series scRNA-seq data. PI-SDE extends the framework of PRESCIENT by leveraging the physical laws that govern the potential energy function–HJ equation. By integrating the HJ regularization, the resulting physics-informed loss function guides our model to accurately capture complex dynamics, while maintaining robustness and biological interpretability.

Generally, PI-SDE effectively combines theoretical principles with machine learning capabilities, thereby reducing overfitting and enabling the exploitation of large-scale, high-dimensional biological data. We have demonstrated the benefits of leveraging the physical insights.

First, PI-SDE maintained superior performance at unseen time points, especially on long-term prediction tasks. This implies that HJ regularization guides PI-SDE to a more accurate potential energy function space to faithfully recapitulate the underlying cellular dynamics.

Second, the potential energy landscape reconstructed by PI-SDE exhibited progressive decline trend over developmental time. The clear gradient trend highlights PI-SDE’s ability to capture the gradual evolution of cellular states from the undifferentiated states at higher energy levels to the differentiated states at lower energy levels. In contrast, PRESCIENT’s potential energy landscape appeared to be overfitted at the final time point on Veres data, resulting in cells at high and low potential being overlapped in gene expression space. This overlap could imply a lesser degree of change as detected by PRESCIENT, or a less sensitive capture of the dynamic cellular processes. Meanwhile, PI-SDE tended to capture cell fate information (e.g. trend toward bifurcation) in the earlier stage on both pancreatic β-cell differentiation and mouse hematopoiesis.

Third, solving SDEs is challenging owing to the inherent stochasticity. Therefore, careful design of the diffusion term is essential to improve stability and efficacy of modeling (Oh et al. 2024). Our results showed that the incorporation of HJ regularization can stabilize the training process, which is also theoretically substantiated. Overall, PI-SDE provides an effective and interpretable tool to model cellular dynamics for predicting gene expressions on unseen data and reconstruct the underlying potential energy landscape.

Nonetheless, some aspects still need to be improved. First, the current PI-SDE does not account for cell growth rate during cell development, possibly resulting in the assumption of conserved cell mass over time. Interestingly, the performance of PRESCIENT without growth rate was superior to PRESCIENT with growth rate, implying that the estimated growth rate may be inaccurate and thus inadvertently introducing bias into the model. We can solve this problem by adopting the unbalanced OT framework of Wasserstein–Fisher–Rao distance introduced by TIGON (Sha et al. 2024). Second, modeling of cell–cell communication in the SDE models can be challenging (Jiang et al. 2022), and we plan to pursue this topic in our future work.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest

None declared.

Funding

This work was supported by National Key Research and Development Program of China [grant number 2022YFA1004801] and National Natural Science Foundation of China [grant number 12071466]. This paper was published as part of a supplement financially supported by ECCB2024.

Data availability

All the datasets used for analysis in this study are publicly available. For Veres data, we downloaded the data from GEO (GSE114412). For MH data, we obtained the raw data from the lineage_tracing folder in https://github.com/AllonKleinLab/paper-data. We provide a more detailed description of these datasets in the Supplementary Note S1.

References

Bressloff

PC.

Stochastic Processes in Cell Biology

. Vol.

, 2nd edn. Heidelberg:

Springer

2021

Fang

Kruse

et al.

Nonequilibrium physics in biology

Rev Mod Phys

2019

;

045004

Crossref

Goldstein

, Poole C, Safko J.

Classical Mechanics

. 3rd edn. New York:

Pearson Education

2011

Huguet

Magruder

Tong

et al.

Manifold interpolating optimal-transport flows for trajectory inference

Adv Neural Inf Process Syst

2022

;

29705

–

PubMed

Jiang

Zhang

Wan

Dynamic inference of cell developmental complex energy landscape from time series single-cell transcriptomic data

PLoS Comput Biol

2022

;

e1009821

Kidger

Foster

et al. Neural SDEs as infinite-dimensional GANs. In: Proceedings of the 38th International Conference on Machine Learning, Vienna, Austria (virtual).

PMLR

2021

, pp.

5453

–

Wong

T-KL

Chen

et al. Scalable gradients for stochastic differential equations. In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, Palermo, Italy (virtual).

PMLR

2020

, pp.

3870

–

Lim

Kim

Stable neural stochastic differential equations in analyzing irregular time series data. In: Proceedings of the 12th International Conference on Learning Representations. Vienna, Austria,

2024

Onken

Fung

et al.

OT-Flow: fast and accurate continuous normalizing flows via optimal transport

AAAI

2021

;

9223

–

Crossref

Ruthotto

Osher

et al.

A machine learning framework for solving high-dimensional mean field game and mean field control problems

Proc Natl Acad Sci USA

2020

;

117

9183

–

Schiebinger

Shu

Tabaka

et al.

Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming

Cell

2019

;

176

928

–

e22

Sha

Qiu

Zhou

et al.

Reconstructing growth and dynamic trajectories from single-cell transcriptomics data

Nat Mach Intell

2024

;

–

Shi

Chen

et al.

Quantifying pluripotency landscape of cell differentiation from scRNA-seq data by continuous birth-death process

PLoS Comput Biol

2019

;

e1007488

Tong

Huang

Wolf

et al. TrajectoryNet: a dynamic optimal transport network for modeling cellular dynamics. In: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria (virtual).

PMLR

2020

, pp.

9526

–

Veres

Faust

Bushnell

et al.

Charting cellular identity during human in vitro β-cell differentiation

Nature

2019

;

569

368

–

Waddington

CH.

The Strategy of the Genes: A Discussion of Some Aspects of Theoretical Biology

. London:

G. Allen and Unwin

1957

Wang

Landscape and flux theory of non-equilibrium dynamical systems with application to biology

Adv Phys

2015

;

–

137

Crossref

E Weinan

Vanden-Eijnden

Applied Stochastic Analysis

. Vol.

199

. New York:

American Mathematical Society

2021