Bayesian Multi-level Mixed-effects Model for Influenza Dynamics

Summary of posterior probabilities over 100 simulation studies for eight different strain-level random effect models

$p_{l 0}$	$\sum_{S}$	null	(a)	(b)	(c)	(a, b)	(a, c)	(b, c)	(a, b, c)
0.2	$Λ_{1} Γ_{1}$	0.008	$\underline{0.713}$	0.003	0.007	0.099	0.116	0.006	0.048
	$Λ_{1} Γ_{2}$	0.008	$\underline{0.703}$	0.003	0.012	0.102	0.110	0.009	0.054
	$Λ_{2} Γ_{1}$	0.005	0.004	0.050	0.116	0.050	0.076	$\underline{0.450}$	0.249
	$Λ_{2} Γ_{2}$	0.008	0.041	0.014	0.044	0.161	0.082	$\underline{0.208}$	$0.443$
	$Λ_{3} Γ_{1}$	0.004	0.024	0.005	0.014	0.032	0.162	0.026	$\underline{0.735}$
	$Λ_{3} Γ_{2}$	0.004	0.056	0.012	0.007	0.099	0.145	0.032	$\underline{0.647}$
0.5	$Λ_{1} Γ_{1}$	0.025	$\underline{0.799}$	0.003	0.011	0.067	0.077	0.004	0.015
	$Λ_{1} Γ_{2}$	0.029	$\underline{0.793}$	0.004	0.014	0.065	0.075	0.006	0.015
	$Λ_{2} Γ_{1}$	0.018	0.032	0.077	0.188	0.050	0.057	$\underline{0.451}$	0.127
	$Λ_{2} Γ_{2}$	0.010	0.092	0.025	0.092	0.177	0.115	$\underline{0.217}$	$0.272$
	$Λ_{3} Γ_{1}$	0.014	0.046	0.014	0.027	0.074	0.205	0.037	$\underline{0.584}$
	$Λ_{3} Γ_{2}$	0.016	0.117	0.025	0.019	0.109	0.187	0.023	$\underline{0.504}$
0.8	$Λ_{1} Γ_{1}$	0.054	$\underline{0.847}$	0.005	0.015	0.034	0.034	0.006	0.004
	$Λ_{1} Γ_{2}$	0.044	$\underline{0.865}$	0.003	0.012	0.035	0.033	0.004	0.004
	$Λ_{2} Γ_{1}$	0.045	0.036	0.173	0.246	0.061	0.022	$\underline{0.359}$	0.058
	$Λ_{2} Γ_{2}$	0.051	0.133	0.052	0.110	0.157	0.040	$\underline{0.192}$	$0.262$
	$Λ_{3} Γ_{1}$	0.037	0.064	0.021	0.042	0.115	0.208	0.028	$\underline{0.485}$
	$Λ_{3} Γ_{2}$	0.028	0.161	0.049	0.030	0.098	0.150	0.050	$\underline{0.435}$

$p_{l 0}$	$\sum_{S}$	null	(a)	(b)	(c)	(a, b)	(a, c)	(b, c)	(a, b, c)
0.2	$Λ_{1} Γ_{1}$	0.008	$\underline{0.713}$	0.003	0.007	0.099	0.116	0.006	0.048
	$Λ_{1} Γ_{2}$	0.008	$\underline{0.703}$	0.003	0.012	0.102	0.110	0.009	0.054
	$Λ_{2} Γ_{1}$	0.005	0.004	0.050	0.116	0.050	0.076	$\underline{0.450}$	0.249
	$Λ_{2} Γ_{2}$	0.008	0.041	0.014	0.044	0.161	0.082	$\underline{0.208}$	$0.443$
	$Λ_{3} Γ_{1}$	0.004	0.024	0.005	0.014	0.032	0.162	0.026	$\underline{0.735}$
	$Λ_{3} Γ_{2}$	0.004	0.056	0.012	0.007	0.099	0.145	0.032	$\underline{0.647}$
0.5	$Λ_{1} Γ_{1}$	0.025	$\underline{0.799}$	0.003	0.011	0.067	0.077	0.004	0.015
	$Λ_{1} Γ_{2}$	0.029	$\underline{0.793}$	0.004	0.014	0.065	0.075	0.006	0.015
	$Λ_{2} Γ_{1}$	0.018	0.032	0.077	0.188	0.050	0.057	$\underline{0.451}$	0.127
	$Λ_{2} Γ_{2}$	0.010	0.092	0.025	0.092	0.177	0.115	$\underline{0.217}$	$0.272$
	$Λ_{3} Γ_{1}$	0.014	0.046	0.014	0.027	0.074	0.205	0.037	$\underline{0.584}$
	$Λ_{3} Γ_{2}$	0.016	0.117	0.025	0.019	0.109	0.187	0.023	$\underline{0.504}$
0.8	$Λ_{1} Γ_{1}$	0.054	$\underline{0.847}$	0.005	0.015	0.034	0.034	0.006	0.004
	$Λ_{1} Γ_{2}$	0.044	$\underline{0.865}$	0.003	0.012	0.035	0.033	0.004	0.004
	$Λ_{2} Γ_{1}$	0.045	0.036	0.173	0.246	0.061	0.022	$\underline{0.359}$	0.058
	$Λ_{2} Γ_{2}$	0.051	0.133	0.052	0.110	0.157	0.040	$\underline{0.192}$	$0.262$
	$Λ_{3} Γ_{1}$	0.037	0.064	0.021	0.042	0.115	0.208	0.028	$\underline{0.485}$
	$Λ_{3} Γ_{2}$	0.028	0.161	0.049	0.030	0.098	0.150	0.050	$\underline{0.435}$

Notes: The subset of ordinary differential equations coefficients that have the strain-level random effects are listed. The bold number denotes the largest posterior probability and the underline denotes the true specification.

TABLE 1

Summary of posterior probabilities over 100 simulation studies for eight different strain-level random effect models

$p_{l 0}$	$\sum_{S}$	null	(a)	(b)	(c)	(a, b)	(a, c)	(b, c)	(a, b, c)
0.2	$Λ_{1} Γ_{1}$	0.008	$\underline{0.713}$	0.003	0.007	0.099	0.116	0.006	0.048
	$Λ_{1} Γ_{2}$	0.008	$\underline{0.703}$	0.003	0.012	0.102	0.110	0.009	0.054
	$Λ_{2} Γ_{1}$	0.005	0.004	0.050	0.116	0.050	0.076	$\underline{0.450}$	0.249
	$Λ_{2} Γ_{2}$	0.008	0.041	0.014	0.044	0.161	0.082	$\underline{0.208}$	$0.443$
	$Λ_{3} Γ_{1}$	0.004	0.024	0.005	0.014	0.032	0.162	0.026	$\underline{0.735}$
	$Λ_{3} Γ_{2}$	0.004	0.056	0.012	0.007	0.099	0.145	0.032	$\underline{0.647}$
0.5	$Λ_{1} Γ_{1}$	0.025	$\underline{0.799}$	0.003	0.011	0.067	0.077	0.004	0.015
	$Λ_{1} Γ_{2}$	0.029	$\underline{0.793}$	0.004	0.014	0.065	0.075	0.006	0.015
	$Λ_{2} Γ_{1}$	0.018	0.032	0.077	0.188	0.050	0.057	$\underline{0.451}$	0.127
	$Λ_{2} Γ_{2}$	0.010	0.092	0.025	0.092	0.177	0.115	$\underline{0.217}$	$0.272$
	$Λ_{3} Γ_{1}$	0.014	0.046	0.014	0.027	0.074	0.205	0.037	$\underline{0.584}$
	$Λ_{3} Γ_{2}$	0.016	0.117	0.025	0.019	0.109	0.187	0.023	$\underline{0.504}$
0.8	$Λ_{1} Γ_{1}$	0.054	$\underline{0.847}$	0.005	0.015	0.034	0.034	0.006	0.004
	$Λ_{1} Γ_{2}$	0.044	$\underline{0.865}$	0.003	0.012	0.035	0.033	0.004	0.004
	$Λ_{2} Γ_{1}$	0.045	0.036	0.173	0.246	0.061	0.022	$\underline{0.359}$	0.058
	$Λ_{2} Γ_{2}$	0.051	0.133	0.052	0.110	0.157	0.040	$\underline{0.192}$	$0.262$
	$Λ_{3} Γ_{1}$	0.037	0.064	0.021	0.042	0.115	0.208	0.028	$\underline{0.485}$
	$Λ_{3} Γ_{2}$	0.028	0.161	0.049	0.030	0.098	0.150	0.050	$\underline{0.435}$

$p_{l 0}$	$\sum_{S}$	null	(a)	(b)	(c)	(a, b)	(a, c)	(b, c)	(a, b, c)
0.2	$Λ_{1} Γ_{1}$	0.008	$\underline{0.713}$	0.003	0.007	0.099	0.116	0.006	0.048
	$Λ_{1} Γ_{2}$	0.008	$\underline{0.703}$	0.003	0.012	0.102	0.110	0.009	0.054
	$Λ_{2} Γ_{1}$	0.005	0.004	0.050	0.116	0.050	0.076	$\underline{0.450}$	0.249
	$Λ_{2} Γ_{2}$	0.008	0.041	0.014	0.044	0.161	0.082	$\underline{0.208}$	$0.443$
	$Λ_{3} Γ_{1}$	0.004	0.024	0.005	0.014	0.032	0.162	0.026	$\underline{0.735}$
	$Λ_{3} Γ_{2}$	0.004	0.056	0.012	0.007	0.099	0.145	0.032	$\underline{0.647}$
0.5	$Λ_{1} Γ_{1}$	0.025	$\underline{0.799}$	0.003	0.011	0.067	0.077	0.004	0.015
	$Λ_{1} Γ_{2}$	0.029	$\underline{0.793}$	0.004	0.014	0.065	0.075	0.006	0.015
	$Λ_{2} Γ_{1}$	0.018	0.032	0.077	0.188	0.050	0.057	$\underline{0.451}$	0.127
	$Λ_{2} Γ_{2}$	0.010	0.092	0.025	0.092	0.177	0.115	$\underline{0.217}$	$0.272$
	$Λ_{3} Γ_{1}$	0.014	0.046	0.014	0.027	0.074	0.205	0.037	$\underline{0.584}$
	$Λ_{3} Γ_{2}$	0.016	0.117	0.025	0.019	0.109	0.187	0.023	$\underline{0.504}$
0.8	$Λ_{1} Γ_{1}$	0.054	$\underline{0.847}$	0.005	0.015	0.034	0.034	0.006	0.004
	$Λ_{1} Γ_{2}$	0.044	$\underline{0.865}$	0.003	0.012	0.035	0.033	0.004	0.004
	$Λ_{2} Γ_{1}$	0.045	0.036	0.173	0.246	0.061	0.022	$\underline{0.359}$	0.058
	$Λ_{2} Γ_{2}$	0.051	0.133	0.052	0.110	0.157	0.040	$\underline{0.192}$	$0.262$
	$Λ_{3} Γ_{1}$	0.037	0.064	0.021	0.042	0.115	0.208	0.028	$\underline{0.485}$
	$Λ_{3} Γ_{2}$	0.028	0.161	0.049	0.030	0.098	0.150	0.050	$\underline{0.435}$

Notes: The subset of ordinary differential equations coefficients that have the strain-level random effects are listed. The bold number denotes the largest posterior probability and the underline denotes the true specification.

To evaluate the performance of our method in different scenarios, we define the average relative estimation error (ARE) of a parameter $θ$ as

\begin{align} ARE = \frac{1}{N} \sum_{i = 1}^{N} \frac{| {\hat{θ}}_{i} - θ |}{| θ |}, \end{align}

where ${\hat{θ}}_{i}$ is the estimate of $θ$ in the ith simulation and $N$ is the number of simulation runs (here $N = 100$ ⁠). Table 2 presents the posterior summaries for the fixed effect ODE parameters over 100 simulated samples including mean, SE, and ARE, which are properties associated with the population level. Here we choose non-informative prior $p_{l 0} = 0.5$ ⁠. It can be seen that the estimated values are quite close to the true values, suggesting good performance of the algorithm. Parameter $a$ is slightly underestimated. The reason is that $a$ is mainly determined by the decay rate of uninfected cell $U$ ⁠. As shown in the upper panel of Figure 2, $U$ decays very fast and quickly becomes flat. Therefore, only the measurements at very beginning time are useful in the estimation of $a$ ⁠. We can increase the accuracy of parameter estimation for $a$ by including more measurements and choosing more knots at the beginning. But in practice, only virus load $V$ is observed, we can only choose knots according to the patterns of virus load changes as described in Section 5.

TABLE 2

Posterior estimates of population-level ordinary differential equations dynamic parameters based on 100 simulated data sets for six different choices of strain-level random effect covariance matrix

	$Λ_{1} Γ_{1}$	$Λ_{1} Γ_{2}$	$Λ_{2} Γ_{1}$	$Λ_{2} Γ_{2}$	$Λ_{3} Γ_{1}$	$Λ_{3} Γ_{2}$
a	7.69	7.75	7.62	6.80	7.29	6.85
	1.43	1.50	0.47	0.92	1.47	1.59
	0.18	0.18	0.15	0.24	0.21	0.26
b	0.73	0.73	0.73	0.76	0.78	0.74
	0.05	0.05	0.13	0.21	0.23	0.17
	0.09	0.09	0.17	0.22	0.23	0.21
c	0.66	0.68	0.72	0.60	0.84	0.68
	0.11	0.16	0.18	0.22	0.29	0.20
	0.07	0.08	0.21	0.25	0.33	0.21

	$Λ_{1} Γ_{1}$	$Λ_{1} Γ_{2}$	$Λ_{2} Γ_{1}$	$Λ_{2} Γ_{2}$	$Λ_{3} Γ_{1}$	$Λ_{3} Γ_{2}$
a	7.69	7.75	7.62	6.80	7.29	6.85
	1.43	1.50	0.47	0.92	1.47	1.59
	0.18	0.18	0.15	0.24	0.21	0.26
b	0.73	0.73	0.73	0.76	0.78	0.74
	0.05	0.05	0.13	0.21	0.23	0.17
	0.09	0.09	0.17	0.22	0.23	0.21
c	0.66	0.68	0.72	0.60	0.84	0.68
	0.11	0.16	0.18	0.22	0.29	0.20
	0.07	0.08	0.21	0.25	0.33	0.21

Notes: For each parameter, the first, second and third rows denote its mean, SE and average relative estimation error, respectively. Here $p_{l 0} = 0.5$ and the true parameters $a = 9$ ⁠, $b = 2 / 3$ ⁠, $c = 2 / 3$ ⁠.

TABLE 2

Open in new tab Download slide

Posterior estimates of population-level ordinary differential equations dynamic parameters based on 100 simulated data sets for six different choices of strain-level random effect covariance matrix

	$Λ_{1} Γ_{1}$	$Λ_{1} Γ_{2}$	$Λ_{2} Γ_{1}$	$Λ_{2} Γ_{2}$	$Λ_{3} Γ_{1}$	$Λ_{3} Γ_{2}$
a	7.69	7.75	7.62	6.80	7.29	6.85
	1.43	1.50	0.47	0.92	1.47	1.59
	0.18	0.18	0.15	0.24	0.21	0.26
b	0.73	0.73	0.73	0.76	0.78	0.74
	0.05	0.05	0.13	0.21	0.23	0.17
	0.09	0.09	0.17	0.22	0.23	0.21
c	0.66	0.68	0.72	0.60	0.84	0.68
	0.11	0.16	0.18	0.22	0.29	0.20
	0.07	0.08	0.21	0.25	0.33	0.21

	$Λ_{1} Γ_{1}$	$Λ_{1} Γ_{2}$	$Λ_{2} Γ_{1}$	$Λ_{2} Γ_{2}$	$Λ_{3} Γ_{1}$	$Λ_{3} Γ_{2}$
a	7.69	7.75	7.62	6.80	7.29	6.85
	1.43	1.50	0.47	0.92	1.47	1.59
	0.18	0.18	0.15	0.24	0.21	0.26
b	0.73	0.73	0.73	0.76	0.78	0.74
	0.05	0.05	0.13	0.21	0.23	0.17
	0.09	0.09	0.17	0.22	0.23	0.21
c	0.66	0.68	0.72	0.60	0.84	0.68
	0.11	0.16	0.18	0.22	0.29	0.20
	0.07	0.08	0.21	0.25	0.33	0.21

Notes: For each parameter, the first, second and third rows denote its mean, SE and average relative estimation error, respectively. Here $p_{l 0} = 0.5$ and the true parameters $a = 9$ ⁠, $b = 2 / 3$ ⁠, $c = 2 / 3$ ⁠.

We have also conducted simulations to study how the performance of our algorithm is influenced by the number of subjects per strain, that is, $n_{s}$ ⁠. We found that increasing $n_{s}$ can reduce the bias in the estimation of population level ODE parameters. For example, under setting $Λ_{1} Γ_{1}$ with $p_{l 0} = 0.5$ ⁠, if the number of subjects per strain $n_{s} = 20$ ⁠, we obtain that the AREs are 0.13, 0.08 and 0.05 for the fixed effect coefficients a, b and c, respectively, which are smaller than the corresponding values of 0.18, 0.09 and 0.07 obtained based on $n_{s} = 6$ ⁠. We also studied the effect of the number of virus load measurements $m$ for each subject and found that the conclusion is pretty similar, that is, the bias for the estimation of the fixed effect coefficients decreases with $m$ ⁠.

5 APPLICATION TO THE EXPERIMENT OF INFLUENZA INFECTIONS IN DUCKS

In this section, we apply the proposed methodology to the data set of viral load dynamics following influenza infections in ducks as shown in Figure 1. Cubic B-splines with 10 equally spaced knots between day 1 and day 4 and 10 equally spaced knots between day 4 and day 14 are used to approximate the ODE solution. The knots are chosen in this way in order to capture the patterns of virus load changes which increase sharply from day 1 to day 3 followed by a slow and continuous decrease until day 14. We have tried different number of knots in the analysis and obtained very similar results. For example, if we choose seven knots between day 1 and day 4 and five knots between day 4 and day 14, the conclusion is pretty similar for both visualisation and parameter estimation. Since no previous information is available about the strain level variation in duck flu, we choose non-informative prior $p_{l 0} = 0.5$ on the set $λ_{l} = 0$ for $l = 1, 2, 3$ to reflect a balance among outcomes. For all the other hyper-parameters, we use the same values as in the simulation study to induce relatively diffuse priors on those parameters. We also follow the similar way as in the simulation study for choosing the initial values of $θ_{s j}$ and $C_{s j}$ ⁠. We discard the first 5000 iterations as burn-in. Posterior probabilities are calculated based on 20,000 iterations collected thereafter. To compute posterior summaries of the parameters, we thin the chain by a factor of 10.

Figure 3 displays the individual fitted curves based on the solutions of the viral load $V$ to the ODE equations. The virus load of the jth subject in the sth strain is estimated from ODE using parameters ${\hat{θ}}_{s j}$ which is taken as the average of the 2000 posterior samples for $θ_{s j}$ ⁠. We found that the model provided a good fit to the observed data for all of the subjects.

FIGURE 3

Fits of the ordinary differential equations (ODE) models to the virus load data for each infected duck. The solid lines represent the ODE trajectories based on estimated coefficients for each individual using the Bayesian hierarchical mixed-effect model. [Colour figure can be viewed at https://dbpia.nl.go.kr]

To assess sensitivity to prior model probability, the analysis was repeated for $p_{l 0} = 0.2$ and $p_{l 0} = 0.8$ ⁠, respectively. Table 3 lists the posterior probabilities of the eight sub-models. There is evidence that the ODE mixed model which includes between-strain random variations for the parameter $a$ alone is optimal. This conclusion holds for three choices of the prior probability for $λ_{l} = 0$ ⁠. The marginal probabilities of including the strain level random effects for the parameter $a$ are 83%, 62% and 82%. In contrast, the corresponding marginal probabilities are 27.9%, 9.4% 29.85% for including the parameter $b$ and 24.75%, 8.6% 26.6% for including the parameter $c$ ⁠. This indicates that the cells are infected at different rates for different strains, but both the death rates of the infected cell and the clearance rates of the free virus show strong homogeneity across different strains.

TABLE 3

Posterior probabilities of eight different strain-level random effect models in the duck influenza applications

Model	$p_{l 0} = 0.2$	$p_{l 0} = 0.5$	$p_{l 0} = 0.8$
No random effects	0.090	0.300	0.079
$(a)$	$0.451$	$0.531$	$0.420$
$(b)$	0.037	0.043	0.050
$(c)$	0.029	0.032	0.033
$(a, b)$	0.18	0.042	0.19
$(b, c)$	0.016	0.006	0.018
$(a, c)$	0.152	0.045	0.170
$(a, b, c)$	0.052	0.005	0.045

Model	$p_{l 0} = 0.2$	$p_{l 0} = 0.5$	$p_{l 0} = 0.8$
No random effects	0.090	0.300	0.079
$(a)$	$0.451$	$0.531$	$0.420$
$(b)$	0.037	0.043	0.050
$(c)$	0.029	0.032	0.033
$(a, b)$	0.18	0.042	0.19
$(b, c)$	0.016	0.006	0.018
$(a, c)$	0.152	0.045	0.170
$(a, b, c)$	0.052	0.005	0.045

Note: The subset of ODE coefficients that have non-zero strain level random effects are listed. The values in bold indicate the highest posterior probabilities.

TABLE 3

Posterior probabilities of eight different strain-level random effect models in the duck influenza applications

Model	$p_{l 0} = 0.2$	$p_{l 0} = 0.5$	$p_{l 0} = 0.8$
No random effects	0.090	0.300	0.079
$(a)$	$0.451$	$0.531$	$0.420$
$(b)$	0.037	0.043	0.050
$(c)$	0.029	0.032	0.033
$(a, b)$	0.18	0.042	0.19
$(b, c)$	0.016	0.006	0.018
$(a, c)$	0.152	0.045	0.170
$(a, b, c)$	0.052	0.005	0.045

Model	$p_{l 0} = 0.2$	$p_{l 0} = 0.5$	$p_{l 0} = 0.8$
No random effects	0.090	0.300	0.079
$(a)$	$0.451$	$0.531$	$0.420$
$(b)$	0.037	0.043	0.050
$(c)$	0.029	0.032	0.033
$(a, b)$	0.18	0.042	0.19
$(b, c)$	0.016	0.006	0.018
$(a, c)$	0.152	0.045	0.170
$(a, b, c)$	0.052	0.005	0.045

Note: The subset of ODE coefficients that have non-zero strain level random effects are listed. The values in bold indicate the highest posterior probabilities.

In addition to investigating differences among strains, investigators are interested in assessing the overall population-level effect of the dynamic parameters. The population posterior means and the corresponding SE as well as the 95% highest density interval (HDI) for the three parameters are summarised in Table 4. This result is also robust to the various prior choices. Table 5 displays the posterior estimation of the dynamic parameters for each strain, that is, $μ + α_{s}$ in (3). It can be seen that, for parameters $b$ and $c$ ⁠, there are no distinct between-strain differences. But for parameter $a$ ⁠, the between-strain differences are quite clear especially for sub-types H4N8 and H4N6* whose values are much larger and smaller, respectively, than the values of the remaining sub-types. This result is consistent with the posterior probability pattern shown in Table 3.

TABLE 4

Posterior estimates of population-level dynamic parameters including mean, SE and 95% highest density interval (HDI)

	$a$	$b$	$c$
Mean	8.01	1.11	0.93
SE	2.11	0.26	0.34
HDI	(3.47,12.01)	(0.65,1.62)	(0.30,1.64)

TABLE 4

Posterior estimates of population-level dynamic parameters including mean, SE and 95% highest density interval (HDI)

	$a$	$b$	$c$
Mean	8.01	1.11	0.93
SE	2.11	0.26	0.34
HDI	(3.47,12.01)	(0.65,1.62)	(0.30,1.64)

TABLE 5

Posterior mean and SE (in parenthesis) of ordinary differential equations parameters for each individual strain including fixed effect and strain-specific random effect

Strain	$a$	$b$	$c$
H6N8	8.90 (2.39)	1.15 (0.30)	0.91 (0.38)
H3N8	7.89 (2.30)	1.11 (0.27)	1.01 (0.46)
H4N8	10.41 (3.21)	1.10 (0.28)	0.90 (0.38)
H6N1	7.94 (2.22)	1.12 (0.27)	0.95 (0.37)
H4N6*	4.83 (3.77)	1.06 (0.33)	0.89 (0.40)
H3N8*	7.44 (2.40)	1.16 (0.30)	0.95 (0.37)
H6N2	8.56 (2.52)	1.08 (0.31)	0.90 (0.39)

Strain	$a$	$b$	$c$
H6N8	8.90 (2.39)	1.15 (0.30)	0.91 (0.38)
H3N8	7.89 (2.30)	1.11 (0.27)	1.01 (0.46)
H4N8	10.41 (3.21)	1.10 (0.28)	0.90 (0.38)
H6N1	7.94 (2.22)	1.12 (0.27)	0.95 (0.37)
H4N6*	4.83 (3.77)	1.06 (0.33)	0.89 (0.40)
H3N8*	7.44 (2.40)	1.16 (0.30)	0.95 (0.37)
H6N2	8.56 (2.52)	1.08 (0.31)	0.90 (0.39)

TABLE 5

Posterior mean and SE (in parenthesis) of ordinary differential equations parameters for each individual strain including fixed effect and strain-specific random effect

Strain	$a$	$b$	$c$
H6N8	8.90 (2.39)	1.15 (0.30)	0.91 (0.38)
H3N8	7.89 (2.30)	1.11 (0.27)	1.01 (0.46)
H4N8	10.41 (3.21)	1.10 (0.28)	0.90 (0.38)
H6N1	7.94 (2.22)	1.12 (0.27)	0.95 (0.37)
H4N6*	4.83 (3.77)	1.06 (0.33)	0.89 (0.40)
H3N8*	7.44 (2.40)	1.16 (0.30)	0.95 (0.37)
H6N2	8.56 (2.52)	1.08 (0.31)	0.90 (0.39)

Strain	$a$	$b$	$c$
H6N8	8.90 (2.39)	1.15 (0.30)	0.91 (0.38)
H3N8	7.89 (2.30)	1.11 (0.27)	1.01 (0.46)
H4N8	10.41 (3.21)	1.10 (0.28)	0.90 (0.38)
H6N1	7.94 (2.22)	1.12 (0.27)	0.95 (0.37)
H4N6*	4.83 (3.77)	1.06 (0.33)	0.89 (0.40)
H3N8*	7.44 (2.40)	1.16 (0.30)	0.95 (0.37)
H6N2	8.56 (2.52)	1.08 (0.31)	0.90 (0.39)

Since the seven sub-types are identified according to the genomic characterisation, our random effect model selection analysis shows that the genetic differences do have some effects on the transmission dynamics of IAV through changing the infection rate parameter $a$ ⁠. As our analysis was based on a limited number of tested viruses, such a finding should be reinforced with additional studies focusing on other virus genotypes, at other locations and in different avian hosts. This result nevertheless provides some insight to help biologist for further investigation.

6 DISCUSSION

In this paper, we propose a Bayesian multi-level mix-effects ODE model to describe the dynamics of influenza virus infection. Both the population and individual dynamic parameters can be estimated. In comparison with fixed-effect modelling, mixed-effects modelling has the advantages of obtaining more precise parameter estimation by pooling all data together. By relaxing the ODE constraint with a probability expression, our method does not need to solve ODE directly. The variations at different levels are simultaneously considered by incorporating both strain-specific and subject-specific random effects for each ODE coefficient. One advantage of the proposed method is that the closed form conditional posterior distributions for all variables and parameters can be obtained which substantially facilitate the convergence process. To identify the variation at strain level, we perform random-effects model selection based on a decomposition of the covariance matrix of the strain-specific random-effects distribution. The decomposition enables us to specify a prior distribution which can exclude one or more random effects by setting their variances to zero. This re-parameterisation of the covariance matrix results in straightforward and efficient posterior computation via a Gibbs sampler.

We have presented large-scale simulation examples and an actual long-term surveillance study of IAV circulation in wild birds to illustrate how the Bayesian procedures can be applied to influenza virus dynamics investigation. For the simulation studies, it was seen that the models provided fairly good fits to both ODE random effects and ODE fixed effects. The estimates for both population and individual dynamic parameters are also very close to the true values. For the real data, the proposed model fitted the observed viral load reasonably well for all subjects in our study. Using random effects variable selection, we identify that the cell infection rate $a$ varies among different strains while the other two ODE parameters are quite homogeneous across the strains.

We use a simplified model to explain the observed patterns and characterise the biological mechanisms of IAV infection with the main goals of retaining crucial features of influenza dynamics. We plan to generalise it to more complicated models with consideration of more infected cell and virus compartments in order to provide more accurate descriptions for long-term IAV dynamics. One challenge is the identifiability problem of model parameters due to the complexity of the models and the factor that only viral loads are observed among ODE variables. We need to consider the trade-off between the complexity and applicability of influenza dynamic models. It has been shown in Lebarbenchon et al. (2011) that the environmental factors, particularly temperature and humidity, appear to be important determinant of IAV persistence and therefore within-host fitness. We will consider modelling ODE parameters as function of environmental factors in order to integrate data and models at both the infection dynamics and the persistence of the virus in the environment within a combined ecological and within-host framework.

DATA AVAILABILITY STATEMENT

Data and code can be obtained from the github repository https://github.com/hwhuanghep/random_effect_ODE.

ACKNOWLEDGEMENTS

The author thanks the editor, associate editor and two referees for many helpful comments and suggestions which led to a much improved presentation. The author also thanks Andreas Handel for sharing the data set and Xiao Song for helpful discussion. This research is supported by Division of Mathematical Sciences (National Science Foundation) grant DMS-1916411.

REFERENCES

Baccam

,

P.

,

Beauchemin

,

C.

,

Macken

,

C.A.

,

Hayden

,

F.G.

&

Perelson

,

A.S.

(

2006

)

Kinetics of influenza a virus infection in humans

.

Journal of Virology

,

80

,

7590

–

7599

.

Beal

,

S.

&

Sheiner

,

L.

(

1980

)

The NONMEM system

.

The American Statistician

,

34

,

118

–

119

.

Campbell

,

D.

&

Steele

,

R.J.

(

2012

)

Smooth functional tempering for nonlinear differential equation models

.

Statistics and Computing

,

22

,

429

–

443

.

Chen

,

Z.

&

Dunson

,

D.B.

(

2003

)

Random effects selection in linear mixed models

.

Biometrics

,

59

,

762

–

769

.

Dondelinger

,

F.

,

Filippone

,

M.

,

Rogers

,

S.

&

Husmeier

,

D.

(

2013

)

ODE parameter inference using adaptive gradient matching with Gaussian processes. Proceedings of the 16th international conference on artificial intelligence and statistics, pp. 216–228

.

Handel

,

A.

,

Lebarbenchon

,

C.

,

Stallknecht

,

D.

&

Rohani

,

P.

(

2014

)

Trade-offs between and within scales: environmental persistence and within-host fitness of avian influenza viruses

.

Proceedings of the Royal Society B: Biological Sciences

,

281

, 20133051.

Huang

,

H.

,

Handel

,

A.

&

Song

,

X.

(

2020

)

A new Bayesian approach to estimate parameters of ordinary differential equation

.

Computational Statistics

,

35

,

1481

–

1499

.

Huang

,

Y.

,

Liu

,

D.

&

Wu

,

H.

(

2006

)

Hierarchical Bayesian methods for estimation of parameters in a longitudinal HIV dynamic system

.

Biometrics

,

62

,

413

–

423

.

Huang

,

Y.

&

Wu

,

H.

(

2006

)

A Bayesian approach for estimating antiviral efficacy in HIV dynamic models

.

Journal of Applied Statistics

,

33

,

155

–

174

.

Keeler

,

S.P.

,

Lebarbenchon

,

C.

&

Stallknecht

,

D.E.

(

2013

)

Strain-related variation in the persistence of influenza a virus in three types of water: distilled water, filtered surface water, and intact surface water

.

Virology Journal

,

10

,

13

.

Kuhn

,

E.

&

Lavielle

,

M.

(

2005

)

Maximum likelihood estimation in nonlinear mixed effects models

.

Computational Statistics and Data Analysis

,

49

,

1020

–

1038

.

Lebarbenchon

,

C.

,

Sreevatsan

,

S.

,

Lefvre

,

T.

,

Yang

,

M.

,

Ramakrishnan

,

M.A.

,

Brown

,

J.D.

et al. (

2012

)

Reassortant influenza a viruses in wild duck populations: effects on viral shedding and persistence in water

.

Proceedings of the Biological Sciences

,

279

(

1744

),

3967

–

3975

.

Lebarbenchon

,

C.

,

Yang

,

M.

,

Keeler

,

S.P.

,

Ramakrishnan

,

M.A.

,

Brown

,

J.D.

,

Stallknecht

,

D.E.

et al. (

2011

)

Viral replication, persistence in water and genetic characterization of two influenza a viruses isolated from surface lake water

.

PLoS One

,

6

(

10

), e26566.

Liang

,

H.

&

Wu

,

H.

(

2008

)

Parameter estimation for differential equation models using a framework of measurement error in regression models

.

Journal of the American Statistical Association

,

103

,

1570

–

1583

.

Liu

,

B.

,

Wang

,

L.

,

Nie

,

Y.

&

Cao

,

J.

(

2019

)

Bayesian inference of mixed-effects ordinary differential equations models using heavy-tailed distributions

.

Computational Statistics & Data Analysis

,

137

,

233

–

246

.

Lixoft

. (

2012

)

Monolix 4.2

. Accessed at: http://www.lixoft.eu/products/monolix/product-monolix-overview.

Macdonald

,

B.

,

Niu

,

M.

,

Rogers

,

S.

,

Filippone

,

M.

&

Husmeier

,

D.

(

2016

)

Approximate parameter inference in systems biology using gradient matching: a comparative evaluation

.

Biomedical Engineering Online

,

15

,

80

.

Miao

,

H.

,

Dykes

,

C.

,

Demeter

,

L.M.

&

Wu

,

H.

(

2009

)

Differential equation modeling of HIV viral fitness experiments: Model identification, model selection, and multimodel inference

.

Biometrics

,

65

,

292

–

300

.

Nowak

,

M.

&

May

,

R.M.

(

2000

)

Virus dynamics: mathematical principles of immunology and virology: mathematical principles of immunology and virology

.

Oxford

:

Oxford University Press

.

Google Preview

Ramsay

,

J.O.

,

Hooker

,

G.

,

Campbell

,

D.

&

Cao

,

J.

(

2007

)

Parameter estimation for differential equations: a generalized smoothing approach

.

Journal of the Royal Statistical Society, Series B: Statistical Methodology

,

69

,

741

–

796

.

Smith

,

A.M.

&

Perelson

,

A.S.

(

2011

) Influenza a virus infection kinetics: quantitative data and models. In:

Wiley interdisciplinary reviews: systems biology and medicine

, Vol.

3

.

Hoboken

:

Wiley

.

Google Preview

Wang

,

L.

,

Cao

,

J.

,

Ramsay

,

J.

,

Burger

,

D.

,

Laporte

,

C.

&

Rockstroh

,

J.

(

2014

)

Estimating mixed-effects differential equation models

.

Statistics and Computing

,

24

,

111

–

121

.

Wilcox

,

B.R.

,

Knutsen

,

G.A.

,

Berdeen

,

J.

,

Goekjian

,

V.

,

Poulson

,

R.

,

Goyal

,

S.

et al. (

2011

)

Influenza-a viruses in ducks in northwestern minnesota: ne scale spatial and temporal variation in prevalence and subtype diversity

.

PLoS One

,

6

(

9

), e24010.