Dynamic landmark prediction for mixture data Free

Estimation performance at landmark times of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) compared to the estimator that ignores landmarking (NPNA estimator assuming |$t_0=0$|⁠, denoted as NPNA|$_{t_0=0}$|⁠), and the Kaplan–Meier estimator that ignores covariate influence (KM) and modified to adjust for landmarking

	\|$\widehat F_1(\cdot\|t_0,z,w)$\|			\|$\widehat F_2(\cdot\|t_0,z,w)$\|
	KM	NPNA	NPNA\|$_{t_0=0}$\|	KM	NPNA	NPNA\|$_{t_0=0}$\|
	Setting 1: \|$t_0=0.5,z=1,w=0.5$\|
abs bias	7.0813	0.2352	2.9457	3.1558	0.2564	13.9247
emp var	0.0465	0.3472	0.3647	0.0554	0.4489	0.4199
est var	0.0481	0.3732	0.3813	0.0492	0.3681	0.3755
95% cov	10.3333	94.7222	92.7778	68.3333	90.5556	37.7778
MSE	0.5570	0.3741	0.4750	0.1530	0.3690	2.3430
	Setting 1: \|$t_0=1,z=1,w=0.5$\|
abs bias	4.4770	0.3764	11.3412	2.1490	0.2543	24.5883
emp var	0.0523	0.3809	0.3848	0.0487	0.4277	0.4342
est var	0.0538	0.3664	0.4007	0.0500	0.3324	0.3865
95% cov	51.8125	92.3125	54.1250	84.3125	89.8125	3.9375
MSE	0.2680	0.3683	1.7336	0.1015	0.3333	6.4955
	Setting 2: \|$t_0=20,z=1,w=45$\|
abs bias	2.0295	0.7752	4.0646	1.8468	1.5081	1.5619
emp var	0.0220	0.1795	0.2064	0.0262	0.1659	0.1889
est var	0.0218	0.1582	0.1773	0.0231	0.1455	0.1594
95% cov	67.7500	93.0000	87.2500	77.2500	89.2500	91.5000
MSE	0.0682	0.1702	0.4119	0.0573	0.1765	0.1890
	Setting 2: \|$t_0=30,z=1,w=45$\|
abs bias	1.7798	2.0042	1.5612	2.2796	2.6579	2.1070
emp var	0.0080	0.0562	0.0406	0.0401	0.2249	0.2155
est var	0.0087	0.0521	0.0378	0.0346	0.1957	0.1825
95% cov	52.5000	93.0000	95.0000	79.0000	90.0000	90.0000
MSE	0.0404	0.0922	0.0622	0.0866	0.2664	0.2269

	\|$\widehat F_1(\cdot\|t_0,z,w)$\|			\|$\widehat F_2(\cdot\|t_0,z,w)$\|
	KM	NPNA	NPNA\|$_{t_0=0}$\|	KM	NPNA	NPNA\|$_{t_0=0}$\|
	Setting 1: \|$t_0=0.5,z=1,w=0.5$\|
abs bias	7.0813	0.2352	2.9457	3.1558	0.2564	13.9247
emp var	0.0465	0.3472	0.3647	0.0554	0.4489	0.4199
est var	0.0481	0.3732	0.3813	0.0492	0.3681	0.3755
95% cov	10.3333	94.7222	92.7778	68.3333	90.5556	37.7778
MSE	0.5570	0.3741	0.4750	0.1530	0.3690	2.3430
	Setting 1: \|$t_0=1,z=1,w=0.5$\|
abs bias	4.4770	0.3764	11.3412	2.1490	0.2543	24.5883
emp var	0.0523	0.3809	0.3848	0.0487	0.4277	0.4342
est var	0.0538	0.3664	0.4007	0.0500	0.3324	0.3865
95% cov	51.8125	92.3125	54.1250	84.3125	89.8125	3.9375
MSE	0.2680	0.3683	1.7336	0.1015	0.3333	6.4955
	Setting 2: \|$t_0=20,z=1,w=45$\|
abs bias	2.0295	0.7752	4.0646	1.8468	1.5081	1.5619
emp var	0.0220	0.1795	0.2064	0.0262	0.1659	0.1889
est var	0.0218	0.1582	0.1773	0.0231	0.1455	0.1594
95% cov	67.7500	93.0000	87.2500	77.2500	89.2500	91.5000
MSE	0.0682	0.1702	0.4119	0.0573	0.1765	0.1890
	Setting 2: \|$t_0=30,z=1,w=45$\|
abs bias	1.7798	2.0042	1.5612	2.2796	2.6579	2.1070
emp var	0.0080	0.0562	0.0406	0.0401	0.2249	0.2155
est var	0.0087	0.0521	0.0378	0.0346	0.1957	0.1825
95% cov	52.5000	93.0000	95.0000	79.0000	90.0000	90.0000
MSE	0.0404	0.0922	0.0622	0.0866	0.2664	0.2269

We report average absolute bias (abs bias), empirical variance (emp var), estimated bootstrap variance (est var), 95% coverage, and MSE for |$\widehat{\boldsymbol F}(t|t_0,z,w)$| at specified |$z,w,$| and landmark times |$t_0$|⁠. Results are averaged over |$t\in[t_0,3]$| for Setting 1 and |$t\in[t_0,80]$| in Setting 2 and summarized over 200 simulations with 40% censoring. Sample size is 2000. All results are multiplied by 100.

Table 1.

Estimation performance at landmark times of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) compared to the estimator that ignores landmarking (NPNA estimator assuming |$t_0=0$|⁠, denoted as NPNA|$_{t_0=0}$|⁠), and the Kaplan–Meier estimator that ignores covariate influence (KM) and modified to adjust for landmarking

	\|$\widehat F_1(\cdot\|t_0,z,w)$\|			\|$\widehat F_2(\cdot\|t_0,z,w)$\|
	KM	NPNA	NPNA\|$_{t_0=0}$\|	KM	NPNA	NPNA\|$_{t_0=0}$\|
	Setting 1: \|$t_0=0.5,z=1,w=0.5$\|
abs bias	7.0813	0.2352	2.9457	3.1558	0.2564	13.9247
emp var	0.0465	0.3472	0.3647	0.0554	0.4489	0.4199
est var	0.0481	0.3732	0.3813	0.0492	0.3681	0.3755
95% cov	10.3333	94.7222	92.7778	68.3333	90.5556	37.7778
MSE	0.5570	0.3741	0.4750	0.1530	0.3690	2.3430
	Setting 1: \|$t_0=1,z=1,w=0.5$\|
abs bias	4.4770	0.3764	11.3412	2.1490	0.2543	24.5883
emp var	0.0523	0.3809	0.3848	0.0487	0.4277	0.4342
est var	0.0538	0.3664	0.4007	0.0500	0.3324	0.3865
95% cov	51.8125	92.3125	54.1250	84.3125	89.8125	3.9375
MSE	0.2680	0.3683	1.7336	0.1015	0.3333	6.4955
	Setting 2: \|$t_0=20,z=1,w=45$\|
abs bias	2.0295	0.7752	4.0646	1.8468	1.5081	1.5619
emp var	0.0220	0.1795	0.2064	0.0262	0.1659	0.1889
est var	0.0218	0.1582	0.1773	0.0231	0.1455	0.1594
95% cov	67.7500	93.0000	87.2500	77.2500	89.2500	91.5000
MSE	0.0682	0.1702	0.4119	0.0573	0.1765	0.1890
	Setting 2: \|$t_0=30,z=1,w=45$\|
abs bias	1.7798	2.0042	1.5612	2.2796	2.6579	2.1070
emp var	0.0080	0.0562	0.0406	0.0401	0.2249	0.2155
est var	0.0087	0.0521	0.0378	0.0346	0.1957	0.1825
95% cov	52.5000	93.0000	95.0000	79.0000	90.0000	90.0000
MSE	0.0404	0.0922	0.0622	0.0866	0.2664	0.2269

	\|$\widehat F_1(\cdot\|t_0,z,w)$\|			\|$\widehat F_2(\cdot\|t_0,z,w)$\|
	KM	NPNA	NPNA\|$_{t_0=0}$\|	KM	NPNA	NPNA\|$_{t_0=0}$\|
	Setting 1: \|$t_0=0.5,z=1,w=0.5$\|
abs bias	7.0813	0.2352	2.9457	3.1558	0.2564	13.9247
emp var	0.0465	0.3472	0.3647	0.0554	0.4489	0.4199
est var	0.0481	0.3732	0.3813	0.0492	0.3681	0.3755
95% cov	10.3333	94.7222	92.7778	68.3333	90.5556	37.7778
MSE	0.5570	0.3741	0.4750	0.1530	0.3690	2.3430
	Setting 1: \|$t_0=1,z=1,w=0.5$\|
abs bias	4.4770	0.3764	11.3412	2.1490	0.2543	24.5883
emp var	0.0523	0.3809	0.3848	0.0487	0.4277	0.4342
est var	0.0538	0.3664	0.4007	0.0500	0.3324	0.3865
95% cov	51.8125	92.3125	54.1250	84.3125	89.8125	3.9375
MSE	0.2680	0.3683	1.7336	0.1015	0.3333	6.4955
	Setting 2: \|$t_0=20,z=1,w=45$\|
abs bias	2.0295	0.7752	4.0646	1.8468	1.5081	1.5619
emp var	0.0220	0.1795	0.2064	0.0262	0.1659	0.1889
est var	0.0218	0.1582	0.1773	0.0231	0.1455	0.1594
95% cov	67.7500	93.0000	87.2500	77.2500	89.2500	91.5000
MSE	0.0682	0.1702	0.4119	0.0573	0.1765	0.1890
	Setting 2: \|$t_0=30,z=1,w=45$\|
abs bias	1.7798	2.0042	1.5612	2.2796	2.6579	2.1070
emp var	0.0080	0.0562	0.0406	0.0401	0.2249	0.2155
est var	0.0087	0.0521	0.0378	0.0346	0.1957	0.1825
95% cov	52.5000	93.0000	95.0000	79.0000	90.0000	90.0000
MSE	0.0404	0.0922	0.0622	0.0866	0.2664	0.2269

We report average absolute bias (abs bias), empirical variance (emp var), estimated bootstrap variance (est var), 95% coverage, and MSE for |$\widehat{\boldsymbol F}(t|t_0,z,w)$| at specified |$z,w,$| and landmark times |$t_0$|⁠. Results are averaged over |$t\in[t_0,3]$| for Setting 1 and |$t\in[t_0,80]$| in Setting 2 and summarized over 200 simulations with 40% censoring. Sample size is 2000. All results are multiplied by 100.

These deficiencies of the NPNA|$_{t_0=0}$| estimator and KM estimator resulted in lower AUC and higher BS than the NPNA estimator (Table 2). However, only the AUC differences between the NPNA and KM estimators were significantly different. Still, these results indicate that the NPNA|$_{t_0=0}$| and KM estimators do not discriminate or calibrate well.

Table 2.

Prediction accuracy at landmark times of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) compared to the estimator that ignores landmarking (NPNA estimator assuming |$t_0=0$|⁠, denoted as NPNA|$_{t_0=0}$|⁠), and the Kaplan–Meier estimator that ignores covariate influence (KM) and modified to adjust for landmarking

	KM	NPNA	NPNA\|$_{t_0=0}$\|	NPNA-KM\|$^\dagger$\|	NPNA-NPNA\|$_{t_0=0}^\dagger$\|
	AUC, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	51.2854	64.4163	61.4384	(7.4368, 18.8251)	(⁠\|$-$\|0.6716, 6.6274)
\|$t=2$\|	55.1505	64.5175	62.8209	(5.0543, 13.6799)	(⁠\|$-$\|0.3956, 3.7888)
\|$t=3$\|	57.652	65.9349	64.7988	(4.0615, 12.5044)	(⁠\|$-$\|0.5791, 2.8514)
	BS, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	13.5879	13.084	15.2423	(⁠\|$-$\|0.9247, \|$-$\|0.0831)	(⁠\|$-$\|2.8356, \|$-$\|1.4809)
\|$t=2$\|	23.3485	22.0813	23.2354	(⁠\|$-$\|2.0813, \|$-$\|0.4531)	(⁠\|$-$\|1.7654, \|$-$\|0.5428)
\|$t=3$\|	24.369	22.9809	23.6991	(⁠\|$-$\|2.3385, \|$-$\|0.4377)	(⁠\|$-$\|1.2936, \|$-$\|0.1428)
	AUC, Setting 1: \|$t_0=1$\|
\|$t=2$\|	56.718	64.7069	61.1501	(3.0307, 12.9472)	(0.204, 6.9096)
\|$t=3$\|	58.876	65.9604	63.6971	(2.7056, 11.4633)	(⁠\|$-$\|0.2317, 4.7584)
	BS, Setting 1: \|$t_0=1$\|
\|$t=2$\|	19.0458	18.2571	22.9205	(⁠\|$-$\|1.5142, \|$-$\|0.0632)	(⁠\|$-$\|5.7315, \|$-$\|3.5954)
\|$t=3$\|	23.6235	22.4671	25.2915	(⁠\|$-$\|2.1805, \|$-$\|0.1323)	(⁠\|$-$\|3.833, \|$-$\|1.816)
	AUC, Setting 2: \|$t_0=20$\|
\|$t=35$\|	68.672	82.9663	83.0024	(9.9418, 18.6468)	(⁠\|$-$\|1.3709, 1.2988)
\|$t=45$\|	69.2237	82.9825	83.0218	(10.1455, 17.372)	(⁠\|$-$\|0.5834, 0.5048)
\|$t=55$\|	70.476	83.3167	83.3574	(8.9964, 16.685)	(⁠\|$-$\|0.4441, 0.3628)
	BS, Setting 2: \|$t_0=20$\|
\|$t=35$\|	15.7572	12.7593	12.8608	(⁠\|$-$\|4.0232, \|$-$\|1.9726)	(⁠\|$-$\|0.5754, 0.3724)
\|$t=45$\|	21.0976	16.6319	16.5867	(⁠\|$-$\|5.8972, \|$-$\|3.0343)	(⁠\|$-$\|0.2675, 0.3579)
\|$t=55$\|	18.859	15.3791	15.3327	(⁠\|$-$\|4.7729, \|$-$\|2.1868)	(⁠\|$-$\|0.1358, 0.2287)
	AUC, Setting 2: \|$t_0=30$\|
\|$t=35$\|	66.1443	78.9501	79.3607	(6.4, 19.2115)	(⁠\|$-$\|4.5741, 3.7529)
\|$t=45$\|	66.8676	79.7724	79.9264	(9.0036, 16.806)	(⁠\|$-$\|1.3163, 1.0083)
\|$t=55$\|	68.1474	80.6619	80.7738	(8.7062, 16.3229)
	BS, Setting 2: \|$t_0=30$\|
\|$t=35$\|	9.2491	8.3604	10.3928	(⁠\|$-$\|1.5438, \|$-$\|0.2337)	(⁠\|$-$\|3.2139, \|$-$\|0.8508)
\|$t=45$\|	20.6777	17.2407	17.3808	(⁠\|$-$\|4.8153, \|$-$\|2.0585)	(⁠\|$-$\|0.9188, 0.6387)
\|$t=55$\|	20.6045	17.2658	17.1442	(⁠\|$-$\|4.6848, \|$-$\|1.9927)	(⁠\|$-$\|0.299, 0.5421)

	KM	NPNA	NPNA\|$_{t_0=0}$\|	NPNA-KM\|$^\dagger$\|	NPNA-NPNA\|$_{t_0=0}^\dagger$\|
	AUC, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	51.2854	64.4163	61.4384	(7.4368, 18.8251)	(⁠\|$-$\|0.6716, 6.6274)
\|$t=2$\|	55.1505	64.5175	62.8209	(5.0543, 13.6799)	(⁠\|$-$\|0.3956, 3.7888)
\|$t=3$\|	57.652	65.9349	64.7988	(4.0615, 12.5044)	(⁠\|$-$\|0.5791, 2.8514)
	BS, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	13.5879	13.084	15.2423	(⁠\|$-$\|0.9247, \|$-$\|0.0831)	(⁠\|$-$\|2.8356, \|$-$\|1.4809)
\|$t=2$\|	23.3485	22.0813	23.2354	(⁠\|$-$\|2.0813, \|$-$\|0.4531)	(⁠\|$-$\|1.7654, \|$-$\|0.5428)
\|$t=3$\|	24.369	22.9809	23.6991	(⁠\|$-$\|2.3385, \|$-$\|0.4377)	(⁠\|$-$\|1.2936, \|$-$\|0.1428)
	AUC, Setting 1: \|$t_0=1$\|
\|$t=2$\|	56.718	64.7069	61.1501	(3.0307, 12.9472)	(0.204, 6.9096)
\|$t=3$\|	58.876	65.9604	63.6971	(2.7056, 11.4633)	(⁠\|$-$\|0.2317, 4.7584)
	BS, Setting 1: \|$t_0=1$\|
\|$t=2$\|	19.0458	18.2571	22.9205	(⁠\|$-$\|1.5142, \|$-$\|0.0632)	(⁠\|$-$\|5.7315, \|$-$\|3.5954)
\|$t=3$\|	23.6235	22.4671	25.2915	(⁠\|$-$\|2.1805, \|$-$\|0.1323)	(⁠\|$-$\|3.833, \|$-$\|1.816)
	AUC, Setting 2: \|$t_0=20$\|
\|$t=35$\|	68.672	82.9663	83.0024	(9.9418, 18.6468)	(⁠\|$-$\|1.3709, 1.2988)
\|$t=45$\|	69.2237	82.9825	83.0218	(10.1455, 17.372)	(⁠\|$-$\|0.5834, 0.5048)
\|$t=55$\|	70.476	83.3167	83.3574	(8.9964, 16.685)	(⁠\|$-$\|0.4441, 0.3628)
	BS, Setting 2: \|$t_0=20$\|
\|$t=35$\|	15.7572	12.7593	12.8608	(⁠\|$-$\|4.0232, \|$-$\|1.9726)	(⁠\|$-$\|0.5754, 0.3724)
\|$t=45$\|	21.0976	16.6319	16.5867	(⁠\|$-$\|5.8972, \|$-$\|3.0343)	(⁠\|$-$\|0.2675, 0.3579)
\|$t=55$\|	18.859	15.3791	15.3327	(⁠\|$-$\|4.7729, \|$-$\|2.1868)	(⁠\|$-$\|0.1358, 0.2287)
	AUC, Setting 2: \|$t_0=30$\|
\|$t=35$\|	66.1443	78.9501	79.3607	(6.4, 19.2115)	(⁠\|$-$\|4.5741, 3.7529)
\|$t=45$\|	66.8676	79.7724	79.9264	(9.0036, 16.806)	(⁠\|$-$\|1.3163, 1.0083)
\|$t=55$\|	68.1474	80.6619	80.7738	(8.7062, 16.3229)
	BS, Setting 2: \|$t_0=30$\|
\|$t=35$\|	9.2491	8.3604	10.3928	(⁠\|$-$\|1.5438, \|$-$\|0.2337)	(⁠\|$-$\|3.2139, \|$-$\|0.8508)
\|$t=45$\|	20.6777	17.2407	17.3808	(⁠\|$-$\|4.8153, \|$-$\|2.0585)	(⁠\|$-$\|0.9188, 0.6387)
\|$t=55$\|	20.6045	17.2658	17.1442	(⁠\|$-$\|4.6848, \|$-$\|1.9927)	(⁠\|$-$\|0.299, 0.5421)

|$^\dagger$|These columns correspond to the 95% bootstrap confidence intervals for the differences between estimators. We report the AUC and the BS for |$\widehat{\boldsymbol F}(t|t_0,z,w)$| at specified |$t$| and landmark times |$t_0$|⁠. Results are summarized over 200 simulations with 40% censoring. Note that better prediction accuracy is indicated by higher AUC and lower BS. Sample size is 2000. All results are multiplied by 100.

Table 2.

Prediction accuracy at landmark times of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) compared to the estimator that ignores landmarking (NPNA estimator assuming |$t_0=0$|⁠, denoted as NPNA|$_{t_0=0}$|⁠), and the Kaplan–Meier estimator that ignores covariate influence (KM) and modified to adjust for landmarking

	KM	NPNA	NPNA\|$_{t_0=0}$\|	NPNA-KM\|$^\dagger$\|	NPNA-NPNA\|$_{t_0=0}^\dagger$\|
	AUC, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	51.2854	64.4163	61.4384	(7.4368, 18.8251)	(⁠\|$-$\|0.6716, 6.6274)
\|$t=2$\|	55.1505	64.5175	62.8209	(5.0543, 13.6799)	(⁠\|$-$\|0.3956, 3.7888)
\|$t=3$\|	57.652	65.9349	64.7988	(4.0615, 12.5044)	(⁠\|$-$\|0.5791, 2.8514)
	BS, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	13.5879	13.084	15.2423	(⁠\|$-$\|0.9247, \|$-$\|0.0831)	(⁠\|$-$\|2.8356, \|$-$\|1.4809)
\|$t=2$\|	23.3485	22.0813	23.2354	(⁠\|$-$\|2.0813, \|$-$\|0.4531)	(⁠\|$-$\|1.7654, \|$-$\|0.5428)
\|$t=3$\|	24.369	22.9809	23.6991	(⁠\|$-$\|2.3385, \|$-$\|0.4377)	(⁠\|$-$\|1.2936, \|$-$\|0.1428)
	AUC, Setting 1: \|$t_0=1$\|
\|$t=2$\|	56.718	64.7069	61.1501	(3.0307, 12.9472)	(0.204, 6.9096)
\|$t=3$\|	58.876	65.9604	63.6971	(2.7056, 11.4633)	(⁠\|$-$\|0.2317, 4.7584)
	BS, Setting 1: \|$t_0=1$\|
\|$t=2$\|	19.0458	18.2571	22.9205	(⁠\|$-$\|1.5142, \|$-$\|0.0632)	(⁠\|$-$\|5.7315, \|$-$\|3.5954)
\|$t=3$\|	23.6235	22.4671	25.2915	(⁠\|$-$\|2.1805, \|$-$\|0.1323)	(⁠\|$-$\|3.833, \|$-$\|1.816)
	AUC, Setting 2: \|$t_0=20$\|
\|$t=35$\|	68.672	82.9663	83.0024	(9.9418, 18.6468)	(⁠\|$-$\|1.3709, 1.2988)
\|$t=45$\|	69.2237	82.9825	83.0218	(10.1455, 17.372)	(⁠\|$-$\|0.5834, 0.5048)
\|$t=55$\|	70.476	83.3167	83.3574	(8.9964, 16.685)	(⁠\|$-$\|0.4441, 0.3628)
	BS, Setting 2: \|$t_0=20$\|
\|$t=35$\|	15.7572	12.7593	12.8608	(⁠\|$-$\|4.0232, \|$-$\|1.9726)	(⁠\|$-$\|0.5754, 0.3724)
\|$t=45$\|	21.0976	16.6319	16.5867	(⁠\|$-$\|5.8972, \|$-$\|3.0343)	(⁠\|$-$\|0.2675, 0.3579)
\|$t=55$\|	18.859	15.3791	15.3327	(⁠\|$-$\|4.7729, \|$-$\|2.1868)	(⁠\|$-$\|0.1358, 0.2287)
	AUC, Setting 2: \|$t_0=30$\|
\|$t=35$\|	66.1443	78.9501	79.3607	(6.4, 19.2115)	(⁠\|$-$\|4.5741, 3.7529)
\|$t=45$\|	66.8676	79.7724	79.9264	(9.0036, 16.806)	(⁠\|$-$\|1.3163, 1.0083)
\|$t=55$\|	68.1474	80.6619	80.7738	(8.7062, 16.3229)
	BS, Setting 2: \|$t_0=30$\|
\|$t=35$\|	9.2491	8.3604	10.3928	(⁠\|$-$\|1.5438, \|$-$\|0.2337)	(⁠\|$-$\|3.2139, \|$-$\|0.8508)
\|$t=45$\|	20.6777	17.2407	17.3808	(⁠\|$-$\|4.8153, \|$-$\|2.0585)	(⁠\|$-$\|0.9188, 0.6387)
\|$t=55$\|	20.6045	17.2658	17.1442	(⁠\|$-$\|4.6848, \|$-$\|1.9927)	(⁠\|$-$\|0.299, 0.5421)

	KM	NPNA	NPNA\|$_{t_0=0}$\|	NPNA-KM\|$^\dagger$\|	NPNA-NPNA\|$_{t_0=0}^\dagger$\|
	AUC, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	51.2854	64.4163	61.4384	(7.4368, 18.8251)	(⁠\|$-$\|0.6716, 6.6274)
\|$t=2$\|	55.1505	64.5175	62.8209	(5.0543, 13.6799)	(⁠\|$-$\|0.3956, 3.7888)
\|$t=3$\|	57.652	65.9349	64.7988	(4.0615, 12.5044)	(⁠\|$-$\|0.5791, 2.8514)
	BS, Setting 1: \|$t_0=0.5$\|
\|$t=1$\|	13.5879	13.084	15.2423	(⁠\|$-$\|0.9247, \|$-$\|0.0831)	(⁠\|$-$\|2.8356, \|$-$\|1.4809)
\|$t=2$\|	23.3485	22.0813	23.2354	(⁠\|$-$\|2.0813, \|$-$\|0.4531)	(⁠\|$-$\|1.7654, \|$-$\|0.5428)
\|$t=3$\|	24.369	22.9809	23.6991	(⁠\|$-$\|2.3385, \|$-$\|0.4377)	(⁠\|$-$\|1.2936, \|$-$\|0.1428)
	AUC, Setting 1: \|$t_0=1$\|
\|$t=2$\|	56.718	64.7069	61.1501	(3.0307, 12.9472)	(0.204, 6.9096)
\|$t=3$\|	58.876	65.9604	63.6971	(2.7056, 11.4633)	(⁠\|$-$\|0.2317, 4.7584)
	BS, Setting 1: \|$t_0=1$\|
\|$t=2$\|	19.0458	18.2571	22.9205	(⁠\|$-$\|1.5142, \|$-$\|0.0632)	(⁠\|$-$\|5.7315, \|$-$\|3.5954)
\|$t=3$\|	23.6235	22.4671	25.2915	(⁠\|$-$\|2.1805, \|$-$\|0.1323)	(⁠\|$-$\|3.833, \|$-$\|1.816)
	AUC, Setting 2: \|$t_0=20$\|
\|$t=35$\|	68.672	82.9663	83.0024	(9.9418, 18.6468)	(⁠\|$-$\|1.3709, 1.2988)
\|$t=45$\|	69.2237	82.9825	83.0218	(10.1455, 17.372)	(⁠\|$-$\|0.5834, 0.5048)
\|$t=55$\|	70.476	83.3167	83.3574	(8.9964, 16.685)	(⁠\|$-$\|0.4441, 0.3628)
	BS, Setting 2: \|$t_0=20$\|
\|$t=35$\|	15.7572	12.7593	12.8608	(⁠\|$-$\|4.0232, \|$-$\|1.9726)	(⁠\|$-$\|0.5754, 0.3724)
\|$t=45$\|	21.0976	16.6319	16.5867	(⁠\|$-$\|5.8972, \|$-$\|3.0343)	(⁠\|$-$\|0.2675, 0.3579)
\|$t=55$\|	18.859	15.3791	15.3327	(⁠\|$-$\|4.7729, \|$-$\|2.1868)	(⁠\|$-$\|0.1358, 0.2287)
	AUC, Setting 2: \|$t_0=30$\|
\|$t=35$\|	66.1443	78.9501	79.3607	(6.4, 19.2115)	(⁠\|$-$\|4.5741, 3.7529)
\|$t=45$\|	66.8676	79.7724	79.9264	(9.0036, 16.806)	(⁠\|$-$\|1.3163, 1.0083)
\|$t=55$\|	68.1474	80.6619	80.7738	(8.7062, 16.3229)
	BS, Setting 2: \|$t_0=30$\|
\|$t=35$\|	9.2491	8.3604	10.3928	(⁠\|$-$\|1.5438, \|$-$\|0.2337)	(⁠\|$-$\|3.2139, \|$-$\|0.8508)
\|$t=45$\|	20.6777	17.2407	17.3808	(⁠\|$-$\|4.8153, \|$-$\|2.0585)	(⁠\|$-$\|0.9188, 0.6387)
\|$t=55$\|	20.6045	17.2658	17.1442	(⁠\|$-$\|4.6848, \|$-$\|1.9927)	(⁠\|$-$\|0.299, 0.5421)

|$^\dagger$|These columns correspond to the 95% bootstrap confidence intervals for the differences between estimators. We report the AUC and the BS for |$\widehat{\boldsymbol F}(t|t_0,z,w)$| at specified |$t$| and landmark times |$t_0$|⁠. Results are summarized over 200 simulations with 40% censoring. Note that better prediction accuracy is indicated by higher AUC and lower BS. Sample size is 2000. All results are multiplied by 100.

These observations were highly evident in Setting 1 and less so in Setting 2. For Setting 2, the NPNA estimator had lower bias and MSEs compared to the NPNA|$_{t_0=0}$| estimator for |$F_1(t\mid t_0,z,w)$| (carrier group), but not for |$F_2(t\mid t_0,z,w)$| (non-carrier group) at |$t_0=20,30$|⁠. This result is reasonable given that Setting 2 mimics the HD study where knowledge that a non-carrier survived to age |$t_0=20$| or |$t_0=30$| years is uninformative because healthy populations (i.e., non-carriers) do not normally die in the first three decades of life. That is, knowing a non-carrier lived to ages |$t_0=20$| or |$t_0=30$| years does not improve survival predictions, nor prediction accuracy (the AUC and BS for NPNA and NPNA|$_{t_0=0}$| were similar at |$t_0=20$| and |$t_0=30$|⁠).

Last, when patient-specific predictions are not the primary goal, we show in Table 3 that the marginal version of our estimator, NPNA|$_{\rm marg}$|⁠, and KM have negligible bias when estimating |${\boldsymbol F}(t)$|⁠. For this table, the true |${\boldsymbol F}(t)$| is the marginal form of the distributions in Settings 1 and 2 obtained by integrating over |$(z,w)$| at |$t_0=0$|⁠. We do observe a slightly higher bias in the NPNA estimator because, before marginalizing over covariates, the NPNA estimator can have small subgroup sizes as it adjusts for covariates. Though the increased bias is relatively small in this simulation study, caution should be used in practice when using this approach to obtain a marginal estimate if there are small subgroup sizes. Results in Table 3 also show that the NPNA estimator has roughly 6–22% higher efficiency than does the KM estimator implying that the NPNA yields more precise estimates. This helps improve power in hypothesis testing when, for example, comparing the marginal survival distributions between carriers and non-carriers.

Table 3.

Marginal estimation results of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA|$_{\rm marg}$|⁠) and the Kaplan–Meier based estimator (KM)

	\|$\widehat F_1(t)$\|		\|$\widehat F_2(t)$\|
	KM	NPNA\|$_{\rm marg}$\|	KM	NPNA\|$_{\rm marg}$\|
	Setting 1: \|$t=1$\|
bias	0.1964	-0.0544	0.1509	-0.2704
emp var	0.0324	0.0332	0.0409	0.0397
est var	0.0332	0.0302	0.0358	0.0331
95% cov	96.5000	94.0000	93.5000	92.5000
MSE	0.0336	0.0303	0.0360	0.0339
rel eff	100.0000	90.9975	100.0000	92.5993
	Setting 1: \|$t=2$\|
bias	-0.0003	-0.5523	0.1674	-0.4324
emp var	0.0485	0.0492	0.0498	0.0495
est var	0.0471	0.0422	0.0439	0.0405
95% cov	96.5000	91.5000	93.0000	89.5000
MSE	0.0471	0.0453	0.0441	0.0424
rel eff	100.0000	89.6270	100.0000	92.3248
	Setting 2: \|$t=30$\|
bias	0.2651	-0.2741	-0.0983	-0.1784
emp var	0.0387	0.0318	0.0232	0.0195
est var	0.0402	0.0323	0.0207	0.0176
95% cov	93.0000	94.0000	92.5000	93.0000
MSE	0.0409	0.0330	0.0208	0.0179
rel eff	100.0000	80.3382	100.0000	84.8547
	Setting 2: \|$t=40$\|
bias	0.0914	-0.7803	-0.1364	-0.3885
emp var	0.0571	0.0439	0.0448	0.0400
est var	0.0523	0.0404	0.0400	0.0329
95% cov	93.0000	93.0000	90.0000	90.0000
MSE	0.0524	0.0465	0.0402	0.0344
rel eff	100.0000	77.3532	100.0000	82.2426

	\|$\widehat F_1(t)$\|		\|$\widehat F_2(t)$\|
	KM	NPNA\|$_{\rm marg}$\|	KM	NPNA\|$_{\rm marg}$\|
	Setting 1: \|$t=1$\|
bias	0.1964	-0.0544	0.1509	-0.2704
emp var	0.0324	0.0332	0.0409	0.0397
est var	0.0332	0.0302	0.0358	0.0331
95% cov	96.5000	94.0000	93.5000	92.5000
MSE	0.0336	0.0303	0.0360	0.0339
rel eff	100.0000	90.9975	100.0000	92.5993
	Setting 1: \|$t=2$\|
bias	-0.0003	-0.5523	0.1674	-0.4324
emp var	0.0485	0.0492	0.0498	0.0495
est var	0.0471	0.0422	0.0439	0.0405
95% cov	96.5000	91.5000	93.0000	89.5000
MSE	0.0471	0.0453	0.0441	0.0424
rel eff	100.0000	89.6270	100.0000	92.3248
	Setting 2: \|$t=30$\|
bias	0.2651	-0.2741	-0.0983	-0.1784
emp var	0.0387	0.0318	0.0232	0.0195
est var	0.0402	0.0323	0.0207	0.0176
95% cov	93.0000	94.0000	92.5000	93.0000
MSE	0.0409	0.0330	0.0208	0.0179
rel eff	100.0000	80.3382	100.0000	84.8547
	Setting 2: \|$t=40$\|
bias	0.0914	-0.7803	-0.1364	-0.3885
emp var	0.0571	0.0439	0.0448	0.0400
est var	0.0523	0.0404	0.0400	0.0329
95% cov	93.0000	93.0000	90.0000	90.0000
MSE	0.0524	0.0465	0.0402	0.0344
rel eff	100.0000	77.3532	100.0000	82.2426

We report bias, empirical variance (emp var), estimated bootstrap variance (est var), 95% coverage and MSE, and relative efficiency (in terms of variance relative to KM) for |$\widehat{\boldsymbol F}(t)=(\widehat F_1(t),\widehat F_2(t)) $| at specified |$t$|⁠. Results summarized over 200 simulations with 40% censoring. Improved efficiencies from the NPNA|$_{\rm marg}$| relative to KM are boldfaced. All values are multiplied by 100.

Table 3.

Open in new tab Download slide

Marginal estimation results of the proposed nonparametric kernel Nelson-Aalen estimator (NPNA|$_{\rm marg}$|⁠) and the Kaplan–Meier based estimator (KM)

	\|$\widehat F_1(t)$\|		\|$\widehat F_2(t)$\|
	KM	NPNA\|$_{\rm marg}$\|	KM	NPNA\|$_{\rm marg}$\|
	Setting 1: \|$t=1$\|
bias	0.1964	-0.0544	0.1509	-0.2704
emp var	0.0324	0.0332	0.0409	0.0397
est var	0.0332	0.0302	0.0358	0.0331
95% cov	96.5000	94.0000	93.5000	92.5000
MSE	0.0336	0.0303	0.0360	0.0339
rel eff	100.0000	90.9975	100.0000	92.5993
	Setting 1: \|$t=2$\|
bias	-0.0003	-0.5523	0.1674	-0.4324
emp var	0.0485	0.0492	0.0498	0.0495
est var	0.0471	0.0422	0.0439	0.0405
95% cov	96.5000	91.5000	93.0000	89.5000
MSE	0.0471	0.0453	0.0441	0.0424
rel eff	100.0000	89.6270	100.0000	92.3248
	Setting 2: \|$t=30$\|
bias	0.2651	-0.2741	-0.0983	-0.1784
emp var	0.0387	0.0318	0.0232	0.0195
est var	0.0402	0.0323	0.0207	0.0176
95% cov	93.0000	94.0000	92.5000	93.0000
MSE	0.0409	0.0330	0.0208	0.0179
rel eff	100.0000	80.3382	100.0000	84.8547
	Setting 2: \|$t=40$\|
bias	0.0914	-0.7803	-0.1364	-0.3885
emp var	0.0571	0.0439	0.0448	0.0400
est var	0.0523	0.0404	0.0400	0.0329
95% cov	93.0000	93.0000	90.0000	90.0000
MSE	0.0524	0.0465	0.0402	0.0344
rel eff	100.0000	77.3532	100.0000	82.2426

	\|$\widehat F_1(t)$\|		\|$\widehat F_2(t)$\|
	KM	NPNA\|$_{\rm marg}$\|	KM	NPNA\|$_{\rm marg}$\|
	Setting 1: \|$t=1$\|
bias	0.1964	-0.0544	0.1509	-0.2704
emp var	0.0324	0.0332	0.0409	0.0397
est var	0.0332	0.0302	0.0358	0.0331
95% cov	96.5000	94.0000	93.5000	92.5000
MSE	0.0336	0.0303	0.0360	0.0339
rel eff	100.0000	90.9975	100.0000	92.5993
	Setting 1: \|$t=2$\|
bias	-0.0003	-0.5523	0.1674	-0.4324
emp var	0.0485	0.0492	0.0498	0.0495
est var	0.0471	0.0422	0.0439	0.0405
95% cov	96.5000	91.5000	93.0000	89.5000
MSE	0.0471	0.0453	0.0441	0.0424
rel eff	100.0000	89.6270	100.0000	92.3248
	Setting 2: \|$t=30$\|
bias	0.2651	-0.2741	-0.0983	-0.1784
emp var	0.0387	0.0318	0.0232	0.0195
est var	0.0402	0.0323	0.0207	0.0176
95% cov	93.0000	94.0000	92.5000	93.0000
MSE	0.0409	0.0330	0.0208	0.0179
rel eff	100.0000	80.3382	100.0000	84.8547
	Setting 2: \|$t=40$\|
bias	0.0914	-0.7803	-0.1364	-0.3885
emp var	0.0571	0.0439	0.0448	0.0400
est var	0.0523	0.0404	0.0400	0.0329
95% cov	93.0000	93.0000	90.0000	90.0000
MSE	0.0524	0.0465	0.0402	0.0344
rel eff	100.0000	77.3532	100.0000	82.2426

We report bias, empirical variance (emp var), estimated bootstrap variance (est var), 95% coverage and MSE, and relative efficiency (in terms of variance relative to KM) for |$\widehat{\boldsymbol F}(t)=(\widehat F_1(t),\widehat F_2(t)) $| at specified |$t$|⁠. Results summarized over 200 simulations with 40% censoring. Improved efficiencies from the NPNA|$_{\rm marg}$| relative to KM are boldfaced. All values are multiplied by 100.

Reported results were from simulated data with a sample size of |$n=2000$| which is similar to the HD study (Section 4). We ran the analysis at sample size |$n=500$| and found that comparative results were similar (see Tables S.3. to S.7. of the Supplementary material available at Biostatistics online).

4. Application to kin-cohort mortality study of HD

4.1. Clinical research problem

We applied our NPNA estimator to a large, multicenter, observational study of HD: the Cooperative Huntington Observational Research Trial (COHORT; Dorsey and Huntington Study Group COHORT Investigators, 2012). COHORT was conducted at 38 sites in the USA, Canada, and Australia from 2005 to 2011, but site information was not available to protect the identity of participants. COHORT collected clinical and genetic data from symptomatic and presymptomatic HD mutation carriers (probands) and their family members. During this time period, probands were systematically interviewed either face-to-face or over the telephone to ascertain neurological disease and mortality information about first-degree relatives. This included relatives who had died prior to the study and those who died during the study period. Relatives were thus “followed” from birth to death or to the end of the study period where if they were still alive, their age of death was censored. The family history interviews were administered by a second person to ensure validity of the information collected.

We analyzed data for the 3291 first-degree relatives of the probands. There were 51% males and 49% females composed of 29% kids, 32% parents, and 39% siblings of the probands. Genotype information for relatives was not available due to resource constraints in obtaining in-person blood samples. Thus, exact knowledge of whether a relative was a carrier or non-carrier of the genetic mutation that causes HD was unknown. But because HD is genetically inheritable, we could compute the probability a relative inherited the gene mutation (see Section S.1. of the Supplementary material available at Biostatistics online for details). We ultimately had |$m=3$| distinct mixture probabilities: 64.18% of the population had a mixture proportion of (0.5,0.5), 27.89% of the population had a mixture proportion of (0.97,0.03), and 7.93% of the population had a mixture proportion of (0.25,0.75). We do assume these mixture proportions are correctly estimated. In Section 5, we discuss the impact when the mixture proportions are measured with uncertainty.

Using data from the relatives, we dynamically predicted the survival age (⁠|$T$|⁠) distributions for carrier and non-carrier relatives after adjusting for gender of the relative (⁠|$Z$|⁠) and the CAG repeat-length of the relative’s proband (⁠|$W$|⁠). The relative and his proband may not have the same CAG repeat-length, but because HD is a hereditary disorder, we hypothesize that the proband’s CAG repeat-length could impact the relative’s survival distribution.

Results from this analysis will lead to two predictions of survival for each individual, one if they are truly a carrier and one if they are truly a non-carrier. In genetic counseling sessions, where individuals are interested in learning the impact of HD on their lifestyle, this information can help in three ways. First, for individuals who prefer not to pursue genetic testing, these predictions can help them understand the natural history of the disease based on their specific information. This is because our models personalize the predicted probabilities of survival based the individual’s gender, familial genetic information, and updated mortality information (e.g., if an individual previously visited 5 years ago and is still alive today). This personalization improves the accuracy of mortality rate information given to individuals. Second, hearing these predictions may influence the decision to obtain genetic testing so that an individual can precisely know his/her predicted risk of mortality and begin to seek out care management if needed. Third, results from our models could help recruit participants for future studies. Specifically, results from our models will identify participants who have a specific probability of inheriting the gene mutation and a specific rate of mortality, but their exact gene mutation status is unknown. This information could help recruit participants for an observational study of disease progression and mortality where neither the participants nor clinical investigators know the patients’ gene mutation status to avoid subjective bias. This idea was actually the exact premise for the PHAROS study of HD (Huntington Study Group PHAROS Investigators, 2006) which helped to objectively assess disease progression in adult participants who had a 50% chance of being a carrier.

One possible concern of our analysis is the feasibility of making dynamic predictions of survival for relatives when the baseline is birth and the proband was born after the relative. Consider the following example. Suppose the proband is a child and the relative is a parent, so that at baseline (the parent’s birth), the proband was not even born. The concern is that in this scenario, we would not know the child-proband’s genotype information (⁠|$W$|⁠) and thus could not use it to make dynamic predictions for the parent-relative. But this would never arise in our scenario. We collect data on relatives from the proband. That is, we first sample the child-proband, and then collect mortality information about his parent which is fully available because we can retrospectively collect this information from the child-proband. Thus, we would indeed have the data to assess the mortality information about the parent-relative.

4.2. Results

The proband’s CAG repeat-length impacted the relative’s likelihood of survival (Figure 1). Figure 1 shows the predicted probability that a relative who is a male carrier survives 10 years after having survived to age |$t_0$|⁠, |$t_0=20,40,60$| years. These probabilities are displayed as a function of the proband’s CAG repeat-length. Results are shown for the NPNA estimator which adjusts for CAG repeat-length and gender, and its marginalized version (NPNA|$_{\rm marg}$|⁠) which does not. Comparing these two helps evaluate the impact of a proband’s CAG repeat-length on the relative’s mortality. For relatives who survived up to |$t_0=20$| or |$t_0=40$| years, the predicted probability of surviving another 10 years did not change as the proband’s CAG repeat-length changed (i.e., the estimates from the NPNA and NONA|$_{\rm marg}$| were similar across CAG repeat-length). Because HD typically onsets in the fourth decade of life (Ross and others, 2014) and death typically occurs 15–20 years after (Rinaldi and others, 2014), it is reasonable to see that CAG repeat-length had little impact on survival during the first 30–50 years of life. However, we found that the probability of survival at older ages depend on the proband’s CAG repeat-length. The probability of survival past 70 years for a male carrier who survived to age 60 and whose proband has 40 CAG repeats is 0.84 (95% CI 0.72–0.96) using the NPNA estimator and 0.59 (95% CI 0.55–0.63) for NPNA|$_{\rm marg}$|⁠.

$Predicted likelihood of survival 10 years past $t_0$ for relatives who are mutation carrier males in the COHORT study. Results are shown as a function of the proband’s CAG repeat-lengths. Results displayed are for the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) and its marginalized version (NPNA$_{\rm marg}$) to evaluate the impact of CAG repeat-length on predicted survival.$

Fig. 1.

Predicted likelihood of survival 10 years past |$t_0$| for relatives who are mutation carrier males in the COHORT study. Results are shown as a function of the proband’s CAG repeat-lengths. Results displayed are for the proposed nonparametric kernel Nelson-Aalen estimator (NPNA) and its marginalized version (NPNA|$_{\rm marg}$|⁠) to evaluate the impact of CAG repeat-length on predicted survival.

Figure 2 illustrates the benefit of incorporating information about survival up to a landmark point when predicting future survival. The curves in Figure 2 display the dynamic probability of survival to age |$t$| given that an individual has survived to age |$t_0$| for |$t_0=0,20,40,60$| years. The curves are left-truncated at |$t_0$| because the probability an individual survives to age |$t$| given he has survived to age |$t_0$| only make sense when |$t\geq t_0$|⁠. Results are shown for relatives who are male carriers and their probands had 46 CAG repeats; results for female relatives were similar. Results are based on estimates from the NPNA estimator. Accounting for survival to |$t_0$| through landmarking changes the predicted probability of survival, as would be expected. For example, at baseline (⁠|$t_0=0$|⁠) the predicted probability of surviving to age 70 (⁠|$t=70$|⁠) is 0.19 (95% CI 0.07–0.3), shown by the solid curve. However, if this relative survives to age 60 (⁠|$t_0=60$|⁠), the updated probability that he will survival to age 70 is much higher at 0.5 (95% CI 0.27–0.73), shown by the lightest dashed curve. That is, without the use of a landmark prediction approach, this patient would be told that their probability of surviving to age 70 is 0.19 when in fact, it is 0.50, given that they have already survived to age 60. This difference in predictions can be very meaningful particularly when treatment decisions and/or life decisions are informed by these estimates.

$Dynamic landmark predictions of survival past age $t$ given survival to age $t_0$ in the COHORT study, $t\geq t_0$. Estimates are shown as a function of age (in years) for relatives who are male, mutation carriers whose proband had 46 CAG repeats. Result displayed is from the proposed nonparametric kernel Nelson-Aalen estimator (NPNA).$

Fig. 2.

Dynamic landmark predictions of survival past age |$t$| given survival to age |$t_0$| in the COHORT study, |$t\geq t_0$|⁠. Estimates are shown as a function of age (in years) for relatives who are male, mutation carriers whose proband had 46 CAG repeats. Result displayed is from the proposed nonparametric kernel Nelson-Aalen estimator (NPNA).

Open in new tab Download slide

Figure S.2. of the Supplementary material available at Biostatistics online displays the predicted survival using the NPNA estimator to compare differences between carrier and non-carrier relatives (top-half), and differences between carrier males and females (bottom-half). The probability of survival for non-carriers was nearly one up until age 60, with a steady decrease thereafter. The carriers, however, had steadily decreasing probabilities of survival beginning at age 20 and rapidly decreasing thereafter. These differences underscore the insidious effect of genetic mutation on mortality. Mortality rates appeared similar for males and females (the 95% confidence bands for carrier males and females overlapped; see bottom-half of Figure S.2. of the Supplementary material available at Biostatistics online). The result agrees with earlier studies where gender did not significantly impact the mean survival time of HD patients (Harper, 1996).

5. Discussion

We developed a new nonparametric prediction model for mixture data that incorporates covariates and dynamic landmark prediction, and we showed how it improves prediction accuracy. There are several extensions of our work.

A straightforward extension is the incorporation of clinically important intermediate event information, such as hospitalization. The estimator |$\widehat S_j(t \mid t_0,z, w)$|⁠, |$j=1,\ldots,m$|⁠, in (2.1) can be adapted to incorporate the timing of intermediate events through |$W$|⁠. Specifically, let |$M_i$| indicate the time of the intermediate event. Similar to the failure time |$T_i$|⁠, |$M_i$| is subject to censoring and thus we may only observe |$M_i^* = \min(M_i, C_i, T_i)$| and |$\Delta_i^* = I(M_i \leq C_i)$|⁠. Among individuals in |$\Omega_{t_0}=\{i:X_i>t_0\}$|⁠, if |$M_i<t_0$| then |$M_i^* = M_i$|⁠. That is, for those who have survived to |$t_0$| and are still under observation at |$t_0$|⁠, it is known that |$C_i > t_0$| and thus if the intermediate event occurred before |$t_0$|⁠, then it was observed. Therefore, when we condition on |$\Omega_{t_0}$|⁠, we can re-define |$W_i$| as |$W_i(t_0) = \min(M_i, t_0)$| and thus, |$W_i(t_0)$| will be equal to |$t_0$| for those who have not yet experienced the intermediate event by |$t_0$| or equal to |$M_i^*$| for those who have experienced the intermediate event by |$t_0$|⁠. Estimation of |${\boldsymbol F}(t \mid t_0, Z_i,W_i)$| then follows the procedure in Section 2, where |$W_i = W_i(t_0) = \min(M_i, t_0)$| and with or without an additional discrete covariate |$Z_i$|⁠.

Another extension is with competing risks. In the HD study, one could be interested in different causes of death due to cardiovascular disease, pneumonia, or suicide (Sørensen and Fenger, 1992 ). To handle competing risks, we could re-define |${\boldsymbol F}(t\mid t_0,z,w)$| as a cumulative incidence function: |${\boldsymbol F}_j(t\mid t_0,z,w)$||$=\{F_{j1}(t\mid t_0,z,w),\ldots,F_{jp}(t\mid t_0,z,w)\}^T$| where |$ F_{jk}(t\mid t_0, z,w) = P(T\leq t, D= j \mid T> t_0, L=k, z, w)$| and |$D$| denotes the type of event that occurred where |$j=1,...,J$| and |$J$| is the number of distinct competing event types. Estimation would focus on a cause-specific version of cumulative hazard in (2.2) and for estimation of each event, individuals experiencing other events would be censored at the event time.

Other extensions apply to the HD example. In the kin-cohort study, mortality outcomes between relatives in the same family are correlated. One could account for this correlation using frailties that adjust for correlations between outcomes from family members. These frailties would be incorporated into our modified nonparametric kernel Nelson–Aalen estimator. We could also explicitly incorporate left truncation of failure times given that survival times for relatives is collected retrospectively and some of these relatives have died prior to the study onset. However, incorporating left truncation requires a new framework beyond the scope of this article.

In practice, it may be of interest to use the estimates obtained to test for differences in predictions at some time |$t$| or differences in the survival curves up to time |$t$| between two groups (e.g., carrier vs. non-carriers, or male carriers vs. female carriers). To test at a particular time |$t$| this difference, e.g., |$H_0: F_1(t|t_0, z, w) - F_2(t|t_0, z, w) = 0$|⁠, we could perform a Wald-type test using |$\mathcal{Z}(t|t_0,z,w) = \{\widehat{F}_1(t|t_0, z, w) - \widehat{F}_2(t|t_0, z, w)\}/\widehat{\sigma}_{12}(t|t_0,z,w)$|⁠. Here, |$k=1$| indicates carrier and |$k=2$| indicates non-carrier and |$\widehat{\sigma}_{12}(t|t_0,z,w)$| is an estimate of the standard error of the difference between |$\widehat{F}_1(t|t_0, z, w)$| and |$\widehat{F}_2(t|t_0, z, w)$| which we can obtain using our bootstrap samples. To compare survival curves, we could define the function |$\widehat{\mathcal{D}}_{12}(t|t_0, z, w) = \widehat{F}_1(t|t_0, z, w) - \widehat{F}_2(t|t_0, z, w)$| for |$t>t_0$| in some range |$\mathcal{T}$|⁠, and calculate a simultaneous confidence band |$\{\widehat{\mathcal{D}}_{12}(t|t_0, z, w) \pm \widehat{c}_{\alpha}\widehat{\sigma}_{12}(t|t_0,z,w), t\in \mathcal{T}\}$| where |$\widehat{c}_{\alpha}$| is the |$(1-\alpha)$| empirical quantile of |$\sup_{t\in \mathcal{T}} || $||$ \widehat{\mathcal{D}}_{12}^{b}(t|t_0, z, w) - \widehat{\mathcal{D}}_{12}(t|t_0, z, w) || / \widehat{\sigma}_{12}(t|t_0,z,w), b=1,...,B \}$| where |$b$| indicates the bootstrap replication.

Our proposed approach does have some limitations. Because we use nonparametric kernel smoothing, we require a relatively large sample size and in particular, there must be adequate sample size within each |${\boldsymbol u}_j$| group. In settings with smaller sample sizes, parametric approaches may be more stable. Second, our proposed estimators were presented assuming covariates |$Z$| and |$W$| were univariate. Our method can extend to the multivariate case using, for example, a dimension reduction approach whereby a working model is used to reduce the multivariate covariate information into a single scalar summary. After this dimension reduction, our proposed approach could then be directly applied. Third, though we note that our method can incorporate information on time-dependent covariates, where |$Z$| and |$W$| are updated at the landmark time |$t_0$| resulting in |$Z(t_0)$| and |$W(t_0)$|⁠, we do not have access to mixture data with time-dependent covariates and thus, are unable to illustrate this feature. However, this methodological development and the availability of software to implement these methods would allow others with such data to apply these methods. Fourth, the prediction accuracy using AUC and BS requires knowing the population identifier, which is only known in simulated data but not in practice. We thus cannot compute prediction accuracy for our HD study, but results from our simulation study suggest that our proposed estimator indeed has better prediction accuracy than competing methods.

6. Software

All R code is available at https://github.com/tpgarcia/landmix.

Supplementary material

Supplementary material is available at http://biostatistics.oxfordjournals.org.

Acknowledgments

Data from the COHORT study, which received support from HP Therapeutics, Inc., were used. We thank the Huntington Study Group COHORT investigators and coordinators who collected the data, and participants and their families who made this work possible.

Conflict of Interest: None declared.

Funding

The Huntington’s Disease Society of America Human Biology Project Fellowship, the National Institute of Neurological Disorders and Stroke (K01NS099343); and the National Institute of Diabetes and Digestive and Kidney Diseases (R21DK103118).

References

Brier,

G. W.

(

1950

).

Verification of forecasts expressed in terms of probability

.

Monthly Weather Review

78

,

1

–

3

.

Cai,

T.

,

Tian,

L.

,

Uno,

H.

,

Solomon,

S. D.

and

Wei,

L. J.

(

2010

).

Calibrating parametric subject-specific risk estimation

.

Biometrika

97

,

389

–

404

.

Cai,

T.

and

Zheng,

Y.

(

2011

).

Nonparametric evaluation of biomarker accuracy under nested case-control studies

.

Journal of the American Statistical Association

106

,

569

–

580

.

Carroll,

R. J.

,

Ruppert,

D.

,

Stefanski,

L. A.

and

Crainiceanu,

C.

(

2006

).

Measurement Error in Nonlinear Models: A Modern Perspective

, 2nd edition.

London

:

CRC Press

.

Chatterjee,

N.

and

Wacholder,

S.

(

2001

).

A marginal likelihood approach for estimating penetrance from kin-cohort designs

.

Biometrics

57

,

245

–

252

.

Dabrowska,

D. M.

(

1989

).

Uniform consistency of the kernel conditional Kaplan–Meier estimate

.

The Annals of Statistics

17

,

1157

–

1167

.

Dorsey,

E. R. and The Huntington Study Group COHORT Investigators.

(

2012

).

Characterization of a large group of individuals with Huntington disease and their relatives enrolled in the COHORT study

.

PLoS One

7

,

e29522

.

Du,

Y.

and

Akritas,

M. G.

(

2002

).

Uniform strong representation of the conditional Kaplan–Meier process

.

Mathematical Methods of Statistics

11

,

152

–

182

.

Fine,

J. P.

,

Zou,

F.

and

Yandell,

B. S.

(

2004

).

Nonparametric estimation of the effects of quantitative trait loci

.

Biostatistics

5

,

501

–

513

.

Garcia,

T. P.

,

Marder,

K.

and

Wang,

Y.

(

2016

). Natural history of Huntington’s disease: evolution of modeling onset. In:

Feigin,

A.

and

Anderson,

K. E.

(editors),

Handbook of Clinical Neurology

,

3rd Series

.

Atlanta

:

Elsevier

.

Gerds,

T. A.

and

Schumacher,

M.

(

2006

).

Consistent estimation of the expected Brier score in general survival models with right-censored event times

.

Biometrical Journal

48

,

1029

–

1040

.

Harper,

P. S.

(

1996

).

Huntington’s Disease

, 2nd edition.

Philadelphia

:

W.B. Saunders

.

Heagerty,

P. J.

and

Zheng,

Y.

(

2005

).

Survival model predictive accuracy and ROC curves

.

Biometrics

61

,

92

–

105

.

Huntington Study Group PHAROS Investigators. (

2006

).

At risk for Huntington disease: the PHAROS (Prospective Huntington At Risk Observational Study) cohort enrolled

.

Archives of Neurology

63

,

991

–

996

.

Khoury,

M. J.

,

Beaty,

T. H.

and

Cohen,

B. H.

(

1993

).

Fundamentals of Genetic Epidemiology

.

New York

:

Oxford University Press

.

Langbehn,

D. R.

,

Hayden,

M. R.

,

Paulsen,

J. S.

and of the Huntington Study Group, PREDICT-HD Investigators. (

2010

).

CAG-repeat length and the age of onset in Huntington disease (HD): a review and validation study of statistical approaches

.

American Journal of Medical Genetics

153B

,

397

–

408

.

Ma,

Y.

and

Wang,

Y.

(

2014a

).

Estimating disease onset distribution functions in mutation carriers with censored mixture data

.

Journal of the Royal Statistical Society C

63

,

1

–

23

.

Ma,

Y.

and

Wang,

Y.

(

2014b

).

Nonparametric modeling and analysis of association between Huntington’s disease onset and CAG repeats

.

Statistics in Medicine

33

,

1369

–

1382

.

Marder,

K.

,

Levy,

G.

,

Louis,

E.D.

,

Mejia-Santana,

H.

,

Cote,

L.

,

Andrews,

H.

,

Harris,

J.

,

Waters,

C.

,

Ford,

B.

,

Frucht,

S.

, and others. (

2003

).

Accuracy of family history data on Parkinson’s disease

.

Neurology

61

,

18

–

23

.

Parast,

L.

,

Cheng,

S.

and

Cai,

T.

(

2011

).

Incorporating short-term outcome information to predict long-term survival with discrete markers

.

Biometrical Journal

53

,

294

–

307

.

Parast,

L.

,

Tian,

L.

and

Cai,

T.

(

2014

).

Landmark estimation of survival and treatment effect in a randomized clinical trial

.

Journal of American Statistical Association

109

,

384

–

394

.

Rinaldi,

C.

,

Salvatore,

E.

,

Giordano,

S. I.

Cinzia,

V.R.

Rossi,

F.

,

Castaldo,

I.

,

Morra,

V.B.

,

Di Maio,

L.

,

Filla,

A.

, and

De Michele,

G.

(

2014

).

Predictors of survival in a Huntington’s disease population from Southern Italy

.

The Canadian Journal of Neurological Sciences

39

,

48

–

51

.

Ross,

C.

,

Pantelyat,

A.

,

Kogan,

J.

and

Brandt,

J.

(

2014

).

Determinants of functional disability in Huntington’s disease: role of cognitive and motor dysfunction

.

Movement Disorders

29

,

1359

–

1358

.

Rubinsztein,

D. C.

,

Leggo,

J.

,

Coles,

R.

Almqvist,

E.

,

Biancalana,

V.

,

Cassiman,

J.J.

,

Chotai,

K.

,

Connarty,

M.

,

Crauford,

D.

,

Curtis,

A.

, and others. (

1996

).

Phenotypic characterization of individuals with 30-40 CAG repeats in the Huntington disease (HD) gene reveals HD cases with 36 repeats and apparently normal elderly individuals with 36-39 repeats

.

American Journal of Human Genetics

59

,

16

–

22

.

Scott,

D. W.

(

1992

).

Multivariate Density Estimation

.

Hoboken, New Jersey

:

John Wiley & Sons

.

Sørensen,

S. A.

and

Fenger,

K.

(

1992

).

Causes of death in patients with Huntington’s disease and in unaffected first degree relatives

.

Journal of medical genetics

29

,

911

–

914

.

Tsiatis,

A. A.

(

2006

).

Semiparametric Theory and Missing Data

.

New York

:

Springer

.

van Houwelingen,

H.

and

Putter,

H.

(

2011

).

Dynamic Prediction in Clinical Survival Analysis

.

Boca Raton, Florida

:

CRC Press

.

Wacholder,

S.

,

Hartge,

P.

,

Struewing,

J.

,

Pee,

D.

,

McAdams,

M.

,

Brody,

L.

and

Tucker,

M.

(

1998

).

The kin-cohort study for estimating penetrance

.

American Journal of Epidemiology

148

,

623

–

630

.

Wang,

Q.

,

Ma,

Y.

and

Wang,

Y.

(

2017

). Predicting disease risk by transformation models in the presence of missing subgroup identifiers. Statistica Sinica

27

,

1857

–

1878

.

Wang,

Y.

,

Garcia,

T. P.

and

Ma,

Y.

(

2012

).

Nonparametric estimation for uncensored mixture data with application to the Cooperative Huntington’s Observational Research Trial

.

Journal of the American Statistical Association

107

,

1324

–

1338

.

Wu,

R.

,

Chang,

M.

and others. (

2002

).

A logistic mixture model for characterizing genetic determinants conserved synteny in rat and mouse for a blood pressure causing differentiation in growth trajectories

.

Genetics Research

79

,

235

–

245

.

Zeng,

D.

and

Lin,

D. Y.

(

2007

).

Maximum likelihood estimation in semiparametric regression models with censored data

.

Journal of the Royal Statistical Society, Series B

69

,

507

–

564

.