Semiparametric Estimation of the Transformation Model by Leveraging External Aggregate Data in the Presence of Population Heterogeneity

Summary of simulation results for the Cox model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	−6	26(25)	94.6	6.68	5	124(123)	94.2	1.01	–	–	–	–
N = 400
	−8	62(63)	94.8	1.07	6	124(123)	94.8	1.00	–	–	–	–
	−8	62(63)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	−7	41(40)	95.6	2.44	5	124(123)	94.6	1.00	–	–	–	–
	−5	39(39)	94.4	2.66	5	124(123)	94.7	1.00	–	–	–	–
	Scenario (II):
	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	−6	27(27)	94.6	12.46	9	127(127)	94.9	1.03	−3	99(98)	93.9	1.79
N = 400
	−10	90(87)	94.0	1.15	10	129(128)	95.4	1.00	0	129(124)	94.0	1.06
	−10	90(87)	94.1	1.15	10	129(128)	95.4	1.00	0	128(124)	93.8	1.06
N = 10,000
	−7	46(45)	95.0	4.33	9	128(127)	94.1	1.02	−2	105(103)	94.7	1.60
	−6	44(43)	94.1	4.85	9	127(127)	95.4	1.03	−3	104(103)	95.2	1.63

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	−6	26(25)	94.6	6.68	5	124(123)	94.2	1.01	–	–	–	–
N = 400
	−8	62(63)	94.8	1.07	6	124(123)	94.8	1.00	–	–	–	–
	−8	62(63)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	−7	41(40)	95.6	2.44	5	124(123)	94.6	1.00	–	–	–	–
	−5	39(39)	94.4	2.66	5	124(123)	94.7	1.00	–	–	–	–
	Scenario (II):
	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	−6	27(27)	94.6	12.46	9	127(127)	94.9	1.03	−3	99(98)	93.9	1.79
N = 400
	−10	90(87)	94.0	1.15	10	129(128)	95.4	1.00	0	129(124)	94.0	1.06
	−10	90(87)	94.1	1.15	10	129(128)	95.4	1.00	0	128(124)	93.8	1.06
N = 10,000
	−7	46(45)	95.0	4.33	9	128(127)	94.1	1.02	−2	105(103)	94.7	1.60
	−6	44(43)	94.1	4.85	9	127(127)	95.4	1.03	−3	104(103)	95.2	1.63

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 1

Summary of simulation results for the Cox model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	−6	26(25)	94.6	6.68	5	124(123)	94.2	1.01	–	–	–	–
N = 400
	−8	62(63)	94.8	1.07	6	124(123)	94.8	1.00	–	–	–	–
	−8	62(63)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	−7	41(40)	95.6	2.44	5	124(123)	94.6	1.00	–	–	–	–
	−5	39(39)	94.4	2.66	5	124(123)	94.7	1.00	–	–	–	–
	Scenario (II):
	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	−6	27(27)	94.6	12.46	9	127(127)	94.9	1.03	−3	99(98)	93.9	1.79
N = 400
	−10	90(87)	94.0	1.15	10	129(128)	95.4	1.00	0	129(124)	94.0	1.06
	−10	90(87)	94.1	1.15	10	129(128)	95.4	1.00	0	128(124)	93.8	1.06
N = 10,000
	−7	46(45)	95.0	4.33	9	128(127)	94.1	1.02	−2	105(103)	94.7	1.60
	−6	44(43)	94.1	4.85	9	127(127)	95.4	1.03	−3	104(103)	95.2	1.63

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	−6	26(25)	94.6	6.68	5	124(123)	94.2	1.01	–	–	–	–
N = 400
	−8	62(63)	94.8	1.07	6	124(123)	94.8	1.00	–	–	–	–
	−8	62(63)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	−7	41(40)	95.6	2.44	5	124(123)	94.6	1.00	–	–	–	–
	−5	39(39)	94.4	2.66	5	124(123)	94.7	1.00	–	–	–	–
	Scenario (II):
	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	−6	27(27)	94.6	12.46	9	127(127)	94.9	1.03	−3	99(98)	93.9	1.79
N = 400
	−10	90(87)	94.0	1.15	10	129(128)	95.4	1.00	0	129(124)	94.0	1.06
	−10	90(87)	94.1	1.15	10	129(128)	95.4	1.00	0	128(124)	93.8	1.06
N = 10,000
	−7	46(45)	95.0	4.33	9	128(127)	94.1	1.02	−2	105(103)	94.7	1.60
	−6	44(43)	94.1	4.85	9	127(127)	95.4	1.03	−3	104(103)	95.2	1.63

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 2

Summary of simulation results for the proportional odds model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	−6	28(27)	95.1	13.19	9	188(188)	94.8	1.01	–	–	–	–
N = 400
	−9	89(90)	95.7	1.14	10	189(188)	94.9	0.99	–	–	–	–
	−9	89(90)	95.8	1.15	10	189(188)	94.9	0.99	–	–	–	–
N = 10,000
	−5	49(49)	94.8	3.80	9	189(188)	94.8	1.00	–	–	–	–
	−3	47(47)	94.8	4.21	9	188(188)	94.8	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	–	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	−6	28(27)	94.7	24.07	9	193(191)	94.4	1.01	−10	145(142)	95.0	1.79
N = 400
	−7	122(120)	95.0	1.31	13	194(192)	94.1	0.99	−10	184(180)	94.7	1.11
	−7	121(120)	95.1	1.31	13	194(192)	94.2	0.99	−10	184(180)	94.6	1.11
N = 10,000
	−4	53(52)	94.5	6.91	11	193(191)	94.4	1.01	−12	151(148)	94.6	1.64
	−3	50(50)	94.6	7.66	11	193(191)	94.4	1.01	−13	151(147)	94.4	1.64

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	−6	28(27)	95.1	13.19	9	188(188)	94.8	1.01	–	–	–	–
N = 400
	−9	89(90)	95.7	1.14	10	189(188)	94.9	0.99	–	–	–	–
	−9	89(90)	95.8	1.15	10	189(188)	94.9	0.99	–	–	–	–
N = 10,000
	−5	49(49)	94.8	3.80	9	189(188)	94.8	1.00	–	–	–	–
	−3	47(47)	94.8	4.21	9	188(188)	94.8	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	–	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	−6	28(27)	94.7	24.07	9	193(191)	94.4	1.01	−10	145(142)	95.0	1.79
N = 400
	−7	122(120)	95.0	1.31	13	194(192)	94.1	0.99	−10	184(180)	94.7	1.11
	−7	121(120)	95.1	1.31	13	194(192)	94.2	0.99	−10	184(180)	94.6	1.11
N = 10,000
	−4	53(52)	94.5	6.91	11	193(191)	94.4	1.01	−12	151(148)	94.6	1.64
	−3	50(50)	94.6	7.66	11	193(191)	94.4	1.01	−13	151(147)	94.4	1.64

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 2

Summary of simulation results for the proportional odds model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	−6	28(27)	95.1	13.19	9	188(188)	94.8	1.01	–	–	–	–
N = 400
	−9	89(90)	95.7	1.14	10	189(188)	94.9	0.99	–	–	–	–
	−9	89(90)	95.8	1.15	10	189(188)	94.9	0.99	–	–	–	–
N = 10,000
	−5	49(49)	94.8	3.80	9	189(188)	94.8	1.00	–	–	–	–
	−3	47(47)	94.8	4.21	9	188(188)	94.8	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	–	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	−6	28(27)	94.7	24.07	9	193(191)	94.4	1.01	−10	145(142)	95.0	1.79
N = 400
	−7	122(120)	95.0	1.31	13	194(192)	94.1	0.99	−10	184(180)	94.7	1.11
	−7	121(120)	95.1	1.31	13	194(192)	94.2	0.99	−10	184(180)	94.6	1.11
N = 10,000
	−4	53(52)	94.5	6.91	11	193(191)	94.4	1.01	−12	151(148)	94.6	1.64
	−3	50(50)	94.6	7.66	11	193(191)	94.4	1.01	−13	151(147)	94.4	1.64

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	−6	28(27)	95.1	13.19	9	188(188)	94.8	1.01	–	–	–	–
N = 400
	−9	89(90)	95.7	1.14	10	189(188)	94.9	0.99	–	–	–	–
	−9	89(90)	95.8	1.15	10	189(188)	94.9	0.99	–	–	–	–
N = 10,000
	−5	49(49)	94.8	3.80	9	189(188)	94.8	1.00	–	–	–	–
	−3	47(47)	94.8	4.21	9	188(188)	94.8	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	–	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	−6	28(27)	94.7	24.07	9	193(191)	94.4	1.01	−10	145(142)	95.0	1.79
N = 400
	−7	122(120)	95.0	1.31	13	194(192)	94.1	0.99	−10	184(180)	94.7	1.11
	−7	121(120)	95.1	1.31	13	194(192)	94.2	0.99	−10	184(180)	94.6	1.11
N = 10,000
	−4	53(52)	94.5	6.91	11	193(191)	94.4	1.01	−12	151(148)	94.6	1.64
	−3	50(50)	94.6	7.66	11	193(191)	94.4	1.01	−13	151(147)	94.4	1.64

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 3

Summary of simulation results for the Cox model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	128	20(19)	0	10.01	−15	120(123)	95.5	1.07	–	–	–	–
N = 400
	8	62(62)	94.7	1.08	4	124(123)	94.8	1.01	–	–	–	–
	−6	62(64)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	99	38(36)	24.2	2.85	−10	120(123)	95.6	1.07	–	–	–	–
	−2	42(42)	92.7	2.37	5	124(123)	94.6	1.01	–	–	–	–

	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	134	20(19)	0	23.34	−14	125(127)	95.3	1.06	−132	96(96)	73.4	1.90
N = 400
	20	89(85)	92.7	1.19	4	128(128)	95.3	1.02	−27	127(123)	93.0	1.08
	-9	93(89)	93.8	1.09	10	129(128)	95.2	1.01	−2	130(125)	94.4	1.04
N = 10,000
	119	40(39)	15.0	5.81	−12	125(127)	95.7	1.06	−118	101(101)	80.1	1.73
	−4	49(48)	94.6	3.85	9	127(127)	95.4	1.03	−5	105(105)	94.5	1.60

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	128	20(19)	0	10.01	−15	120(123)	95.5	1.07	–	–	–	–
N = 400
	8	62(62)	94.7	1.08	4	124(123)	94.8	1.01	–	–	–	–
	−6	62(64)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	99	38(36)	24.2	2.85	−10	120(123)	95.6	1.07	–	–	–	–
	−2	42(42)	92.7	2.37	5	124(123)	94.6	1.01	–	–	–	–

	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	134	20(19)	0	23.34	−14	125(127)	95.3	1.06	−132	96(96)	73.4	1.90
N = 400
	20	89(85)	92.7	1.19	4	128(128)	95.3	1.02	−27	127(123)	93.0	1.08
	-9	93(89)	93.8	1.09	10	129(128)	95.2	1.01	−2	130(125)	94.4	1.04
N = 10,000
	119	40(39)	15.0	5.81	−12	125(127)	95.7	1.06	−118	101(101)	80.1	1.73
	−4	49(48)	94.6	3.85	9	127(127)	95.4	1.03	−5	105(105)	94.5	1.60

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 3

Summary of simulation results for the Cox model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	128	20(19)	0	10.01	−15	120(123)	95.5	1.07	–	–	–	–
N = 400
	8	62(62)	94.7	1.08	4	124(123)	94.8	1.01	–	–	–	–
	−6	62(64)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	99	38(36)	24.2	2.85	−10	120(123)	95.6	1.07	–	–	–	–
	−2	42(42)	92.7	2.37	5	124(123)	94.6	1.01	–	–	–	–

	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	134	20(19)	0	23.34	−14	125(127)	95.3	1.06	−132	96(96)	73.4	1.90
N = 400
	20	89(85)	92.7	1.19	4	128(128)	95.3	1.02	−27	127(123)	93.0	1.08
	-9	93(89)	93.8	1.09	10	129(128)	95.2	1.01	−2	130(125)	94.4	1.04
N = 10,000
	119	40(39)	15.0	5.81	−12	125(127)	95.7	1.06	−118	101(101)	80.1	1.73
	−4	49(48)	94.6	3.85	9	127(127)	95.4	1.03	−5	105(105)	94.5	1.60

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−7	64(66)	95.4	–	6	124(123)	94.7	–	–	–	–	–
	−7	64(66)	95.4	1.00	6	124(123)	94.7	1.00	–	–	–	–
	128	20(19)	0	10.01	−15	120(123)	95.5	1.07	–	–	–	–
N = 400
	8	62(62)	94.7	1.08	4	124(123)	94.8	1.01	–	–	–	–
	−6	62(64)	95.3	1.07	6	124(123)	94.8	1.00	–	–	–	–
N = 10,000
	99	38(36)	24.2	2.85	−10	120(123)	95.6	1.07	–	–	–	–
	−2	42(42)	92.7	2.37	5	124(123)	94.6	1.01	–	–	–	–

	−9	97(95)	94.3	–	10	129(129)	95.6	–	−2	132(129)	93.8	–
	−9	97(95)	94.3	1.00	10	129(129)	95.6	1.00	−2	132(129)	93.8	1.00
	134	20(19)	0	23.34	−14	125(127)	95.3	1.06	−132	96(96)	73.4	1.90
N = 400
	20	89(85)	92.7	1.19	4	128(128)	95.3	1.02	−27	127(123)	93.0	1.08
	-9	93(89)	93.8	1.09	10	129(128)	95.2	1.01	−2	130(125)	94.4	1.04
N = 10,000
	119	40(39)	15.0	5.81	−12	125(127)	95.7	1.06	−118	101(101)	80.1	1.73
	−4	49(48)	94.6	3.85	9	127(127)	95.4	1.03	−5	105(105)	94.5	1.60

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 4

Summary of simulation results for the proportional odds model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	138	20(19)	0	22.31	1	185(188)	95.2	1.03	–	–	–	–
N = 400
	16	89(89)	93.8	1.15	8	188(188)	94.7	1.00	–	–	–	–
	−7	91(92)	93.9	1.09	10	189(188)	94.8	0.99	–	–	–	–
N = 10,000
	117	44(43)	24.1	4.68	2	186(188)	95.0	1.03	–	–	–	–
	−3	54(53)	95.0	3.17	9	188(188)	94.9	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	−	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	141	20(19)	0	48.02	0	191(191)	94.0	1.02	−152	143(140)	81.2	1.84
N = 400
	36	119(117)	93.7	1.36	10	194(192)	94.3	1.00	−52	181(178)	94.0	1.14
	−3	127(125)	95.1	1.19	13	194(192)	94.2	0.99	−14	186(183)	94.7	1.08
N = 10,000
	132	46(45)	18.3	9.21	1	191(191)	94.2	1.02	−143	148(146)	83.8	1.71
	1	57(58)	95.3	5.88	11	193(191)	94.2	1.00	−17	153(150)	94.3	1.60

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	138	20(19)	0	22.31	1	185(188)	95.2	1.03	–	–	–	–
N = 400
	16	89(89)	93.8	1.15	8	188(188)	94.7	1.00	–	–	–	–
	−7	91(92)	93.9	1.09	10	189(188)	94.8	0.99	–	–	–	–
N = 10,000
	117	44(43)	24.1	4.68	2	186(188)	95.0	1.03	–	–	–	–
	−3	54(53)	95.0	3.17	9	188(188)	94.9	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	−	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	141	20(19)	0	48.02	0	191(191)	94.0	1.02	−152	143(140)	81.2	1.84
N = 400
	36	119(117)	93.7	1.36	10	194(192)	94.3	1.00	−52	181(178)	94.0	1.14
	−3	127(125)	95.1	1.19	13	194(192)	94.2	0.99	−14	186(183)	94.7	1.08
N = 10,000
	132	46(45)	18.3	9.21	1	191(191)	94.2	1.02	−143	148(146)	83.8	1.71
	1	57(58)	95.3	5.88	11	193(191)	94.2	1.00	−17	153(150)	94.3	1.60

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

Table 4

Summary of simulation results for the proportional odds model assuming formula

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	138	20(19)	0	22.31	1	185(188)	95.2	1.03	–	–	–	–
N = 400
	16	89(89)	93.8	1.15	8	188(188)	94.7	1.00	–	–	–	–
	−7	91(92)	93.9	1.09	10	189(188)	94.8	0.99	–	–	–	–
N = 10,000
	117	44(43)	24.1	4.68	2	186(188)	95.0	1.03	–	–	–	–
	−3	54(53)	95.0	3.17	9	188(188)	94.9	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	−	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	141	20(19)	0	48.02	0	191(191)	94.0	1.02	−152	143(140)	81.2	1.84
N = 400
	36	119(117)	93.7	1.36	10	194(192)	94.3	1.00	−52	181(178)	94.0	1.14
	−3	127(125)	95.1	1.19	13	194(192)	94.2	0.99	−14	186(183)	94.7	1.08
N = 10,000
	132	46(45)	18.3	9.21	1	191(191)	94.2	1.02	−143	148(146)	83.8	1.71
	1	57(58)	95.3	5.88	11	193(191)	94.2	1.00	−17	153(150)	94.3	1.60

	β₁				β₂				β₃
	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE	Bias	ESD(ASE)	CP	RE
	Scenario (I):
	−5	95(97)	95.3	–	8	188(188)	94.8	–	–	–	–	–
	−6	94(96)	95.5	1.01	−3	187(188)	95.6	1.01
	138	20(19)	0	22.31	1	185(188)	95.2	1.03	–	–	–	–
N = 400
	16	89(89)	93.8	1.15	8	188(188)	94.7	1.00	–	–	–	–
	−7	91(92)	93.9	1.09	10	189(188)	94.8	0.99	–	–	–	–
N = 10,000
	117	44(43)	24.1	4.68	2	186(188)	95.0	1.03	–	–	–	–
	−3	54(53)	95.0	3.17	9	188(188)	94.9	1.00	–	–	–	–
	Scenario (II):
	−1	139(137)	94.0	−	9	193(192)	94.3	–	−9	193(191)	94.9	–
	−4	137(137)	94.6	1.02	7	193(192)	94.7	1.01	5	193(191)	95.2	1.01
	141	20(19)	0	48.02	0	191(191)	94.0	1.02	−152	143(140)	81.2	1.84
N = 400
	36	119(117)	93.7	1.36	10	194(192)	94.3	1.00	−52	181(178)	94.0	1.14
	−3	127(125)	95.1	1.19	13	194(192)	94.2	0.99	−14	186(183)	94.7	1.08
N = 10,000
	132	46(45)	18.3	9.21	1	191(191)	94.2	1.02	−143	148(146)	83.8	1.71
	1	57(58)	95.3	5.88	11	193(191)	94.2	1.00	−17	153(150)	94.3	1.60

Note: formula ⁠, the pseudopartial likelihood estimator; formula ⁠, the maximum likelihood estimator; formula ⁠, the empirical likelihood estimator; formula ⁠, the proposed estimator accounting for uncertainty in auxiliary information; formula ⁠, the proposed estimator accounting for population heterogeneity and uncertainty in auxiliary information. Bias, ESD, ASE, and CP are empirical bias (× 1000), empirical standard deviation (× 1000), the average of the estimated asymptotic standard error (× 1000) over 1000 simulated datasets, and the 95% coverage probability. RE, the empirical variance of formula divided by that of the proposed estimators.

In the absence of population heterogeneity, all methods yield a small bias in the parameter estimation and the coverage rates of the 95% confidence intervals based on the estimated asymptotic standard errors are very close to the nominal level (0.95). Compared with ⁠, the proposed methods enjoy efficiency gains in estimating β₁ in Scenario (I) and both β₁ and β₃ in Scenario (II) with the upper bound given by that of ⁠, but not β₂ in either case. Note that the external information consists of two exclusive subgroups differed by their values in X₁ but not X₂. Thus, the efficiency gain is mainly observed for effects involving X₁. On the other hand, is slightly more efficient than ⁠, with the relative efficiency ranging from 1.01 to 1.02. Yet, it is computationally more costly than its competitors. The computation burden of the MLE is 52 times higher than that of (26,382 s vs. 509 s for analyzing 10 datasets with and ⁠).

When heterogeneity between internal and external studies is present, Table 3 and 4 show that and ⁠, the augmented empirical likelihood estimators without accounting for heterogeneity, can yield large biases. When a density ratio model is employed to characterize population heterogeneity, yields small biases while enjoying efficiency gains under all scenarios. When information from a large external study with was exploited, the relative efficiency in estimating β₁ under Scenario (I) is 2.19 and 3.28 for the Cox model and the proportional odds model, respectively. On the other hand, the relative efficiency in Scenario (II) can be as high as 3.86 and 1.59 in estimating β₁ and β₃ under the Cox model, and 5.87 and 1.60 under the proportional odds model.

To investigate the robustness of the proposed methods against model misspecification, we carried out additional simulation studies with incorrect choices of in the semiparametric density ratio model. The results are presented in Tables S1–S2 of the Supporting Information. In the case where fails to include X₂, the bias in estimating β remains negligible, and the efficiency gain is similar to that under the correctly specified model. On the other hand, failing to include X₁ or in leads to larger biases in parameter estimation. The results can be explained by the fact that the external information consists of two exclusive subgroups differed by their values in X₁ but not in X₂. Thus model misspecification has a minor impact on the estimation of β when X₂ is not included in ⁠.

Following the suggestions of the reviewers, we expanded simulation studies by including the smaller internal sample sizes and 200, varying the external sample size N from 200 to 10,000, and considered different censoring rates. The results show that the proposed methods perform well in all situations. Details of additional simulation studies are reported in Section S2 of the Supporting Information. Moreover, since may not be available in practice, we studied the asymptotic properties and investigated numerical performance of the proposed method when its estimate is employed instead. As expected, replacing with its estimate yields a larger asymptotic variance. Interestingly, simulation studies show that two estimators have a similar numerical performance in estimating β but not γ. Details can be found in Section 3 of the Supplementary Information.

5 Pancreatic Cancer Data Analysis

Pancreatic cancer is a highly aggressive disease. According to Global Cancer Statistics 2020 (Sung et al., 2021), pancreatic cancer is the seventh leading cause of cancer death in the world, and its incidence rate is on the rise. Late diagnosis, early metastasis, and lack of effective therapy have contributed to the dismal overall prognosis, with only 6% of the patients surviving more than 5 years after diagnosis. Despite recent advances in cancer diagnosis and treatment, surgical resection remains the only possible curative option for pancreatic cancer. However, less than 20% of the patients are eligible for resection when diagnosed, as they often present at advanced stages. Moreover, most patients with resectable pancreatic cancer have an unfavorable outcome due to recurrent disease within a few years. The 5-year survival probability after resection is reported to be around 34.5% (Yamamoto et al., 2015). Hence, it is crucial to identify prognostic factors for pancreatic cancer patients to improve disease management.

In the Johns Hopkins Hospital pancreatic cancer study described in Section 1, patients' demographic information, treatments, and clinical and pathological exam were collected via a retrospective chart review. All-cause and cancer-specific deaths were determined by a combined review of clinical information, Social Security Death Index, and the National Cancer Database. Prognostic factors of interest in our analysis included resection margin status, lymph node involvement, invasion of the surrounding nerves, age, and gender. After excluding patients with missing data, the mean age at the time of surgery among the 204 remaining patients was 64.2 years. About half of the patients were male (51.9%) and had positive resection margins (48.0%). The majority of the patients had perineural invasion (PNI) (94.1%) and lymph node involvement (86.2%).

To evaluate the effects of prognostic factors on survival after pancreatectomy, we fit two special cases of the semiparametric transformation model: the Cox model and the proportional odds model. As reported in Table 5, both sets of analysis concluded that female sex, older age, PNI, node positivity, and margin positivity were associated with poorer prognosis. However, the effect of node status, a known prognostic factor, did not reach statistical significance in both models, most likely due to the small sample size of patients without lymph node involvement.

Table 5

Estimated regression coefficients of the Cox model and the proportional odds model for the pancreatic cancer study

	Node positive		Margin positive		PNI		>65 years		Male		γ₀		γ₁		γ₂
	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE
Cox model
	0.363	0.226	0.407	0.153	1.124	0.373	0.282	0.153	−0.295	0.154	–	–	–	–	–	–
	0.509	0.158	0.393	0.153	1.128	0.373	0.275	0.153	−0.291	0.154	1.141	0.226	−0.867	0.269	−1.098	0.290
Proportional odds model
	0.654	0.373	0.916	0.256	2.096	0.744	0.385	0.246	−0.341	0.248	–	–	–	–	–	–
	0.877	0.282	0.899	0.255	2.089	0.743	0.377	0.245	−0.340	0.249	1.143	0.226	−0.867	0.269	−1.100	0.290

	Node positive		Margin positive		PNI		>65 years		Male		γ₀		γ₁		γ₂
	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE
Cox model
	0.363	0.226	0.407	0.153	1.124	0.373	0.282	0.153	−0.295	0.154	–	–	–	–	–	–
	0.509	0.158	0.393	0.153	1.128	0.373	0.275	0.153	−0.291	0.154	1.141	0.226	−0.867	0.269	−1.098	0.290
Proportional odds model
	0.654	0.373	0.916	0.256	2.096	0.744	0.385	0.246	−0.341	0.248	–	–	–	–	–	–
	0.877	0.282	0.899	0.255	2.089	0.743	0.377	0.245	−0.340	0.249	1.143	0.226	−0.867	0.269	−1.100	0.290

Note: formula ⁠, the conventional pseudopartial likelihood estimator without synthesizing auxiliary information; formula ⁠, the proposed estimator accounting for uncertainty and heterogeneity in the external information. Est denotes the estimate, SE denotes the standard error, which is calculated by the square root of the asymptotic variance. formula ⁠, and γ₂, the regression parameters of the density ratio model corresponding to intercept, margin positivity, and node positivity, respectively.

Table 5

Estimated regression coefficients of the Cox model and the proportional odds model for the pancreatic cancer study

	Node positive		Margin positive		PNI		>65 years		Male		γ₀		γ₁		γ₂
	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE
Cox model
	0.363	0.226	0.407	0.153	1.124	0.373	0.282	0.153	−0.295	0.154	–	–	–	–	–	–
	0.509	0.158	0.393	0.153	1.128	0.373	0.275	0.153	−0.291	0.154	1.141	0.226	−0.867	0.269	−1.098	0.290
Proportional odds model
	0.654	0.373	0.916	0.256	2.096	0.744	0.385	0.246	−0.341	0.248	–	–	–	–	–	–
	0.877	0.282	0.899	0.255	2.089	0.743	0.377	0.245	−0.340	0.249	1.143	0.226	−0.867	0.269	−1.100	0.290

	Node positive		Margin positive		PNI		>65 years		Male		γ₀		γ₁		γ₂
	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE	Est	SE
Cox model
	0.363	0.226	0.407	0.153	1.124	0.373	0.282	0.153	−0.295	0.154	–	–	–	–	–	–
	0.509	0.158	0.393	0.153	1.128	0.373	0.275	0.153	−0.291	0.154	1.141	0.226	−0.867	0.269	−1.098	0.290
Proportional odds model
	0.654	0.373	0.916	0.256	2.096	0.744	0.385	0.246	−0.341	0.248	–	–	–	–	–	–
	0.877	0.282	0.899	0.255	2.089	0.743	0.377	0.245	−0.340	0.249	1.143	0.226	−0.867	0.269	−1.100	0.290

Note: formula ⁠, the conventional pseudopartial likelihood estimator without synthesizing auxiliary information; formula ⁠, the proposed estimator accounting for uncertainty and heterogeneity in the external information. Est denotes the estimate, SE denotes the standard error, which is calculated by the square root of the asymptotic variance. formula ⁠, and γ₂, the regression parameters of the density ratio model corresponding to intercept, margin positivity, and node positivity, respectively.

To improve estimation efficiency, we seek to incorporate information from Ahmad et al. (2001), which reported 3-year survival probabilities with respect to lymph node status based on data from 116 patients. From this study, we exploited two sets of 3-year subgroup survival probabilities: 14% for patients with positive node status and 38% for patients with negative node status. An examination of the available covariate summary statistics revealed that the proportions of margin-positive and node-positive patients in the external study were only 24% and 62%, respectively, which were much lower than that in the internal study (48% and 86%, respectively), indicating strong heterogeneity between the covariate distributions between the internal and external study. As discussed in Section 3.2, we constructed a density ratio model with margin status and node status to account for the heterogeneity between the covariate distributions. Note that we applied the estimator described in Section 3 of the Supporting Information to account for variability in the covariate summary statistics.

Table 5 summarizes the fitted Cox model and the fitted proportional odds model using the proposed methods. We extracted the standard errors of the reported 3-year survival probabilities from Ahmad et al. (2001) to evaluate the degree of uncertainty in the external information. In particular, a significant efficiency gain is observed in estimating the effects of node positivity, which determined the two subgroups in the external information. The estimated coefficients of margin positivity and node positivity in the density ratio model are −0.867 (95% CI, [0.340]) and −1.098 (95% CI, [0.529]), respectively, under the Cox model, and −0.867 (95% CI, [0.339]) and −1.100 (95% CI, [0.531]), respectively, under the proportional odds model, indicating significant heterogeneity between internal and external studies. Interestingly, the efficiency loss due to estimating an additional set of parameters in the density ratio model is minimal. As a result, compared with ⁠, the relative efficiency in estimating node positivity is 2.046 under the Cox model and 1.750 under the proportional odds model. Notably, the effect of lymph node involvement reaches statistical significance after incorporating the external information. Finally, following Zeng and Lin (2007), the final model was selected based on the Akaike information criterion (AIC). In this data example, the proportional odds model fits slightly better than the Cox model (AIC 1623 vs. 1625). Hence, we conclude that node-positive patients had 2.404 (⁠⁠) times higher odds of dying before any given time t compared to node-negative patients.

6 Discussion

In this article, we propose empirical likelihood-based methods to improve the estimation efficiency of the semiparametric transformation model by incorporating three types of external information: t-year subgroup survival probabilities, variance–covariance matrix of the estimated survival probabilities, and covariate summary statistics. With externally reported t-year subgroup survival probabilities and a consistently estimated variance–covariance matrix, information synthesis can be performed under a homogeneity assumption between internal and external studies. When the homogeneity condition fails to hold, a density ratio model is used to account for population heterogeneity. Therefore, additional information on the distribution of the external covariates is needed to estimate the extra set of parameters in the density ratio model.

When IPD from external sources is available, a pooled IPD analysis can be performed to combine information from internal and external study. However, when ⁠, the pooled analysis may not yield better efficiency than properly combining information from the subgroup survival probabilities evaluated in entire external cohort. On the other hand, when t-year subgroup survival probabilities and a random sample of the covariates are available from external sources, the population estimating equation (2) can be approximated using the available external data. Specifically, let denote the jump size of the marginal distribution of X* at the data point ⁠. One can maximize the empirical likelihood obtained by multiplying the pseudopartial likelihood and the marginal likelihood of X* with respect to the constraints:

(28)

It is worth pointing out that this approach gives valid inference even when the covariate distribution differs between internal and external studies. Moreover, as argued as in Chatterjee et al. (2016) and Han and Lawless (2019), when ⁠, it can be shown that this approach is asymptotically as or more efficient than under a homogeneity assumption between internal and external studies.

In practice, adding more covariates in the density ratio model usually leads to increased computation burden and instability when implementing the empirical likelihood estimation. Alternatively, one may consider a two-step procedure as follows. In the first step, estimate γ by solving when the number of estimating equations constructed based on covariate summary statistics equals q, that is, the dimension of γ. When the number of estimating equations is greater than q, the generalized method of moments (GMMs) can be applied to estimate γ. In the second step, the parameter β of interest can be obtained by using the estimating procedure proposed in (17) with the constraint replaced by ⁠, where is the solution obtained in Step 1. In our simulation experience, this approach shares the advantage of the computational speed, and the results are similar to the empirical likelihood estimation.

It is worthwhile to point out that the use of the density ratio model to account for population heterogeneity is different than what has been proposed in the literature. For example, Huang et al. (2016) required the covariate distributions between internal and external studies to be the same but assumed that the baseline hazard function in the external study is proportional (but may not be identical) to that in the internal study up to a constant factor. Recently, Zheng et al. (2022) proposed a calibration weighting method to reduce the bias for individual risk prediction in the presence of population heterogeneity. In contrast to the density ratio model, the calibration weights ⁠, with ξ being the Lagrange multiplier, also reflect the difference of the covariate distributions between internal and external studies. Our conjecture is that the calibration weight adjustment can be less efficient than the density ratio weight adjustment when important covariates are included in the density ratio model. Further research is warranted and will be investigated in our future work.

Data Availability Statement

The application study data in this paper are not publicly available due to patient privacy and confidentiality issues.

Open Research Badges

This article has earned an Open Materials badge for making publicly available the components of the research methodology needed to reproduce the reported procedure and analysis. All materials are available at the Biometrics website on Wiley Online Library https://github.com/kgolmakani/CumulativeRisk.git.

Acknowledgments

This work was partially supported by Taiwan Ministry of Science and Technology MOST 110-2628-M-007-003-MY2 (Cheng and Liu), MOST 110-2811-M-007-560 (Tsai), and National Institutes of Health grant R01CA193888 (Huang). The authors thank Dr. Lei Zheng for kindly sharing the Johns Hopkins Pancreatic Cancer Study data. They also thank Dr. Ying Sheng for discussion and computing assistance.

References

Ahmad

,

N.A.

,

Lewis

,

J.D.

,

Ginsberg

,

G.G.

,

Haller

,

D.G.

,

Morris

,

J.B.

,

Williams

,

N.N.

,

Rosato

,

E.F.

&

Kochman

,

M.L.

(

2001

)

Long term survival after pancreatic resection for pancreatic adenocarcinoma

.

The American Journal of Gastroenterology

,

96

(

9

),

2609

–

2615

.

Chatterjee

,

N.

,

Chen

,

Y.H.

,

Maas

,

P.

&

Carroll

,

R.J.

(

2016

)

Constrained maximum likelihood estimation for model calibration using summary-level information from external big data sources

.

Journal of the American Statistical Association

,

111

(

513

),

107

–

117

.

Chen

,

Z.

,

Ning

,

J.

,

Shen

,

Y.

&

Qin

,

J.

(

2021

)

Combining primary cohort data with external aggregate information without assuming comparability

.

Biometrics

,

77

(

3

),

1024

–

1036

.

Gao

,

F.

&

Chan

,

K.

(

2022

)

Noniterative adjustment to regression estimators with population-based auxiliary information for semiparametric models

.

Biometrics

,

to appear

.

Guyatt

,

G.

,

Cairns

,

J.

,

Churchill

,

D.

,

Cook

,

D.

,

Haynes

,

B.

,

Hirsh

,

J.

,

Irvine

,

J.

,

Levine

,

M.

,

Levine

,

M.

,

Nishikawa

,

J.

et al. (

1992

)

Evidence-based medicine: a new approach to teaching the practice of medicine

.

Journal of the American Medical Association

,

268

(

17

),

2420

–

2425

.

Han

,

P.

&

Lawless

,

J.F.

(

2019

)

Empirical likelihood estimation using auxiliary summary information with different covariate distributions

.

Statistica Sinica

,

29

(

3

),

1321

–

1342

.

Huang

,

C.Y.

,

Qin

,

J.

&

Tsai

,

H.T.

(

2016

)

Efficient estimation of the Cox model with auxiliary subgroup survival information

.

Journal of the American Statistical Association

,

111

(

514

),

787

–

799

.

Imbens

,

G.W.

&

Lancaster

,

T.

(

1994

)

Combining micro and macro data in microeconometric models

.

The Review of Economic Studies

,

61

(

4

),

655

–

680

.

Liu

,

D.

,

Zheng

,

Y.

,

Prentice

,

R.L.

&

Hsu

,

L.

(

2014

)

Estimating risk with time-to-event data: an application to the women's health initiative

.

Journal of the American Statistical Association

,

109

(

506

),

514

–

524

.

Owen

,

A.B.

(

1988

)

Empirical likelihood ratio confidence intervals for a single functional

.

Biometrika

,

75

(

2

),

237

–

249

.

Qin

,

J.

(

2000

)

Combining parametric and empirical likelihoods

.

Biometrika

,

87

(

2

),

484

–

490

.

Qin

,

J.

&

Lawless

,

J.

(

1994

)

Empirical likelihood and general estimating equations

.

The Annals of Statistics

,

72

(

1

),

300

–

325

.

Sheng

,

Y.

,

Sun

,

Y.

,

Huang

,

C.Y.

&

Kim

,

M.O.

(

2021

)

Synthesizing external aggregated information in the penalized Cox regression under population heterogeneity

.

Statistics in Medicine

,

40

(

23

),

4915

–

4930

.

Shimodaira

,

H.

(

2000

)

Improving predictive inference under covariate shift by weighting the log-likelihood function

.

Journal of Statistical Planning and Inference

,

90

(

2

),

227

–

244

.

Sung

,

H.

,

Ferlay

,

J.

,

Siegel

,

R.L.

,

Laversanne

,

M.

,

Soerjomataram

,

I.

,

Jemal

,

A.

&

Bray

,

F.

(

2021

)

Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries

.

CA: A Cancer Journal for Clinicians

,

71

(

3

),

209

–

249

.

PubMed

Sutton

,

A.J.

,

Abrams

,

K.R.

,

Jones

,

D.R.

,

Jones

,

D.R.

,

Sheldon

,

T.A.

&

Song

,

F.

(

2000

)

Methods for meta-analysis in medical research

.

Chichester

:

Wiley

.

Google Preview

Thomas

,

D.R.

&

Grunkemeier

,

G.L.

(

1975

)

Confidence interval estimation of survival probabilities for censored data

.

Journal of the American Statistical Association

,

70

(

352

),

865

–

871

.

Whitehead

,

A.

(

2002

)

Meta-analysis of controlled clinical trials

.

Chichester

:

John Wiley & Sons

.

Yamamoto

,

T.

,

Yagi

,

S.

,

Kinoshita

,

H.

,

Sakamoto

,

Y.

,

Okada

,

K.

,

Uryuhara

,

K.

,

Morimoto

,

T.

,

Kaihara

,

S.

&

Hosotani

,

R.

(

2015

)

Long-term survival after resection of pancreatic cancer: a single-center retrospective analysis

.

World Journal of Gastroenterology

,

21

(

1

),

262

–

268

.

Zeng

,

D.

&

Lin

,

D.Y.

(

2006

)

Efficient estimation of semiparametric transformation models for counting processes

.

Biometrika

,

93

(

3

),

627

–

640

.

Zeng

,

D.

&

Lin

,

D.Y.

(

2007

)

Maximum likelihood estimation in semiparametric regression models with censored data

.

Journal of the Royal Statistical Society: Series B

,

69

(

4

),

507

–

564

.

Zhang

,

H.

,

Deng

,

L.

,

Schiffman

,

M.

,

Qin

,

J.

&

Yu

,

K.

(

2020

)

Generalized integration model for improved statistical inference by leveraging external summary data

.

Biometrika

,

107

(

3

),

689

–

703

.

Zheng

,

J.

,

Zheng

,

Y.

&

Hsu

,

L.

(

2022

)

Risk projection for time-to-event outcome leveraging summary statistics with source individual-level data

.

Journal of the American Statistical Association

,

to appear

.

Zucker

,

D.M.

(

2005

)

A pseudo–partial likelihood method for semiparametric survival regression with covariate errors

.

Journal of the American Statistical Association

,

100

(

472

),

1264

–

1277

.