Fitting the Cox proportional hazards model to big data

Bi

W.

,

Fritsche

L. G.

,

Mukherjee

B.

,

Kim

S.

,

Lee

S.

(

2020

).

A fast and accurate method for genome-wide time-to-event data analysis and its application to UK biobank

.

The American Journal of Human Genetics

,

107

,

222

–

233

.

Bickel

P. J.

,

Klaassen

C. A. J.

,

Ritov

Y.

,

Wellner

J. A.

(

1993

).

Efficient and Adaptive Estimation for Semiparametric Models

.

Baltimore

:

Johns Hopkins University Press

.

Google Preview

OpenURL Placeholder Text

Breslow

N.

(

1972

).

Discussion of the paper by D. R. Cox

.

Journal of the Royal Statistical Society, Series B

,

34

,

216

–

217

.

OpenURL Placeholder Text

Bycroft

C.

,

Freeman

C.

,

Petkova

D.

,

Band

G.

,

Elliott

L. T.

,

Sharp

K.

et al. (

2018

).

The UK Biobank resource with deep phenotyping and genomic data

.

Nature

,

562

,

203

–

209

.

Cox

D. R.

(

1972

).

Regression models and life-tables

.

Journal of the Royal Statistical Society, Series B

,

34

,

187

–

202

.

OpenURL Placeholder Text

Cox

D. R.

(

1975

).

Partial likelihood

.

Biometrika

,

62

,

269

–

276

.

Denny

J. C.

,

Bastarache

L.

,

Ritchie

M. D.

,

Carroll

R. J.

,

Zink

R.

,

Mosley

J. D.

et al. (

2013

).

Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data

.

Nature Biotechnology

,

31

,

1102

–

1111

.

Dey

R.

,

Zhou

W.

,

Kiiskinen

T.

,

Havulinna

A.

,

Elliott

A.

,

Karjalainen

J.

et al. (

2022

).

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks

.

Nature Communications

,

13

,

5437

.

Friedman

J. H.

,

Hastie

T.

,

Tibshirani

R.

(

2010

).

Regularization paths for generalized linear models via coordinate descent

.

Journal of Statistical Software

,

33

,

1

–

22

.

Kalbfleisch

J. D.

,

Prentice

R. L.

(

2002

).

The Statistical Analysis of Failure Time Data

.

John Wiley & Sons

:

New York

.

Kawaguchi

E. S.

,

Shen

J. I.

,

Suchard

M. A.

,

Li

G.

(

2021

).

Scalable algorithms for large competing risks data

.

Journal of Computational and Graphical Statistics

,

30

,

685

–

693

.

Liu

C.

,

Kraja

A. T.

,

Smith

J. A.

,

Brody

J. A.

,

Franceschini

N.

,

Bis

J. C.

et al. (

2016

).

Meta-analysis identifies common and rare variants influencing blood pressure and overlapping with metabolic trait loci

.

Nature Genetics

,

48

,

1162

–

1170

.

Stanzick

K. J.

,

Li

Y.

,

Schlosser

P.

,

Gorski

M.

,

Wuttke

M.

,

Thomas

L. F.

et al. (

2021

).

Discovery and prioritization of variants and genes for kidney function in ≳1.2 million individuals

.

Nature Communications

,

12

,

4350

.

The International Schizophrenia Consortium

(

2009

).

Common polygenic variation contributes to risk of schizophrenia and bipolar disorder

.

Nature

,

460

,

748

–

752

.

PubMed

Vuckovic

D.

,

Bao

E. L.

,

Akbari

P.

,

Lareau

C. A.

,

Mousas

A.

,

Jiang

T.

et al. (

2020

).

The polygenic and monogenic basis of blood traits and diseases

.

Cell

,

182

,

1214

–

1231

.

Wang

H.

,

Leng

C.

(

2007

).

Unified lasso estimation by least squares approximation

.

Journal of the American Statistical Association

,

102

,

1039

–

1048

.

Wang

W.

,

Lu

S.-E.

,

Cheng

J. Q.

,

Xie

M.

,

Kostis

J. B.

(

2022

).

Multivariate survival analysis in big data: a divide-and-combine approach

.

Biometrics

,

78

,

852

–

866

.

Wang

Y.

,

Hong

C.

,

Palmer

N.

,

Di

Q.

,

Schwartz

J.

,

Kohane

I.

et al. (

2021

).

A fast divide-and-conquer sparse Cox regression

.

Biostatistics

,

22

,

381

–

401

.

Zeng

D.

,

Lin

D. Y.

(

2007

).

Maximum likelihood estimation in semiparametric regression models with censored data

.

Journal of the Royal Statistical Society, Series B

,

69

,

507

–

564

.

Zou

H.

(

2006

).

The adaptive lasso and its oracle properties

.

Journal of the American Statistical Association

,

101

,

1418

–

1429

.