Prediction of Vitamin D Deficiency in Older Adults: The Role of Machine Learning Models Free

Characteristics of participants by categories of serum 25(OH)D concentration

Characteristic^a	25(OH)D, nmol/L
	<25 (n = 91)	<50 (n = 1270)	≥50 (n = 3836)
Age (years)	63.2 ± 9.2	66.6 ± 8.1	65.4 ± 8.8
Female sex	36 (40)	598 (47)	1540 (40)
Ethnicity^b
European/Other	33 (36)	829 (65)	3422 (89)
Maori (%)	7 (8)	105 (8)	178 (5)
Pacific (%)	11 (12)	156 (12)	167 (4)
South Asian (%)	40 (44)	180 (14)	69 (2)
Marital status^b
Married/partnered or widow/widower	64 (70)	984 (77)	3174 (83)
Separated/divorced and living without partner	15 (16)	193 (15)	451 (12)
Never been married/partnered	12 (13)	93 (7)	211 (6)
2013 NZ Deprivation Index quintile^b
1 (least deprived)	15 (16)	355 (28)	1592 (42)
2	11 (12)	241 (19)	730 (19)
3	20 (22)	218 (17)	648 (17)
4	16 (18)	166 (13)	411 (11)
5 (most deprived)	29 (32)	290 (23)	455 (12)
Self-rated general health
Excellent or very good (%)	53 (58)	823 (65)	3039 (79)
Good or fair/poor (%)	38 (42)	447 (35)	797 (21)
Diabetes mellitus (%)	19 (21)	228 (18)	394 (10)
Cardiovascular disease (%)	14 (15)	208 (16)	580 (15)
Urgent treatment for asthma or chronic bronchitis/emphysema in past 12 months	5 (5)	53 (4)	64 (2)
Antihypertensive medication (%)	32 (35)	540 (43)	1360 (35)
Taking vitamin D supplements (%)	2 (2)	43 (3)	370 (10)
Sun exposure (hours/day)	1.3 ± 1.4	1.6 ± 1.4	2.2 ± 1.6
Total physical activity (hours/week)	21.6 ± 7.4	20.2 ± 14.5	22.5 ± 15.1
Vigorous physical activity (hours/week)	0.9 ± 1.8	1.6 ± 2.9	2.5 ± 3.8
Current smoker (%)	6 (7)	108 (9)	212 (6)
No alcohol intake in past 12 months (%)	53 (58)	299 (24)	412 (11)
Body mass index (kg/m²)	29.2 ± 6.5	29.6 ± 6.1	28.1 ± 4.7
Systolic blood pressure (mmHg)	135.6 ± 15.9	138.9 ± 18.3	139.0 ± 18.8
Albumin (g/L)	43.1 ± 2.4	43.5 ± 2.3	43.7 ± 2.2
Creatinine (nmol/L)	94.0 ± 27.4	91.7 ± 19.2	93.1 ± 16.9

Characteristic^a	25(OH)D, nmol/L
	<25 (n = 91)	<50 (n = 1270)	≥50 (n = 3836)
Age (years)	63.2 ± 9.2	66.6 ± 8.1	65.4 ± 8.8
Female sex	36 (40)	598 (47)	1540 (40)
Ethnicity^b
European/Other	33 (36)	829 (65)	3422 (89)
Maori (%)	7 (8)	105 (8)	178 (5)
Pacific (%)	11 (12)	156 (12)	167 (4)
South Asian (%)	40 (44)	180 (14)	69 (2)
Marital status^b
Married/partnered or widow/widower	64 (70)	984 (77)	3174 (83)
Separated/divorced and living without partner	15 (16)	193 (15)	451 (12)
Never been married/partnered	12 (13)	93 (7)	211 (6)
2013 NZ Deprivation Index quintile^b
1 (least deprived)	15 (16)	355 (28)	1592 (42)
2	11 (12)	241 (19)	730 (19)
3	20 (22)	218 (17)	648 (17)
4	16 (18)	166 (13)	411 (11)
5 (most deprived)	29 (32)	290 (23)	455 (12)
Self-rated general health
Excellent or very good (%)	53 (58)	823 (65)	3039 (79)
Good or fair/poor (%)	38 (42)	447 (35)	797 (21)
Diabetes mellitus (%)	19 (21)	228 (18)	394 (10)
Cardiovascular disease (%)	14 (15)	208 (16)	580 (15)
Urgent treatment for asthma or chronic bronchitis/emphysema in past 12 months	5 (5)	53 (4)	64 (2)
Antihypertensive medication (%)	32 (35)	540 (43)	1360 (35)
Taking vitamin D supplements (%)	2 (2)	43 (3)	370 (10)
Sun exposure (hours/day)	1.3 ± 1.4	1.6 ± 1.4	2.2 ± 1.6
Total physical activity (hours/week)	21.6 ± 7.4	20.2 ± 14.5	22.5 ± 15.1
Vigorous physical activity (hours/week)	0.9 ± 1.8	1.6 ± 2.9	2.5 ± 3.8
Current smoker (%)	6 (7)	108 (9)	212 (6)
No alcohol intake in past 12 months (%)	53 (58)	299 (24)	412 (11)
Body mass index (kg/m²)	29.2 ± 6.5	29.6 ± 6.1	28.1 ± 4.7
Systolic blood pressure (mmHg)	135.6 ± 15.9	138.9 ± 18.3	139.0 ± 18.8
Albumin (g/L)	43.1 ± 2.4	43.5 ± 2.3	43.7 ± 2.2
Creatinine (nmol/L)	94.0 ± 27.4	91.7 ± 19.2	93.1 ± 16.9

Abbreviation: 25(OH)D, deseasonalized 25-hydroxyvitamin D.

^aColumn % for categorical variables and mean ± SD for continuous variables.

^bSome percentages do not total 100% because of their rounding.

Table 1.

Characteristics of participants by categories of serum 25(OH)D concentration

Characteristic^a	25(OH)D, nmol/L
	<25 (n = 91)	<50 (n = 1270)	≥50 (n = 3836)
Age (years)	63.2 ± 9.2	66.6 ± 8.1	65.4 ± 8.8
Female sex	36 (40)	598 (47)	1540 (40)
Ethnicity^b
European/Other	33 (36)	829 (65)	3422 (89)
Maori (%)	7 (8)	105 (8)	178 (5)
Pacific (%)	11 (12)	156 (12)	167 (4)
South Asian (%)	40 (44)	180 (14)	69 (2)
Marital status^b
Married/partnered or widow/widower	64 (70)	984 (77)	3174 (83)
Separated/divorced and living without partner	15 (16)	193 (15)	451 (12)
Never been married/partnered	12 (13)	93 (7)	211 (6)
2013 NZ Deprivation Index quintile^b
1 (least deprived)	15 (16)	355 (28)	1592 (42)
2	11 (12)	241 (19)	730 (19)
3	20 (22)	218 (17)	648 (17)
4	16 (18)	166 (13)	411 (11)
5 (most deprived)	29 (32)	290 (23)	455 (12)
Self-rated general health
Excellent or very good (%)	53 (58)	823 (65)	3039 (79)
Good or fair/poor (%)	38 (42)	447 (35)	797 (21)
Diabetes mellitus (%)	19 (21)	228 (18)	394 (10)
Cardiovascular disease (%)	14 (15)	208 (16)	580 (15)
Urgent treatment for asthma or chronic bronchitis/emphysema in past 12 months	5 (5)	53 (4)	64 (2)
Antihypertensive medication (%)	32 (35)	540 (43)	1360 (35)
Taking vitamin D supplements (%)	2 (2)	43 (3)	370 (10)
Sun exposure (hours/day)	1.3 ± 1.4	1.6 ± 1.4	2.2 ± 1.6
Total physical activity (hours/week)	21.6 ± 7.4	20.2 ± 14.5	22.5 ± 15.1
Vigorous physical activity (hours/week)	0.9 ± 1.8	1.6 ± 2.9	2.5 ± 3.8
Current smoker (%)	6 (7)	108 (9)	212 (6)
No alcohol intake in past 12 months (%)	53 (58)	299 (24)	412 (11)
Body mass index (kg/m²)	29.2 ± 6.5	29.6 ± 6.1	28.1 ± 4.7
Systolic blood pressure (mmHg)	135.6 ± 15.9	138.9 ± 18.3	139.0 ± 18.8
Albumin (g/L)	43.1 ± 2.4	43.5 ± 2.3	43.7 ± 2.2
Creatinine (nmol/L)	94.0 ± 27.4	91.7 ± 19.2	93.1 ± 16.9

Characteristic^a	25(OH)D, nmol/L
	<25 (n = 91)	<50 (n = 1270)	≥50 (n = 3836)
Age (years)	63.2 ± 9.2	66.6 ± 8.1	65.4 ± 8.8
Female sex	36 (40)	598 (47)	1540 (40)
Ethnicity^b
European/Other	33 (36)	829 (65)	3422 (89)
Maori (%)	7 (8)	105 (8)	178 (5)
Pacific (%)	11 (12)	156 (12)	167 (4)
South Asian (%)	40 (44)	180 (14)	69 (2)
Marital status^b
Married/partnered or widow/widower	64 (70)	984 (77)	3174 (83)
Separated/divorced and living without partner	15 (16)	193 (15)	451 (12)
Never been married/partnered	12 (13)	93 (7)	211 (6)
2013 NZ Deprivation Index quintile^b
1 (least deprived)	15 (16)	355 (28)	1592 (42)
2	11 (12)	241 (19)	730 (19)
3	20 (22)	218 (17)	648 (17)
4	16 (18)	166 (13)	411 (11)
5 (most deprived)	29 (32)	290 (23)	455 (12)
Self-rated general health
Excellent or very good (%)	53 (58)	823 (65)	3039 (79)
Good or fair/poor (%)	38 (42)	447 (35)	797 (21)
Diabetes mellitus (%)	19 (21)	228 (18)	394 (10)
Cardiovascular disease (%)	14 (15)	208 (16)	580 (15)
Urgent treatment for asthma or chronic bronchitis/emphysema in past 12 months	5 (5)	53 (4)	64 (2)
Antihypertensive medication (%)	32 (35)	540 (43)	1360 (35)
Taking vitamin D supplements (%)	2 (2)	43 (3)	370 (10)
Sun exposure (hours/day)	1.3 ± 1.4	1.6 ± 1.4	2.2 ± 1.6
Total physical activity (hours/week)	21.6 ± 7.4	20.2 ± 14.5	22.5 ± 15.1
Vigorous physical activity (hours/week)	0.9 ± 1.8	1.6 ± 2.9	2.5 ± 3.8
Current smoker (%)	6 (7)	108 (9)	212 (6)
No alcohol intake in past 12 months (%)	53 (58)	299 (24)	412 (11)
Body mass index (kg/m²)	29.2 ± 6.5	29.6 ± 6.1	28.1 ± 4.7
Systolic blood pressure (mmHg)	135.6 ± 15.9	138.9 ± 18.3	139.0 ± 18.8
Albumin (g/L)	43.1 ± 2.4	43.5 ± 2.3	43.7 ± 2.2
Creatinine (nmol/L)	94.0 ± 27.4	91.7 ± 19.2	93.1 ± 16.9

Abbreviation: 25(OH)D, deseasonalized 25-hydroxyvitamin D.

^aColumn % for categorical variables and mean ± SD for continuous variables.

^bSome percentages do not total 100% because of their rounding.

Prediction of 25(OH)D < 50 nmol

Table 2 shows the discriminative performances of models in predicting 25(OH)D <50 nmol/L in the test set. Of the simple models, the 5 ML ones had AUC values similar to the reference model (Figure 1a and Table 2 (24)). The AUC of the reference model was 0.68 (95% CI 0.65-0.71) and those of ML models ranged from 0.67 (3 models) to 0.69 (for random forest and gradient boosted decision tree). Compared with the reference model, nearly all ML models yielded an improvement in discrimination as measured by IDI (maximum IDI = 0.0109; P < .0001). As demonstrated by the decision curve analysis (Figure 1b (24)), over the range of threshold probabilities, the net benefit for the ML models was similar to that for the conventional model.

Table 2.

Discrimination performance in predicting serum 25(OH)D <50 nmol/L and <25 nmol/L in the test set of the reference and machine learning models

Model	25(OH)D < 50 nmol/L				25(OH)D < 25 nmol/L
	AUC (95% CI)	P value	IDI (95% CI)^b	P value	AUC (95% CI)	P value	IDI (95% CI)^b	P value
Simple models
Reference model^a	0.68 (0.65-0.71)	Reference	Reference	Reference	0.71 (0.59-0.83)	Reference	Reference	Reference
Lasso regression	0.67 (0.64-0.70)	.33	10.9 (7.6-14.2)	<.0001	0.76 (0.65-0.87)	.02	25.2 (–14.6-65.0)	.21
Elastic net	0.67 (0.64-0.70)	.34	10.4 (7.1-13.6)	<.0001	0.76 (0.65-0.87)	.02	26.2 (–13.1-65.5)	.19
Random forest	0.69 (0.66-0.72)	.30	6.3 (0.3-12.3)	.04	0.85 (0.77-0.92)	.005	19.1 (–16.5-54.7)	.29
Gradient boosted decision tree	0.69 (0.66-0.72)	.13	5.6 (1.2-10.0)	.01	0.77 (0.66-0.87)	.10	5.1 (–34.9-45.1)	.80
Dense neural net	0.67 (0.64-0.70)	.24	–0.3 (–7.7-7.0)	.93	0.76 (0.64-0.87)	.15	0.4 (–39.1-39.6)	.99
Augmented models
Reference model^a	0.72 (0.69-0.75)	Reference	Reference	Reference	0.81 (0.71-0.91)	Reference	Reference	Reference
Lasso regression	0.73 (0.70-0.75)	.14	9.0 (4.8-13.2)	<.0001	0.90 (0.82-0.96)	.002	8.9 (–20.3-38.1)	.55
Elastic net	0.73 (0.70-0.75)	.12	11.1 (7.0-15.2)	<.0001	0.93 (0.90-0.96)	.002	18.2 (–36.0-72.3)	.51
Random forest	0.72 (0.69-0.74)	.64	2.1 (–6.4-10.6)	.62	0.92 (0.88-0.95)	.009	21.4 (–14.0-56.8)	.24
Gradient boosted decision tree	0.72 (0.70-0.75)	.74	4.8 (–1.5-11.1)	.13	0.90 (0.85-0.95)	.007	13.1 (–34.2-60.3)	.59
Dense neural net	0.73 (0.70-0.75)	.16	4.3 (–2.6-11.2)	.22	0.85 (0.78-0.92)	.22	4.5 (–17.9-26.9)	.69

Model	25(OH)D < 50 nmol/L				25(OH)D < 25 nmol/L
	AUC (95% CI)	P value	IDI (95% CI)^b	P value	AUC (95% CI)	P value	IDI (95% CI)^b	P value
Simple models
Reference model^a	0.68 (0.65-0.71)	Reference	Reference	Reference	0.71 (0.59-0.83)	Reference	Reference	Reference
Lasso regression	0.67 (0.64-0.70)	.33	10.9 (7.6-14.2)	<.0001	0.76 (0.65-0.87)	.02	25.2 (–14.6-65.0)	.21
Elastic net	0.67 (0.64-0.70)	.34	10.4 (7.1-13.6)	<.0001	0.76 (0.65-0.87)	.02	26.2 (–13.1-65.5)	.19
Random forest	0.69 (0.66-0.72)	.30	6.3 (0.3-12.3)	.04	0.85 (0.77-0.92)	.005	19.1 (–16.5-54.7)	.29
Gradient boosted decision tree	0.69 (0.66-0.72)	.13	5.6 (1.2-10.0)	.01	0.77 (0.66-0.87)	.10	5.1 (–34.9-45.1)	.80
Dense neural net	0.67 (0.64-0.70)	.24	–0.3 (–7.7-7.0)	.93	0.76 (0.64-0.87)	.15	0.4 (–39.1-39.6)	.99
Augmented models
Reference model^a	0.72 (0.69-0.75)	Reference	Reference	Reference	0.81 (0.71-0.91)	Reference	Reference	Reference
Lasso regression	0.73 (0.70-0.75)	.14	9.0 (4.8-13.2)	<.0001	0.90 (0.82-0.96)	.002	8.9 (–20.3-38.1)	.55
Elastic net	0.73 (0.70-0.75)	.12	11.1 (7.0-15.2)	<.0001	0.93 (0.90-0.96)	.002	18.2 (–36.0-72.3)	.51
Random forest	0.72 (0.69-0.74)	.64	2.1 (–6.4-10.6)	.62	0.92 (0.88-0.95)	.009	21.4 (–14.0-56.8)	.24
Gradient boosted decision tree	0.72 (0.70-0.75)	.74	4.8 (–1.5-11.1)	.13	0.90 (0.85-0.95)	.007	13.1 (–34.2-60.3)	.59
Dense neural net	0.73 (0.70-0.75)	.16	4.3 (–2.6-11.2)	.22	0.85 (0.78-0.92)	.22	4.5 (–17.9-26.9)	.69

Abbreviations: 25(OH)D = deseasonalized 25-hydroxyvitamin D (nmol/L); AUC = area under ROC curve.

^aLogistic regression.

^b×10^–3.

Table 2.

Discrimination performance in predicting serum 25(OH)D <50 nmol/L and <25 nmol/L in the test set of the reference and machine learning models

Model	25(OH)D < 50 nmol/L				25(OH)D < 25 nmol/L
	AUC (95% CI)	P value	IDI (95% CI)^b	P value	AUC (95% CI)	P value	IDI (95% CI)^b	P value
Simple models
Reference model^a	0.68 (0.65-0.71)	Reference	Reference	Reference	0.71 (0.59-0.83)	Reference	Reference	Reference
Lasso regression	0.67 (0.64-0.70)	.33	10.9 (7.6-14.2)	<.0001	0.76 (0.65-0.87)	.02	25.2 (–14.6-65.0)	.21
Elastic net	0.67 (0.64-0.70)	.34	10.4 (7.1-13.6)	<.0001	0.76 (0.65-0.87)	.02	26.2 (–13.1-65.5)	.19
Random forest	0.69 (0.66-0.72)	.30	6.3 (0.3-12.3)	.04	0.85 (0.77-0.92)	.005	19.1 (–16.5-54.7)	.29
Gradient boosted decision tree	0.69 (0.66-0.72)	.13	5.6 (1.2-10.0)	.01	0.77 (0.66-0.87)	.10	5.1 (–34.9-45.1)	.80
Dense neural net	0.67 (0.64-0.70)	.24	–0.3 (–7.7-7.0)	.93	0.76 (0.64-0.87)	.15	0.4 (–39.1-39.6)	.99
Augmented models
Reference model^a	0.72 (0.69-0.75)	Reference	Reference	Reference	0.81 (0.71-0.91)	Reference	Reference	Reference
Lasso regression	0.73 (0.70-0.75)	.14	9.0 (4.8-13.2)	<.0001	0.90 (0.82-0.96)	.002	8.9 (–20.3-38.1)	.55
Elastic net	0.73 (0.70-0.75)	.12	11.1 (7.0-15.2)	<.0001	0.93 (0.90-0.96)	.002	18.2 (–36.0-72.3)	.51
Random forest	0.72 (0.69-0.74)	.64	2.1 (–6.4-10.6)	.62	0.92 (0.88-0.95)	.009	21.4 (–14.0-56.8)	.24
Gradient boosted decision tree	0.72 (0.70-0.75)	.74	4.8 (–1.5-11.1)	.13	0.90 (0.85-0.95)	.007	13.1 (–34.2-60.3)	.59
Dense neural net	0.73 (0.70-0.75)	.16	4.3 (–2.6-11.2)	.22	0.85 (0.78-0.92)	.22	4.5 (–17.9-26.9)	.69

Model	25(OH)D < 50 nmol/L				25(OH)D < 25 nmol/L
	AUC (95% CI)	P value	IDI (95% CI)^b	P value	AUC (95% CI)	P value	IDI (95% CI)^b	P value
Simple models
Reference model^a	0.68 (0.65-0.71)	Reference	Reference	Reference	0.71 (0.59-0.83)	Reference	Reference	Reference
Lasso regression	0.67 (0.64-0.70)	.33	10.9 (7.6-14.2)	<.0001	0.76 (0.65-0.87)	.02	25.2 (–14.6-65.0)	.21
Elastic net	0.67 (0.64-0.70)	.34	10.4 (7.1-13.6)	<.0001	0.76 (0.65-0.87)	.02	26.2 (–13.1-65.5)	.19
Random forest	0.69 (0.66-0.72)	.30	6.3 (0.3-12.3)	.04	0.85 (0.77-0.92)	.005	19.1 (–16.5-54.7)	.29
Gradient boosted decision tree	0.69 (0.66-0.72)	.13	5.6 (1.2-10.0)	.01	0.77 (0.66-0.87)	.10	5.1 (–34.9-45.1)	.80
Dense neural net	0.67 (0.64-0.70)	.24	–0.3 (–7.7-7.0)	.93	0.76 (0.64-0.87)	.15	0.4 (–39.1-39.6)	.99
Augmented models
Reference model^a	0.72 (0.69-0.75)	Reference	Reference	Reference	0.81 (0.71-0.91)	Reference	Reference	Reference
Lasso regression	0.73 (0.70-0.75)	.14	9.0 (4.8-13.2)	<.0001	0.90 (0.82-0.96)	.002	8.9 (–20.3-38.1)	.55
Elastic net	0.73 (0.70-0.75)	.12	11.1 (7.0-15.2)	<.0001	0.93 (0.90-0.96)	.002	18.2 (–36.0-72.3)	.51
Random forest	0.72 (0.69-0.74)	.64	2.1 (–6.4-10.6)	.62	0.92 (0.88-0.95)	.009	21.4 (–14.0-56.8)	.24
Gradient boosted decision tree	0.72 (0.70-0.75)	.74	4.8 (–1.5-11.1)	.13	0.90 (0.85-0.95)	.007	13.1 (–34.2-60.3)	.59
Dense neural net	0.73 (0.70-0.75)	.16	4.3 (–2.6-11.2)	.22	0.85 (0.78-0.92)	.22	4.5 (–17.9-26.9)	.69

Abbreviations: 25(OH)D = deseasonalized 25-hydroxyvitamin D (nmol/L); AUC = area under ROC curve.

^aLogistic regression.

^b×10^–3.

When we used augmented models instead, all performance indices were higher (Table 2 and Figs. 1A and 1B). AUC values were similar across models, ranging from 0.72 to 0.73. Lasso regression and elastic net regression yielded improvement in discrimination with respect to the reference model, as measured by IDI (0.009 and 0.0111, respectively; both P < .001). Across a wide range of thresholds, the net benefit was similar for all models but higher than for the assumptions that none or all were vitamin D deficient (Fig. 1B).

Figure 1.

Prediction performance of augmented models for detection, in the test set, of serum 25(OH)D (deseasonalized 25-hydroxyvitamin D) <50 nmol/L: (A) ROC curves. (B) Decision curves. For decision curves, the net benefit associated with not testing anyone for vitamin D deficiency and testing all are given by the black horizontal lines (net benefit = 0) and gray angled lines, respectively. Insets are zooms of the curves.

Prediction of 25(OH)D <25 nmol

When we modeled 25(OH)D <25 nmol/L as an outcome, the discrimination performance was higher for all models (Table 2). Of the simple models, the ML ones had AUC point estimates—ranging from 0.76 (for regression-based models and dense neural network) to 0.85 for random forest—that were all higher than that of the conventional model (0.71) (Figure 2a and Table 2 (24)). Net benefit was highest for random forest at threshold probabilities of <4% (up to 0.004 [4 per 1000] higher than the conventional model) and, for probabilities >4%, was highest for elastic net regression (Figure 2b (24)).

In comparison, the augmented models, especially the ML ones, yielded even more accurate predictions (Table 2 and Figs. 2A and 2B). The elastic net model had the highest AUC (0.93), which was higher than that of the reference model (AUC = 0.81; P = .002 for difference). At every threshold, net benefit was highest for ML (at least 1 ML model; particularly for elastic net regression and random forest; Fig. 2B).

Figure 2.

Prediction performance of augmented models for detection, in the test set, of 25(OH)D (deseasonalized 25-hydroxyvitamin D) <25 nmol/L: (A) ROC curves. (B) Decision curves. For decision curves, the net benefit associated with not testing anyone for vitamin D deficiency and testing all are given by the black horizontal lines (net benefit = 0) and gray angled lines, respectively.

Positive Prediction Sample

In the sample that we predicted to have the outcome, mean 25(OH)D was lowest at higher probability thresholds (Figure 3 (24)). For both outcomes, across thresholds, 25(OH)D was lowest for the consensus model, particularly with augmented models. For example, at a threshold of 11% for predicting 25(OH)D <25 with augmented models, mean 25(OH)D was ~31 nmol/L for the consensus model and ~45 nmol/L for the reference model (Figure 3 (24)).

Selected Features of Models

The most important predictors identified by lasso regression, random forest, and gradient boosted decision tree models for each outcome are summarized elsewhere (Figure 4 (24)) for the simple models and Fig. 3 for the augmented models). In all figures, the most important predictor was South Asian ethnicity. For both outcomes, some predictors were consistently of high importance across all 3 augmented models. For the 25(OH)D <50 nmol/L outcome (Fig. 3A), these were sun exposure (second most important in all 3 models), vigorous physical activity (1 of the top 5 features in all 3 models), BMI, and vitamin D supplements. For the 25(OH)D <25 nmol/L outcome (Fig. 3B), these were no alcohol intake in past 12 months (1 of the top 5 features in all 3 models) and creatinine.

Figure 3.

Variable importance of predictors in the augmented models for predicting low serum 25(OH)D (deseasonalized 25-hydroxyvitamin D). (A) <50 nmol/L and. (B) <25 nmol/L. The variable importance is a scaled measure to have a maximum value of 100.

Impact of Feature Selection

We repeated the analyses for Table 2 for the augmented models, but with feature selection: using only the most important predictors identified by Boruta (Fig. 4). For the <50 nmol/L outcome (Fig. 4A), these were all predictors except for 5: systolic BP, hypertension, never been married/partnered, antihypertensive treatment, and diabetes. Compared with using all features (no feature selection), applying feature selection yielded similar AUC values for all models (maximum difference = 0.01), including the reference model.

Figure 4.

Discrimination performance for predicting low serum 25(OH)D (deseasonalized 25-hydroxyvitamin D) in the test set using augmented models with and without selection of input features using Boruta. (A) <50 nmol/L. (B) <25 nmol/L. AUC, area under curve. Error bars represent 95% CI.

For the <25 nmol/L outcome (Fig. 4B), Boruta selected 10 features: South Asian ethnicity, BMI, albumin, female sex, creatinine, Pacific ethnicity, no alcohol intake in past 12 months, diabetes, hypertension, and antihypertensive treatment. The dense neural network model performed better with feature selection than without it (AUC improvement = 0.05). For other models, AUC was mostly similar with and without feature selection.

Discussion

In 5106 community-resident adults, we predicted vitamin D deficiency using a conventional approach (logistic regression) and 5 modern ML techniques: lasso regression, elastic net regression, random forest, gradient boosted decision tree, and dense neural network. Compared with the conventional approach, these ML models demonstrated a similar performance in predicting 25(OH)D <50 nmol/L. By contrast, the ML models predicted 25(OH)D <25 nmol/L with a higher AUC—indicating an improvement in diagnostic accuracy. For this outcome, the utility of the ML models was reinforced by their greater net benefit across a range of threshold probabilities (or clinical preferences to balance false positives and false negatives). To our knowledge, this is the first study to compare ML and logistic regression for predicting vitamin D deficiency.

The improvement in discrimination with ML in predicting 25(OH)D <25 nmol/L is consistent with that reported in 2 studies that predicted 25(OH)D (not deseasonalized) with linear regression as the conventional model (19, 38). One of these studies modeled 25(OH)D as a continuous variable only (38). The other found that, compared with linear regression, ML (support vector regression) yielded a significantly higher AUC value (by 0.14) for detecting 25(OH)D <50 nmol/L based on Diasorin Liaison assay measurement. When analyses were repeated using liquid chromatography–tandem mass spectrometry, which yielded more accurate measures but a lower prevalence (13%) of 25(OH)D values <50 nmol/L, the AUC increase was 0.07 and not statistically significant (19). This increase is near the middle of the range of improvements we observed for predicting 25(OH)D <25 nmol/L with augmented models of between 0.04 (for dense neural network) and 0.12 (for elastic net regression) (Table 2). Our study corroborates the promise of additive benefit from ML suggested by these earlier studies (19, 38) and extends them by demonstrating higher diagnostic accuracy of a wider range of ML algorithms with a different conventional model (logistic regression), set of predictors, and outcome (low deseasonalized 25(OH)D).

The most likely explanation for the improvement in diagnostic accuracy of ML models is that this statistical method proficiently handles high-order interactions between predictors and nonlinear associations with the outcome (17, 18). If this is a key reason, this implies that, in their associations with 25(OH)D, the predictors of vitamin D deficiency may have a complex interrelationship. Some evidence of this is that the regression coefficients or performance of 25(OH)D models differ by age (consistent with the notion that cutaneous vitamin D production decreases with age) (22) and by sex (10). Another explanation is that our ML models utilized rigorous techniques (eg, regularization, cross-validation, and dropout) to mitigate overfitting, which is often problematic in traditional models.

Another novel feature of our models was that the 25(OH)D variable used to define our outcome was deseasonalized. This is important since 25(OH)D varies by season (26). Considering the influence of latitude-related factors (eg, cloud cover, ozone, and altitude) on 25(OH)D production (39), another original aspect of our study was geographical location. Whereas our study was conducted in northern New Zealand (latitude 37° S), which has a subtropical climate, most adult vitamin D deficiency models (conventional or ML) were based on populations in the United States and Northern Europe (7-14).

Two European studies reported that ML (random forest or feed forward artificial neural network) models predicted 25(OH)D <25 nmol/L (not deseasonalized) with AUC values of 0.677 and 0.835 (21, 22). We build on this prior research by demonstrating that, using a different set of predictors in a subtropical climate, deseasonalized 25(OH)D <25 nmol/L can be predicted with higher AUCs of up to 0.91 (with elastic net regression or random forest models), which constitutes high diagnostic accuracy (40).

As our model algorithms can be stored in computer files and transferred to other computers, they could be applied to identify older adults with vitamin D deficiency in different settings. First, they could help select those to be recruited into RCTs of people with vitamin D deficiency (eg, 25(OH)D <25 nmol/L (1, 2)); useful given the previously noted barriers to 25(OH)D testing (costly, time-consuming, and ethical considerations). To ensure that those selected have a high likelihood of actually being vitamin D deficient, the positive predictive value would need to be high. This could be achieved not only by screening in populations with a relatively high prevalence of vitamin D deficiency but by using a consensus model and selecting a high prediction threshold to identify cases (Figure 3 (24)). At a high threshold, the corresponding sensitivity would be low, indicating that, to compensate for missed cases, many patients would need to be screened to select patients for RCT inclusion. However, doing this is made feasible by implementing our simple models, which can be used to make predictions from large patient databases without needing to collect in-person data. This is another novel aspect of our study, given that the models in past studies contain predictors (eg, sun exposure) that are not routinely collected in clinical practice.

A second setting could be in studies aiming to perform a subgroup analysis of participants with vitamin D deficiency or to use the latter as a variable, but where 25(OH)D testing is a barrier (eg, cost prohibitive or otherwise infeasible).

Although our primary goal is to facilitate vitamin D research, a third setting for using the models is to identify patients in clinical practice who could benefit from vitamin D supplementation (7, 27). Clinician-ordered patient 25(OH)D testing has increased in many countries but much of this has been considered unnecessary, with a sizeable fraction of 25(OH)D assays testing negative for vitamin D deficiency, however defined (27). However, our models would add benefit in making the decision to undergo confirmatory 25(OH)D tests. For example, at a threshold probability of 5%, where the net benefit for our augmented random forest model for predicting 25(OH)D <25 nmol/L is ~0.041 higher than the test all (with 25(OH)D assays) patients’ plan (Fig. 2B), the net avoided false positives per 100 patients would be 100 × ~0.041/odds (0.05) = ~78 (35). That is, to avoid 1 unnecessary 25(OH)D test, the prediction model should be applied to ~1.28 patients. Thus, our model would help to avoid unnecessary testing. Through the adoption of more advanced computerized decision support systems for clinical outcomes (41), the opportunities to predict vitamin D status in this setting may be realized.

Our study participants were recruited from the community, which increases the generalizability of our findings. Another study strength is that we used the criterion standard laboratory method for 25(OH)D measurement (liquid chromatography–tandem mass spectrometry). Third, our decision curve analysis, which assesses clinical utility (in contrast to traditional performance measures) (35), enhances novelty as this analysis has not been incorporated in almost all previous assessments of 25(OH)D models that we reviewed (7-9, 11-14, 19-22, 38). As for limitations, our models had imperfect accuracy. We attribute this, at least in part, to the subjectivity of various predictors (eg, physical activity) and that we did not include some predictors observed in other studies, such as fish consumption, which may be high in vitamin D (8, 11), skin color (7, 42), and genetic factors (16), which are likely to have improved model accuracy. Given the influence of geographical location on 25(OH)D (39), our models have uncertain applicability to people in other geographical locations.

In summary, compared with conventional models, ML models predicted 25(OH)D <50 nmol/L with similar accuracy but predicted 25(OH)D <25 nmol/L with higher discrimination and net benefit. Our models provide a rapid, computer-based, and inexpensive method to accurately identify participants to be included in trials of older adults with vitamin D deficiency. Further, if applied in clinical practice, these improvements should yield fewer missed cases and less over-reporting of severe vitamin D deficiency, which would favorably impact on cost-efficiency of vitamin D testing. We encourage further ML studies in different populations, geographical locations and using a wider range of predictors.

Abbreviations

25(OH)D
25-hydroxyvitamin D

AUC
area under the receiver operating characteristic curve

BMI
body mass index

BP
blood pressure

IDI
integrated discrimination improvement

IS
integral of sensitivity

IP
integral of (1 – specificity)

ML
machine learning

NZDep13
2013 New Zealand Deprivation Index

RCT
randomized controlled trial

Acknowledgments

We thank the participants and the ViDA study staff.

Funding

The Health Research Council of New Zealand (HRC; grant 10/400) and Accident Compensation Corporation of New Zealand funded this study. HRC supported J.D.S. with a fellowship (HRC 18/258).

Disclosures

None declared.

Data Availability

No additional data are available. However, the original (de-identified) data that support the findings derived from this analysis can be requested by emailing the corresponding author.

References

1.

Scragg

R

.

Emerging evidence of thresholds for beneficial effects from vitamin D supplementation

.

Nutrients.

2018

;

10

(

5

):

561

.

2.

Martineau

AR

,

Jolliffe

DA

,

Hooper

RL

, et al.

Vitamin D supplementation to prevent acute respiratory tract infections: systematic review and meta-analysis of individual participant data

.

BMJ

2017

:

356

:i6583. Doi: 10.1136/bmj.i6583

3.

Reid

IR

,

Horne

AM

,

Mihov

B

, et al.

Effect of monthly high-dose vitamin D on bone density in community-dwelling older adults substudy of a randomized controlled trial

.

J Intern Med.

2017

;

282

:

452

-

460

.

4.

Macdonald

HM

,

Reid

IR

,

Gamble

GD

,

Fraser

WD

,

Tang

JC

,

Wood

AD

.

25-hydroxyvitamin D threshold for the effects of vitamin D supplements on bone density: secondary analysis of a randomized controlled trial

.

J Bone Miner Res.

2018

;

33

(

8

):

1464

-

1469

.

5.

Sofianopoulou

E

,

Kaptoge

SK

,

Afzal

S

, et al.

Estimating dose-response relationships for vitamin D with coronary heart disease, stroke, and all-cause mortality: observational and Mendelian randomisation analyses

.

Lancet Diabetes Endocrinol.

2021

;

9

(

12

):

837

-

846

.

6.

Camargo

CA

,

Martineau

AR

.

Vitamin D to prevent COVID-19: recommendations for the design of clinical trials

.

FEBS J.

2020

;

287

(

17

):

3689

-

3692

.

7.

Deschasaux

M

,

Souberbielle

JC

,

Andreeva

VA

, et al.

Quick and easy screening for Vitamin D insufficiency in adults a scoring system to be implemented in daily clinical practice

.

Medicine (Baltimore).

2016

;

95

(

7

):

e2783

.

8.

Merlijn

T

,

Swart

KMA

,

Lips

P

, et al.

Prediction of insufficient serum vitamin D status in older women: a validated model

.

Osteoporosis Int

.

2018

;

29

(

7

):

1539

-

1547

.

9.

Sohl

E

,

Heymans

MW

,

De Jongh

RT

, et al.

Prediction of vitamin D deficiency by simple patient characteristics

.

Am J Clin Nutr.

2014

;

99

(

5

):

1089

-

1095

.

10.

Tran

B

,

Armstrong

BK

,

McGeechan

K

, et al.

Predicting vitamin D deficiency in older Australian adults

.

Clin Endocrinol

.

2013

;

79

(

5

):

631

-

640

.

11.

Kuwabara

A

,

Tsugawa

N

,

Mizuno

K

,

Ogasawara

H

,

Watanabe

Y

,

Tanaka

K

.

A simple questionnaire for the prediction of vitamin D deficiency in Japanese adults (Vitamin D Deficiency questionnaire for Japanese: VDDQ-J)

.

J Bone Miner Metab.

2019

;

37

(

5

):

854

-

863

.

12.

Nabak

AC

,

Johnson

RE

,

Keuler

NS

,

Hansen

KE

.

Can a questionnaire predict vitamin D status in postmenopausal women?

Public Health Nutr.

2014

;

17

(

4

):

739

-

746

.

13.

Bolek-Berquist

J

,

Elliott

ME

,

Gangnon

RE

, et al.

Use of a questionnaire to assess vitamin D status in young adults

.

Public Health Nutr.

2009

;

12

(

2

):

236

-

243

.

14.

Lopes

JB

,

Fernandes

GH

,

Takayama

L

,

Figueiredo

CP

,

Pereira

RMR

.

A predictive model of vitamin D insufficiency in older community people: from the São Paulo Aging & Health Study (SPAH)

.

Maturitas.

2014

;

78

(

4

):

335

-

340

.

15.

Narang

RK

,

Gamble

GG

,

Khaw

KT

, et al.

A prediction tool for vitamin D deficiency in New Zealand adults

.

Arch Osteoporos.

2020

;

15

(

1

):

172

.

16.

Touvier

M

,

Deschasaux

M

,

Montourcy

M

, et al.

Determinants of vitamin D status in Caucasian adults: influence of sun exposure, dietary intake, sociodemographic, lifestyle, anthropometric, and genetic factors

.

J Invest Dermatol.

2015

;

135

(

2

):

378

-

388

.

17.

Kuhn

M

,

Johnson

K.

Applied Predictive Modeling

.

Springer

;

2013

.

Google Preview

18.

Weng

SF

,

Reps

J

,

Kai

J

,

Garibaldi

JM

,

Qureshi

N

.

Can machine-learning improve cardiovascular risk prediction using routine clinical data?

PLoS One.

2017

;

12

(

4

):

e0174944

.

19.

Guo

S

,

Lucas

RM

,

Ponsonby

AL

, et al.

A novel approach for prediction of vitamin D status using support vector regression

.

PLoS One.

2013

;

8

(

11

):

e79970

.

20.

Bhan

I

,

Burnett-Bowie

SAM

,

Ye

J

,

Tonelli

M

,

Thadhani

R

.

Clinical measures identify vitamin D deficiency in dialysis

.

Clin J Am Soc Nephrol.

2010

;

5

(

3

):

460

-

467

.

21.

Annweiler

C

,

Kabeshova

A

,

Legeay

M

,

Fantino

B

,

Beauchet

O

.

Derivation and validation of a clinical diagnostic tool for the identification of older community-dwellers with hypovitaminosis D

.

J Am Med Dirs Assoc.

2015

;

16

(

6

):

536.e8

-

536e19

.

22.

O’Sullivan

F

,

Laird

E

,

Kelly

D

, et al.

Ambient UVB dose and sun enjoyment are important predictors of vitamin D status in an older population

.

J Nutr.

2017

;

147

(

5

):

858

-

868

.

23.

Scragg

R

,

Waayer

D

,

Stewart

AW

, et al.

The Vitamin D Assessment (ViDA) Study: design of a randomized controlled trial of vitamin D supplementation for the prevention of cardiovascular disease, acute respiratory infection, falls and non-vertebral fractures

.

J Steroid Biochem Mol Biol.

2016

;

164

:

318

-

325

.

24.

Sluyter

JD

,

Raita

Y

,

Hasegawa

K

,

Reid

IR

,

Scragg

R

,

Camargo

CA.

Supplementary material for: Prediction of vitamin D deficiency in older adults: the role of machine learning models

. Deposited June 7 2022. Doi: 10.17608/k6.auckland.20006513.v1

Google Preview

. https.//www.otago.ac.nz/wellington/departments/publichealth/research/hirp/otago020194.html

25.

Atkinson

J

,

Salmond

C

,

Crampton

P.

NZDep2013 Index of Deprivation

.

Wellington

:

University of Otago

,

2014

. Accessed

24 May, 2018

Google Preview

26.

Sachs

MC

,

Shoben

A

,

Levin

GP

, et al.

Estimating mean annual 25-hydroxyvitamin D concentrations from single measurements: the Multi-Ethnic Study of Atherosclerosis

.

Am J Clin Nutr.

2013

;

97

(

6

):

1243

-

1251

.

27.

Rockwell

M

,

Kraak

V

,

Hulver

M

,

Epling

J

.

Clinical management of low vitamin D: a scoping review of physicians’ practices

.

Nutrients.

2018

;

10

(

4

):

493

.

28.

Aloia

JT

.

2011 report on dietary reference intake for vitamin D: where do we go from here?

J Clin Endocrinol Metab.

2011

;

96

(

10

):

2987

-

2996

.

29.

Win

SS

,

Camargo

CA

,

Khaw

KT

, et al.

Cross-sectional associations of vitamin D status with asthma prevalence, exacerbations, and control in New Zealand adults

.

J Steroid Biochem Mol Biol.

2019

;

188

:

1

-

7

.

30.

Breiman

L

,

Cutler

A

,

Liaw

A

,

Wiener

M

.

randomForest: Breiman and Cutler’s random forests for classification and regression

. Version 4.6-14. Accessed on February 13, 2019. https://cran.r-project.org/web/packages/randomForest/.

2018

.

31.

Kuhn

M

,

Wing

J

,

Weston

S

,

Williams

A

,

Keefer

C

,

Engelhardt

A

, et al.

caret: Classification and regression training. Version 6.0-84

. Accessed on 11 August 11, 2019. https://cran.r-project.org/web/packages/caret/.

2019

.

32.

Allaire

J

,

Chollet

F

,

Tang

Y

,

Van Der Bijl

W

,

Studer

M

,

Keydana

S.

R Interface to

“Keras”

. Version 2.2.4.1. Accessed on 2 October 2, 2019. https://CRAN.R-project.org/package=keras.

2019

.

33.

DeLong

ER

,

DeLong

DM

,

Clarke-Pearson

DL

.

Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach

.

Biometrics.

1988

;

44

(

3

):

837

-

845

.

34.

Pencina

MJ

,

D’Agostino

RB

Sr,

D’Agostino

RB

Jr,

Vasan

RS

.

Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond

.

Stat Med.

2008

;

27

(

2

):

157

-

172

.

35.

Van Calster

B

,

Wynants

L

,

Verbeek

JFM

, et al.

Reporting and interpreting decision curve analysis: a guide for investigators

.

Eur Urol.

2018

;

74

(

6

):

796

-

804

.

36.

Glass

GE

,

C

G

,

Kessler

WH

.

Validating species distribution models with standardized surveys for Ixodid ticks in mainland Florida

.

J Med Entomol.

2021

;

58

(

3

):

1345

-

1351

.

37.

Kursa

MB

,

Rudnicki

WR

.

Feature selection with the Boruta package

.

J Stat Softw.

2010

;

36

(

11

):

1

-

13

.

38.

Bechrouri

S

,

Monir

A

,

Mraoui

H

,

Sebbar

EH

,

Saalaoui

E

,

Choukri

M

.

Performance of statistical models to predict vitamin D levels

. ACM International Conference Proceeding Series;

2019

. Doi: 10.1145/3314074.3314076

39.

Mendes

MM

,

Darling

AL

,

Hart

KH

,

Morse

S

,

Murphy

RJ

,

Lanham-New

SA

.

Impact of high latitude, urban living and ethnicity on 25-hydroxyvitamin D status: a need for multidisciplinary action?

J Steroid Biochem Mol Biol.

2019

;

188

:

95

-

102

.

40.

Akobeng

AK

.

Understanding diagnostic tests 3: receiver operating characteristic curves

.

Acta Paediatr.

2007

;

96

(

5

):

644

-

647

.

41.

Wells

S

,

Riddell

T

,

Kerr

A

, et al.

Cohort Profile: The PREDICT cardiovascular disease cohort in New Zealand primary care (PREDICT-CVD 19)

.

Int J Epidemiol.

2017

;

46

(

1

):

22

.

42.

Cairncross

CT

,

Stonehouse

W

,

Conlon

CA

, et al.

Predictors of vitamin D status in New Zealand preschool children

.

Matern Child Nutr.

2017

;

13

(

3

):

e12340

.