A machine learning approach reveals features related to clinicians’ diagnosis of clinically relevant knee osteoarthritis

Characteristics of recruited clinicians

	General practitioner (n = 17)	Secondary care physician (n = 17)
Experience of treating OA patients, years, mean (s.d.)	12 (9)	15 (9)
Number of OA patients treated per week, mean (s.d.)	5 (3)	27 (30)
Importance of radiographs^a, median (range)	2 (1–4)	4 (2–4)

	General practitioner (n = 17)	Secondary care physician (n = 17)
Experience of treating OA patients, years, mean (s.d.)	12 (9)	15 (9)
Number of OA patients treated per week, mean (s.d.)	5 (3)	27 (30)
Importance of radiographs^a, median (range)	2 (1–4)	4 (2–4)

a

Perceived importance of radiography for making the diagnosis of knee OA: 1, not important; 2, minor important; 3, somewhat important; 4, very important.

Table 1.

Open in new tab Download slide

Characteristics of recruited clinicians

	General practitioner (n = 17)	Secondary care physician (n = 17)
Experience of treating OA patients, years, mean (s.d.)	12 (9)	15 (9)
Number of OA patients treated per week, mean (s.d.)	5 (3)	27 (30)
Importance of radiographs^a, median (range)	2 (1–4)	4 (2–4)

	General practitioner (n = 17)	Secondary care physician (n = 17)
Experience of treating OA patients, years, mean (s.d.)	12 (9)	15 (9)
Number of OA patients treated per week, mean (s.d.)	5 (3)	27 (30)
Importance of radiographs^a, median (range)	2 (1–4)	4 (2–4)

a

Perceived importance of radiography for making the diagnosis of knee OA: 1, not important; 2, minor important; 3, somewhat important; 4, very important.

Clinically relevant knee OA

GPs diagnosed clinically relevant OA in 42% and 43% knees, before and after viewing radiographic data, respectively. SPs in 43% and 51% of the knees. Both GPs and SPs somewhat modified their diagnoses after viewing radiographic data, while generally they agreed on 70% diagnoses regardless of whether radiographic data were available or not (Fig. 2). During the procedure, GPs viewed 45% of the actual radiographic knee films and SPs viewed 75%.

Figure 2.

Clinicians’ diagnosis before and after viewing radiographic data. ‘Consensus’ means GP and SP made the same diagnosis. Percentages indicates proportions of knees in each category and are calculated against total number of knees (1106). GP: general practitioner; SP: secondary care physician

Machine learning models and model performance

All the RFE-RF full models contained 50 features and had good performance for explaining clinicians’ diagnoses: model_GP, mean AUC of 0.87 (95% CI, 0.85, 0.89); model_{GP+radiographs}, 0.84 (95% CI, 0.82, 0.86); model_SP, 0.83 (95% CI, 0.80, 0.86); model_{SP+radiographs}, 0.79 (95% CI, 0.76, 0.82).

Models containing the top 10 features presented similarly good performance: 10-feature model_GP, mean AUC of 0.83 (95% CI, 0.85, 0.85); 10-feature model_{GP+radiographs}, 0.82 (95% CI, 0.80, 0.84); 10-feature model_SP, 0.77 (95% CI, 0.75, 0.79); 10-feature model_{SP+radiographs}, 0.76 (95% CI, 0.73, 0.78). Clinicians’ diagnoses were poorly explained by the three commonly used clinical criteria (AUC ranged from 0.62–0.68 for GPs, and 0.58–0.65 for SPs). Mean AUC of the 10-feature models were all significantly higher than the three criteria (Fig. 3).

Figure 3.

Receiver operating curves for top 10-feature models and clinical diagnostic/classification criteria against clinicians’ diagnosis. Mean curve was generated based on the five curves of the 5-fold cross-validation (CV). AUC: area under the curve; NICE: National Institute for Health and Care Excellence. ***P < 0.001, comparing mean AUC of 10-feature models with EULAR, ACR and NICE criteria

Open in new tab Download slide

Top features

Top 10 and top five features selected by the RFE-RF models are presented in Table 2. Before viewing radiographic data, patient symptom features, especially quantitative measures (WOMAC scores), were the most important ones related to the diagnosis of both GPs and SPs. Besides, none of the physical examination items was identified as important for GPs’ diagnosis, while T10 joint line tenderness was for SPs’ diagnosis.

Table 2.

Top 10 and five (bold items) features related to clinicians’ diagnosis on clinically relevant knee OA

	GP diagnosis based on clinical data	SP diagnosis based on clinical data	GP diagnosis based on clinical and radiographic data	SP diagnosis based on clinical and radiographic data
Demographics and medical history	None	None	None	None
Symptoms	T10 WOMAC total score	T10 WOMAC total score	T10 WOMAC total score	T8 WOMAC total score
	T10 WOMAC function score	T10 WOMAC function score	T10 WOMAC function score	T5 WOMAC total score
	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC function score
	T8 WOMAC function score	T8 WOMAC function score	T8 WOMAC function score	T5 WOMAC function score
	T5 WOMAC total score	T5 WOMAC total score	T5 WOMAC total score
	T10 WOMAC pain score	T10 WOMAC pain	T10 WOMAC stiffness score
	T5 WOMAC function score	T5 WOMAC function score	T5 WOMAC function score
	T5 WOMAC stiffness score	T5 WOMAC pain score
	T10 knee stiffness-No
	T10 knee stiffness-Yes
Physical examination	None	T10 Joint line tenderness – Negative^a	T10 knee flexion degree	T10 Joint line tenderness – Positive
Physical examination	None	T10 Joint line tenderness – Positive^a		T8 knee flexion degree
Radiographic features	–	–	T10 medial joint space narrowing grade	T10 KL grade
			T5 medial tibial osteophyte grade	T10 medial femoral osteophyte grade
				T8 KL grade
				T5 KL grade

	GP diagnosis based on clinical data	SP diagnosis based on clinical data	GP diagnosis based on clinical and radiographic data	SP diagnosis based on clinical and radiographic data
Demographics and medical history	None	None	None	None
Symptoms	T10 WOMAC total score	T10 WOMAC total score	T10 WOMAC total score	T8 WOMAC total score
	T10 WOMAC function score	T10 WOMAC function score	T10 WOMAC function score	T5 WOMAC total score
	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC function score
	T8 WOMAC function score	T8 WOMAC function score	T8 WOMAC function score	T5 WOMAC function score
	T5 WOMAC total score	T5 WOMAC total score	T5 WOMAC total score
	T10 WOMAC pain score	T10 WOMAC pain	T10 WOMAC stiffness score
	T5 WOMAC function score	T5 WOMAC function score	T5 WOMAC function score
	T5 WOMAC stiffness score	T5 WOMAC pain score
	T10 knee stiffness-No
	T10 knee stiffness-Yes
Physical examination	None	T10 Joint line tenderness – Negative^a	T10 knee flexion degree	T10 Joint line tenderness – Positive
Physical examination	None	T10 Joint line tenderness – Positive^a		T8 knee flexion degree
Radiographic features	–	–	T10 medial joint space narrowing grade	T10 KL grade
			T5 medial tibial osteophyte grade	T10 medial femoral osteophyte grade
				T8 KL grade
				T5 KL grade

a

In the random forest model, joint line tenderness tested positive indicates the knee is more likely to have OA; negative indicates have no OA.

GP: general practitioner; SP: secondary care physician; T5 (8,10): 5 (8,10)-year follow-up; KL: Kellgren and Lawrence.

Table 2.

Top 10 and five (bold items) features related to clinicians’ diagnosis on clinically relevant knee OA

	GP diagnosis based on clinical data	SP diagnosis based on clinical data	GP diagnosis based on clinical and radiographic data	SP diagnosis based on clinical and radiographic data
Demographics and medical history	None	None	None	None
Symptoms	T10 WOMAC total score	T10 WOMAC total score	T10 WOMAC total score	T8 WOMAC total score
	T10 WOMAC function score	T10 WOMAC function score	T10 WOMAC function score	T5 WOMAC total score
	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC function score
	T8 WOMAC function score	T8 WOMAC function score	T8 WOMAC function score	T5 WOMAC function score
	T5 WOMAC total score	T5 WOMAC total score	T5 WOMAC total score
	T10 WOMAC pain score	T10 WOMAC pain	T10 WOMAC stiffness score
	T5 WOMAC function score	T5 WOMAC function score	T5 WOMAC function score
	T5 WOMAC stiffness score	T5 WOMAC pain score
	T10 knee stiffness-No
	T10 knee stiffness-Yes
Physical examination	None	T10 Joint line tenderness – Negative^a	T10 knee flexion degree	T10 Joint line tenderness – Positive
Physical examination	None	T10 Joint line tenderness – Positive^a		T8 knee flexion degree
Radiographic features	–	–	T10 medial joint space narrowing grade	T10 KL grade
			T5 medial tibial osteophyte grade	T10 medial femoral osteophyte grade
				T8 KL grade
				T5 KL grade

	GP diagnosis based on clinical data	SP diagnosis based on clinical data	GP diagnosis based on clinical and radiographic data	SP diagnosis based on clinical and radiographic data
Demographics and medical history	None	None	None	None
Symptoms	T10 WOMAC total score	T10 WOMAC total score	T10 WOMAC total score	T8 WOMAC total score
	T10 WOMAC function score	T10 WOMAC function score	T10 WOMAC function score	T5 WOMAC total score
	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC total score	T8 WOMAC function score
	T8 WOMAC function score	T8 WOMAC function score	T8 WOMAC function score	T5 WOMAC function score
	T5 WOMAC total score	T5 WOMAC total score	T5 WOMAC total score
	T10 WOMAC pain score	T10 WOMAC pain	T10 WOMAC stiffness score
	T5 WOMAC function score	T5 WOMAC function score	T5 WOMAC function score
	T5 WOMAC stiffness score	T5 WOMAC pain score
	T10 knee stiffness-No
	T10 knee stiffness-Yes
Physical examination	None	T10 Joint line tenderness – Negative^a	T10 knee flexion degree	T10 Joint line tenderness – Positive
Physical examination	None	T10 Joint line tenderness – Positive^a		T8 knee flexion degree
Radiographic features	–	–	T10 medial joint space narrowing grade	T10 KL grade
			T5 medial tibial osteophyte grade	T10 medial femoral osteophyte grade
				T8 KL grade
				T5 KL grade

a

In the random forest model, joint line tenderness tested positive indicates the knee is more likely to have OA; negative indicates have no OA.

GP: general practitioner; SP: secondary care physician; T5 (8,10): 5 (8,10)-year follow-up; KL: Kellgren and Lawrence.

After viewing radiographic data, the top five features for GPs’ diagnosis remained the same, but were moderately changed for SPs’ diagnosis with incorporating three radiographic features. Among the top 10 features, two medial compartment structural features (T10 medial joint space narrowing grade and T5 medial tibial osteophyte grade) were found related to GPs’ diagnosis, while the grades for the whole tibia-femoral joint (T5 to T10 KL grade) and T10 medial femoral osteophyte were found related to SPs’ diagnosis.

None of the demographic and medical history features were identified as important in any of the RFE-RF models.

Sensitivity analysis

The AUC of the models built in the complete datasets were found slightly lower (about 2%) than those obtained in our main analysis, while the top 10 features were generally similar (Supplementary Tables S3 and S4, available at Rheumatology online). Overall, the sensitivity analysis supports the robustness of our main findings.

Discussion

In this study, we developed RFE-RF models with good performance to explain GPs’ and SPs’ diagnosis of clinically relevant knee OA. The patient features identified by these models suggest typical characteristics of the patients who would likely receive a diagnosis of clinically relevant knee OA from clinicians.

Although GPs and SPs agreed on most diagnoses (70%), both before and after viewing radiographic data, variations existed in the patient features related to the diagnoses in different situations. When radiographs were unavailable, patients with more severe symptoms were more likely to receive the OA diagnosis from both GP and SP. Additionally, only SP seemed to have taken joint line tenderness examination into account, which might be one of the reasons for the 30% discrepancies in the diagnosis between the two kinds of clinicians. Another reason could be that the threshold of symptom severity for making an OA diagnosis was different between GP and SP, as suggested by a real-world report from Jordan et al. In that study, GPs tended not to diagnose knee OA in patients with mild symptoms only [12]. Unfortunately, there is no similar study available in secondary care.

When radiographs were available, radiographic features appeared in the top-10 lists for both GPs and SPs. Whereas, focusing on the top five features only, GPs seemed to still make diagnoses mainly based on patient symptoms; SPs shifted to a combination of symptomatic and structural features. This is consistent with actual clinical practice that SPs tend to check radiographic films for assessing the structural severity and then plan further treatments (e.g. orthopaedic surgeons assess the necessity/suitability of surgery based on radiographs), while GPs are advised to not obtain radiography for OA diagnoses [4] and are likely to provide symptom relief treatments.

Patient demographic and medical history features were found unimportant for diagnosis in the four 10-feature models. This is inconsistent with previous reports that patients with older age, more comorbidities or obesity were more likely to be diagnosed as knee OA by GPs [12, 14]. Besides, the EULAR, ACR and NICE criteria, which were developed based on the consensus of clinical and research experts, all treated age as an important indicator for identifying OA knees. A possible reason could be that the CHECK cohort recruited patients above 45 years at baseline [19], so all the patients in this study (after 5 years follow-up) had already fulfilled the age requirement of the three criteria. On the other hand, our analysis focused on identifying the most important features, could have missed features with weak associations. Hence, the findings should be interpreted as clinicians rely more on symptoms, physical examinations or radiographic features than on risk factors (e.g. older age, comorbidities and higher BMI) in diagnosing knee OA.

The differences in the top features between GPs and SPs suggests that researchers using registry diagnoses to assess OA disease burden should be aware of the situation under which the diagnosis is made. For instance, diagnoses of GPs (with and without radiographs) mainly reflect patient symptoms, which could provide hints towards the demands for symptom relief management (e.g. pain medication) in primary care. Global disease burden studies defined OA patients by the combination of knee pain and KL grade (≥2), which seems to reflect the disease burden similar to the perspective from SP [26].

In this study, none of the three commonly used clinical criteria adequately captured the knees recognized with clinically relevant knee OA by clinicians. It should be noted that ACR criteria were originally developed as classification criteria to be used in research, and EULAR and NICE criteria were developed for diagnosis in clinical settings. It might be ‘unfair’ to test the ACR criteria against clinicians’ diagnosis, while our results suggest the diagnostic performance of ACR was similar to the NICE. Meanwhile, the knee OA definition by the NICE criteria is exactly the same as the Dutch healthcare practice guideline for GPs (https://richtlijnen.nhg.org/standaarden/niet-traumatische-knieklachten), which reveals an inconsistency between guidelines and GPs’ actual clinical practice. This is in line with a previous report that indicated only moderate adherence to practice guidelines by clinicians [27]. On the other hand, the presented RFE-RF models performed well in the discrimination of the diagnosis by clinicians, which showed the feasibility of applying machine learning models in similar research problems. Meanwhile, it may also imply the feasibility of simulating human diagnosis through machine learning models.

The design of this study has several strengths. First, GPs and SPs were paired to (independently) review the same sample of knees, which made it robust to compare the diagnoses and features between the two kinds of clinicians. Second, clinicians’ diagnoses were made on patient longitudinal data, which is more similar to the actual situation than on cross-sectional data; as suggested by a previous study, GPs would record patients with ‘joint pain’ at the initial consultations and then diagnose OA after 6–7 years follow-up [12]. Third, nested 5-fold CV was used to improve model stability and allowed all model performances to be evaluated in an independent testing dataset. Fourth, data imputation was incorporated into the 5-fold CV, meaning that the imputation was done on the random sample of the whole dataset for five times. Similar to the merits of the multiple imputation, this created more uncertainty in the imputed values and thus increased the standard error to obtain a better estimation of the correct value. Fifth, because patient features are likely to be inter-correlated, RFE, by iteratively training a model with removing the lowest ranking features, was used in combination with the RF model. Previous studies have demonstrated the robustness of developing the RFE-RF model in data with correlated features [17, 24, 28].

This study has limitations. First, despite the strengths regarding the internal validity, the findings should be treated cautiously when implemented externally. For example, clinicians were asked to diagnose clinically relevant OA; this could be different from the case where diagnosis of pre-clinical or early-stage OA is included. The CHECK cohort excluded the patients with potential differential diagnoses (e.g. rheumatoid arthritis) at baseline, so the findings could not apply to patients with these conditions. Moreover, only Dutch clinicians were recruited, which calls for future studies on evaluating the generalizability of our results in other regions. Second, we could have missed some radiographic features (e.g. tibiofemoral alignment) which were not listed in the dataset, but could have been captured by clinicians when viewing the actual radiographic films. Third, though RFE-RF models take feature interactions into account [16], trajectories of the features over the period (T5 to T10) might not have been well reflected in the analysis. Patients with worsened symptoms and structural progression might be more likely to be diagnosed with knee OA. While it had been shown that the majority of the knees had stable symptoms in the CHECK cohort from T5 to T10 [6], we assumed this issue is only relative in the minority of patients. For structural progression, we did an explorative analysis on testing correlation between KL progression (T10-T5 ≥ 1) and diagnoses (after viewing radiographic data); no statistically significant correlation was found for either GPs or SPs. In our final models, features from different time points were included, which should be interpreted as patients with consistent severe symptoms and structural damages were more likely to receive the diagnosis. Forth, missing values of the features were presented as ‘blank box’ to the clinicians but were imputed in our statistical analysis, which caused discrepancies between the two scenarios. We did the imputation because the RFE-RF model does not tolerate missing values. To reduce the risk of overfitting, we used simple imputation which may lead the data to be more similar and result in the increasement in the type I error (false-positive correlation). Whereas the sensitivity analysis validated the robustness of our main findings. We interpret this as missing values should have somewhat biased our results (e.g. AUC values), but it seemed the extent is not large enough to have influenced our main conclusions significantly. Fifth, information leakage could have occurred while selecting top five features, because they were selected from the top 10 features which were determined with access to the whole dataset.

In conclusion, RFE-RF models developed in this study had good performance in explaining clinicians’ diagnosis of clinically relevant knee OA. Patients’ (severity of) symptoms are the most important features related to the diagnosis of GPs, and of SPs when there is no access to radiographs. Although related to the diagnosis of both, radiographic features seem to be more important for SPs than GPs. The study findings helped to illustrate typical vignettes of patients recognized as clinically relevant knee OA by experts from two different care settings.

Supplementary material

Supplementary material is available at Rheumatology online.

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author.

Funding

This work was supported by the Dutch Arthritis Society (Project ID 15–1-301); Q.W. was financed by China Scholarship Council (CSC) (grant number: 201906230308).

Disclosure statement: J.R. and M.K. received research grants from the Dutch Arthritis Society; M.K. reports fee for consultancy (Abbvie, Pfizer, Levicept, GlaxoSmithKline, Merck-Serono, Kiniksa, Flexion, Galapagos, Jansen, CHDR, Novartis, UCB) and local investigator of industry-driven trial (Abbvie), from Wolters Kluwer (UptoDate), Springer Verlag (Reumatologie en klinische immunologie), board member for OARSI, president of the Dutch Society Rheumatology and member of the EULAR Council.

Acknowledgements

We would like to acknowledge the CREDO experts group (N.E. Aerts-Lankhorst, R. Agricola, A.N. Bastick, R.D.W. van Bentveld, P.J. van den Berg, J. Bijsterbosch, A. de Boer, M. Boers, A.M. Bohnen, A.E.R.C.H. Boonen, P.K Bos, T.A.E.J. Boymans, H.P. Breedveldt-Boer, R.W. Brouwer, J.W. Colaris, J. Damen, G. Elshout, P.J. Emans, W.T.M. Enthoven, E.J.M. Frölke, R. Glijsteen, H.J.C. van der Heide, A.M. Huisman, R.D. van Ingen, M.L. Jacobs, R.P.A. Janssen, P.M. Kevenaar, M.A. van Koningsbrugge, P. Krastman, N.O. Kuchuk, M.L.A. Landsmeer, W.F. Lems, H.M.J. van der Linden, R. van Linschoten, E.A.M. Mahler, B.L. van Meer, D.E. Meuffels, W.H. Noort-van der Laan, J.M. van Ochten, J. van Oldenrijk, G.H.J. Pols, T.M. Piscaer, J.B.M. Rijkels-Otters, N. Riyazi, J.M. Schellingerhout, H.J. Schers, B.W.V. Schouten, G.F. Snijders, W.E. van Spil, S.A.G. Stitzinger, J.J. Tolk, Y.D.M. van Trier, M. Vis, V.M.I Voorbrood, B.C. de Vos, and A. de Vries) for evaluating the medical files and their feedback on the manuscript.

References

1

Hunter

DJ

,

Bierma-Zeinstra

S.

Osteoarthritis

.

Lancet

2019

;

393

:

1745

–

59

.

2

Altman

R

,

Asch

E

,

Bloch

D

et al.

Development of criteria for the classification and reporting of osteoarthritis. Classification of osteoarthritis of the knee. Diagnostic and Therapeutic Criteria Committee of the American Rheumatism Association

.

Arthritis Rheum

1986

;

29

:

1039

–

49

.

3

Zhang

W

,

Doherty

M

,

Peat

G

et al.

EULAR evidence-based recommendations for the diagnosis of knee osteoarthritis

.

Ann Rheum Dis

2010

;

69

:

483

–

9

.

4

National Clinical Guideline Centre

.

Osteoarthritis: care and management in adults

. UK:

NICE

,

2014

.

PubMed

OpenURL Placeholder Text

Google Preview

5

Skou

ST

,

Koes

BW

,

Gronne

DT

,

Young

J

,

Roos

EM.

Comparison of three sets of clinical classification criteria for knee osteoarthritis: a cross-sectional study of 13,459 patients treated in primary care

.

Osteoarthritis Cartilage

2020

;

28

:

167

–

72

.

6

Schiphof

D

,

Runhaar

J

,

Waarsing

JH

et al.

The clinical and radiographic course of early knee and hip osteoarthritis over 10 years in CHECK (Cohort Hip and Cohort Knee)

.

Osteoarthritis Cartilage

2019

;

27

:

1491

–

500

.

7

Kluzek

S

,

Rubin

KH

,

Sanchez-Santos

M

et al.

Accelerated osteoarthritis in women with polycystic ovary syndrome: a prospective nationwide registry-based cohort study

.

Arthritis Res Ther

2021

;

23

:

225

.

8

Misra

D

,

Lu

N

,

Felson

D

et al.

Does knee replacement surgery for osteoarthritis improve survival? The jury is still out

.

Ann Rheum Dis

2017

;

76

:

140

–

6

.

9

Yu

D

,

Jordan

KP

,

Snell

KIE

et al.

Development and validation of prediction models to estimate risk of primary total hip and knee replacements using data from the UK: two prospective open cohorts using the UK Clinical Practice Research Datalink

.

Ann Rheum Dis

2019

;

78

:

91

–

9

.

10

Fraenkel

L

,

Buta

E

,

Suter

L

et al.

Nonsteroidal anti-inflammatory drugs vs cognitive behavioral therapy for arthritis pain: a randomized withdrawal trial

.

JAMA Intern Med

2020

;

180

:

1194

–

202

.

11

Mol

MF

,

Runhaar

J

,

Bos

PK

et al.

Effectiveness of intramuscular gluteal glucocorticoid injection versus intra-articular glucocorticoid injection in knee osteoarthritis: design of a multicenter randomized, 24 weeks comparative parallel-group trial

.

BMC Musculoskelet Disord

2020

;

21

:

225

.

12

Jordan

KP

,

Tan

V

,

Edwards

JJ

et al.

Influences on the decision to use an osteoarthritis diagnosis in primary care: a cohort study with linked survey and electronic health record data

.

Osteoarthritis Cartilage

2016

;

24

:

786

–

93

.

13

Turkiewicz

A

,

Petersson

IF

,

Bjork

J

et al.

Current and future impact of osteoarthritis on health care: a population-based study with projections to year 2032

.

Osteoarthritis Cartilage

2014

;

22

:

1826

–

32

.

14

Arslan

IG

,

Damen

J

,

de Wilde

M

et al.

Incidence and prevalence of knee osteoarthritis using codified and narrative data from electronic health records: a population-based study

.

Arthritis Care Res

2022

;

74

:

937

–

44

.

Crossref

15

Wang

Q

,

Runhaar

J

,

Kloppenburg

M

et al.

The added value of radiographs in diagnosing knee osteoarthritis is similar for general practitioners and secondary care physicians; data from the CHECK early osteoarthritis cohort

.

J Clin Med

2020

;

9

:

3374

.

16

Jamshidi

A

,

Pelletier

JP

,

Martel-Pelletier

J.

Machine-learning-based patient-specific prediction models for knee osteoarthritis

.

Nat Rev Rheumatol

2019

;

15

:

49

–

60

.

17

Lazzarini

N

,

Runhaar

J

,

Bay-Jensen

AC

et al.

A machine learning approach for the identification of new biomarkers for knee osteoarthritis development in overweight and obese women

.

Osteoarthritis Cartilage

2017

;

25

:

2014

–

21

.

18

Runhaar

J

,

Kloppenburg

M

,

Boers

M

et al.

Towards developing diagnostic criteria for early knee osteoarthritis: data from the CHECK study

.

Rheumatology

2021

;

60

:

2448

–

55

.

19

Wesseling

J

,

Boers

M

,

Viergever

MA

et al.

Cohort Profile: cohort hip and cohort knee (CHECK) study

.

Int J Epidemiol

2016

;

45

:

36

–

44

.

20

Norgeot

B

,

Quer

G

,

Beaulieu-Jones

BK

et al.

Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist

.

Nat Med

2020

;

26

:

1320

–

4

.

21

Bellamy

N

,

Buchanan

WW

,

Goldsmith

CH

,

Campbell

J

,

Stitt

LW.

Validation study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee

.

J Rheumatol

1988

;

15

:

1833

–

40

.

PubMed

OpenURL Placeholder Text

22

Kellgren

JH

,

Lawrence

JS.

Radiological assessment of osteo-arthrosis

.

Ann Rheum Dis

1957

;

16

:

494

–

502

.

23

Macri

EM

,

Runhaar

J

,

Damen

J

,

Oei

EH

,

Bierma-Zeinstra

SM.

Kellgren/Lawrence grading in cohort studies: methodological update and implications illustrated using data from the CHECK cohort

.

Arthritis Care Res

2022

;

74

:

1179

–

87

.

Crossref

24

Yang

R

,

Zhang

C

,

Gao

R

,

Zhang

L.

A novel feature extraction method with feature selection to identify golgi-resident protein types from imbalanced data

.

Int J Mol Sci

2016

;

17

:

218

.

25

DeLong

ER

,

DeLong

DM

,

Clarke-Pearson

DL.

Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach

.

Biometrics

1988

;

44

:

837

–

45

.

26

Safiri

S

,

Kolahi

AA

,

Cross

M

et al.

Prevalence, deaths, and disability-adjusted life years due to musculoskeletal disorders for 195 countries and territories 1990-2017

.

Arthritis Rheumatol

2021

;

73

:

702

–

14

.

27

DeHaan

MN

,

Guzman

J

,

Bayley

MT

,

Bell

MJ.

Knee osteoarthritis clinical practice guidelines – how are we doing?

J Rheumatol

2007

;

34

:

2099

–

105

.

PubMed

OpenURL Placeholder Text