Patient Versus Clinician Proxy Reliability of the AM-PAC “6-Clicks” Basic Mobility and Daily Activity Short Forms

Patient Characteristics for “6-Clicks” Mobility and Activity Short Forms

Characteristic	“6-Clicks” Mobility N = 70	“6-Clicks” Activity N = 71
Patient completed before service, no. (%)	38 (54.3)	35 (49.3)
Patient completed after service, no. (%)	32 (45.7)	36 (50.7)
Patient age, mean (SD) [range], y	64.1 (14.2) [27–90]	66.0 (15.3) [19–98]
Female, no. (%)	39 (55.7)	33 (46.5)
Evaluation service, no. (%)
Cardiac	17 (24.3)	17 (23.9)
Medical/surgical	20 (28.6)	17 (23.9)
Neurologic	20 (28.6)	20 (28.2)
Orthopedic	13 (18.6)	17 (23.9)

Characteristic	“6-Clicks” Mobility N = 70	“6-Clicks” Activity N = 71
Patient completed before service, no. (%)	38 (54.3)	35 (49.3)
Patient completed after service, no. (%)	32 (45.7)	36 (50.7)
Patient age, mean (SD) [range], y	64.1 (14.2) [27–90]	66.0 (15.3) [19–98]
Female, no. (%)	39 (55.7)	33 (46.5)
Evaluation service, no. (%)
Cardiac	17 (24.3)	17 (23.9)
Medical/surgical	20 (28.6)	17 (23.9)
Neurologic	20 (28.6)	20 (28.2)
Orthopedic	13 (18.6)	17 (23.9)

Table 1

Patient Characteristics for “6-Clicks” Mobility and Activity Short Forms

Characteristic	“6-Clicks” Mobility N = 70	“6-Clicks” Activity N = 71
Patient completed before service, no. (%)	38 (54.3)	35 (49.3)
Patient completed after service, no. (%)	32 (45.7)	36 (50.7)
Patient age, mean (SD) [range], y	64.1 (14.2) [27–90]	66.0 (15.3) [19–98]
Female, no. (%)	39 (55.7)	33 (46.5)
Evaluation service, no. (%)
Cardiac	17 (24.3)	17 (23.9)
Medical/surgical	20 (28.6)	17 (23.9)
Neurologic	20 (28.6)	20 (28.2)
Orthopedic	13 (18.6)	17 (23.9)

Characteristic	“6-Clicks” Mobility N = 70	“6-Clicks” Activity N = 71
Patient completed before service, no. (%)	38 (54.3)	35 (49.3)
Patient completed after service, no. (%)	32 (45.7)	36 (50.7)
Patient age, mean (SD) [range], y	64.1 (14.2) [27–90]	66.0 (15.3) [19–98]
Female, no. (%)	39 (55.7)	33 (46.5)
Evaluation service, no. (%)
Cardiac	17 (24.3)	17 (23.9)
Medical/surgical	20 (28.6)	17 (23.9)
Neurologic	20 (28.6)	20 (28.2)
Orthopedic	13 (18.6)	17 (23.9)

Reliability of “6-Clicks” Mobility

For the “6-Clicks” mobility short form, the mean (SD) total T-score determined by physical therapists for the entire sample (N = 70) was 44.6 (8.7), whereas the mean (SD) total T-score determined by participants was 39.3 (10.1), a mean difference of 5.3 (8.7) T-points, which was statistically significant (P < .001) (Tab. 2).

Table 2

Summary of “6-Clicks” Mobility and Activity Items and Total T-Scores by Patient Versus Therapist and Stratified by Timing^a^,^b

“6-Clicks” Measure	Items and Scores	All Patients			Patient Completed Before Service			Patient Completed After Service
“6-Clicks” Measure	Items and Scores	Therapist Score Mean (SD)	Patient Score (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)
Mobility	Rolling	3.6 (0.6)	2.8 (1.0)	0.8 (1.0)^c	3.8 (0.6)	2.7 (1.1)	1.1 (1.0)^c	3.4 (0.6)	2.9 (0.9)	0.6 (0.8)^c
	Supine to sit	3.4 (0.7)	2.8 (0.9)	0.6 (0.9)^c	3.6 (0.6)	2.7 (1.0)	0.9 (1.0)^c	3.2 (0.8)	2.9 (0.9)	0.3 (0.7)^c
	Moving to chair	3.2 (0.9)	2.8 (1.0)	0.4 (1.0)^c	3.3 (0.9)	2.9 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.0)	0.4 (0.8)^c
	Sit to stand	3.3 (0.8)	3.0 (1.0)	0.3 (0.9)^c	3.4 (0.8)	3.1 (1.0)	0.4 (1.0)^c	3.2 (0.9)	2.9 (1.0)	0.3 (0.8)
	Walking in room	3.2 (0.9)	2.8 (1.1)	0.4 (1.0)^c	3.2 (0.9)	2.8 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.1)	0.4 (0.9)^c
	Climbing stairs	2.6 (1.0)	2.3 (1.2)	0.3 (1.1)^c	2.7 (0.9)	2.4 (1.2)	0.3 (1.2)	2.5 (1.0)	2.3 (1.2)	0.3 (0.9)
	Total T-score	44.6 (8.7)	39.3 (10.1)	5.3 (8.7)^c	45.5 (8.6)	39.3 (10.1)	6.2 (9.4)^c	43.5 (8.9)	39.2 (10.2)	4.3 (7.8)^c
	SEM	3.6 (1.4)	3.2 (1.3)	–	3.7 (1.4)	3.1 (1.3)	–	3.6 (1.5)	3.2 (1.3)	–
Activity	Lower body clothing	2.7 (0.9)	2.9 (0.8)	−0.2 (0.9)	2.6 (0.9)	2.8 (0.8)	−0.2 (1.0)	2.8 (0.9)	3.0 (0.8)	−0.2 (0.8)
	Bathing	2.9 (0.9)	2.8 (1.0)	0.1 (1.1)	2.8 (0.8)	2.7 (0.9)	0.1 (1.2)	3.0 (0.9)	2.9 (1.0)	0.0 (1.0)
	Toileting	3.3 (0.8)	3.3 (0.8)	0.0 (0.80)	3.2 (0.9)	3.2 (0.8)	0.0 (0.9)	3.4 (0.8)	3.3 (0.8)	0.1 (0.7)
	Upper body clothing	3.5 (0.8)	3.1 (0.9)	0.4 (1.0)^c	3.3 (0.8)	3.2 (0.9)	0.1 (1.1)	3.7 (0.7)	3.1 (0.9)	0.6 (0.8)^c
	Grooming	3.7 (0.6)	3.6 (0.6)	0.1 (0.7)	3.6 (0.6)	3.6 (0.7)	0.1 (0.8)	3.8 (0.4)	3.7 (0.5)	0.2 (0.7)
	Eating	3.9 (0.3)	3.6 (0.7)	0.3 (0.7)^c	3.9 (0.4)	3.7 (0.7)	0.3 (0.6)^c	4.0 (0.2)	3.6 (0.7)	0.3 (0.8)^c
	Total T-score	44.7 (8.1)	43.0 (7.8)	1.7 (8.4)	43.3 (8.0)	42.2 (7.1)	1.1 (8.8)	46.1 (8.0)	43.9 (8.5)	2.2 (8.0)
	SEM	3.6 (1.6)	3.3 (1.3)	–	3.4 (1.5)	3.1 (1.1)	–	3.8 (1.7)	3.6 (1.6)

“6-Clicks” Measure	Items and Scores	All Patients			Patient Completed Before Service			Patient Completed After Service
“6-Clicks” Measure	Items and Scores	Therapist Score Mean (SD)	Patient Score (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)
Mobility	Rolling	3.6 (0.6)	2.8 (1.0)	0.8 (1.0)^c	3.8 (0.6)	2.7 (1.1)	1.1 (1.0)^c	3.4 (0.6)	2.9 (0.9)	0.6 (0.8)^c
	Supine to sit	3.4 (0.7)	2.8 (0.9)	0.6 (0.9)^c	3.6 (0.6)	2.7 (1.0)	0.9 (1.0)^c	3.2 (0.8)	2.9 (0.9)	0.3 (0.7)^c
	Moving to chair	3.2 (0.9)	2.8 (1.0)	0.4 (1.0)^c	3.3 (0.9)	2.9 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.0)	0.4 (0.8)^c
	Sit to stand	3.3 (0.8)	3.0 (1.0)	0.3 (0.9)^c	3.4 (0.8)	3.1 (1.0)	0.4 (1.0)^c	3.2 (0.9)	2.9 (1.0)	0.3 (0.8)
	Walking in room	3.2 (0.9)	2.8 (1.1)	0.4 (1.0)^c	3.2 (0.9)	2.8 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.1)	0.4 (0.9)^c
	Climbing stairs	2.6 (1.0)	2.3 (1.2)	0.3 (1.1)^c	2.7 (0.9)	2.4 (1.2)	0.3 (1.2)	2.5 (1.0)	2.3 (1.2)	0.3 (0.9)
	Total T-score	44.6 (8.7)	39.3 (10.1)	5.3 (8.7)^c	45.5 (8.6)	39.3 (10.1)	6.2 (9.4)^c	43.5 (8.9)	39.2 (10.2)	4.3 (7.8)^c
	SEM	3.6 (1.4)	3.2 (1.3)	–	3.7 (1.4)	3.1 (1.3)	–	3.6 (1.5)	3.2 (1.3)	–
Activity	Lower body clothing	2.7 (0.9)	2.9 (0.8)	−0.2 (0.9)	2.6 (0.9)	2.8 (0.8)	−0.2 (1.0)	2.8 (0.9)	3.0 (0.8)	−0.2 (0.8)
	Bathing	2.9 (0.9)	2.8 (1.0)	0.1 (1.1)	2.8 (0.8)	2.7 (0.9)	0.1 (1.2)	3.0 (0.9)	2.9 (1.0)	0.0 (1.0)
	Toileting	3.3 (0.8)	3.3 (0.8)	0.0 (0.80)	3.2 (0.9)	3.2 (0.8)	0.0 (0.9)	3.4 (0.8)	3.3 (0.8)	0.1 (0.7)
	Upper body clothing	3.5 (0.8)	3.1 (0.9)	0.4 (1.0)^c	3.3 (0.8)	3.2 (0.9)	0.1 (1.1)	3.7 (0.7)	3.1 (0.9)	0.6 (0.8)^c
	Grooming	3.7 (0.6)	3.6 (0.6)	0.1 (0.7)	3.6 (0.6)	3.6 (0.7)	0.1 (0.8)	3.8 (0.4)	3.7 (0.5)	0.2 (0.7)
	Eating	3.9 (0.3)	3.6 (0.7)	0.3 (0.7)^c	3.9 (0.4)	3.7 (0.7)	0.3 (0.6)^c	4.0 (0.2)	3.6 (0.7)	0.3 (0.8)^c
	Total T-score	44.7 (8.1)	43.0 (7.8)	1.7 (8.4)	43.3 (8.0)	42.2 (7.1)	1.1 (8.8)	46.1 (8.0)	43.9 (8.5)	2.2 (8.0)
	SEM	3.6 (1.6)	3.3 (1.3)	–	3.4 (1.5)	3.1 (1.1)	–	3.8 (1.7)	3.6 (1.6)

^a

SEM = standard error of measurement

^b

Higher scores indicate better mobility or activity. T-score and SEM are derived using conversion tables in the AM-PAC Short Forms Manual 2.0.²⁶

^c

P < .05 based on paired t test or Wilcoxon signed-rank test.

Table 2

Summary of “6-Clicks” Mobility and Activity Items and Total T-Scores by Patient Versus Therapist and Stratified by Timing^a^,^b

“6-Clicks” Measure	Items and Scores	All Patients			Patient Completed Before Service			Patient Completed After Service
“6-Clicks” Measure	Items and Scores	Therapist Score Mean (SD)	Patient Score (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)
Mobility	Rolling	3.6 (0.6)	2.8 (1.0)	0.8 (1.0)^c	3.8 (0.6)	2.7 (1.1)	1.1 (1.0)^c	3.4 (0.6)	2.9 (0.9)	0.6 (0.8)^c
	Supine to sit	3.4 (0.7)	2.8 (0.9)	0.6 (0.9)^c	3.6 (0.6)	2.7 (1.0)	0.9 (1.0)^c	3.2 (0.8)	2.9 (0.9)	0.3 (0.7)^c
	Moving to chair	3.2 (0.9)	2.8 (1.0)	0.4 (1.0)^c	3.3 (0.9)	2.9 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.0)	0.4 (0.8)^c
	Sit to stand	3.3 (0.8)	3.0 (1.0)	0.3 (0.9)^c	3.4 (0.8)	3.1 (1.0)	0.4 (1.0)^c	3.2 (0.9)	2.9 (1.0)	0.3 (0.8)
	Walking in room	3.2 (0.9)	2.8 (1.1)	0.4 (1.0)^c	3.2 (0.9)	2.8 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.1)	0.4 (0.9)^c
	Climbing stairs	2.6 (1.0)	2.3 (1.2)	0.3 (1.1)^c	2.7 (0.9)	2.4 (1.2)	0.3 (1.2)	2.5 (1.0)	2.3 (1.2)	0.3 (0.9)
	Total T-score	44.6 (8.7)	39.3 (10.1)	5.3 (8.7)^c	45.5 (8.6)	39.3 (10.1)	6.2 (9.4)^c	43.5 (8.9)	39.2 (10.2)	4.3 (7.8)^c
	SEM	3.6 (1.4)	3.2 (1.3)	–	3.7 (1.4)	3.1 (1.3)	–	3.6 (1.5)	3.2 (1.3)	–
Activity	Lower body clothing	2.7 (0.9)	2.9 (0.8)	−0.2 (0.9)	2.6 (0.9)	2.8 (0.8)	−0.2 (1.0)	2.8 (0.9)	3.0 (0.8)	−0.2 (0.8)
	Bathing	2.9 (0.9)	2.8 (1.0)	0.1 (1.1)	2.8 (0.8)	2.7 (0.9)	0.1 (1.2)	3.0 (0.9)	2.9 (1.0)	0.0 (1.0)
	Toileting	3.3 (0.8)	3.3 (0.8)	0.0 (0.80)	3.2 (0.9)	3.2 (0.8)	0.0 (0.9)	3.4 (0.8)	3.3 (0.8)	0.1 (0.7)
	Upper body clothing	3.5 (0.8)	3.1 (0.9)	0.4 (1.0)^c	3.3 (0.8)	3.2 (0.9)	0.1 (1.1)	3.7 (0.7)	3.1 (0.9)	0.6 (0.8)^c
	Grooming	3.7 (0.6)	3.6 (0.6)	0.1 (0.7)	3.6 (0.6)	3.6 (0.7)	0.1 (0.8)	3.8 (0.4)	3.7 (0.5)	0.2 (0.7)
	Eating	3.9 (0.3)	3.6 (0.7)	0.3 (0.7)^c	3.9 (0.4)	3.7 (0.7)	0.3 (0.6)^c	4.0 (0.2)	3.6 (0.7)	0.3 (0.8)^c
	Total T-score	44.7 (8.1)	43.0 (7.8)	1.7 (8.4)	43.3 (8.0)	42.2 (7.1)	1.1 (8.8)	46.1 (8.0)	43.9 (8.5)	2.2 (8.0)
	SEM	3.6 (1.6)	3.3 (1.3)	–	3.4 (1.5)	3.1 (1.1)	–	3.8 (1.7)	3.6 (1.6)

“6-Clicks” Measure	Items and Scores	All Patients			Patient Completed Before Service			Patient Completed After Service
“6-Clicks” Measure	Items and Scores	Therapist Score Mean (SD)	Patient Score (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)	Therapist Score Mean (SD)	Patient Score Mean (SD)	Mean Difference (SD)
Mobility	Rolling	3.6 (0.6)	2.8 (1.0)	0.8 (1.0)^c	3.8 (0.6)	2.7 (1.1)	1.1 (1.0)^c	3.4 (0.6)	2.9 (0.9)	0.6 (0.8)^c
	Supine to sit	3.4 (0.7)	2.8 (0.9)	0.6 (0.9)^c	3.6 (0.6)	2.7 (1.0)	0.9 (1.0)^c	3.2 (0.8)	2.9 (0.9)	0.3 (0.7)^c
	Moving to chair	3.2 (0.9)	2.8 (1.0)	0.4 (1.0)^c	3.3 (0.9)	2.9 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.0)	0.4 (0.8)^c
	Sit to stand	3.3 (0.8)	3.0 (1.0)	0.3 (0.9)^c	3.4 (0.8)	3.1 (1.0)	0.4 (1.0)^c	3.2 (0.9)	2.9 (1.0)	0.3 (0.8)
	Walking in room	3.2 (0.9)	2.8 (1.1)	0.4 (1.0)^c	3.2 (0.9)	2.8 (1.0)	0.4 (1.1)^c	3.1 (0.9)	2.7 (1.1)	0.4 (0.9)^c
	Climbing stairs	2.6 (1.0)	2.3 (1.2)	0.3 (1.1)^c	2.7 (0.9)	2.4 (1.2)	0.3 (1.2)	2.5 (1.0)	2.3 (1.2)	0.3 (0.9)
	Total T-score	44.6 (8.7)	39.3 (10.1)	5.3 (8.7)^c	45.5 (8.6)	39.3 (10.1)	6.2 (9.4)^c	43.5 (8.9)	39.2 (10.2)	4.3 (7.8)^c
	SEM	3.6 (1.4)	3.2 (1.3)	–	3.7 (1.4)	3.1 (1.3)	–	3.6 (1.5)	3.2 (1.3)	–
Activity	Lower body clothing	2.7 (0.9)	2.9 (0.8)	−0.2 (0.9)	2.6 (0.9)	2.8 (0.8)	−0.2 (1.0)	2.8 (0.9)	3.0 (0.8)	−0.2 (0.8)
	Bathing	2.9 (0.9)	2.8 (1.0)	0.1 (1.1)	2.8 (0.8)	2.7 (0.9)	0.1 (1.2)	3.0 (0.9)	2.9 (1.0)	0.0 (1.0)
	Toileting	3.3 (0.8)	3.3 (0.8)	0.0 (0.80)	3.2 (0.9)	3.2 (0.8)	0.0 (0.9)	3.4 (0.8)	3.3 (0.8)	0.1 (0.7)
	Upper body clothing	3.5 (0.8)	3.1 (0.9)	0.4 (1.0)^c	3.3 (0.8)	3.2 (0.9)	0.1 (1.1)	3.7 (0.7)	3.1 (0.9)	0.6 (0.8)^c
	Grooming	3.7 (0.6)	3.6 (0.6)	0.1 (0.7)	3.6 (0.6)	3.6 (0.7)	0.1 (0.8)	3.8 (0.4)	3.7 (0.5)	0.2 (0.7)
	Eating	3.9 (0.3)	3.6 (0.7)	0.3 (0.7)^c	3.9 (0.4)	3.7 (0.7)	0.3 (0.6)^c	4.0 (0.2)	3.6 (0.7)	0.3 (0.8)^c
	Total T-score	44.7 (8.1)	43.0 (7.8)	1.7 (8.4)	43.3 (8.0)	42.2 (7.1)	1.1 (8.8)	46.1 (8.0)	43.9 (8.5)	2.2 (8.0)
	SEM	3.6 (1.6)	3.3 (1.3)	–	3.4 (1.5)	3.1 (1.1)	–	3.8 (1.7)	3.6 (1.6)

^a

SEM = standard error of measurement

^b

Higher scores indicate better mobility or activity. T-score and SEM are derived using conversion tables in the AM-PAC Short Forms Manual 2.0.²⁶

^c

P < .05 based on paired t test or Wilcoxon signed-rank test.

The mean (SD) difference in total T-scores between participant and therapist was also statistically significant whether participants’ self-assessment was completed before the therapy evaluation (6.2 [9.4]; N = 38) or after (4.3 [7.8]; N = 32). Of the 6 items scored, the mean difference in item-level scores differed significantly between participants and therapists for 5 and 4 items in the before- and after-evaluation groups, respectively.

In the full sample, the ICC for the “6-Clicks” mobility total T-score was 0.57 (95% CI = 0.42–0.69), indicative of moderate reliability. However, reliability was higher when the participant completed their self-assessment after the therapy evaluation (ICC = 0.67, 95% CI = 0.47–0.80) compared with before the therapy evaluation (ICC = 0.50, 95% CI = 0.26–0.67) (Tab. 3). The Bland–Altman plot (Figure 1) indicates that agreement was most variable for total scores in the mid-range.

Table 3

Reliability Between Patient and Therapist “6-Clicks” Total T-Scores and Stratified by Timing^a

	Total Sample	Patients Completed Before Service	Patients Completed After Service
Mobility scores
ICC (95% CI)	0.57 (0.42–0.69)	0.50 (0.26–0.67)	0.67 (0.47–0.80)
Activity scores
ICC (95% CI)	0.45 (0.28–0.59)	0.34 (0.06–0.56)	0.52 (0.29–0.70)

	Total Sample	Patients Completed Before Service	Patients Completed After Service
Mobility scores
ICC (95% CI)	0.57 (0.42–0.69)	0.50 (0.26–0.67)	0.67 (0.47–0.80)
Activity scores
ICC (95% CI)	0.45 (0.28–0.59)	0.34 (0.06–0.56)	0.52 (0.29–0.70)

^a

ICC = intraclass correlation coefficient ([2,1], 2-way random effects, consistency, single rater).

Table 3

Open in new tab Download slide

Reliability Between Patient and Therapist “6-Clicks” Total T-Scores and Stratified by Timing^a

	Total Sample	Patients Completed Before Service	Patients Completed After Service
Mobility scores
ICC (95% CI)	0.57 (0.42–0.69)	0.50 (0.26–0.67)	0.67 (0.47–0.80)
Activity scores
ICC (95% CI)	0.45 (0.28–0.59)	0.34 (0.06–0.56)	0.52 (0.29–0.70)

	Total Sample	Patients Completed Before Service	Patients Completed After Service
Mobility scores
ICC (95% CI)	0.57 (0.42–0.69)	0.50 (0.26–0.67)	0.67 (0.47–0.80)
Activity scores
ICC (95% CI)	0.45 (0.28–0.59)	0.34 (0.06–0.56)	0.52 (0.29–0.70)

^a

ICC = intraclass correlation coefficient ([2,1], 2-way random effects, consistency, single rater).

Figure 1

Bland–Altman plot of “6-Clicks” mobility total T-score agreement between patient and physical therapist. x axis = average physical therapist and patient T-score; y axis = difference between physical therapist and patient T-score. Horizontal reference lines indicate the 95% limits of agreement.

For each individual “6-Clicks” mobility item, the quadratic weighted kappa (κ) ranged from 0.18 to 0.39 in the full sample. As with the total score, agreement tended to be higher at the item level in the after-evaluation group (range of κ = 0.24 to κ = 0.53) compared with the before-evaluation group (range of κ = 0.14 to κ = 0.28) (Tab. 4).

Table 4

Agreement Between Patient and Therapist Mobility and Activity Items and Stratified by Timing^a

“6-Clicks” Measure	Items	Total Sample κ (95% CI)	Patient Completed Before Service κ (95% CI)	Patient Completed After Service κ (95% CI)
Mobility	Rolling	0.18 (0.08 to 0.28)	0.14 (0.03 to 0.26)	0.24 (0.06 to 0.41)
	Supine to sit	0.25 (0.12 to 0.39)	0.14 (0.00 to 0.29)	0.45 (0.23 to 0.67)
	Moving to chair	0.33 (0.18 to 0.47)	0.23 (0.01 to 0.46)	0.41 (0.22 to 0.60)
	Sit to stand	0.39 (0.22 to 0.55)	0.28 (0.01 to 0.56)	0.49 (0.30 to 0.68)
	Walking in room	0.33 (0.17 to 0.49)	0.21 (−0.02 to 0.44)	0.46 (0.24 to 0.67)
	Climbing stairs	0.39 (0.24 to 0.54)	0.26 (0.06 to 0.47)	0.53 (0.33 to 0.72)
Activity	Lower body clothing	0.38 (0.21 to 0.55)	0.26 (0.02 to 0.50)	0.48 (0.25 to 0.71)
	Bathing	0.26 (0.04 to 0.47)	−0.11 (−0.43 to 0.22)	0.51 (0.28 to 0.73)
	Toileting	0.53 (0.37 to 0.69)	0.44 (0.17 to 0.70)	0.64 (0.46 to 0.81)
	Upper body clothing	0.28 (0.05 to 0.50)	0.21 (−0.12 to 0.53)	0.36 (0.07 to 0.66)
	Grooming	0.20 (−0.07 to 0.47)	0.24 (−0.14 to 0.62)	0.11 (−0.20 to 0.42)
	Eating	0.19 (−0.16 to 0.54)	0.40 (−0.10 to 0.90)	−0.03 (−0.09 to 0.03)

“6-Clicks” Measure	Items	Total Sample κ (95% CI)	Patient Completed Before Service κ (95% CI)	Patient Completed After Service κ (95% CI)
Mobility	Rolling	0.18 (0.08 to 0.28)	0.14 (0.03 to 0.26)	0.24 (0.06 to 0.41)
	Supine to sit	0.25 (0.12 to 0.39)	0.14 (0.00 to 0.29)	0.45 (0.23 to 0.67)
	Moving to chair	0.33 (0.18 to 0.47)	0.23 (0.01 to 0.46)	0.41 (0.22 to 0.60)
	Sit to stand	0.39 (0.22 to 0.55)	0.28 (0.01 to 0.56)	0.49 (0.30 to 0.68)
	Walking in room	0.33 (0.17 to 0.49)	0.21 (−0.02 to 0.44)	0.46 (0.24 to 0.67)
	Climbing stairs	0.39 (0.24 to 0.54)	0.26 (0.06 to 0.47)	0.53 (0.33 to 0.72)
Activity	Lower body clothing	0.38 (0.21 to 0.55)	0.26 (0.02 to 0.50)	0.48 (0.25 to 0.71)
	Bathing	0.26 (0.04 to 0.47)	−0.11 (−0.43 to 0.22)	0.51 (0.28 to 0.73)
	Toileting	0.53 (0.37 to 0.69)	0.44 (0.17 to 0.70)	0.64 (0.46 to 0.81)
	Upper body clothing	0.28 (0.05 to 0.50)	0.21 (−0.12 to 0.53)	0.36 (0.07 to 0.66)
	Grooming	0.20 (−0.07 to 0.47)	0.24 (−0.14 to 0.62)	0.11 (−0.20 to 0.42)
	Eating	0.19 (−0.16 to 0.54)	0.40 (−0.10 to 0.90)	−0.03 (−0.09 to 0.03)

^a

κ = Quadratic weighted kappa.

Table 4

Open in new tab Download slide

Agreement Between Patient and Therapist Mobility and Activity Items and Stratified by Timing^a

“6-Clicks” Measure	Items	Total Sample κ (95% CI)	Patient Completed Before Service κ (95% CI)	Patient Completed After Service κ (95% CI)
Mobility	Rolling	0.18 (0.08 to 0.28)	0.14 (0.03 to 0.26)	0.24 (0.06 to 0.41)
	Supine to sit	0.25 (0.12 to 0.39)	0.14 (0.00 to 0.29)	0.45 (0.23 to 0.67)
	Moving to chair	0.33 (0.18 to 0.47)	0.23 (0.01 to 0.46)	0.41 (0.22 to 0.60)
	Sit to stand	0.39 (0.22 to 0.55)	0.28 (0.01 to 0.56)	0.49 (0.30 to 0.68)
	Walking in room	0.33 (0.17 to 0.49)	0.21 (−0.02 to 0.44)	0.46 (0.24 to 0.67)
	Climbing stairs	0.39 (0.24 to 0.54)	0.26 (0.06 to 0.47)	0.53 (0.33 to 0.72)
Activity	Lower body clothing	0.38 (0.21 to 0.55)	0.26 (0.02 to 0.50)	0.48 (0.25 to 0.71)
	Bathing	0.26 (0.04 to 0.47)	−0.11 (−0.43 to 0.22)	0.51 (0.28 to 0.73)
	Toileting	0.53 (0.37 to 0.69)	0.44 (0.17 to 0.70)	0.64 (0.46 to 0.81)
	Upper body clothing	0.28 (0.05 to 0.50)	0.21 (−0.12 to 0.53)	0.36 (0.07 to 0.66)
	Grooming	0.20 (−0.07 to 0.47)	0.24 (−0.14 to 0.62)	0.11 (−0.20 to 0.42)
	Eating	0.19 (−0.16 to 0.54)	0.40 (−0.10 to 0.90)	−0.03 (−0.09 to 0.03)

“6-Clicks” Measure	Items	Total Sample κ (95% CI)	Patient Completed Before Service κ (95% CI)	Patient Completed After Service κ (95% CI)
Mobility	Rolling	0.18 (0.08 to 0.28)	0.14 (0.03 to 0.26)	0.24 (0.06 to 0.41)
	Supine to sit	0.25 (0.12 to 0.39)	0.14 (0.00 to 0.29)	0.45 (0.23 to 0.67)
	Moving to chair	0.33 (0.18 to 0.47)	0.23 (0.01 to 0.46)	0.41 (0.22 to 0.60)
	Sit to stand	0.39 (0.22 to 0.55)	0.28 (0.01 to 0.56)	0.49 (0.30 to 0.68)
	Walking in room	0.33 (0.17 to 0.49)	0.21 (−0.02 to 0.44)	0.46 (0.24 to 0.67)
	Climbing stairs	0.39 (0.24 to 0.54)	0.26 (0.06 to 0.47)	0.53 (0.33 to 0.72)
Activity	Lower body clothing	0.38 (0.21 to 0.55)	0.26 (0.02 to 0.50)	0.48 (0.25 to 0.71)
	Bathing	0.26 (0.04 to 0.47)	−0.11 (−0.43 to 0.22)	0.51 (0.28 to 0.73)
	Toileting	0.53 (0.37 to 0.69)	0.44 (0.17 to 0.70)	0.64 (0.46 to 0.81)
	Upper body clothing	0.28 (0.05 to 0.50)	0.21 (−0.12 to 0.53)	0.36 (0.07 to 0.66)
	Grooming	0.20 (−0.07 to 0.47)	0.24 (−0.14 to 0.62)	0.11 (−0.20 to 0.42)
	Eating	0.19 (−0.16 to 0.54)	0.40 (−0.10 to 0.90)	−0.03 (−0.09 to 0.03)

^a

κ = Quadratic weighted kappa.

Reliability of “6-Clicks” Activity

For the “6-Clicks” activity short form, the mean (SD) total T-score determined by the occupational therapist for the entire sample (N = 71) was 44.7 (8.1), whereas the mean (SD) total T-score determined by participants was 43.0 (7.8), a mean difference of 1.7 (8.4) points, which was not statistically significant (P = .10) (Tab. 2). Similarly, no significant difference was observed for the mean [SD] difference in total T-scores between participant and therapist when participants’ self-assessment was completed before the therapy evaluation (1.1 [8.8]; N = 35) or after (2.2 [8.0]; N = 36). Of the 6 items scored, the mean difference in item-level scores differed significantly between participants and therapists for 1 and 2 items in the before- and after-evaluation groups, respectively.

For the “6-Clicks” activity short form, the overall ICC for the total score in the full sample was 0.45 (95% CI = 0.28–0.59), indicative of moderate reliability. However, reliability between participant and occupational therapist scores increased when the participant completed their self-assessment after the therapy evaluation (ICC = 0.52, 95% CI = 0.29–0.70) compared with before the therapy evaluation (ICC = 0.34, 95% CI = 0.06–0.56) (Tab. 3). The agreement in scores for the “6-Clicks” activity was most variable in the mid-range to highest of the total T-scores (Fig. 2).

Figure 2

Bland–Altman plot of “6-Clicks” activity total T-score agreement between patient and occupational therapist. x axis = average occupational therapist and patient T-score; y axis = difference between occupational therapist and patient T-score. Horizontal reference lines indicate the 95% limits of agreement.

For the total sample, agreement for each individual “6-Clicks” activity item ranged from κ = 0.19 to κ = 0.53. For the before-evaluation group, item-level agreement ranged from κ = −0.11 to κ = 0.44, and for the after-evaluation group it ranged from κ = −0.03 to κ = 0.64 (Tab. 4).

Discussion

In this prospective study of the interrater reliability of the AM-PAC “6-Clicks” basic mobility and daily activity short forms between participants and therapists, we found moderate reliability. Reliability and agreement were higher between participants and physical therapists using the mobility assessment than between participants and occupational therapists using the activity assessment. For both tools, participants’ and therapists’ scores agreed more when participants completed their self-assessment following the therapy evaluation compared with before.

Despite lower reliability for the activity short form compared with the mobility short form, we did observe smaller mean differences in scores between participants and occupational therapists using the activity short form than between participants and physical therapists using the mobility short form. The reason for this discrepancy may be the greater variability in mobility scores than activity scores. ICCs are lower if between-participant variability is low. Mobility scores were observed across the continuum of the mobility scale, whereas activity scores were mostly observed near the top of the score range. The standard error of measurement is larger at either end of the continuum and has greater precision in the middle of the score range.²⁰ Because activity scores were more likely to be higher, their measurement error was also higher, which could be 1 explanation for the lower ICC demonstrated for activity versus mobility scores.

The level of reliability observed in our study, although moderate, is lower than observed for previous studies of the reliability of the AM-PAC when scored by a clinical proxy. Haley et al,²⁵ using what was at the time of their study the full item banks for the AM-PAC basic mobility (21 items) and daily activity (29 items) functional domains, estimated ICCs of 0.91 and 0.82, respectively, for assessments completed by clinicians and 31 participants in inpatient rehabilitation or transitional care settings. Jette et al,²⁴ in a study of the computer adaptive test version of the AM-PAC that included 67 participants with stroke who were admitted to inpatient rehabilitation facilities, estimated ICCs of 0.72 and 0.63 for the mobility and activity domains, respectively. The ICCs we estimate in this study (0.57 for the mobility domain and 0.45 for the activity domain) may be lower for several reasons. First, we calculated reliability using 2-way random effects (ICC[2,1]), which generalizes the 2 raters. The other studies do not specify the ICC formula, which could substantially affect the magnitude of reliability. If we had chosen other formulas to calculate ICC, our reliability would increase to 0.73 for mobility and 0.62 for activity scores, like those reported in the study from Jette et al.²⁴ Second, we specifically used the “6-Clicks” short forms of the AM-PAC. These short forms each assess 6 distinct tasks, whereas the previous studies drew from more robust item banks to assess function and derive total functional scores. The ability to assess function using a broader bank of items may positively impact the likelihood of score reliability from 2 different raters. Third, ours is the first study, to our knowledge, to assess the proxy reliability of AM-PAC items scored in the acute care hospital, where illness and injury acuity is greater than for those samples with whom proxy reliability had been assessed in prior studies. This greater acuity likely contributes to greater uncertainty (particularly on the part of patients) in the perception of independence with functional tasks, which is underscored by the fact that participants’ scores were consistently lower than therapists’ scores. Lastly, the clinical heterogeneity of our overall sample, although purposeful to be representative of the standard utilization of the “6-Clicks” in practice, may have influenced the estimated level of reliability because this is likely to differ across clinical populations.

Two additional findings from our study highlight that uncertainty about perceived independence with functional tasks influences scoring. First, in the full sample and in the before- and after-evaluation subsamples, agreement was greater for scores from the mobility short form compared with the activity short form. The tasks scored on the mobility short form (eg, getting out of bed and walking) are often performed by individuals in the hospital, whereas the tasks scored on the activity short form (eg, getting dressed and grooming) are not performed as often by most patients. Having the opportunity to attempt the tasks that are scored likely influences perceptions of independence and subsequent scoring responses. Second, as a related observation, agreement between participants and therapists was higher when the participant completed their self-assessment after the therapy evaluation. Of note, the timing of participants’ self-assessment was randomized to before or after the evaluation, but therapists completed their scoring after the evaluation for all participants, which is consistent with standard practice. For participants who completed their assessments after evaluation, both the therapist and participant were able to observe participants’ performance with many of the tasks that are scored on the short forms, which likely contributed to more agreement.

Although moderate reliability may be acceptable, we expected higher reliability between participant and therapist scores than was observed in the study, particularly in the after-evaluation groups. In addition to the perceptions of independence potentially influencing scoring responses, reliability may be limited because of inconsistent understanding of the response meanings. Whereas the therapists in this study have all used the “6-Clicks” short forms with each of their patients and so have been trained to conceptualize the meaning of each of the levels of physical assistance assessed by the tools (eg, “a lot” of assistance means the patient can contribute only 50% effort or less toward the task), patients in this study were not equally trained. Thus, when asked, “How much help do you currently need to [complete a particular task]?” different patients may have considered an otherwise equivalent amount of assistance as, for example, “a lot” versus “a little.”

Importantly, the AM-PAC “6-Clicks” short forms were validated as therapist-measured tools.²⁰ With only moderate reliability between therapists and patients, clinicians should continue to score the “6-Clicks” in standard practice to appropriately apply the evidence-informed decision-making processes that have become associated with its score (eg, discharge disposition recommendations^6–10^,³² and setting mobility goals⁵^,³³). Further, whereas the AM-PAC was designed to measure patients’ functional status across the continuum of care (ie, observing functional status changes from the hospital to post-acute care settings to the community), we would caution against doing so without careful consideration to the mode of measurement. Measurement error was higher for the therapists than the participants, probably due to the therapists’ scores being indicative of higher functioning than indicated by the participant self-report. Thus, any change captured by AM-PAC scores at various time points and/or in different settings, if scored at one point by a clinician and at another by a patient, is likely to include variable measurement error in addition to biases introduced by having therapist versus patient respondents.

Limitations

Our study has important limitations to note. Individuals who were not appropriately alert and oriented were excluded. Many individuals in the hospital have cognitive impairments and/or delirium, which would affect their ability to complete a self-assessment of function. Excluding such persons limits the generalizability of our findings, even for those within the hospital setting, to only those whose cognition is grossly intact. Similarly, we did not include a formal assessment of cognition for participants who were included in the study so were unable to analyze how cognitive status may have influenced scores. Additionally, although the distribution of scores in our sample is representative of the general patient population in our hospital, the limited number of scores at the lower end of the “6-Clicks” score range for both the mobility and activity short forms may have adversely affected the estimation of reliability.

We demonstrate in this study that the interrater reliability of the AM-PAC “6-Clicks” mobility and activity short forms between therapists and patients is moderate, higher for the mobility short form than the activity short form. Taken with the previous evidence that these tools are valid for the assessment of functional status for patients in the hospital when scored by a clinician and that their interrater reliability between clinicians is substantial, our findings suggest that they are best used as clinician-scored instruments. Due to the variability demonstrated in this study, caution should be exercised when using repeated measures of the AM-PAC if the scores come from patients versus clinicians at separate time points. As the AM-PAC “6-Clicks” short forms continue to inform clinical decisions and to address research questions, particularly in the acute care setting, it is important to keep these considerations in mind.

Author Contributions

Concept/idea/research design: J.K. Johnson, B. Lapin, I. Katzan, M. Stilphen

Writing: J.K. Johnson, B. Lapin, F. Bethoux, A. Skolaris

Data collection: J.K. Johnson

Data analysis: B. Lapin

Project management: J.K. Johnson

Providing participants: J.K. Johnson

Providing facilities/equipment: M. Stilphen

Consultation (including review of manuscript before submitting): I. Katzan, M. Stilphen

Funding

There are no funders to report for this study.

Ethics Approval

This study was approved by the Cleveland Clinic Institutional Review Board (#19-1612).

Disclosures

The authors completed the ICMJE Form for Disclosure of Potential Conflicts of Interest and reported no conflicts of interest.

References

1.

Brown

CJ

,

Friedkin

RJ

,

Inouye

SK

.

Prevalence and outcomes of low mobility in hospitalized older patients

.

J Am Geriatr Soc

.

2004

;

52

:

1263

–

1270

.

2.

Wald

HL

,

Ramaswamy

R

,

Perskin

MH

, et al.

The case for mobility assessment in hospitalized older adults: American Geriatrics Society white paper executive summary

.

J Am Geriatr Soc

.

2019

;

67

:

11

–

16

.

3.

Probasco

JC

,

Lavezza

A

,

Cassell

A

, et al.

Choosing wisely together: physical and occupational therapy consultation for acute neurology inpatients

.

Neurohospitalist

.

2018

;

8

:

53

–

59

.

4.

Hoyer

EH

,

Young

DL

,

Klein

LM

, et al.

Toward a common language for measuring patient mobility in the hospital: reliability and construct validity of interprofessional mobility measures

.

Phys Ther

.

2018

;

98

:

133

–

142

.

5.

Workman

CA

,

Davies

CC

,

Ogle

KC

,

Arthur

C

,

Tussey

K

.

Evaluation of a multisite nurse-led mobility plan

.

JONA J Nurs Adm

.

2020

;

50

:

649

–

654

.

Crossref

6.

Jette

DU

,

Stilphen

M

,

Ranganathan

VK

,

Passek

SD

,

Frost

FS

,

Jette

AM

.

AM-PAC “6-clicks” functional assessment scores predict acute care hospital discharge destination

.

Phys Ther

.

2014

;

94

:

1252

–

1261

.

7.

Menendez

ME

,

Schumacher

CS

,

Ring

D

,

Freiberg

AA

,

Rubash

HE

,

Kwon

YM

.

Does “6-clicks” day 1 postoperative mobility score predict discharge disposition after total hip and knee arthroplasties?

J Arthroplast

.

2016

;

31

:

1916

–

1920

.

Crossref

8.

Covert

S

,

Johnson

JK

,

Stilphen

M

,

Passek

S

,

Thompson

NR

,

Katzan

I

.

Use of the activity measure for post-acute care “6 clicks” basic mobility inpatient short form and National Institutes of Health stroke scale to predict hospital discharge disposition after stroke

.

Phys Ther

.

2020

;

100

:

1423

–

1433

.

9.

Pfoh

ER

,

Hamilton

A

,

Hu

B

,

Stilphen

M

,

Rothberg

MB

.

The six-clicks mobility measure: a useful tool for predicting discharge disposition

.

Arch Phys Med Rehabil

.

2020

;

101

:

1199

–

1203

.

10.

Hoyer

EH

,

Young

DL

,

Friedman

LA

, et al.

Routine inpatient mobility assessment and hospital discharge planning

.

JAMA Intern Med

.

2019

;

179

:

118

–

120

.

11.

Fritz

S

,

Lusardi

M

.

White paper: “walking speed: the sixth vital sign”

.

J Geriatr Phys Ther.

2009

;

32

:

46

–

49

.

12.

Hoyer

EH

,

Needham

DM

,

Atanelov

L

,

Knox

B

,

Friedman

M

,

Brotman

DJ

.

Association of impaired functional status at hospital discharge and subsequent rehospitalization

.

J Hosp Med

.

2014

;

9

:

277

–

282

.

13.

Berian

JR

,

Mohanty

S

,

Ko

CY

,

Rosenthal

RA

,

Robinson

TN

.

Association of loss of independence with readmission and death after discharge in older patients after surgical procedures

.

JAMA Surg

.

2016

;

151

:

e161689

.

14.

Tonkikh

O

,

Shadmi

E

,

Flaks-Manov

N

,

Hoshen

M

,

Balicer

RD

,

Zisberg

A

.

Functional status before and during acute hospitalization and readmission risk identification

.

J Hosp Med.

2016

;

11

:

636

–

641

.

15.

Soley-Bori

M

,

Soria-Saucedo

R

,

Ryan

CM

, et al.

Functional status and hospital readmissions using the medical expenditure panel survey

.

J Gen Intern Med

.

2015

;

30

:

965

–

972

.

16.

Johnson

JK

,

Fritz

JM

,

Brooke

BS

, et al.

The association between patients’ physical function in the hospital and their outcomes in skilled nursing facilities

.

Phys Ther J

.

2019

;

19

:

5

–

16

.

17.

Johnson

JK

,

Fritz

JM

,

Brooke

BS

, et al.

Physical function in the hospital is associated with patient-centered outcomes in an inpatient rehabilitation facility

.

Phys Ther

.

2020

;

100

:

1237

–

1248

.

18.

Peetz

AB

,

Brat

GA

,

Rydingsward

J

, et al.

Functional status, age, and long-term survival after trauma

.

Surgery

.

2016

;

160

:

762

–

770

.

19.

Rydingsward

JE

,

Horkan

CM

,

Mogensen

KM

,

Quraishi

SA

,

Amrein

K

,

Christopher

KB

.

Functional status in ICU survivors and out of hospital outcomes: a cohort study

.

Crit Care Med

.

2016

;

44

:

869

–

879

.

20.

Jette

DU

,

Stilphen

M

,

Ranganathan

VK

,

Passek

SD

,

Frost

FS

,

Jette

AM

.

Validity of the AM-PAC “6-clicks” inpatient daily activity and basic mobility short forms

.

Phys Ther

.

2014

;

94

:

379

–

391

.

21.

Haley

SM

,

Coster

WJ

,

Andres

PL

, et al.

Activity outcome measurement for postacute care

.

Med Care

.

2004

;

42

:

I49

–

I61

.

22.

Haley

SM

,

Andres

PL

,

Coster

WJ

,

Kosinski

M

,

Ni

P

,

Jette

AM

.

Short-form activity measure for post-acute care

.

Arch Phys Med Rehabil

.

2004

;

85

:

649

–

660

.

23.

Jette

DU

,

Stilphen

M

,

Ranganathan

VK

,

Passek

S

,

Frost

FS

,

Jette

AM

.

Interrater reliability of AM-PAC “6-clicks” basic mobility and daily activity short forms

.

Phys Ther

.

2015

;

95

:

758

–

766

.

24.

Jette

AM

,

Ni

P

,

Rasch

EK

, et al.

Evaluation of patient and proxy responses on the activity measure for postacute care

.

Stroke

.

2012

;

43

:

824

–

829

.

25.

Haley

SM

,

Ni

P

,

Coster

WJ

,

Black-Schaffer

R

,

Siebens

H

,

Tao

W

.

Agreement in functional assessment: graphic approaches to displaying respondent effects

.

Am J Phys Med Rehabil

.

2006

;

85

:

747

–

755

.

26.

Jette

AM

,

Haley

SM

,

Coster

WJ

,

Ni

P

.

Activity measure for post-acute care short forms 2.0 instruction manual

.

2016

;

81

:

83

.

27.

Harris

PA

,

Taylor

R

,

Minor

BL

, et al.

The REDCap consortium: building an international community of software platform partners

.

J Biomed Inform

.

2019

;

95

:103208.

28.

Harris

PA

,

Taylor

R

,

Thielke

R

,

Payne

J

,

Gonzalez

N

,

Conde

JG

.

Research electronic data capture (REDCap)-a metadata-driven methodology and workflow process for providing translational research informatics support

.

J Biomed Inform

.

2009

;

42

:

377

–

381

.

29.

Shrout

PE

,

Fleiss

JL

.

Intraclass correlations: uses in assessing rater reliability

.

Psychol Bull

.

1979

;

86

:

420

–

428

.

30.

Koo

TK

,

Li

MY

.

A guideline of selecting and reporting intraclass correlation coefficients for reliability research

.

J Chiropr Med

.

2016

;

15

:

155

–

163

.

31.

Landis

JR

,

Koch

GG

.

The measurement of observer agreement for categorical data

.

Biometrics

.

1977

;

33

:

174

.