Responsiveness of PROMIS and Patient Health Questionnaire (PHQ) Depression Scales in three clinical trials

Kroenke, Kurt; Stump, Timothy E.; Chen, Chen X.; Kean, Jacob; Damush, Teresa M.; Bair, Matthew J.; Krebs, Erin E.; Monahan, Patrick O.

doi:10.1186/s12955-021-01674-3

Health and Quality of Life Outcomes

Table 2 Responsiveness of depression measures by prospective global rating of change for mood

From: Responsiveness of PROMIS and Patient Health Questionnaire (PHQ) Depression Scales in three clinical trials

Depression change	Mean SRM	CAMEO				SPACE				SSM
Depression change	Mean SRM	Score change*	SRM^†	(95% CI)	P^‡	Score change*	SRM^†	(95% CI)	P^‡	Score change*	SRM^†	(95% CI)	P^‡
PROMIS 4-item					.008				< .0001				< .0001
Better	.54	3.58	.48	(.23, .75)	.019	5.12	.59	(.38, .81)	< .0001	4.81	.56	(.32, .82)	.005
Same	.10	− 0.17	− .02	(− .27, .25)	–	− 0.14	− .02	(− .21, .17)	–	1.41	.27	(.10, .44)	–
Worse	− .27	− 1.09	− .12	(− .55, .32)	.87	− 3.38	− .55	(− .95, − .19)	.122	− 1.55	− .16		.072
PROMIS 6-item					.008				< .0001				< .0001
Better	.58	4.13	.54	(.29, .80)	.039	5.85	.64	(.42, .87)	< .0001	5.17	.55	(.33, .79)	.003
Same	.13	0.72	.13	(− .14, .41)	–	0.16	.03	(− .16, .21)	–	1.35	.23	(.06, .41)	–
Worse	− .28	− 1.03	− .11	(− .54, .33)	.61	− 2.95	− .52	(− .94, − .12)	.186	− 1.94	− .21	(− .53, .12)	.055
PROMIS 8-item					.007				< .0001				< .0001
Better	.59	4.22	.56	(.31, .83)	.037	5.93	.65	(.43, .88)	< .0001	5.16	.55	(.33, .79)	.002
Same	.14	0.76	.13	(− .13, .39)	–	0.19	.03	(− .15, .21)	–	1.39	.26	(.09, .44)	–
Worse	− .27	− 1.00	− .11	(− .54, .34)	.61	− 3.12	− .53	(− .95, − .14)	.141	− 1.56	− .17	(− .48, .16)	.082
PROMIS SF					.009				< .0001				< .0001
Better	.60	4.22	.56	(.31, .84)	.077	6.51	.70	(.47, .95)	< .0001	5.26	.54	(.31, .79)	.003
Same	.16	1.10	.17	(− .09, .44)	–	0.45	.08	(− .11, .26)	–	1.33	.22	(.05, .39)	–
Worse	− .33	− 1.35	− .14	(− .58, .30)	.40	− 3.57	− .65	(− 1.2, − .19)	.060	− 1.94	− .20	(− .51, .12)	.068
PROMIS average
Better	.58		.54				.65				.55
Same	.13		.10				.03				.25
Worse	− .29		− .12				− .56				− .19
PHQ-9					.002				< .0001				.001
Better	.63	3.56	.71	(.43, 1.0)	.026	2.52	.67	(.48, .87)	.002	3.23	.51	(.27, .76)	.014
Same	.24	1.07	.25	(− .02, .56)	–	0.69	.20	(.02, .40)	–	1.13	.26	(.09, .43)	–
Worse	− .18	− 0.73	− .13	(− .60, .31)	.34	− 1.45	− .33	(− .84, .10)	.039	− 0.30	− .07	(− .42, .26)	.28
PHQ-2					.163				< .0001				.007
Better	.53	0.78	.50	(.25, .75)	.44	0.93	.71	(.53, .89)	< .0001	0.81	.39	(.14, .66)	.045
Same	.16	0.39	.25	(.− .01, .51)	–	0.12	.11	(− .08, .33)	–	0.19	.13	(− .04, .30)	–
Worse	− .24	0.00	− .01	(− .49, .41)	.63	− 0.48	− .33	(− .97, .14)	.143	− 0.24	− .12	(− .42, .22)	.38
SF-36 Mental					< .0001
Better		15.18	.71	(.50, .94)	< .0001
Same		− 0.22	− .02	(− .28, .25)	–
Worse		− 9.55	− .69	(− 1.5, − .17)	.076

Total N (better, same, worse) with baseline and follow-up data in CAMEO = 135 (55, 58, 22); in SPACE = 223 (87, 114, 22); and in SSM = 239 (70, 131, 38)
*Score change = baseline—follow-up (positive score indicates improvement, and negative score indicates worsening)
^†SRM = (baseline − follow-up)/SD change score;
^‡Bolded p-values are from omnibus ANOVA tests comparing change scores among the three groups. Other p values were derived from pairwise comparisons of change scores between better vs. same, same vs. worse, and better vs. worse, and were adjusted for multiple comparisons using the Tukey–Kramer procedure. Since all better vs. worse pairwise comparisons were significant when the omnibus test was significant, only better vs. same and same vs. worse p-values are reported in this table

Back to article page

ISSN: 1477-7525

Contact us

Submission enquiries: journalsubmissions@springernature.com