Psychometric properties of morning joint stiffness duration and severity measures in patients with moderately to severely active rheumatoid arthritis
Health and Quality of Life Outcomes volume 15, Article number: 239 (2017)
To assess the measurement properties of two single-item patient-reported outcome (PRO) measures that assessed the length of time (in minutes) and severity of morning joint stiffness (MJS) experienced each day.
Data from two Phase 3, randomized placebo-controlled (and active-controlled [RA-BEAM]), clinical studies assessing the safety and efficacy of baricitinib in adults with moderately to severely active rheumatoid arthritis (RA) were used to evaluate the psychometric properties of the Duration of MJS and Severity of MJS PROs.
Test-retest reliability of Duration of MJS and Severity of MJS was supported through large intraclass correlation coefficients among stable patients (coefficient range for both studies: 0.88 to 0.93). In support of construct validity, moderate correlations were evidenced between Duration of MJS and other related patient- and clinician-reported assessments of RA symptoms and patient functioning, whereas moderate-to-strong correlations were evidenced between these same patient- and clinician-reported assessments and Severity of MJS. Statistically significant differences between the median and mean values of Duration of MJS and Severity of MJS for differing categories of RA disease severity supported known-groups validity. Finally, large and statistically significant differences in change scores from Day 1 to Week 12 for patients defined as responders versus non-responders using the American College of Rheumatology 20 criteria supported the responsiveness of both PROs.
Duration of MJS and Severity of MJS PROs demonstrated reliability, validity, and responsiveness in adults with moderately to severely active RA, supporting the measurement of these key symptoms in clinical trials.
Rheumatoid arthritis (RA) is a systemic, inflammatory, autoimmune disease. The expression and outcomes of the disease vary, ranging from mild, limited disease to severe disease that is associated with progressive joint destruction, significantly compromised health-related quality of life (HRQOL), and reduced survival. Some clinical symptoms of RA, such as morning joint stiffness (MJS), may follow a circadian pattern and the rhythm of pro-inflammatory cytokines, such as interleukin-6 (IL-6). The increase in nocturnal anti-inflammatory cortisol seen in patients with RA is generally insufficient to suppress the ongoing joint inflammation, often resulting in joint stiffness in the morning [1,2,3].
Prior research illustrates the importance of early morning stiffness or difficulty moving joints for patients with RA. In a study of 916 patients with recent onset of RA (disease duration ≤24 months), Westhoff et al.  reported that for many patients, morning stiffness impacts their HRQOL, functional capability, and ability to continue to work. These findings are supported by recent survey research conducted by Mattila et al.  in patients with RA (duration >6 months) who experience impairment of morning function at least 3 times per week. Of the 534 working respondents, 47% indicated morning stiffness affected their work performance, and 33% indicated it led to a late arrival at work. Of the 224 retired participants, 159 (71%) stopped working earlier than their expected retirement age, with 64% of these participants giving RA-related morning stiffness as a reason for their retirement . Indeed, MJS is an important and clinically meaningful element of disease activity .
Patient-reported outcome (PRO) measures of symptoms like MJS are important tools to aid clinicians in treating patients with RA, facilitate doctor-patient communication to improve the quality of patient care, and contribute to better patient outcomes . Despite the importance of MJS to patients with RA [4, 6], the assessment of MJS is not currently a measured component in several recommended endpoints in clinical trials, such as the American College of Rheumatology (ACR), Disease Activity Score modified to include the 28 diarthrodial joint count (DAS28), Clinical Disease Activity Index (CDAI), or Simplified Disease Activity Index (SDAI).
To address this need, two daily electronic PRO diary items were created to assess both the duration and severity of MJS from the patient’s perspective. The content validity of these two items were supported through a targeted literature review, interviews with health-care providers, and qualitative concept elicitation and cognitive debriefing interviews with patients with RA . These interviews confirmed the relevance of duration and severity of MJS as symptoms of RA, as well as the appropriateness of terminology used to assess these symptoms.
This study reports the assessment of the psychometric properties (i.e., reliability, validity, and responsiveness) of two PRO items administered daily, the Duration of MJS and Severity of MJS, in patients with moderately to severely active RA who participated in two Phase 3 clinical trials for baricitinib RA-BEAM and RA-BUILD.
RA-BEAM (N = 1305) was a randomized, double-blind, double-dummy, placebo- and active-controlled, parallel-arm, 52-week study designed to assess improvements in disease activity, structural preservation, and PROs including physical function, safety, and tolerability with oral baricitinib 4-mg once daily in patients with RA who had inadequate responses to methotrexate. Full details regarding the primary efficacy and safety outcomes of this study have been reported previously . Briefly, patients were aged ≥18 years with active RA (≥6/68 tender and ≥6/66 swollen joints; serum high-sensitivity C-reactive protein [hsCRP] ≥6 mg/L). Comparisons were made to placebo and to adalimumab, a tumor necrosis factor (TNF)-α inhibitor and a standard-of-care biologic disease-modifying antirheumatic drug (DMARD) in this setting.
RA-BUILD (N = 684) was a randomized, double-blind, placebo-controlled, parallel-group 24-week study designed to assess improvements in disease activity, structural preservation, and PROs including physical function, safety, and, tolerability with oral baricitinib 2-mg and 4-mg once daily in patients with RA who were refractory to or intolerant of conventional synthetic DMARDs (csDMARDs). Full details regarding the primary efficacy and safety outcomes of this study have been reported previously . Briefly, patients were aged ≥18 years with active rheumatoid arthritis (≥6/68 tender and ≥6/66 swollen joints; hsCRP ≥3.6 mg/L [upper limit of normal 3.0 mg/L]) and an insufficient response (despite prior therapy) or intolerance to ≥1 csDMARDs. Comparisons were made to placebo.
For both studies, visits occurred on Weeks 0, 1, 2, 4, 8, 12, 14, 16, 20, and 24, and for RA-BEAM, the visits continued to Weeks 28, 32, 40, and 52. Both studies were conducted with informed consent, under institutional review board approval, and in accordance with the Declaration of Helsinki (ClinicalTrials.gov number NCT01710358 [RA-BEAM] and NCT01721057 [RA-BUILD]).
Instruments used in the psychometric analyses
Patient-reported outcomes (PROs)
Duration of morning joint stiffness, severity of morning joint stiffness, severity of worst tiredness, and severity of worst joint pain
Duration of MJS is a single-item PRO designed to capture information on self-reported length of time, in minutes, that a patient’s MJS lasted each day. Specifically, patients were asked: “Please indicate how long your morning joint stiffness lasted today,” and responded with the number of hours and minutes. Durations recorded as >12 h (720 min) were censored at 720 min.
Severity of MJS, Severity of Worst Tiredness, and Severity of Worst Joint Pain are all single-item PROs designed to capture the severity of MJS, worst tiredness, and worst joint pain experienced that day, respectively. Patient’s were asked to complete each at the end of their day. Specifically, for Severity of MJS, patients were asked: “Please rate the overall level of morning joint stiffness you had from the time you woke up today.” All three of these PROs are anchored at 0 and 10, where 0 represents “no joint stiffness,” “no tiredness,” or “no joint pain,” and 10 represents “joint stiffness as bad as you can imagine,” “tiredness as bad as you can imagine,” or “joint pain as bad as you can imagine,” respectively.
For RA-BEAM and RA-BUILD, all four PROs were assessed using an electronic diary on a daily basis through Week 12. The Day 1 assessment was the first assessment at the end of the patient’s day following the randomization visit (Week 0, Visit 2). The Week 1 assessment refers to the weekly average values from Days 2 to 8. Assessments at Weeks 2, 4, 8, and 12 refer to weekly average values of the 7 days prior to Weeks 2, 4, 8, and 12 visits, respectively. Recognizing that late-shift workers, individuals who work outside of the hours of 9 am until 5 pm, could not complete the electronic diary (at home) at the end of Day 1, the Day 2 assessment (if available) was used to impute missing Day 1 values so that more patients could be included in the psychometric analyses utilizing the Day 1 data.
Medical Outcomes Study 36-item Short Form Health Survey version 2 Acute (SF-36)
The SF-36 is a generic, 36-item PRO that measures general health status. The SF-36 includes eight domains of health status evaluated over the previous week: physical function, role limitations–physical, bodily pain, general health perceptions, vitality, social function, role limitations–emotional, and mental health. Two component scores, the Physical Component Score (PCS) and the Mental Component Score (MCS), are derived based on the 8 domain scores . Domain and component scores are derived using established formulas , with higher scores indicating better health status or functioning. Acceptable psychometric properties of this instrument have been demonstrated elsewhere [12, 13].
Health Assessment Questionnaire-Disability Index (HAQ-DI)
The HAQ-DI assesses patients’ physical function or disability over the past week. The HAQ-DI contains 24 questions that query the degree of difficulty a person has in accomplishing tasks in 8 functional areas (dressing, arising, eating, walking, hygiene, reaching, gripping, and activities). Responses in each functional area are scored from 0, indicating “no difficulty,” to 3, indicating “inability to perform a task” in that area. The HAQ-DI total score, ranging from 0 to 3 (higher values indicate worse functioning), is obtained by summing the highest score within each functional area and dividing by the number of functional areas answered . The reliability and validity of this instrument have been documented previously .
Quick Inventory of Depressive Symptomatology Self-Rated-16 (QIDS-SR16)
The QIDS-SR16 is a 16-item PRO intended to assess the existence and severity of symptoms of depression as listed in the American Psychiatric Association’s Diagnostic and Statistical Manual of Mental Disorders, 4th Edition (DSM-IV) . Patients were asked to consider each statement as it relates to the way they have felt for the past 7 days. There is a unique 4-point ordinal scale for each item, with scores ranging from 0 to 3 reflecting increasing depressive symptoms as the item score increases. The instrument measures 9 core symptom domains that are used to define a depressive episode: sad mood; concentration; self criticism; suicidal ideation; interest; energy/fatigue; sleep disturbance; decrease or increase in appetite or weight; and psychomotor agitation or retardation. The QIDS-SR16 total score is derived as the sum of the scores across the 9 scale domains. The psychometric properties, including reliability, validity, and sensitivity to change of this instrument have been demonstrated elsewhere .
Patient’s assessment of pain
Patient’s current pain was assessed at each study visit with the use of a 0- to 100-mm visual analogue scale (VAS), with higher scores indicating more severe pain.
Patient's Global Assessment of Disease Activity (PtGA)
The PtGA, assessed as their current disease activity, was assessed at each study visit and is recorded on a 0- to 100-mm VAS, with higher scores indicating more active RA.
Physician's Global Assessment of Disease Activity (PhGA)
The PhGA was assessed at each study visit and is recorded on a 0-to 100-mm VAS, with higher scores indicating or more active RA.
Clinical signs and symptoms measures
American College of Rheumatology 20 (ACR20)
An ACR20 response (i.e., a binary variable indicating achieving or not achieving a response) was measured at each study visit and is defined as at least a 20% improvement from baseline in both tender joint count (TJC) (0 to 68) and swollen joint count (SJC) (0 to 66), and in at least 3 of the following 5 assessments: patient’s assessment of pain, PtGA, PhGA, HAQ-DI, and high-sensitivity C-reactive protein.
Clinical Disease Activity Index (CDAI)
The CDAI is a tool for measurement of disease activity in RA that integrates measures of physical examination, patient self-assessment, and evaluator assessment . The CDAI was assessed at each study visit and is calculated by adding together scores from the following assessments: number of swollen joints (0 to 28), number of tender joints (0 to 28), PtGA on a VAS (0 to 10 cm), PhGA on a VAS (0 to 10 cm). Total scores are calculated using established formulas . Thresholds have been established for the CDAI (remission: 0.0 to ≤2.8; low disease activity: >2.8 to ≤10; moderate disease activity: >10 to ≤22; high disease activity: >22 to ≤76) .
Disease Activity Score (28 joints) (DAS28)
The DAS28 is a composite score that is based on a 28-joint count (both TJC 0 to 28 and SJC 0 to 28), hsCRP or erythrocyte sedimentation rate (ESR), and PtGA and was measured at each study visit. Total scores are calculated using established formulas . Patients can be categorized into 4 groups (remission: <2.6; low disease activity: ≥2.6 to ≤3.2; moderate disease activity: >3.2 to ≤5.1; high disease activity: >5.1).
The distribution of scores for the Duration of MJS and Severity of MJS PROs was assessed using descriptive statistics at Day 1, including mean (SD), median, range, and ceiling/floor effects.
Test-retest reliability was assessed in stable patients during the interval between Week 1 and 2 and again between Week 4 and 8. Stable patients were defined as those with ≤5 point difference  on the 0 to 100 PtGA between each assessment period. Intraclass correlation coefficients (ICCs) were calculated between the initial (Week 1 or 4) and retest (Week 2 or 8) scores to evaluate test-retest reliability. An ICC of ≥0.70 was considered good agreement .
Convergent and discriminant validity (construct validity)
Construct validity was assessed by Pearson correlations at Day 1 and Week 12 between the scores of Duration of MJS and Severity of MJS, and the scores of other clinical/PRO endpoints: Severity of Worst Tiredness, Severity of Worst Joint Pain, SF-36 domain and component scores, HAQ-DI, QIDS-SR16, patient’s assessment of pain, PtGA, TJC28, SJC28, PhGA, and hsCRP. Cohen’s conventions were used to interpret the absolute value of the correlation results, where a correlation >0.5 is large, 0.3 to 0.5 is moderate, 0.1 to <0.3 is small, and <0.1 is insubstantial .
It was hypothesized that moderate or large correlations supporting convergent validity would be demonstrated between Duration of MJS and Severity of MJS and the PRO instruments measuring concepts related to RA symptoms (SF-36 PCS, SF-36 Bodily Pain), their impact on functioning (SF-36 Social Functioning, SF-36 Vitality, SF-36 Physical Functioning, HAQ-DI), and clinician-reported/laboratory assessments of disease activity (TJC28, SJC28, PhGA, and hsCRP). Discriminant validity was assessed by Pearson correlations at Day 1 and at Week 12 between Duration of MJS, Severity of MJS, and PROs measuring distally related concepts (SF-36 MCS, SF-36 Role Emotional, QIDS-SR16) where small correlations were hypothesized.
Known-groups validity was evaluated using the Kruskal-Wallis test to distinguish median Duration of MJS and an analysis of variance (ANOVA) model to distinguish mean Severity of MJS between subgroups defined by the DAS28-ESR thresholds (<2.6; ≥2.6 and ≤3.2; >3.2 and ≤5.1; >5.1) measured at Day 1 and Week 4, and CDAI (0.0 to ≤2.8; >2.8 to ≤10; >10 to ≤22; and >22 to ≤76) measured at Day 1 and Week 4. The Scheffé adjustment was used for multiple comparisons. Subgroups were combined in instances of small sample sizes (i.e., <5% of the total sample size for the subgroup).
Due to anticipated skewness of Duration of MJS PRO, responsiveness was evaluated using a nonparametric randomization-based analysis of covariance (ANCOVA) methodology  to assess significant differences in median change in Duration of MJS from Day 1 to Week 12 between ACR20 responders and nonresponders at Week 12, controlling for Duration of MJS at Day 1. Responsiveness was evaluated using an ANCOVA methodology to assess significant differences in mean change in Severity of MJS. Responsiveness was also assessed using disease activity as measured by DAS28-hsCRP at Week 12, using the following subgroups: DAS28-hsCRP <2.6, DAS28-hsCRP ≥2.6 and DAS28-hsCRP ≤3.2, and DAS28-hsCRP >3.2. An overall statistically significant difference (p < 0.05) with statistically significant subgroup comparisons was hypothesized.
SAS® statistical software Version 9,4 (SAS Institute Inc., Cary, NC, USA) was used to conduct all analyses.
Baseline demographics for the total modified intent-to-treat population, patients with Day 1 diary scores, and patients with Week 12 diary scores are provided in Table 1. Baseline and Week 12 scores for Duration of MJS PRO, Severity of MJS PRO, and other assessments are presented in Table 2, while score distributions for Duration of MJS PRO and Severity of MJS PRO are found in Additional file 1: Table S1. As can be seen in Tables 1 and 2, there was a large amount of missing data at the Day 1 assessment period. This missing data was due to multiple reasons as shown in Additional file 1: Table S2, such as the diary device alarm not sounding until the following day or the diary device being given to the patient after Day 1. Sensitivity analyses with the imputation for missing data at Day 1 (n = 1041 for RA-BEAM and n = 563 for RA-BUILD, respectively) demonstrated similar comparisons to the results presented in this document.
For RA-BEAM and RA-BUILD, the ICCs for weekly mean Duration of MJS and Severity of MJS ranged from 0.88 to 0.91 from Week 1 to 2 (RA-BEAM n = 412; RA-BUILD n = 185) and from 0.90 to 0.93 from Week 4 to Week 8 (RA-BEAM n = 417; RA-BUILD n = 215). These values provide evidence for substantial agreement between assessment periods among stable patients.
Convergent and discriminant validity
Results supporting convergent validity of Duration of MJS and Severity of MJS are found in Table 3. For Duration of MJS, small-to-moderate associations were found between Duration of MJS and other assessments of RA symptoms, assessments of function, and clinician-reported assessments at Day 1 and Week 12. At Day 1 and Week 12, respectively, Duration of MJS was most strongly associated with Severity of MJS (RA-BEAM r = 0.36 and 0.45; RA-BUILD r = 0.35 and 0.52) and Severity of Worst Joint Pain (RA-BEAM r = 0.29 and 0.41; RA-BUILD r = 0.30 and 0.47), followed by the patient’s assessment of pain (RA-BEAM r = 0.24 and 0.38; RA-BUILD r = 0.23 and 0.39) and PtGA (RA-BEAM r = 0.19 and 0.36; RA_BUILD r = 0.28 and 0.37).
For Severity of MJS, moderate-to-large associations were found between Severity of MJS and other assessments of RA symptoms, assessments of function, and clinician-reported assessments at Day 1 and Week 12. At Day 1 and Week 12, respectively, Severity of MJS was most strongly associated with Severity of Worst Joint Pain (RA-BEAM r = 0.79 and 0.89; RA-BUILD r = 0.75 and 0.89), patient’s assessment of pain (RA-BEAM r = 0.59 and 0.78; RA-BUILD r = 0.54 and 0.72), and Severity of Worst Tiredness (RA-BEAM r = 0.66 and 0.79; RA-BUILD r = 0.52 and 0.77).
In contrast, smaller correlations between Duration and Severity of MJS with SF-36 MCS, SF-36 Role Emotional, and the QIDS-SR16 at Day 1 (r = −0.07 to 0.34) and Week 12 (r = −0.39 to 0.34) were observed, indicating that the SF-36 and QIDS-SR16 assessments measure more distally related constructs.
These findings demonstrate adequate convergent and discriminant validity, as well as the uniqueness of Duration of MJS and Severity of MJS compared to other PROs and clinical measures of RA in patients with moderately to severely active RA.
Due to small sample sizes in the lower DAS28-ESR subgroups at Day 1 (i.e., <5% of the sample in each score category) in RA-BEAM and RA-BUILD, patients were categorized into only 2 subgroups: ≤5.1 and >5.1 (Table 4). At Day 1 in both studies, patients with higher DAS28-ESR scores reported a significantly longer Duration of MJS measured in minutes (RA-BEAM: p = 0.028; RA-BUILD: p = 0.033) and significantly greater Severity of MJS (RA-BEAM: p = 0.001; RA-BUILD: p = 0.001) than those patients with lower DAS28-ESR scores. Similar findings were evident for both studies at Week 4.
Similar to the approach taken for the DAS28-ESR analyses, due to small sample sizes in the lower CDAI subgroups, patients were categorized into 2 subgroups based on the CDAI at Day 1 (0.0 to ≤22.0 and >22.0 to ≤76.0) and 3 subgroups at Week 4 (0.0 to ≤10.0, >10.0 to ≤22.0, and >22.0 to ≤76.0). At Day 1, patients with higher CDAI scores experienced a significantly longer Duration of MJS in RA-BUILD (median = 60.0 vs. 120.0, p = 0.016) and greater Severity of MJS in both RA-BEAM and RA-BUILD (RA-BEAM: mean = 4.7 vs. 5.9, p = 0.001; RA-BUILD: 3.7 vs. 5.9, p = 0.001) than those patients with a lower CDAI score (Table 4). However, at Week 4, the Duration of MJS was significantly longer and the Severity of MJS was significantly greater in patients with higher CDAI scores than in patients with low CDAI scores in both studies (all p = 0.001) (Table 5). All values increased linearly with higher CDAI scores, where all post-hoc comparisons using Scheffé adjustment were statistically significant (p = 0.001), except for the comparison of median Duration of MJS values between CDAI categories of 0.0 to ≤10.0 versus >10.0 to ≤22.0 in RA-BUILD. These findings indicate that the Duration of MJS and Severity of MJS PROs are able to distinguish between known groups based on disease severity.
As hypothesized, the responsiveness of the Duration of MJS and Severity of MJS PROs was supported by large and statistically significant differences in median and mean change in Duration of MJS and Severity of MJS, respectively, from Day 1 to Week 12 between patients defined as responders or nonresponders on the basis of the ACR20 at Week 12 (Table 6). For example, for Duration of MJS in RA-BEAM, median change from Day 1 to Week 12 was −35.1 for responders in comparison to −7.0 for nonresponders (p = 0.001). Similarly, for Severity of MJS in RA-BEAM, mean change was −3.2 compared to −1.1 for responders and nonresponders, respectively (p = 0.001).
Comparable findings were seen when using DAS28-hsCRP as an anchor, as pairwise comparisons assessing significant differences in median and mean change between DAS28-hsCRP groups of <2.6 versus ≥3.2 (p = 0.001 for all comparisons), and ≥2.6 and <3.2 versus ≥3.2 were statistically significant (p < 0.01 for all comparisons) (Table 6). However, the change scores comparisons between <2.6 versus ≥2.6 and <3.2 were not statistically significant for Duration of MJS (RA-BEAM p = 0.809; RA-BUILD p = 0.221) or Severity of MJS (RA-BEAM p = 0.083; RA-BUILD p = 0.264).
The psychometric properties, including reliability, validity, and responsiveness, of Duration of MJS and Severity of MJS PROs were supported by the results from RA-BEAM and RA-BUILD. Specifically, test-retest reliability analyses revealed high levels of agreement in Duration of MJS and Severity of MJS values among patients considered stable across two assessment periods. In support of construct validity, a priori hypotheses of the relationships between Duration of MJS and Severity of MJS with other related PROs and clinical outcome measures were supported at Day 1 and Week 12 assessment periods. Known-groups validity was similarly supported as median and mean values of Duration of MJS and Severity of MJS were significantly different based on known-groups using the DAS28-ESR and CDAI. Finally, both measures were responsive to change from Day 1 to Week 12 when defining responders using the ACR20. Sensitivity to change was also supported when using the DAS28-hsCRP as an anchor, except for comparisons between the lowest disease activity groups.
However, although these symptoms have been demonstrated to be important to patients with RA, an assessment of these symptoms was excluded from the the core set of disease activity measures. Specifically, rheumatologists have frequently used morning stiffness as an indicator that patient medication changes are needed . However, in 1993, the core set of disease activity measures for use in RA clinical trials was revised. Studies with therapies available at the time (e.g., csDMARDs, auranofin, nonsteroidal anti-inflammatory drugs [NSAIDs]) suggested that MJS was not sensitive to change with treatment compared to control, and this lack of ability to discriminate treatment from control contributed to this symptom’s exclusion from the core set of measures .
Subsequent to its removal from the ACR core set in 1993, morning stiffness has been found to be an important determining clinical status indicator driving changes to DMARD therapy . Additional work by Boers et al.  further demonstrated that both duration and severity of morning stiffness are related to other measures of disease activity. The findings from the present study provide support that both the Duration of MJS and Severity of MJS PROs are not only associated with other measures of disease activity in RA but rather are measures that assess distinct symptom experiences by patients. Furthermore, these measures are sensitive to detect change over time in adults with moderately to severely active RA. However, future research is needed in order to determine the relationship of these instruments to patient outcomes over a long term period, and the usability of such instruments in a clinical setting. Given the advent of electronic PRO diaries, these instruments could be used in a clinical setting and collected daily, thus enhancing the dialogue between patients and care providers. Further, using applications on mobile phones, these instruments can be easily completed by patients to track and collect their daily symptoms.
Despite the strong findings related to the reliability, validity, and responsiveness of the Duration of MJS and Severity of MJS PROs, a key limitation to this investigation is the missing data at the Day 1 assessment. As noted above, the missing data were due to a variety of reasons (e.g., device alarm not sounding until the following day). However, sensitivity analyses were conducted after imputing missing Day 1 Duration of MJS and Severity of MJS PRO scores, and results for the evaluations of reliability, validity, and responsiveness of these two PROs remained the same. In addition, the evaluation of responsiveness was limited as the studies’ baseline assessment period consisted of only data at Day 1 rather than a full 7 days of data as was available at Week 12. Finally, all analyses were pooled across countries due to small sample sizes (<5 participants) within certain participating countries. Thus, future research is needed to assess the psychometric properties of both instruments within each country.
The findings from the current study demonstrate that the Duration of MJS and Severity of MJS items are unique PROs fit for purpose of measuring two key symptoms of MJS in adult patients with moderately to severely active RA in clinical trials.
American College of Rheumatology
20% improvement in American College of Rheumatology criteria
Analysis of covariance
Analysis of variance
Clinical Disease Activity Index
Conventional synthetic DMARD
Disease Activity Score modified to include the 28 diarthroidal joint count
Disease-modifying antirheumatic drug
American Psychiatric Association’s Diagnostic and Statistical Manual of Mental Disorders, 4th Edition
Erythrocyte sedimentation rate
Health Assessment Questionnaire-Disability Index
Health-related quality of life
High-sensitivity C-reactive protein
Intraclass correlation coefficient
Mental Component Score
Morning Joint Stiffness
Nonsteroidal anti-inflammatory drug
Physical Component Score
Physician’s Global Assessment of Disease Activity
Patient’s Global Assessment of Disease Activity
- QIDS-SR16 :
Quick Inventory of Depressive Symptomatology Self-Rated-16
Simplified Disease Activity Index
Medical Outcomes Study 36-Item Short Form Health Survey Version 2 Acute
Swollen joint count
Tender joint count
Tumor necrosis factor
Visual analogue scale
Straub RH, Cutolo M. Circadian rhythms in rheumatoid arthritis: implications for pathophysiology and therapeutic management. Arthritis Rheum. 2007;56:399–408.
Cutolo M, Otsa K, Aakre O, Sulli A. Nocturnal hormones and clinical rhythms in rheumatoid arthritis. Ann N Y Acad Sci. 2005;1051:372–81.
Gibbs JE, Ray DW. The role of the circadian clock in rheumatoid arthritis. Arthritis Res Ther. 2013;15:205.
Westhoff G, Buttgereit F, Gromnica-Ihle E, Zink A. Morning stiffness and its influence on early retirement in patients with recent onset rheumatoid arthritis. Rheumatology (Oxford). 2008;47:980–4.
Mattila K, Buttgereit F, Tuominen R. Impact of morning stiffness on working behaviour and performance in people with rheumatoid arthritis. Rheumatol Int. 2014;34:1751–8.
Sierakowski S, Cutolo M. Morning symptoms in rheumatoid arthritis: a defining characteristic and marker of active disease. Scand J Rheumatol Suppl. 2011;125:1–5.
Sokka T. Morning stiffness and other patient-reported outcomes of rheumatoid arthritis in clinical practice. Scand J Rheumatol Suppl. 2011;125:23–7.
DeLozier AM, Gaich CL, Vernon MK, von Maltzahn R. Content validity evaluation of a new diary developed to evaluate symptoms important to patients with moderate-to-severe rheumatoid arthritis. Philadelphia: ISPOR 20th Annual International Meeting; May 16–20; 2015.
Taylor PC, Keystone EC, van der Heijde D, Weinblatt ME, Del Carmen ML, Reyes Gonzaga J, Yakushin S, Ishii T, Emoto K, Beattie S, et al. Baricitinib versus placebo or Adalimumab in rheumatoid arthritis. N Engl J Med. 2017;376:652–62.
Dougados M, van der Heijde D, Chen YC, Greenwald M, Drescher E, Liu J, Beattie S, Witt S, de la Torre I, Gaich C, et al. Baricitinib in patients with inadequate response or intolerance to conventional synthetic DMARDs: results from the RA-BUILD study. Ann Rheum Dis. 2017;76:88–95.
Ware JE Jr, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30:473–83.
McHorney CA, Ware JE Jr, Lu JF, Sherbourne CD. The MOS 36-item short-form health survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. Med Care. 1994;32:40–66.
Tsai C, Bayliss MS, Ware JE. SF-36 health survey annotated bibliography: second edition (1988–1996). Boston: Health Assessment Lab, New England Medical Center; 1997.
Fries JF, Spitz PW, Young DY. The dimensions of health outcomes: the health assessment questionnaire, disability and pain scales. J Rheumatol. 1982;9:789–93.
Cuperus N, Mahler EA, Vliet Vlieland TP, Hoogeboom TJ, van den Ende CH. Measurement properties of the health assessment questionnaire disability index for generalized osteoarthritis. Rheumatology (Oxford). 2015;54:821–6.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (DSM-IV). 4th ed. Washington: American Psychiatric Association; 1994.
Rush AJ, Trivedi MH, Ibrahim HM, Carmody TJ, Arnow B, Klein DN, Markowitz JC, Ninan PT, Kornstein S, Manber R, et al. The 16-item quick inventory of depressive Symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biol Psychiatry. 2003;54:573–83.
Aletaha D, Smolen J. The simplified disease activity index (SDAI) and the clinical disease activity index (CDAI): a review of their usefulness and validity in rheumatoid arthritis. Clin Exp Rheumatol. 2005;23:S100–8.
Felson DT, Smolen JS, Wells G, Zhang B, van Tuyl LH, Funovits J, Aletaha D, Allaart CF, Bathon J, Bombardieri S, et al. American College of Rheumatology/European league against rheumatism provisional definition of remission in rheumatoid arthritis for clinical trials. Ann Rheum Dis. 2011;70:404–13.
Fransen J, Stucki G, van Riel PLCM. Rheumatoid arthritis measures: disease activity score (DAS), disease activity Score-28 (DAS28), rapid assessment of disease activity in rheumatology (RADAR), and rheumatoid arthritis disease activity index (RADAI). Arthritis Care Res. 2003;49:S214–24.
DeLoach LJ, Higgins MS, Caplan AB, Stiff JL. The visual analog scale in the immediate postoperative period: intrasubject variability and correlation with a numeric scale. Anesth Analg. 1998;86:102–6.
Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York: McGraw-Hill; 1994.
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale: Lawrence Erlbaum Associates; 1988.
Koch GG, Tangen CM, Jung JW, Amara IA. Issues for covariance analysis of dichotomous and ordered categorical data from randomized clinical trials and non-parametric strategies for addressing them. Stat Med. 1998;17:1863–92.
Kirwan JR, Chaput de Saintonge DM, Joyce CR, Currey HL. Clinical judgment in rheumatoid arthritis. III. British rheumatologists’ judgments of ‘change in response to therapy’. Ann Rheum Dis. 1984;43:686–94.
Felson DT, Anderson JJ, Boers M, Bombardier C, Chernoff M, Fried B, Furst D, Goldsmith C, Kieszak S, Lightfoot R, et al. The American College of Rheumatology preliminary core set of disease activity measures for rheumatoid arthritis clinical trials. The committee on outcome measures in rheumatoid arthritis clinical trials. Arthritis Rheum. 1993;36:729–40.
Soubrier M, Zerkak D, Gossec L, Ayral X, Roux C, Dougados M. Which variables best predict change in rheumatoid arthritis therapy in daily clinical practice? J Rheumatol. 2006;33:1243–6.
Boers M, Buttgereit F, Saag K, Alten R, Grahn A, Storey D, Rice P, Kirwan J. What is the relationship between morning symptoms and measures of disease activity in patents with rheumatoid arthritis. Arthrit Care Res. 2015;67:1202–9.
This study was funded by Eli Lilly and Company and Incyte.
Availability of data and materials
Lilly provides access to the individual patient data from studies on approved medicines and indications as defined by the sponsor specific information on ClinicalStudyDataRequest.com.
This access is provided in a timely fashion after the primary publication is accepted. Researchers need to have an approved research proposal submitted through clinicalstudydatarequest.com. Access to the data will be provided in a secure data sharing environment after signing a data sharing agreement.
Ethics approval and consent to participate
The study was approved by the Copernicus Group IRB in Research Triangle Park, NC (IRB Tracking #: PAR1–09-143). Ethics approval was also obtained for all 41 sites. All patients provided written informed consent.
Consent for publication
EDB and KWW were employees of Evidera, and received financial support from Eli Lilly in connection with the development of the manuscript. AMD, CL, CLG, TR, and RH are all full-time employees of Eli Lilly and Company. TR is a minor share holder of Eli Lilly and Company.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Bacci, E.D., DeLozier, A.M., Lin, CY. et al. Psychometric properties of morning joint stiffness duration and severity measures in patients with moderately to severely active rheumatoid arthritis. Health Qual Life Outcomes 15, 239 (2017). https://doi.org/10.1186/s12955-017-0813-7