Skip to main content

Dynamic weight-bearing assessment of pain in knee osteoarthritis: construct validity, responsiveness, and interpretability in a research setting

Abstract

Background

The Osteoarthritis Research Society International (OARSI) has suggested to asses pain after specific activities consistently in clinical trials on knee OA. The Dynamic weight-bearing Assessment of Pain (DAP) assesses pain during activity (30 s of performing repeated deep knee-bends from a standing position). The purpose of this study is to evaluate the construct validity, responsiveness, and interpretability of the DAP for knee osteoarthritis (OA).

Methods

One-hundred participants with knee OA were tested twice each with the DAP, the Knee injury and Osteoarthritis Outcome Score (KOOS), six-minute-walk-test (6MWT), and 6-min-walk-test with subsequent pain rating (6MWTpain), and once with a transition questionnaire (TRANS-Q) for the patient-reported change in pain after 12 weeks of exercise. Construct validity (baseline-scores) and responsiveness (change-scores) were estimated by Spearman Correlation Coefficients. We hypothesized that no correlations would be excellent (<0.7) (divergent validity), except for the 6MWTpain (convergent validity). The TRANS-Q was used for interpreting the DAP change-scores in terms of responsiveness and Minimal Important Change (MIC).

Results

Divergent validity with the KOOS subscales (r = −0.31 to–0.45) and the 6MWT (r = −0.25) was supported. Convergent validity with the 6MWTpain was not supported (r = 0.54). The DAP change-scores corresponded to patient-reported change in pain (TRANS-Q), while correlations with change-scores on the other instruments were <0.35. The MIC was 2.4 DAP points.

Conclusions

The DAP possesses divergent validity compared to other instruments for knee OA, supporting the potential for this new way of assessing pain directly during activity. Importantly, the DAP change-scores correspond to patient-reported changes in pain, showing responsiveness. A change of 2.4 or more can be interpreted as clinically relevant. The DAP is a promising alternative to using ‘pain on walking’ as a clinical trial inclusion criterion/outcome.

Background

The Dynamic weight-bearing Assessment of Pain (DAP) for knee osteoarthritis (OA) is a simple test designed to assess pain during performance of a specified activity in patients with knee OA [1, 2]. It is a simple performance test requiring repeated deep knee bends from a standing position for 30 s, with subsequent rating of pain on a 0–10 numeric rating scale (NRS), where 0 indicates ‘no pain’ and 10 ‘worst pain imaginable.’ The DAP follows current quality standards [3] to scientifically qualify a common clinical practice: asking patients about their pain during performance of a test. An elaborated description of the test has been published previously [2].

Application of the DAP is in line with The Osteoarthritis Research Society International (OARSI) suggestion for clinical trials on knee OA; "Pain can be assessed after specific activities (e.g., a walk test). If a pain assessment occurs after an activity then the study team should ensure consistency throughout the trial with the type and duration of activity as well as the timing of the pain assessment after the activity" [4]. This consistency is best obtained through a standardized test such as the DAP.

The DAP was developed based on input both from patients (with knee OA) and health care professionals. Through focus groups, ‘weight-bearing deep knee bends’ was identified as a clinically important pain-provoking activity that would translate to a feasible instrument for ‘pain during activity’ by adding patient reports of pain [1]. The first version of the DAP had two scores: the pain rating, and the knee bend count. A study on reproducibility showed excellent properties of the pain score in a population of patients with knee OA (Intra-rater ICC = 0.93 [95 % CI 0.83 to 0.97], corresponding to a Standard Error of Measurement (SEM = of 0.70, and a Smallest Detectable Change (SDC) of 1.95; inter-rater ICC 0.91, [95 % CI 0.78 to 0.96], SEM 0.86, SDC 2.39), whereas the knee bend count did not reach acceptable level [2]. Consequently, the knee bend score was omitted and thus the second version of the DAP now consists of a single pain score. Simultaneously we clarified the construct assessed with the DAP to be ‘pain during activity’ instead of ‘the interaction between pain and function.’ The level of reliability is similar to that of pain ratings subsequent to other performance tests in patients with hip pain [5] and knee OA [6]. However, the validity, responsiveness, or interpretability of combined performance tests and pain ratings have not been evaluated.

We anticipate that the DAP assesses pain during performance of a specific weight-bearing activity in patients with knee OA — a construct that adds a new perspective to existing instruments. Our objectives were to evaluate the construct (divergent and convergent) validity, responsiveness, and interpretability of the DAP instrument.

Methods

Participants

This study was nested within an assessor- and participant-blinded randomized controlled trial comparing corticosteroid injection with placebo given 2 weeks prior to 12 weeks of supervised exercise in people with knee OA [7]. Inclusion criteria for the trial were: age 40 and over, symptomatic and radiologically verified diagnosed knee OA, ‘pain while walking on a flat surface’ of at least 4 on a 0–10 NRS, and a body mass index of 20 or more, but less than 35 kg/m2. Exclusion criteria included use of intra-articular corticosteroids in the knee or participation in physiotherapeutic exercise for knee OA within the last 3 months, or severe concomitant diseases. All participants in this study gave informed consent before enrolling in the hosting trial. Each participant received a copy of the consent.

Instrumentation

Dynamic weight-bearing Assessment of Pain (DAP)

The DAP [2] is a simple performance test with an integrated pain score, designed to provide useful information for monitoring treatment progress and evaluating treatment effects in knee OA. The patient is asked to perform as many standing knee-bends as possible within 30 s. For each knee bend, the knees should reach close to 90° of flexion and full extension. This movement is supervised by the rater. The test score is self-reported pain intensity during knee bends on a 0–10 NRS reported immediately after the 30 s of knee-bends as the worst pain during the test. Thus, the pain intensity score is an assessment of pain during performance of a specific weight-bearing activity. The DAP takes 1–2 min to perform, including verbal instructions. Administering the test does not require any equipment beside a stopwatch/watch. The DAP was applied at baseline and at the end-of-treatment visit.

Knee injury and Osteoarthritis Outcome Score (KOOS)

The KOOS [8] is a questionnaire for assessing patient-reported symptoms. KOOS consists of 5 subscales assessing different constructs: Symptoms (7 items), Pain (9 items), Function in daily living (16 items), Function in sports and recreation (5 items), and knee-related Quality of Life (QoL) (4 items). Responses are given using Likert boxes, and each question is assigned a score from 0–4. A normalized score is calculated for each subscale ranging from 0 (extreme symptoms) to 100 (no symptoms). Reliability of KOOS in an OA population has been reported as acceptable [9]. The KOOS questionnaire was applied at baseline and at the end-of-treatment visit.

Six-minute-walk-test (6MWT)

The 6MWT [10] is a walking test measuring the total distance walked in 6 min. The distance is a surrogate measure of functional capacity and cardiovascular function, originally used for patients with heart and lung diseases. The 6MWT has shown acceptable test-retest reliability and responsiveness in knee OA populations [11, 12]; it was applied at baseline and at the end-of-treatment visit.

Pain after 6MWT (6MWTpain)

Pain rating on a 0–10 NRS immediately after the 6MWT (6MWTpain) is not a standardized test. However, similar ratings of pain subsequent to a performance test have been applied in other studies [5, 6]. The 6MWTpain was included to compare the DAP with this other assessment of pain during a performance test. The 6MWTpain was applied at baseline and at the end-of-treatment visit.

Transition questionnaire (TRANS-Q)

Transition ratings, or Global Perceived Effect scales, are recommended as a core outcome measure in chronic pain trials [13] and have been used as an external criterion to determine responsiveness [14] or Minimal Important Change (MIC) [3] of other measurement instruments. A transition questionnaire (TRANS-Q), modified from Jaschke et al. [15], was used for asking the participants about their experienced change in pain after the intervention with the question: “Did your knee pain change since you entered this project?” Response options were: “It is unchanged,” “It is better,” and “It is worse.” The ‘unchanged’ response is given a score of 0, and no further questions are asked. The responses “It is better” and “It is worse” bring up a seven-point scale, with scores spanning from −7 (worst) to +7 (best), respectively. For the purpose of this study, a clinically important change in pain was defined as a TRANS-Q score of at least 2 (+2: a little better; −2: a little worse). No change was defined as a TRANS-Q score of 0 (no change) or +1/-1 (Almost the same, hardly any better/worse at all). The transition questionnaire was implemented in the hosting trial after the trial commenced and therefore applicable to only a subset of the trial participants. The transition questionnaire was administered only at the end-of-treatment visit.

Kellgren & Lawrence

The Kellgren & Lawrence grading scale is used to assess radiographic severity of knee OA based on radiographic features; osteophytes, periarticular ossicles, narrowing of joint cartilage, sclerotic tissue, altered shape of bone ends. The scores are: 0 (no x-ray changes of OA), 1 (doubtful presence of OA), 2 (minimal presence of OA), 3 (moderate presence of OA), and 4 (severe presence of OA) [16]. X-rays were taken at baseline and at the end-of-treatment visit.

Ahlbäck classification

The Ahlbäck classification of radiographic knee OA of the tibiofemoral joint also assesses radiographic severity of OA and has five grades: 1 (joint space narrowing, <3 mm), 2 (joint space obliteration), 3 (minor bone attrition, 0-5 mm), 4 (moderate bone attrition, 5-10 mm), 5 (severe bone attrition, >10 mm) [17]. X-rays were taken at baseline and at the end-of-treatment visit.

Statistical analysis

Analyses involving hypothesis testing for validity and responsiveness (i.e., the validity of a change score), and determination of the Minimal Important Change (MIC) to interpret change scores of the DAP were conducted adhering to the COnsensus-based Standards for the selection of healthMeasurement INstruments (COSMIN) methodology [3, 18].

For validation studies, a minimum sample size of 50 is recommended, but larger samples are preferred [3]. There is currently no consensus on standards for determining sample size in MIC studies. The statistical analyses were performed using SAS statistical software (version 9.3; SAS Institute Inc., Cary, NC, USA) and follow the COSMIN standards [3]. As no gold standard exists for the construct ‘pain during activity,’ validity and responsiveness were evaluated through hypothesis testing. The construct validity of the DAP was evaluated by Spearman Correlation Coefficients with the other outcome instruments using baseline scores. Likewise, the responsiveness of the DAP was estimated by Spearman Correlation Coefficients with the other outcome instruments using change scores (baseline to end-of-treatment). There is no consensus about the magnitude of correlations required for acceptable convergent or divergent validity [3, 19], indicating that similar or different constructs, respectively, are assessed by the two instruments being compared. As the DAP was expected to assess a composite construct containing aspects of the constructs assessed by the other instruments, some correlation was expected. Thus, relatively high correlation criteria were applied, for both validity and responsiveness in this study; r >0.7 for convergence and r <0.7 for divergence, based on the common application of 0.7 as cutoff [20]. Correlations below 0.2 were disregarded, as this is the critical point for a two-tailed 0.05 level of significance in an n = 100 sample (the sample of the hosting trial) [7].

Both the DAP and the 6MWTpain were expected to assess a construct of pain during activity (albeit two different activities). Thus, for both baseline and change scores, the DAP score correlations with the 6MWTpain score were hypothesized to be convergent. The 6MWT reflects a construct of physical capacity/cardiovascular function, whereas the subscales of the KOOS assess symptoms, pain, function in daily living, function in sports and recreation, and knee-related quality of life. Thus, for both baseline and change scores, the DAP score correlations with the 6MWT and the KOOS subscales scores were hypothesized to show divergence.

Responsiveness was further evaluated by patient-reported change (TRANS-Q), hypothesizing that the group that had experienced a change in pain would have a greater mean change in DAP scores than the group reporting no change in pain. Patient-reported change of pain (TRANS-Q) were also used as an external criterion to interpret the DAP change scores in terms of the MIC, which we defined as the optimal cutoff point on a Receiver Operating Characteristic (ROC) curve (i.e., the value for which the sum of misclassifications ([1 – sensitivity] + [1 – specificity]) is smallest. [3] The 95 % limit cutoff point is calculated as mean change + 1.645 * SD change of the group of participants who reported no change [3].

Results

As illustrated in Fig. 1, out of 100 participants included in the hosting trial, 93 participants who were still in the study by end of treatment were included in the present analysis of validity and responsiveness; though for the 6MWTpain data for further 13 participants were missing. A convenience subsample of 41 participants from the hosting trial was included in the analysis for responsiveness and interpretability using data from the TRANS-Q. Baseline characteristics for the 93 participants included in the validity study are presented in Table 1.

Fig. 1
figure 1

Numbers of participants enrolled in hosting trial (n = 100), included in the validity study (n = 93), completing the 6MWTpain (n = 80) and completing the TRANS-Q (n = 41)

Table 1 Baseline characteristics (n = 93)

Validity

Our hypothesis about divergent validity was confirmed with correlations <0.7 between baseline-scores of the DAP and the 6MWT (r = −0.25), and the KOOS subscales (r-values ranging from −0.31 to −0.45) (Table 2). For baseline-scores of the DAP and the 6MWTpain, the correlation (r = 0.54) did not reach the criterion for convergence (r > 0.7), thus failing to support our hypothesis about convergent validity.

Table 2 Correlation matrix (construct validity; baseline scores and responsiveness; change scores)

Responsiveness

Our hypothesis about divergence was confirmed with correlations <0.7 between change-scores of the DAP and the 6MWT (r = −0.28) and the KOOS subscales (r-values ranging from −0.10 to–0.31) (Table 2). However, out hypothesis about convergence between the DAP and the 6MWTpain change-scores was not supported (r = 0.35).

Responsiveness of the DAP was supported, evaluated by patient-reported change in pain; the participants who had experienced a change (TRANS-Q rating of 2 or more) had higher DAP change scores (median change: −2.0 points) than the group reporting no change in pain (median change: −0.5 points; Table 3).

Table 3 Numbers of participants within each TRANS-Q response category and the corresponding DAP change median, range, mean, SD, and 95 % Confidence limits

Interpretability

The MIC was established to be 1.5 DAP-points using patient-reported change in pain (TRANS-Q of 2 or more) as gold standard, corresponding to a sensitivity of 0.57 and a specificity of 1.00 (Table 4 and Fig. 2). The 95 % limit cutoff point is 2.3 (calculated as 0.0 + 1.65*1.4), corresponding to the upper confidence limit in the group who experienced no change (Table 4). As the previously established SMD is 1.94 for intra-rater test, and 2.39 for inter-rater test [2], the MIC is changed to 2.4 DAP-points.

Table 4 Sensitivity and specificity for DAP change scores, using perceived change in pain as gold standard
Fig. 2
figure 2

Receiver operating characteristic (ROC) curve for the various cutoff points for DAP change, using perceived change in pain as gold standard

Discussion

These results suggest that the DAP has adequate properties for measuring a construct of pain during activity. This construct is not covered by widely used instruments for knee OA: the questionnaire KOOS and the performance test 6MWT, as confirmed by divergent validity. The correlation coefficient of 0.54 between the DAP and the 6MWTpain suggests that the constructs assessed with these two instruments are somewhat similar, although not fully convergent. The responsiveness of the DAP is adequate, shown by the different median DAP change scores in the group that reported change in pain by TRANS-Q and the group that reported no change, respectively. An improvement of 1.5 or more DAP points can be interpreted as clinically relevant, relating nicely to previous findings where the minimal clinically important difference in chronic musculoskeletal pain equals a reduction of 15.0 % on the 0–10 NRS [21]. However, as the smallest detectable change of the DAP was previously estimated to be 1.95 for intra-rater and 2.39 for inter-rater [2], the more conservatively estimated MIC of 2.4 (corresponding to the upper confidence limit of the mean DAP change in the group experiencing no change) should be used to inform clinical decisions for individual patients.

Ideally, evaluating the construct validity of the DAP should involve a comparison with a gold standard — another instrument that is proven to reliably assess pain during activity. As no such gold standard exists, we focused on the divergent aspect of construct validity, hypothesizing that the instruments focusing solely on pain (KOOSpain) or function (KOOSfunction, 6MWT) would not correlate highly (prespecified as r < 0.7) with the DAP. Furthermore we added a pain rating to an existing performance test (6MWTpain), expecting this addition to reflect a similar construct of pain during activity. A high correlation (prespecified as r > 0.7) with the 6MWTpain would thus support convergent validity. Although our assumption of divergent validity held true, the DAP and the 6MWTpain had lower correlations than expected, not supporting convergent validity. This finding may be explained by the differences between the DAP and 6MWTpain. Importantly, the 6MWTpain is not a standardized instrument and is not designed to incorporate a pain score. Further, the activities of knee bends versus 6 min of walking may not be equally pain provoking in a population of patients with knee OA. There are differences in the activities – knee bending and walking. During the performance of the two tests there were different responses from the patients; walking seemed to be more painful in the beginning, but then fading out, while the pain increased with more knee bends. This supports that the DAP assess a construct of weight-bearing activity that is relatively more pain provoking than 6 min of walking. The DAP is a simple test with a short duration and a focus on the pain intensity during performance of a specified activity. In the 6MWTpain, the distance walked is the main score, whereas the pain score can be considered a ‘nice-to-know’ add-on. Even though the exact construct assessed by the DAP cannot be described based on these results, it is suggested that the DAP assess a different construct than other commonly used instruments, but has the closest relation to the 6MWpain.

The correlation among change scores of the DAP and the 6MWTpain was low, potentially diluting the DAP responsiveness either due to poor qualities of the DAP or inappropriate selection of comparable instruments (lack of gold standard). However, the correspondence to patient-reported change in pain (TRANS-Q) may be seen as an even stronger indicator of actual change than other instruments show. Thus, we interpret the responsiveness of the DAP as adequate.

With a median DAP score of 4 (range 0–9) at baseline, most of the participants had relatively low pain levels, as also reflected by the other instruments. Nine participants reported to have no pain (DAP score 0 at baseline). No participants experienced worsening of their pain as assessed by TRANS-Q, and only four participants reported ‘no change’. This cause a limitation to the study, as only the MIC for improvement can be established, not for deterioration. Another limitation is constituted from our choice of anchor. Had the anchor been more specific, e.g. asking about pain during a weight-bearing activity, or more general, e.g. asking about wellbeing, the MIC may have been different. Likewise the TRANS-Q only gives information about the size of experienced change, not whether it is important. Thus, the choice of which change that is considered important was in this case not based on patient-reports. However, the final MIC of 2.4 is a conservative estimate, in this population corresponding to ‘a good deal better’ or more, which presumably reflects an important improvement.

It is a limitation to the DAP that not everyone experience pain during the test. This may reflect day-to-day variability of knee pain in patients with knee OA, or that pain occurs in various situations for the individual patient (at rest, walking, standing, kneeling etc.). Another possible explanation for the low pain level could be that the study population was eligible for pharmacological treatment and rehabilitation, thus not very severely affected. On the other hand, all participants did have both symptomatic and radiographic signs of knee OA, pointing at advanced knee OA. Including more severely affected patients (e.g., those eligible for surgery) could influence the results. However, DAP scores of 0 is not considered a ceiling effect because when there is no pain, there is no need for further discrimination on this parameter [3]. The DAP did not show signs of a floor effect either, as no participants reported ‘worst pain imaginable’ (DAP score 10). Whether this absence of floor and ceiling effects would hold true in a more severely affected population has not been established.

So far, the DAP has been tested by physiotherapists in a research setting only in a population of people with knee OA and mild to moderate symptoms, who mostly benefitted from the exercise intervention in a clinical trial. Testing beyond these limitations is ongoing.

Conclusions

In conclusion, the DAP possesses divergent validity compared to currently used instruments for the assessment of knee OA, supporting the potential for this new way of assessing pain directly during activity. Importantly, the DAP change-scores correspond to patient-reported changes in pain, which proves adequate responsiveness. A change of 2.4 or more can be interpreted as clinically relevant. Hence, the DAP is a promising alternative to using ‘pain on walking’ as an inclusion criterion or as a clinical trial outcome.

Abbreviations

6MWT, the 6-min-walk-test; 6MWTpain, the 6-min-walk-test with subsequent pain rating; BMI, body mass index; COSMIN, the COnsensus-based Standards for the selection of healthMeasurement INstruments; DAP, the dynamic weight-bearing assessment of pain; ICC, intraclass correlation coefficient; KL, Kellgren Lawrence; KOOS, the knee injury and osteoarthritis outcome score; MIC, minimal important change; NRS, numeric rating scale; OA, osteoarthritis; OARSI, The Osteoarthritis Research Society International; QoL, quality of life; ROC, receiver operating characteristic; SDC, smallest detectable change; SEM, standard error of measurement; TRANS-Q, transition questionnaire

References

  1. Klokker L, Osborne R, Waehrens EE, Norgaard O, Bandak E, Bliddal H et al. The concept of physical limitations in knee osteoarthritis: as viewed by patients and health professionals. Qual Life Res. 2015;24(10):2423–32.

  2. Klokker L, Christensen R, Osborne R, Ginnerup E, Waehrens EE, Bliddal H et al. Dynamic weight-bearing assessment of pain in knee osteoarthritis: a reliability and agreement study. Qual Life Res. 2015;24(12):2985–92.

  3. de Vet HC, Terwee CB, Mokkink LB, Knol DL. Measurement in Medicine A Practical Guide. Cambridge, UK: Cambridge University Press; 2011.

  4. McAlindon TE, Driban JB, Henrotin Y, Hunter DJ, Jiang GL, Skou ST, et al. OARSI Clinical Trials Recommendations: Design, conduct, and reporting of clinical trials for knee osteoarthritis. Osteoarthritis Cartilage. 2015;23:747–60.

    Article  CAS  PubMed  Google Scholar 

  5. White SG, McNair P, Laslett M, Hing W. Do patients undergoing physical testing report pain intensity reliably? Arthritis Care Res (Hoboken). 2015;67:873–9.

    Article  Google Scholar 

  6. Cliborne AV, Wainner RS, Rhon DI, Judd CD, Fee TT, Matekel RL, et al. Clinical hip tests and a functional squat test in patients with knee osteoarthritis: reliability, prevalence of positive test findings, and short-term response to hip mobilization. J Orthop Sports Phys Ther. 2004;34:676–85.

    Article  PubMed  Google Scholar 

  7. Henriksen M, Christensen R, Klokker L, Bartholdy C, Bandak E, Ellegaard K, et al. Evaluation of the benefit of corticosteroid injection before exercise therapy in patients with osteoarthritis of the knee: a randomized clinical trial. JAMA Intern Med. 2015;175:923–30.

    Article  PubMed  Google Scholar 

  8. Roos EM, Roos HP, Lohmander LS, Ekdahl C, Beynnon BD. Knee Injury and Osteoarthritis Outcome Score (KOOS)--development of a self-administered outcome measure. J Orthop Sports Phys Ther. 1998;28:88–96.

    Article  CAS  PubMed  Google Scholar 

  9. Collins NJ, Misra D, Felson DT, Crossley KM, Roos EM. Measures of knee function: International Knee Documentation Committee (IKDC) Subjective Knee Evaluation Form, Knee Injury and Osteoarthritis Outcome Score (KOOS), Knee Injury and Osteoarthritis Outcome Score Physical Function Short Form (KOOS-PS), Knee Outcome Survey Activities of Daily Living Scale (KOS-ADL), Lysholm Knee Scoring Scale, Oxford Knee Score (OKS), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Activity Rating Scale (ARS), and Tegner Activity Score (TAS). Arthritis Care Res (Hoboken). 2011;63 Suppl 11:S208–28.

    Article  Google Scholar 

  10. Enright PL. The six-minute walk test. Respir Care. 2003;48:783–5.

    PubMed  Google Scholar 

  11. Kennedy DM, Stratford PW, Wessel J, Gollish JD, Penney D. Assessing stability and change of four performance measures: a longitudinal study evaluating outcome following total hip and knee arthroplasty. BMC Musculoskelet Disord. 2005;6:3.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Naylor JM, Hayen A, Davidson E, Hackett D, Harris IA, Kamalasena G, et al. Minimal detectable change for mobility and patient-reported tools in people with osteoarthritis awaiting arthroplasty. BMC Musculoskelet Disord. 2014;15:235.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Dworkin RH, Turk DC, Farrar JT, Haythornthwaite JA, Jensen MP, Katz NP, et al. Core outcome measures for chronic pain clinical trials: IMMPACT recommendations. Pain. 2005;113:9–19.

    Article  PubMed  Google Scholar 

  14. van der Roer N, Ostelo RW, Bekkering GE, van Tulder MW, de Vet HC. Minimal clinically important change for pain intensity, functional status, and general health status in patients with nonspecific low back pain. Spine (Phila Pa 1976). 2006;31:578–82.

    Article  Google Scholar 

  15. Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10:407–15.

    Article  CAS  PubMed  Google Scholar 

  16. KELLGREN JH, LAWRENCE JS. Radiological assessment of osteo-arthrosis. Ann Rheum Dis. 1957;16:494–502.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Davies AP, Calder DA, Marshall T, Glasgow MM. Plain radiography in the degenerate knee. A case for change. J Bone Joint Surg Br. 1999;81:632–5.

    Article  CAS  PubMed  Google Scholar 

  18. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737–45.

    Article  PubMed  Google Scholar 

  19. Swinkels RA, Bouter LM, Oostendorp RA, Swinkels-Meewisse IJ, Dijkstra PU, de Vet HC. Construct validity of instruments measuring impairments in body structures and function in rheumatic disorders: which constructs are selected for validation? A systematic review. Clin Exp Rheumatol. 2006;24:93–102.

    CAS  PubMed  Google Scholar 

  20. Hinkle DE, Wiersma W, Jurs SG. Applied statistics for the behavioral sciences. 5th ed. Boston: Houghton Mifflin; 2003.

    Google Scholar 

  21. Salaffi F, Stancati A, Silvestri CA, Ciapetti A, Grassi W. Minimal clinically important changes in chronic musculoskeletal pain intensity measured on a numerical rating scale. Eur J Pain. 2004;8:283–91.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank the participants in this study. The Parker Institute is grateful for the support from the Oak Foundation.

Funding

This study was supported by The Oak Foundation, The Danish Physiotherapy Association, The Danish Rheumatism Association, and Capital Region of Denmark. Financial support was provided from The Parker Institute. The sponsors and funders of the study had no role in the study design, data collection and analysis, interpretation or reporting of this work, or the decision to submit the work for publication.

Authors’ contributions

LK, EW and MH conceived and designed the study. HB provided patients and CB, EB and LK acquired the data. LK, RC, EW and MH contributed to the analysis and interpretation of data. LK drafted the article, and all co-authors have commented and approved the final version.

Competing interests

None of the authors have any financial or personal relationships with other people or organizations that could potentially influence this work or the conclusions.

Ethics approval and consent to participate

This study was nested within an assessor- and participant-blinded randomized controlled trial [7]. All participants in this study gave informed consent before enrolling in the hosting trial. Each participant received a copy of the consent.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Louise Klokker.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Klokker, L., Christensen, R., Wæhrens, E.E. et al. Dynamic weight-bearing assessment of pain in knee osteoarthritis: construct validity, responsiveness, and interpretability in a research setting. Health Qual Life Outcomes 14, 91 (2016). https://doi.org/10.1186/s12955-016-0495-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12955-016-0495-6

Keywords