Health status in routine clinical practice: validity of the clinical COPD questionnaire at the individual patient level
© Kocks et al; licensee BioMed Central Ltd. 2010
Received: 3 September 2010
Accepted: 16 November 2010
Published: 16 November 2010
There is a growing interest to use health status or disease control questionnaires in routine clinical practice. However, the validity of most questionnaires is established using techniques developed for group level validation. This study examines a new method, using patient interviews, to validate a short health status questionnaire, the Clinical COPD Questionnaire (CCQ), at the individual patient level.
Patients with COPD who visited an outpatient clinic completed the CCQ before the consultation, and the specialist physician completed it after the consultation. After the consultation all patients had a semi-structured in-depth interview. The patients' CCQ scores were compared with those of the treating clinician, and with mean scores from 5 clinicians from a pool of 20 who scored the CCQ after reading the transcript of the in-depth interviews only. Agreement was assessed using Lin's concordance correlation coefficient (CCC), and Blant and Altman plots. Interviews with patients with low agreement were reviewed for possible explanations.
A total of 44 COPD patients (32 male, mean age 66 years, FEV1 45% of predicted) participated. Agreement between the patients' CCQ scores and those of the treating clinicians (CCC = 0.87) and the mean score of the reviewing clinicians (CCC = 0.86) was very high. No systematic error was detected. No explanation for individuals with low agreement was found.
The validity of the CCQ on the individual patient level, as assessed by these methods, is good. Individual health status assessment with the CCQ is therefore sufficiently accurate to be used in routine clinical practice.
Health status measurement by questionnaires can be used in routine clinical practice to enhance communication, monitor disease progression and response to treatment, screen for undetected disability, improve patient satisfaction, and assess disease severity [1, 2]. Questionnaires available for use in routine clinical practice must be short, and easy to administer, score and interpret; in addition, guidelines for their interpretation should be available [3, 4]. Questionnaires should also be reliable and validated for the patient who completes the questionnaire.
Methods to develop and validate health status or quality of life questionnaires are well established. These validation processes focus on their use in clinical trials in groups of patients. However, despite their increasing use in everyday practice, we found only one proposed guideline for the validation of questionnaires in the individual patient. In 1995 McHorney and Tarlov suggested a number of measurement standards for individual patient application of questionnaires, such as high internal consistency reliability (above 0.9) and a small standard error of measurement, besides usual qualities such as construct validity and sensitivity to clinical change . Although these proposed standards are mainly based on current knowledge and 'common sense', practically no questionnaires have been validated for individual health status assessment according to these standards .
Reliability levels of 0.90-0.95 are difficult to meet for many existing questionnaires. Secondly, since reliability is related to questionnaire length and measuring a unidimensional construct, newly developed questionnaires aiming to achieve these levels of reliability should be long and unidimensional (i.e. they measure only one aspect of the disease). However, clinicians might be more interested in being informed about several aspects of the disease (e.g. emotions, functional state and symptoms) and may prefer short questionnaires. Therefore, it may be interesting to ignore these suggested guidelines and assess whether a questionnaire is valid according to the dynamics of routine clinical practice.
To address this question we published a proposal for a new methodology . In this methodology the patient's health status in daily life (as measured by an in-depth interview) is used as the gold standard, and the outcome of the in-depth interview is compared with the patient's score on the questionnaire completed in the clinic before the in-depth interview took place. We applied this new methodology in patients with chronic obstructive pulmonary disease (COPD), a condition that has a large impact on health status , even in mild disease .
One of the health status questionnaires used in COPD in clinical trials and clinical practice is the Clinical COPD Questionnaire (CCQ) . This is a short 10-item questionnaire with answers based on a 7-point Likert scale. The final score is calculated by simply summing the item scores and dividing them by the number of items. The CCQ has three domains: symptoms (4 items), mental health (2 items) and functional status (4 items). The CCQ has shown to be reliable health status measure, is responsive to treatment and is stable over time if no changes occur[8, 9].
This article describes the validation, at the individual patient level, of the CCQ.
Patients with physician diagnosed COPD, and confirmed by lung function measurement, visiting an outpatient clinic were invited to participate in the study. Patients were excluded if they had suffered a myocardial infarction within 3 months prior to enrolment. All patients gave written informed consent.
Lung function was taken from the patient's charts, including height and weight. Exercise capacity was assessed by the 6-min walking distance test performed according to the ATS criteria . Pulse oxygenation and BORG scores for dyspnoea  were measured before and after the walking test. Health status was measured using the CCQ. Dyspnoea during exercise was measured with the MRC dyspnoea score. The BODE score (a multidimensional index) was calculated.
The CCQ is a 10-item health status questionnaire measuring symptoms, functional status and mental status in patients with COPD. The questionnaire is self-administered, and can be completed in 2 min. The CCQ has a high internal consistency reliability (0.91 ) and a small standard error of measurement (0.21 ) The minimal clinically important difference (MCID) was calculated using three different methods and is set at 0.4 points .
Patients completed the CCQ prior to the routine consultation with their pulmonary clinician. Directly after the consultation, the pulmonary clinician (without knowledge of the patient's scores) completed the CCQ as he thought the patient should have completed the CCQ. After the consultation, patients performed the 6-min walking distance test.
One of the investigators (SLS or BdV), who did not know the patient, held a semi-structured in-depth individual interview with the patients on the day of the consultation. Patients were asked to comment on every separate concept of the questionnaire. They were asked what thoughts they had during completion of the individual questions, and were asked to give examples of their symptoms and disabilities in daily life.
Group of reviewing clinicians
All interviews were recorded and fully transcribed. All references to scores on individual items of the questionnaire (in numbers or words) were covered by black bars to blind these results for the reviewing clinicians.
Twenty sets were created containing: i) patient characteristics: gender, age, marital status, forced expiratory volume in one second (FEV1) %predicted, body mass index, 6-min walking distance, oxygenation at start of the 6-min walking distance, and the number of pack years; ii) the transcribed and blinded interview; and iii) a blank CCQ.
Each set of interviews contained 11 randomly assigned interviews. The order in which interviews were in the packaged set was randomised to prevent fatigue of the reviewers and learning effects in the interviews performed later in sequence.
These sets were sent to 20 pulmonary physicians and general practitioners who have a special interest in pulmonary diseases. The clinicians were instructed to complete the CCQ of a patient the way they thought the patient should have rated the CCQ, based on the patient characteristics and interview.
This method resulted in each patient/interview being reviewed and scored by five separate clinicians.
The agreement between patient CCQ scores and the treating physician and reviewing clinicians scores was presented in Blant and Altman plots. The Shapiro-Wilk normality test was used to assess normality.
The pairwise agreement (concordance) between patient's score, treating clinician's score and the mean of the scores of the five reviewing clinicians, was studied by two coefficients: the intraclass correlation coefficient (ICC) and Lin's concordance correlation coefficient (CCC). Both range from 0 = no agreement, to +1 = perfect agreement.
Lin et al. have proposed a unified approach for assessing agreement for continuous and categorical data . For the pairwise agreement used in our study, the unified estimate reduces to the original CCC proposed by Lin . The CCC contains a measurement of precision and of accuracy for a better understanding of the sources of disagreement.
where and are the mean values of the measures at 2 times, by 2 raters, or by 2 methods. Lin further proposes two absolute indices, the Total Deviation Index (TDI) and the Coverage Probability (CP), which are independent of the total data range. The MCID is used for the Coverage Probability.
The cut-off points described for rating agreement based on the ICC are ≤ 0.4 as poor to fair, 0.41-0.6 as moderate, 0.61-0.8 as good, and 0.81-1.0 as excellent. Because the CCC and ICC measure the same construct, the cut-off points can be assumed to be similar.
A significance level of 0.05 was considered statistically significant.
All analyses (excluding the CCC) were performed using SPSS for windows version 14. The CCC was calculated using SAS version 9.1 for Windows and the macro available at http://tigger.uic.edu/~hedayat/
Characteristics of the study population.
No primary school
FEV1 % predicted
CCQ Total score
Agreement between patient, treating clinician and reviewing clinicians.
Patient & Treating clinician
95% Conf. Limit
Patient & Mean reviewing clinicians
95% Conf. Limit
Treating clinician & Mean reviewing clinicians
95% Conf. Limit
There were no differences in patient characteristics between patients with a score difference smaller than the MCID, and those larger than the MCID between patient and reviewing clinicians. No recurrent themes emerged from the interviews to explain low agreement.
This study uses a new method to assess the individual validity of a health status questionnaire. The method was applied in the management of COPD, with the Clinical COPD Questionnaire (CCQ) health status questionnaire. This study shows that there is a very good agreement in CCQ outcomes between the individual patient score and 20 reviewing clinicians who did not know the patient but scored the CCQ based on an in-depth interview. In combination with the previously known high reliability and stability, this confirms the validity of the CCQ at the individual patient level.
This new method to assess the validity of health status questionnaires in clinical practice is feasible for a short questionnaire, but requires much effort due to the patient interviews and the subsequent review by clinicians. However, by using transcripts of interviews with each individual patient, this method provides accurate and transparent information about the actual performance of a patient who is asked to complete a questionnaire in daily clinical practice. The use of qualitative methods to assess the individual accuracy of a questionnaire in routine practice provides more insight into individual validity than pure statistical methods assessing the internal consistency and stability of a questionnaire.
The validity of the CCQ at the group level has already been assessed [8, 9, 16, 17]. In two of these studies internal stability and consistency was very high, thus meeting the requirements for individual use of the questionnaire [8, 9]. However, in a recent study this high level was not met , possibly due to the different study population and methods used in that study. Nevertheless the high concordance between the results of the approach according to the standards as proposed by McHorney and Tarlov  and our new method using patient interviews, confirms the acceptability of this new method.
The high CCC indicates that there was no systematic error in measuring. The Blant and Altman plots also confirm this finding. The absence of a systematic error is in contrast to previous findings of a difference in patient-proxy ratings of quality of life  and differences in patient-clinician ratings . Proxies and clinicians tend to rate the quality of life worse than the patients [18, 20]. The domains of the questionnaires that cover emotions tend to differ more between proxies and patients than the domains measuring symptoms. In the current studies we also see that the mental state domain shows the least concordance; however, there is no systematic under- or over-estimation compared to the patient's score.
For the Bland and Altman plots we chose a difference in scores of 0.4 (the MCID), as cut-off point for agreement. Over 60% of the 44 patient-reviewer scores differed less than the MCID. Of the patients-reviewer scores, 90% (the TDI0.9) were within 0.7 points. We chose the MCID because this difference in score would (according to the definition) potentially cause a clinician to change the management: "the smallest difference in a score in the domain of interest which patients perceive as beneficial and which would mandate in the absence of troublesome side-effects and excessive costs a change in the patient's management" . Compared to others, we chose a very strict cut-off point. In Wilson's method for proxy ratings, a moderate difference is used . For the CCQ, this moderate difference would be around 1.3 , which is far more than the 0.7 points in which 90% of scores were in the current study.
A limitation of this study is that this new method of assessing the validity of a health status questionnaire on the individual patient level could be improved by additional information about the individual patient's scoring stability and responsiveness to changes. The stability of scoring of the studied population could be assessed using the test-retest method. Although the test-retest reliability of the CCQ was very high in two studies [8, 9], and so was the ability to measure treatment effects [17, 22] we did not confirm this in the present group of patients.
The current study could not identify patient factors that were associated with low agreement between patient and reviewers. A possible explanation for low agreement in some individuals might be that most patients completed the questionnaire for the first time. During the interview, patients sometimes answered "now I'm re-thinking about this, my score would have been...". Although some patients with a low agreement gave the impression of lower intelligence, this could not be substantiated by a lower educational level.
In the present study we found a better agreement in scores between patients and the treating clinicians than others . In contrast to other patient-proxy agreement studies, only two clinicians participated in the recruitment of the patients and the scoring of the CCQ, as the main research question was the patient-reviewer agreement. These two clinicians previously used the CCQ in their practice or in pulmonary rehabilitation programs. One clinician stated that he changed his history taking during the study, because he was unable to answer specific questions on multiple occasions, especially about the mental state domain. The experience in measuring health status and the change in history taking might contribute to the high agreement between the scores of the patient and treating clinician.
In conclusion, this study shows that this new method to assess the individual validity of a questionnaire by using patient interviews is feasible, and confirms results from previous studies using statistical methods. Secondly, there seems to be a good validity of the CCQ on the individual patient level as established with this new methods. The CCQ can therefore be used in routine clinical practice to assess the health status of patients with COPD.
TvdrM had developed, with others, the CCQ
The authors thank Hans Berg, Jan-Willem van den Berg, Richard Dekhuizen, Rob Douma, Pier Eppinga, Huub van Gijsel, Nick ten Hacken, Huib Kerstjens, Ernst Lammers, Cees van Minnen, Dirk Nijmeijer, Jan Rauws, Roland Riemersma, Martin Ruiter, Steven Rutgers, Dirk-Jan Slebos, Peter Vennik, Frank Visser, Johan Wempe and Steward Wills for reviewing and rating the patient data. The authors also thank Wim Krijnen for his comments on the statistical analyses.
- Delbanco TL: Enriching the doctor-patient relationship by inviting the patient's perspective. Ann Intern Med 1992,116(5):414–418.PubMedView ArticleGoogle Scholar
- Detmar SB, Muller MJ, Schornagel JH, Wever LD, Aaronson NK: Health-related quality-of-life assessments and patient-physician communication: a randomized controlled trial. JAMA 2002,288(23):3027–3034. 10.1001/jama.288.23.3027PubMedView ArticleGoogle Scholar
- McHorney CA, Tarlov AR: Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res 1995,4(4):293–307. 10.1007/BF01593882PubMedView ArticleGoogle Scholar
- Greenhalgh J, Long AF, Flynn R: The use of patient reported outcome measures in routine clinical practice: lack of impact or lack of theory? Soc Sci Med 2005,60(4):833–843. 10.1016/j.socscimed.2004.06.022PubMedView ArticleGoogle Scholar
- van der Molen T, Kocks JWH: Do health-status measures play a role in improving treatment in chronic obstructive pulmonary disease? Expert Opin Pharmacother 2006,7(1):57–61. 10.1517/146565126.96.36.199PubMedView ArticleGoogle Scholar
- Jones PW: Health status measurement in chronic obstructive pulmonary disease. Thorax 2001,56(11):880–887. 10.1136/thorax.56.11.880PubMed CentralPubMedView ArticleGoogle Scholar
- Ferrer M, Alonso J, Morera J, Marrades RM, Khalaf A, Aguar MC, Plaza V, Prieto L, Anto JM: Chronic obstructive pulmonary disease stage and health-related quality of life. The Quality of Life of Chronic Obstructive Pulmonary Disease Study Group. Ann Intern Med 1997,127(12):1072–1079.PubMedView ArticleGoogle Scholar
- van der Molen T, Willemse BW, Schokker S, Ten Hacken NH, Postma DS, Juniper EF: Development, validity and responsiveness of the Clinical COPD Questionnaire. Health Qual Life Outcomes 2003,1(1):13. 10.1186/1477-7525-1-13PubMed CentralPubMedView ArticleGoogle Scholar
- Damato S, Bonatti C, Frigo V, Pappagallo S, Raccanelli R, Rampoldi C, Rodi F: Validation of the Clinical COPD questionnaire in Italian language. Health Qual Life Outcomes 2005,3(1):9. 10.1186/1477-7525-3-9PubMed CentralPubMedView ArticleGoogle Scholar
- Anonymous: ATS statement: guidelines for the six-minute walk test. Am J Respir Crit Care Med 2002,166(1):111–117. 1073–449View ArticleGoogle Scholar
- Borg GA: Psychophysical bases of perceived exertion. Med Sci Sports Exerc 1982,14(5):377–381.PubMedView ArticleGoogle Scholar
- Celli BR, Cote CG, Marin JM, Casanova C, Montes de Oca M, Mendez RA, Pinto Plata V, Cabral HJ: The body-mass index, airflow obstruction, dyspnea, and exercise capacity index in chronic obstructive pulmonary disease. N Engl J Med 2004,350(10):1005–1012. 10.1056/NEJMoa021322PubMedView ArticleGoogle Scholar
- Kocks JWH, Tuinenga MG, Uil SM, van den Berg JWK, Stahl E, van der Molen T: Health status measurement in COPD: the minimal clinically important difference of the clinical COPD questionnaire. Respir Res 2006, 7: 62. 1465–993 10.1186/1465-9921-7-62PubMed CentralPubMedView ArticleGoogle Scholar
- Lin L, Hedayat AS, Wu W: A unified approach for assessing agreement for continuous and categorical data. J Biopharm Stat 2007,17(4):629–652. 10.1080/10543400701376498PubMedView ArticleGoogle Scholar
- Lin LI: A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989,45(1):255–268. 0006–341 10.2307/2532051PubMedView ArticleGoogle Scholar
- Stallberg B, Nokela M, Ehrs PO, Hjemdal P, Jonsson EW: Validation of the clinical COPD Questionnaire (CCQ) in primary care. Health Qual Life Outcomes 2009, 7: 26. 10.1186/1477-7525-7-26PubMed CentralPubMedView ArticleGoogle Scholar
- Reda AA, Kotz D, Kocks JW, Wesseling G, van Schayck CP: Reliability and validity of the clinical COPD questionniare and chronic respiratory questionnaire. Respir Med 2010,104(11):1675–1682. 10.1016/j.rmed.2010.04.023PubMedView ArticleGoogle Scholar
- Wilson KA, Dowling AJ, Abdolell M, Tannock IF: Perception of quality of life by patients, partners and treating physicians. Qual Life Res 2000,9(9):1041–1052. 10.1023/A:1016647407161PubMedView ArticleGoogle Scholar
- Sneeuw KC, Sprangers MA, Aaronson NK: The role of health care providers and significant others in evaluating the quality of life of patients with chronic disease. J Clin Epidemiol 2002,55(11):1130–1143. 10.1016/S0895-4356(02)00479-1PubMedView ArticleGoogle Scholar
- Sprangers MA, Aaronson NK: The role of health care providers and significant others in evaluating the quality of life of patients with chronic disease: a review. J Clin Epidemiol 1992,45(7):743–760. 10.1016/0895-4356(92)90052-OPubMedView ArticleGoogle Scholar
- Jaeschke R, Singer J, Guyatt GH: Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials 1989,10(4):407–415. 10.1016/0197-2456(89)90005-6PubMedView ArticleGoogle Scholar
- Kocks JWH, Kerstjens HAM, van den Berg JWK, van der Molen T: The course of health status during an exacerbation of COPD in hospitalized patients versus out-patient treated patients. Prim Care Respir J 2010,19(2):A8-A9.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.