Comparing the EQ-5D 3L and 5L: measurement properties and association with chronic conditions and multimorbidity in the general population
© Agborsangaya et al.; licensee BioMed Central Ltd. 2014
Received: 19 February 2014
Accepted: 9 May 2014
Published: 16 May 2014
Studies comparing the measurement properties of EQ-5D 3L (3L) and EQ-5D 5L (5L) are limited to specific patient populations with small sample sizes. Using a general population sample, we compared 3L and 5L in terms of their measurement properties and association with number of chronic conditions, including multimorbidity – the concurrent occurrence of two or more chronic conditions.
Data were available from two consecutive cycles of a cross-sectional telephone interview survey using 3L (2010 cycle) and 5L (2012 cycle), in the general population of adults (age ≥ 18 years) in Alberta, Canada. Measurement properties were compared by determining their feasibility, ceiling effect, and discriminatory power (Shannon indices) for 3L and 5L. Linear regression models were fitted to test the associations between multimorbidity and EQ-5D index score.
Data were available for 4946 (2010) and 4752 (2012) survey respondents with information on HRQL. Compared to 3L, 5L showed lower ceiling effect (32.3% versus 42.1%), higher absolute discriminatory power (Shannon index, mean 0.79 versus 0.52) and higher relative discriminatory power (Shannon Evenness index, mean 0.09 versus 0.06 for 3L). Despite these differences, similar relationships of lower HRQL with greater multimorbidity were observed for the 3L (ß = −0.13, 95% CI −0.15; −0.11) and 5L (ß = −0.12, 95% CI −0.13; −0.11).
Using a general population sample, the EQ-5D 5L showed better measurement properties than the EQ-5D 3L. Nonetheless, clinically important differences in HRQL associated with multimorbidity were similar in magnitude using both versions of EQ-5D.
For over two decades, EuroQol’s EQ-5D has been used as a generic instrument to measure and evaluate health status [1–3]. The older version, the EQ-5D 3L (3L), describes general health based on five distinct dimensions: mobility, self-care, usual activities, pain/discomfort and anxiety/depression. Each dimension has 3 Levels (indicating no problem, some or moderate problem and extreme problem). Due to its limited ability to delineate minor but important clinical differences in health status as well as the presence of a ceiling effect, the EuroQol group recently developed a new version, the EQ-5D 5L (5L) [4–7]. The 5L version differs from 3L in that for each dimension, there are 5 levels (no problem, slight problem, moderate problem, severe problem and extreme problem).
Previous research comparing the 3L and 5L versions of the EQ-5D indicate that additional levels of the 5L potentially increase discriminatory power and reduce ceiling effect among patients with chronic conditions, including chronic hepatic disease  or cancer patients . Although few studies have compared 3L and 5L, most of them are based on patient populations or studies with relatively small sample sizes [6–9]. In one study among eighty-two participants , the authors compared 3L and 5L and noted a significant improvement in the discriminatory power of the 5L. More research is needed to obtain a comparative assessment of both versions of the EQ-5D in a larger sample and their applicability in the general population.
The present study aims to compare the measurement properties of 3L and 5L in terms of their feasibility, ceiling effect and discriminatory power (Shannon indices) in a general population sample. In a further step, we compared the association between multimorbidity, the concurrent occurrence of 2 or more chronic conditions, and health-related quality of life (HRQL) using both versions of the EQ-5D.
Study setting and population
The study sample constitutes respondents to two consecutive survey cycles of the Health Quality Council of Alberta (HQCA) Patient Experience and Satisfaction Survey  for 2010 and 2012. In this cross-sectional survey, adult Albertans aged 18 years or older, representative of the adult general population, self-reported their experiences and satisfaction with the quality of health services received in the past twelve months. The survey comprised of a telephone-based questionnaire that was administered by Random-Digit Dialing (RDD). In the sampling design, densely populated regions were under-sampled and sparsely populated regions were oversampled to ensure a reasonably large sample for analysis and reporting of data for each health region. This study uses combined regional samples to represent the population of Alberta. In doing so, sampling weights were applied to adjust for under- and over-sampling.
Chronic conditions and socio-demographic factors
Respondents were asked about their health status in the past 12 months, including if they had “any of the following chronic conditions”; diabetes, chronic obstructive pulmonary disorder, asthma, hypertension, high cholesterol, sleep apnea, congestive heart failure, depression or anxiety, chronic pain, arthritis, heart disease, stroke (or related), cancer, gastro-intestinal tract and kidney diseases. Thus, this analysis includes up to fifteen chronic conditions. Respondents were also categorized as having multimorbidity if they had any two or more of these conditions. The survey respondents also gave information on their socio-demographic characteristics such as age, sex, educational attainment and household income. Unlike in 2012, the 2010 survey cycle included a skip pattern for listing chronic conditions that queried respondents to indicate if they have “any chronic condition” prior to the list of chronic conditions.
The 2010 survey cycle included EQ-5D 3L and has been described elsewhere [3, 11], whereas the 2012 cycle entailed the newer version, EQ-5D 5L. Considering that the descriptive system of the EQ-5D comprises of five domains, each with three possible levels for 3L and five levels for 5L, a combination of the characteristic levels produces 243 (35) possible health states for 3L (ranging from 11111 to 33333) and 3125 (55) health states for 5L (ranging from 11111 to 55555). 11111 represents the best possible health state whereas 33333 (for 3L) or 55555 (for 5L) represent the worst health states.
The EQ-5D index scores are utilities derived from the respondents profile and ranges from 0 to 1; 0 meaning death and 1 complete health . Values less than 0 indicate health states worse than death. Since the EQ-5D index score is a weighted summary score of five items representing different dimensions of health, changes in the EQ-5D index score may arise from different patterns of impairment across individual dimensions. We were particularly interested in (a) the frequency of reported problems in the 3L and 5L and (b) their EQ-5D single index score. The EQ-5D index scores from 5L were derived as described elsewhere [12, 13], using a Crosswalk Index Value Calculator based on US national scoring algorithms. A difference of 0.03 (3%) was considered to be clinically important . Given that index scores for 5L are generated by mapping responses on to 3L, we hypothesize that significant differences in association will not be observed.
Both indices are descriptive measures of the discriminatory ability of an instrument and are needed for useful interpretation of the measurement power of a scale. We expect that the H’ for 5L should be higher than for 3L, and that J’ should be about the same or slightly higher for 5L, indicating the usefulness of the extra levels in the 5L.
We tested feasibility by computing the percentage of respondents with missing responses. That is, we calculated the percentage of respondents not answering each dimension. The percentages were then compared for both the 3L and 5L.
The ceiling of 3L and 5L were defined as the proportion of respondents that scored no problem on all five dimensions (persons with 11111 score). Based on the assumption that the majority of respondents would report at least slight/some problem on at least one of the five dimensions, we hypothesis that the ceiling effect will be lower for 5L compared to 3L version.
Descriptive statistics were performed to determine the sample characteristics for each survey cycle. We presented the proportion of reported problems in all five dimensions for each cycle (without merging) to provide information on the advantage of the extra levels in each dimension. Multivariable linear regression models were then fitted to study associations between chronic conditions and EQ-5D index scores. Three separate models were fitted for 1) specific chronic conditions, 2) for number of chronic conditions and 3) for presence of multimorbidity as a binary categorical variable. The multivariable models were adjusted for respondents’ socio-demographic characteristics such as age, sex, and household income. Because educational level tends to correlate with income, we determined a priori not to adjust for educational level to avoid multicollinearity. A 2-sided P < 0.05 was considered statistically significant. All analyses were undertaken using STATA version 11 (StataCorp LP. 2009). The Health Research Ethics Board (HREB) at the University of Alberta approved the data collection protocols and survey instruments.
Socio-demographic characteristics of respondents
Socio-demographic characteristics of the survey respondents
2010 Survey (N = 4946)
2012 Survey (N = 4752)
Mean age (SD), years
46.6 (16.5) years
47.7 (17.1) years
Secondary or less
Household income (CAD), %
30,000 – 59,999
60,000 – 99,999
Number of chronic conditions
EQ-5D profile and morbidity status
EQ-5D profile (3L and 5L) of the survey respondents with multimorbidity
Unable to walk about
Unable to wash or dress
Unable to do usual activity
Specific chronic conditions and EQ-5D index score
Associations between specific chronic conditions and EQ index score
3L index score Coef. (95% CI)1
5L index score Coef. (95% CI)2
−0.05 (−0.08, −0.02)
−0.04 (−0.06, −0.02)
−0.10 (−0.16, −0.04)
−0.08 (−0.11, −0.05)
−0.06 (−0.09, −0.03)
−0.06 (−0.07, −0.04)
High blood pressure
−0.06 (−0.08, −0.04)
−0.06 (−0.07, −0.04)
−0.05 (−0.08, −0.03)
−0.04 (−0.06, −0.03)
−0.13 (−0.17, −0.09)
−0.09 (−0.11, −0.06)
Congestive heart failure
−0.12 (−0.21, −0.03)
−0.12 (−0.19, −0.05)
Depression or anxiety
−0.19 (−0.21, −0.16)
−0.14 (−0.15, −0.13)
−0.19 (−0.21, −0.17)
−0.17 (−0.18, −0.15)
−0.12 (−0.14, −0.10)
−0.10 (−0.12, −0.09)
−0.06 (−0.10, −0.02)
−0.06 (−0.09, −0.03)
Stroke (or related)
−0.11 (−0.19, −0.02)
−0.12 (−0.17, −0.06)
−0.06 (−0.11, −0.01)
−0.05 (−0.08, −0.02)
−0.07 (−0.20, 0.05)
−0.14 (−0.19, −0.09)
−0.09 (−0.13, −0.06)
−0.10 (−0.13, −0.08)
Multimorbidity and EQ-5D index score
Associations between multimorbidity and EQ index score
Number of chronic conditions
EQ-5D 3L index score Coef. (95% CI)1
EQ-5D 5L index score Coef. (95% CI)2
−0.07 (−0.09, −0.06)
−0.06 (−0.07, −0.05)
−0.12 (−0.14, −0.10)
−0.10 (−0.12, −0.09)
−0.14 (−0.17, −0.11)
−0.12 (−0.14, −0.11)
−0.16 (−0.20, −0.11)
−0.18 (−0.21, −0.15)
−0.23 (−0.27, −0.19)
−0.22 (−0.23, −0.20)
−0.13 (−0.15, −0.11)
−0.12 (−0.13, −0.11)
Comparing measurement properties of EQ-5D 3L and EQ-5D 5L
Missing values ranged from 0.1% for Self Care to 0.8% for Anxiety/Depression for the 3L version and 0.1% for Self Care to 0.6% for Anxiety/Depression for the 5L version. The proportion of respondents with at least one missing value in all dimensions was 1.3% for the 3L version and 1.1% for the 5L version, indicating good feasibility for both versions of the instrument.
There were a total of 384 unique health states observed using the 5L (12.2% of 3125 possible) and 116 (47.7% of 243 possible) using the 3L version. A ceiling effect was observed among a greater proportion of respondents with the 3L, 2082/4946 (43.1%) compared to 5L, 1536/4752 (32.3%), with an absolute difference of 9.8%. Among the most common chronic conditions, the highest difference was noted for high blood pressure (20.6% versus 14.3%) and high cholesterol (20.1% versus 14.8%) and similar for arthritis (6.9% vs. 6.8%) and anxiety/depression (3.0% for both).
The present study compares the measurement properties of EuroQol’s EQ-5D 3L and the newer 5L in terms of their feasibility, ceiling effect and discriminatory power (Shannon indices) as well as the association between clinical characteristics and EQ index score. In our comparison of 3L and 5L, both were comparable in terms of feasibility, but 5L had lower ceiling effect and higher discriminatory power. Overall, multimorbidity was associated with clinically important reduction in index scores using both versions of the instrument, although it was notable that the decrements associated with specific conditions were different between the 3L and 5L version.
This population-based study utilizes large samples from two consecutive survey cycles that are representative of the general population. Chronic pain was associated with the highest clinically important difference in HRQL using both versions of the EQ-5D instrument, consistent with previous findings [3, 20, 21]. However, the magnitude of this difference tended to be larger using the 3L compared to the 5L for some chronic conditions. The difference for individual conditions such as chronic pain and anxiety or depression may be due to different weights for specific dimensions associated with symptoms of these chronic conditions. That is, these conditions have the biggest weight on level 3 of the 3L. Likely related to the symptom of pain, the score difference associated with arthritis was also larger for the 3L than 5L.
On the other hand, the associations between chronic conditions and index scores were similar for most chronic conditions, including multimorbidity using both versions of the EQ-5D. This similarity may be due to the fact that health profiles of 5L are mapped unto the 3L to derive its index score using an interim scoring “EQ-5D 5L Crosswalk Index Value Calculator” . That is, the utilities for 5L are derived by first cross-matching its profiles to those of 3L. It is therefore expected that studied associations using index scores from both versions will not be significantly different. On the other hand, similar findings may in fact indicate the lack of difference in 3L and 5L using their derived utility scores. As a unique scoring system is being awaited for 5L, further studies will be required to show if differences occur in association between clinical characteristics and HRQL for both versions of the instrument.
The observation that 5L had lower ceiling effect than the 3L has been reported in other studies using patient populations [7–9]. The finding supports the original intent of increasing the levels in 5L, to capture differences in health states that are otherwise not captured by 3L. Moreover, the 5L version of the EQ-5D descriptive system had a higher absolute discriminatory power than the 3L version in all five dimensions. Also, the relative discriminatory power (Evenness index) was slightly better in the 5L than the 3L version. This measure indicates the evenness of spread of responses across levels of the instrument by adjusting for the number of levels. Thus, higher evenness scores in all 5L dimensions indicate that the extra levels in the descriptive system were used efficiently. Our study findings indicate that the measurement properties of 5L are better than 3L in a general population sample. Further longitudinal analysis is needed to compare the sensitivity of both versions of the EQ-5D and their ability to detect change over time.
This study has a few limitations that are worth mentioning. Although the source population was the same in both survey cycles, the study samples for instrument comparison are not the same. The average number of chronic conditions was higher in the 2012 compared to 2010 survey cycle. This may be due to inherent differences in the study samples or under-reporting in the 2010 cycle, especially because of a skip pattern in the order of questions for identifying chronic conditions in that cycle. It is unclear to what extent this difference has on the comparison. It is unlikely to affect our comparison of specific conditions that have a higher prevalence, but may limit comparisons of total chronic conditions in the population. Furthermore, the results are consistent with findings from previous studies that are based on the same samples to validate both versions of the instrument [6–8]. Respondents’ chronic conditions were self-reported. Because chronic conditions can be quite subtle, it may be confusing to differentiate between symptoms or minor ailments with more severe disease states. Moreover, some conditions such as chronic pain are subjective and may be difficult to define without assistance from a clinician or the use of standardized scales. The severity of chronic conditions, an important predictor of HRQL , was not accounted for in the present study. On the other hand, a unique property of our study is the large sample size derived from the general population, heightening the external validity of our study findings. Also, the study captured common chronic conditions in the general population, including the core chronic conditions recommended for inclusion in multimorbidity indices .
In this study, we found that the EQ-5D 5L showed better measurement properties, with lower ceiling effect and better discriminatory power than the 3L version. Furthermore, while the association between overall multimorbidity and index scores is comparable using both versions of the EQ-5D, the different versions suggest notable differences in HRQL burden for individual chronic conditions.
- Rabin R, de Charro F: EQ-5D: a measure of health status from the EuroQol Group. Ann Med 2001, 33: 337–343. 10.3109/07853890109002087View ArticlePubMedGoogle Scholar
- Fortin M, Bravo G, Hudon C, Lapointe L, Almirall J, Dubois M-F, Vanasse A: Relationship between multimorbidity and health-related quality of life of patients in primary care. Qual Life Res 2006, 15: 83–91. 10.1007/s11136-005-8661-zView ArticlePubMedGoogle Scholar
- Agborsangaya CB, Lau D, Lahtinen M, Cooke T, Johnson JA: Health-related quality of life and healthcare utilization in multimorbidity: results of a cross-sectional survey. Qual Life Res 2013, 22: 791–799. 10.1007/s11136-012-0214-7View ArticlePubMedGoogle Scholar
- Pickard AS, De Leon MC, Kohlmann T, Cella D, Rosenbloom S: Psychometric comparison of the standard EQ-5D to a 5 level version in cancer patients. Med Care 2007, 45: 259–263. 10.1097/01.mlr.0000254515.63841.81View ArticlePubMedGoogle Scholar
- Pickard AS, Kohlmann T, Janssen MF, Bonsel G, Rosenbloom S, Cella D: Evaluating equivalency between response systems: application of the Rasch model to a 3-level and 5-level EQ-5D. Med Care 2007, 45: 812–819. 10.1097/MLR.0b013e31805371aaView ArticlePubMedGoogle Scholar
- Janssen MF, Birnie E, Haagsma JA, Bonsel GJ: Comparing the standard EQ-5D three-level system with a five-level version. Value Health 2008, 11: 275–284. 10.1111/j.1524-4733.2007.00230.xView ArticlePubMedGoogle Scholar
- Janssen MF, Pickard AS, Golicki D, Gudex C, Niewada M, Scalone L, Swinburn P, Busschbach J: Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L across eight patient groups: a multi-country study. Qual Life Res 2013, 22(7):1717–1727. 10.1007/s11136-012-0322-4PubMed CentralView ArticlePubMedGoogle Scholar
- Scalone L, Ciampichini R, Fagiuoli S, Gardini I, Fusco F, Gaeta L, Del Prete A, Cesana G, Mantovani LG: Comparing the performance of the standard EQ-5D 3L with the new version EQ-5D 5L in patients with chronic hepatic diseases. Qual Life Res 2013, 22: 1707–1716. 10.1007/s11136-012-0318-0View ArticlePubMedGoogle Scholar
- Kim SH, Kim HJ, Lee SI, Jo MW: Comparing the psychometric properties of the EQ-5D 3L and EQ-5D 5L in cancer patients in Korea. Qual Life Res 2012, 21: 1065–1073. 10.1007/s11136-011-0018-1View ArticlePubMedGoogle Scholar
- HQCA: Satisfaction and Experience with Health Care Services: A Survey of Albertans. Calgary: Health Quality Council of Alberta; 2010.Google Scholar
- Agborsangaya CB, Lau D, Lahtinen M, Cooke T, Johnson JA: Multimorbidity prevalence and patterns across socioeconomic determinants: a cross-sectional survey. BMC Public Health 2012, 12: 201. 10.1186/1471-2458-12-201PubMed CentralView ArticlePubMedGoogle Scholar
- Shaw JW, Johnson JA, Coons SJ: US valuation of the EQ-5D health states: development and testing of the D1 valuation model. Med Care 2005, 43: 203–220. 10.1097/00005650-200503000-00003View ArticlePubMedGoogle Scholar
- van Hout B, Janssen MF, Feng YS, Kohlmann T, Busschbach J, Golicki D, Lloyd A, Scalone L, Kind P, Pickard AS: Interim scoring for the EQ-5D 5L: mapping the EQ-5D 5L to EQ-5D 3L value sets. Value Health 2012, 15: 708–715. 10.1016/j.jval.2012.02.008View ArticlePubMedGoogle Scholar
- Kaplan RM: The minimally clinically important difference in generic utility-based measures. COPD 2005, 2: 91–97. 10.1081/COPD-200052090View ArticlePubMedGoogle Scholar
- Shannon CE: The mathematical theory of communication. 1963. MD Comput 1997, 14: 306–317.PubMedGoogle Scholar
- Rao GS, Hamid Z, Rao JS: The information content of DNA and evolution. J Theor Biol 1979, 81: 803–807. 10.1016/0022-5193(79)90282-0View ArticlePubMedGoogle Scholar
- Dahl FA, Østerås N: Quantifying information content in survey data by entropy. Entropy 2010, 2: 161–163.View ArticleGoogle Scholar
- Janssen MF, Birnie E, Bonsel GJ: Quantification of the level descriptors for the standard EQ-5D three-level system and a five-level version according to two methods. Qual Life Res 2008, 17: 463–473. 10.1007/s11136-008-9318-5PubMed CentralView ArticlePubMedGoogle Scholar
- Polinder S, Haagsma JA, Bonsel G, Essink-Bot ML, Toet H, van Beeck EF: The measurement of long-term health-related quality of life after injury: comparison of EQ-5D and the health utilities index. Inj Prev 2010, 16: 147–153. 10.1136/ip.2009.022418View ArticlePubMedGoogle Scholar
- Lame IE, Peters ML, Vlaeyen JW, Kleef M, Patijn J: Quality of life in chronic pain is more associated with beliefs about pain, than with pain intensity. Eur J Pain 2005, 9: 15–24. 10.1016/j.ejpain.2004.02.006View ArticlePubMedGoogle Scholar
- Horng YS, Hwang YH, Wu HC, Liang HW, Mhe YJ, Twu FC, Wang JD: Predicting health-related quality of life in patients with low back pain. Spine 2005, 30: 551–555. 10.1097/01.brs.0000154623.20778.f0View ArticlePubMedGoogle Scholar
- Diederichs C, Berger K, Bartels DB: The measurement of multiple chronic diseases–a systematic review on existing multimorbidity indices. J Gerontol A Biol Sci Med Sci 2011, 66: 301–311.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.