A head-to-head comparison of the EQ-5D-3L index scores derived from the two EQ-5D-3L value sets for China

Zhang, Ruo-Yu; Wang, Wei; Zhou, Hui-Jun; Xuan, Jian-Wei; Luo, Nan; Wang, Pei

doi:10.1186/s12955-022-01988-w

Research
Open access
Published: 19 May 2022

A head-to-head comparison of the EQ-5D-3L index scores derived from the two EQ-5D-3L value sets for China

Ruo-Yu Zhang¹^na1,
Wei Wang²^na1,
Hui-Jun Zhou³,
Jian-Wei Xuan⁴,
Nan Luo⁵ &
…
Pei Wang^2,6

Health and Quality of Life Outcomes volume 20, Article number: 80 (2022) Cite this article

1904 Accesses
3 Citations
Metrics details

Abstract

Objective

Two EQ-5D-3L (3L) value sets (developed in 2014 and 2018) co-exist in China. The study examined the level of agreement between index scores for all the 243 health states derived from them at both absolute and relative levels and compared the responsiveness of the two indices.

Methods

Intraclass correlations coefficient (ICC) and Bland–Altman plot were adopted to assess the degree of agreement between the two indices at the absolute level. Health gains for 29,403 possible transitions between pairs of 3L health states were calculated to assess the agreement at the relative level. Their responsiveness for the transitions was assessed using Cohen effect size.

Results

The mean (SD) value was 0.427 (0.206) and 0.649 (0.189) for the 3L₂₀₁₄ and 3L₂₀₁₈ index scores, respectively. Although the ICC value showed good agreement (i.e., 0.896), 88.9% (216/243) of the points were beyond the minimum important difference limit according to the Bland–Altman plot. The mean health gains for the 29,403 health transitions was 0.234 (3L₂₀₁₄ index score) and 0.216 (3L₂₀₁₈ index score). The two indices predicted consistent transitions in 23,720 (80.7%) of 29,403 pairs. For the consistent pairs, Cohen effective size value was 1.05 (3L₂₀₁₄ index score) or 1.06 (3L₂₀₁₈ index score); and the 3L₂₀₁₄ index score only yielded 0.007 more utility gains. However, the results based on the two measures varied substantially according to the direction and magnitude of health change.

Conclusion

The 3L₂₀₁₄ and 3L₂₀₁₈ index scores are not interchangeable. The choice between them is likely to influence QALYs estimations.

Introduction

The EQ-5D-3L (3L) is one of the most widely used utility instruments in measuring health-related quality of life (HRQoL) [1,2,3,4] for use in quality-adjusted life years (QALYs) calculation. It has a classification system consisting of five dimensions: mobility (MO), self-care (SC), usual activities (UA), pain/discomfort (PD), anxiety/depression (AD), with three functioning levels (no problems, some problems, and extreme problems) in each dimension. The system thus defined 243 (3⁵) possible health states [5], and each of them can be coded into a five-digit number ranging from “11111” to “33333” (e.g., 12321 means no problems in mobility, some problems in self-care, extreme problems in usual activities, some problems in pain/discomfort and no problems in anxiety/depression). A single utility index score can be assigned to each health state by using a value set, which was developed in a valuation study based on general population’s health preferences. Since health preferences differ across populations [6, 7], a number of 3L value sets have been derived in different countries/regions [8]. Some countries (e.g., Korea, USA, and China) even developed two value sets due to respective reasons [9,10,11,12,13,14]. Taking China for example, compared to the first value set developed in 2014 (i.e., 3L₂₀₁₄ value set) using a sample comprising residents mainly form urban areas, the second value set developed in 2018 (i.e., 3L₂₀₁₈ value set) adopted a more representative sample of residents from both rural and urban areas (Table 1).

Table 1 Comparison of valuation method and characteristics of the two EQ-5D-3L value sets for China

Full size table

Despite the availability of the EQ-5D-5L (5L, a new version of 3L) index score with improved psychometric properties [15,16,17,18], the 3L index score is still with great usefulness due to the considerations of consistency and continuity in decision making process [19]. Indeed, the National Health Service Survey in China continually used the 3L to measure the HRQoL of Chinese residents even after the publication of the 5L value set for China in 2017 [20]. Moreover, the 3L can also be used to generate the 5L index score based on the 5L information and a crosswalk function [21], thus utilizing the advantages of 5L descriptive system.

Similarly, the 3L₂₀₁₄ value set is still more frequently used than the 3L₂₀₁₈ value set, albeit with its disadvantage in the sampling method. According to Web of Science, the former has been cited in 139 articles by April 16, 2021, 62 of which cited it after the availability of the latter. In contrast, the 3L₂₀₁₈ value set has been cited only eighteen times since its publication [22, 23]. Given the noticeable differences in coefficients of scoring algorithms for the two value sets (Table 1), it is unlikely that the two value sets would yield identical utility index scores for the same health state. However, it remains unclear to what extent the use of different utility scores generated from the two value sets would affect results of QALYs computation, which mainly depends on the difference in utility scores rather than the absolute utility scores. Moreover, it is not known whether the difference in the utility scores is clinically important as well. Our previous study has compared the two 3L indices in diabetes patients, and found that they had different discriminative power and the choice between them may impact the QALYs estimation [24]. Another study has also compared them in patients with gastric cancer and healthy controls, and showed that the 3L₂₀₁₄ index score had better ability to distinguish the patients from controls [25]. A study published in Chinese also compared the two 3L value sets in measuring the HRQoL of Tibet residents and concluded they could not be used interchangeably [26]. Nevertheless, all the previous studies were based on either a single disease group or a special group, it is not known whether the findings could be generalized to general populations or other patients in China.

Hence, the study aimed to: (1) examine the level of agreement at both absolute and relative levels of all the 243 index scores derived from the two 3L value sets for China; and (2) compare the responsiveness of two indices (i.e. to capture the real changes in health states over time).

Methods

The two 3L indices generated from the two 3L value sets for China

The two 3L value sets were developed using different sampling methods, valuation protocols [27, 28], modeling methods, leading to distinct algorithms for calculating the 3L index scores (Table 1). For example, the utility score for health state “23221” is 0.466 (i.e., 1-0.039-0.099-0.208-0.074-0.092-0.022) according to the 2014 algorithm or 0.568 (i.e., 1-0.077-0.291-0.037-0.027) according to the 2018 algorithm. In the study, both algorithms were used to generate the two index scores of all the 243 3L health states for analysis. There are three main differences between them. First, for the 3L₂₀₁₄ value set, respondents were selected from urban areas through a quota sampling; while for the 3L₂₀₁₈ value set, a more representative sample of respondents were obtained from both rural and urban areas by using a random sampling method. Second, the 3L₂₀₁₄ and 3L₂₀₁₈ value sets were developed using the Paris protocol and the Measurement and Valuation of Health (MVH) protocol, respectively, whereby the former protocol is an improvement of the latter. Third, the time-trade off (TTO) technique for the 3L₂₀₁₄ value set was based on the ‘death’ state to elicit health utility scores, but not for the 3L₂₀₁₈ value set. Those differences led to distinct algorithms for calculating the 3L index scores (Table 1).

Statistical analysis

We assessed the distributions of the two indices (i.e., 3L₂₀₁₄ index score and 3L₂₀₁₈ index score) using the Shapiro–Wilk test. T-test or Wilcoxon rank-sum test were then used to compare their mean values wherever appropriate.

A two-way mixed intraclass correlation coefficient (ICC) [29] and Bland–Altman plot [30] were adopted to assess the degree of agreement between the two indices at absolute level. The agreement was considered good when the ICC value was higher than 0.7. The Bland–Altman plot was used to visualize and assess the level of agreement across different utility segments, whereby the Y-axis depicts the differences in score between the two indices, and the X-axis represents their mean values. A limit of 0.074, that is the minimally important difference (MID) of the 3L index score, [31] was used to determine whether the magnitude of the difference would be clinically important.

To examine the agreement of the two 3L index scores at relative level, we simulated all the possible health states transitions that may occur over time. All the 243 health states were paired to form 29,403 (C²₂₄₃) health state combinations, each of which was used to simulate a pair of health states before and after treatment. It was assumed that the health states with higher index scores were as the states after treatment (post-treatment), and the lower were as the health states before treatment (pre-treatment) [32]. Hence, the health gains of our simulated treatment were always positive. However, the index score of the same health state may vary when changing from one value set to the other, thus a health state labeled as pre-treatment when using the 3L₂₀₁₄ value set may represent post-treatment instead when using the 3L₂₀₁₈ value set in the same pair, or vice versa. This was what we considered as an “inconsistent” pair of health states [33], whereby the choice of index scores would have a substantial impact on health outcomes, i.e. one may generate a positive health gain, while the other may result in health losses.

On the contrary, for a “consistent” pair, the health state representing pre-treatment remained unchanged regardless of using either the 3L₂₀₁₄ or 3L₂₀₁₈ value set. Given the magnitude of health gains may vary from one value set to another, the consistent group was further divided into four subgroups according to the perceived direction and magnitude of the change before and after treatment: (1) major improvement (i.e. at least one dimension in the health transition is increased from level 3 to level 1 or level 2, and no dimension is decreased); (2) minor improvement (i.e. at least one dimension in the health transition is increased from level 2 to level 1, and no dimension is increased from level 3 to 1 or 2, nor is the level of any dimension decreased); (3) mixed response with minor deterioration (i.e. at least one dimension is decreased from level 1 to 2 and no dimension is decreased from level 1 or 2 to 3); (4) mixed response with major deterioration (i.e. at least one dimension is decreased from level 1 or 2 to 3) [33]. It should be noted that, if the level of one dimension deteriorates yet the level of the others improves in a health transition, it would be considered as a mixed response with some deterioration and thus assigned to either subgroup 3 or 4. We then compared the health gains yielded from the two 3L indices for all the transitions, consistent transitions, and each subgroup of the consistent transitions.

Moreover, in order to help understand how a single-level change in severity of descriptive systems would result in different utility change between the two value sets, we computed changes in utility values between pairs of adjacent health states for each value set. We called them “adjacent” when two health states are exactly the same except for one dimension where the severity level differs by one only [15, 34,35,36]. For example, health states “21111” and “11111” were considered adjacent.

We also compared the responsiveness of the two 3L indices within the consistent group by using Cohen effect size [37]^. It is commonly used to measure the effect size of a treatment, and is independent of the sample size which is unlike the significance test. It is calculated as the difference in the mean scores between post-treatment and pre-treatment divided by the standard deviation of the pre-treatment. The effect size was categorized as small (0.2–0.5), moderate (> 0.5–0.8), or large (> 0.8) [37]. Given that the hypothetical treatment was fixed in our simulation, the effect size would reflect the ability of an index score to discern changes in two known health states. The higher the effect size, the more responsive the index score is. We calculated and compared Cohen effect size for all the consistent pairs and each subgroup of the pairs. Microsoft Excel and Stata and SAS were used for statistical analysis.

Results

The two 3L indices were both normally distributed according to the Shapiro–Wilk test (Fig. 1). Overall, the 3L₂₀₁₄ value set generated systematically lower index scores compared with those yielded from the 3L₂₀₁₈ value set. The mean (SD) value of all the index scores was 0.427 (0.206) for the former and 0.649 (0.189) for the latter, with the difference in mean being 0.222 (p < 0.001) (Table 2); the 3L₂₀₁₄ value set also had lower scores for 239 out of 243 health states. Meanwhile, the difference and variance between the two index scores were not invariant but generally increased with the increasing in health-state severity (Fig. 2). For example, the index score of the second-best health state was 0.887 (for state “11211”) and 0.973 (for state “11121”); while the minimum index score was −0.149 and 0.170 (for the worst state “33333”) according to the 3L₂₀₁₄ or 3L₂₀₁₈ value set, respectively. Although the overall agreement between the two kinds of index scores was good (ICC = 0.896), 88.9% (216/243) of the points were beyond the MID limit according to Bland–Altman plot (Fig. 3).

Table 2 Comparison of the two EQ-5D-3L index scores at absolute and relative levels

Full size table

On the other hand, the difference between the two indices was not so obvious for the 29,403 health transitions: the mean differences (SD) were 0.234 (0.173) and 0.216 (0.158) for the 3L₂₀₁₄ and 3L₂₀₁₈ index scores, respectively. Similarly, in 23,720 (80.7%) of 29,403 transitions, the two indices generated consistent results for health gains before and after a simulated treatment, with the difference in mean health gains for the transitions being only 0.007 (p < 0.001) (Table2). Among the consistent transitions, the number of pairs for each subgroup was 6752 (major improvement), 781(minor improvement), 4515 (mixed response with minor deterioration), and 11,672 (mixed response with major deterioration).

In the subgroups of major/minor improvement, the 3L₂₀₁₄ index score yielded greater magnitude of health gains at 0.411/0.151(vs. 0.310/0.072 from the 3L₂₀₁₈ index score). However, it generated similar or lower health gains compared to the 3L₂₀₁₈ index score in the subgroups of “mixed response with minor deterioration” (health gains: 0.246 for both index scores) and “mixed response with major deterioration” (health gains: 0.069 vs. 0.118) (Table 3)_. The absolute change in utility values between any two adjacent states computed using the 3L₂₀₁₄ value set was larger than that using the 3L₂₀₁₈ value set, expect for pairs that involve a change between level 2 and 3 in either “mobility” or “self-care” dimension (Table 4). Essentially, it reflected the fact that differences between coefficients of the same dimension in the scoring algorithm vary from one value set to another.

Table 3 Responsiveness of the two EQ-5D index scores in simulated transitions between EQ-5D-3L health states

Full size table

Table 4 Differences in utility change of adjacent health states between two value sets

Full size table

The two indices also showed a similar level of sensitivity to change for all the consistent changes, with Cohen effect size values at 1.05 and 1.06, respectively. Nevertheless, the value varied substantially across the subgroups. In the subgroups of major/minor improvement, the 3L₂₀₁₄ index score demonstrated higher values than the 3L₂₀₁₈ index score (Cohen effect size: 2.38 vs. 1.72/0.88 vs. 0.45). While in the subgroup of mixed response with major deterioration, the result was reversed (Cohen effect size:0.37 vs. 0.66); in the subgroup of mixed response with minor deterioration, the two index scores demonstrated similar responsiveness with Cohen effect sizes at 1.48 vs. 1.42. (Table 3).

Discussion

In the study, we compared the agreement of all the two 3L index scores generated from the two 3L value sets for China. We found that the 3L₂₀₁₄ index score was systematically lower than the 3L₂₀₁₈ index score at absolute level, but their differences at relative level varied in terms of the direction and magnitude of the health change.

It is not surprising that the 3L₂₀₁₄ index score was much lower given the 3L₂₀₁₄ algorithm has larger values in 8 out of 10 parameters and two more terms (i.e., constant and N3) further pulling down the scores (Table 1). The difference and variance between the two index scores were also increased with the increasing in health-state severity. Regarding the former, the difference in level-3 (L3) parameters between the two algorithms is in general larger than the difference in level-2 (L2) parameters. This, plus the use of N3 term, lead to the increased difference. The latter could be ascribed to the fact that the 3L₂₀₁₈ algorithm has two L3 parameters with larger values (i.e., MO3 and SC3) than those of the 3L₂₀₁₄ algorithm. As a result, for health states including the problems, the difference between the index scores may be reduced rather than increased, resulting in larger variance for all health states including L3 problems. Difference in algorithm parameters may be attributed to several factors such as the valuation protocol, modeling method, as well as the sample used [13, 14]. The sample for the 3L₂₀₁₈ algorithm including the rural population, who may be more likely to live with economic hardships over years. Hence, they may be able to endure more pain and suffering, leading to a relatively higher estimate in utility values for health problems than the better-off residents. In addition, the 3L₂₀₁₈ value set used an open-ended TTO question. The developers of the 3L₂₀₁₈ value set believed that due to cultural reasons, death is a taboo in China, especially in rural areas. When using the TTO method, the interviewers did not tell the respondents to imagine die immediately after living in a hypothetical health state for a period of time. Therefore, the respondents may make variant assumptions about the length of life and health states of the continued lives, which may have led to an overestimation of the TTO.

The two indices generated consistent results for the majority (80.7%) of health transitions. For the transitions involving improvement only, the results would always be consistent regardless the differences in scoring algorithms. On the other hand, the inconsistent results would be presented for the transitions including both improvement and deterioration in different dimensions. Compared to the 3L₂₀₁₄ algorithm, the parameter coefficients of the 3L₂₀₁₈ algorithm display greater variance. Its parameter value for L2 and L3 problems of the 3L₂₀₁₈ algorithm varied from 0.027 (PD2) to 0.077 (MO2), and 0.041 (PD3) to 0.291(SC3); while such the parameters for the 3L₂₀₁₄ algorithm ranged from 0.074 (UA2) to 0.099 (MO2) and 0.205(AD3) to 0.246 (MO3). For example, a health transition resulted from health state “11131”to “11113” would be considered as health gain and health loss according to the 3L₂₀₁₄ (0.031) algorithm and 3L₂₀₁₈ (−0.136) algorithm, respectively.

With regard to all the consistent health transitions, both the index scores showed similar health gains and responsiveness, but they varied considerably across the four subgroups. The health gains and responsiveness of the 3L₂₀₁₄ index score were found to be better or greater than those of the 3L₂₀₁₈ index score in the “major improvement” and “minor improvement” subgroups, which suggested that the use of the 3L₂₀₁₄ algorithm would tend to result in larger QALY gains for the two subgroups. On the other hand, in the subgroups of “mixed response with minor deterioration” and “mixed response with major deterioration”, the two index scores generated similar or even reversed results. For the subgroups 1 & 2, the 3L₂₀₁₄ algorithm overall has larger parameter values, indicating the health gain from a transition from extreme/some problems to no problems is much greater according to it. Similarly, the magnitude of difference between L2 and L3 parameters is also generally larger for the 3L₂₀₁₄ algorithm, leading to comparable conclusions for the transitions from extreme problems to some problems. This point became clearer when we compared changes in utility values of two adjacent health states between the two value sets, as shown in Table 4. For the subgroups 3 & 4, the 3L₂₀₁₄ algorithm has relatively similar parameter values across the five L2 and the five L3 parameters. Hence, for a health transition involving both improvement and deterioration, the magnitude of health gain from the improvement in a certain dimension may be offset to a large extent by the deterioration from another dimension according to the 3L₂₀₁₄ algorithm. The resulting health gains and responsiveness were therefore not larger or better than those based on the 3L₂₀₁₈ algorithm in the subgroups.

It should be bear in mind that in reality the frequencies of the 243 health states and 29,403 transitions would be distributed disproportionately. For example, the state “11111” has been the most frequently observed in a number of studies in China, which may lead to different conclusions. [24] When measuring individuals who are expected to be either stable or gain improvement in all the 5 dimensions of 3L from an intervention, the 3L₂₀₁₄ value set may be a more preferable choice. But in other scenarios, the choice becomes less straightforward and thus it is recommended to apply both value sets in data analyses as part of a robustness check. Also, the absolute utility score could also influence the QALY calculation to some extent. Hence, more empirical studies are warranted to further assess the impact in various settings in China.We also acknowledge a new 3L value set for China’s rural population developed by Liu et al. has been available recently [38]. They also found that the utilty scores generated from the value set were generally lower than those of the two 3L value sets used in the current analysis. We did not include the value set as we have finished the analysis and paper writing before its publication. Nevetherless, the differences among the three kinds of 3L utilities may necessitate the valuation of 5L health states from both rural and urban respondents since the current 5L value set for China is based on urban respondents only.

Conclusion

Our results suggested a substantial difference between the 3L₂₀₁₄ and 3L₂₀₁₈ index scores at absolute level; while their differences at relative level differed according to the type of health change. Our findings suggested that choosing which value set to generate 3L index score is very likely to influence QALYs estimate in China.

Availability of data and material

The data that support the findings of this study are available on request from the corresponding author.

Abbreviations

3L:: EQ-5D-3L
5L:: EQ-5D-5L
AD:: Anxiety/depression
HRQoL:: Health-related quality of life
ICC:: Intraclass correlations coefficient
MID:: Minimally important difference
MO:: Mobility
MVH:: Measurement and valuation of health protocol
PD:: Pain/discomfort
QALYs:: Quality-adjusted life years
SC:: Self-care
TTO:: Time-trade off
UA:: Usual activities

References

Wang H, Kindig DA, Mullahy J. Variation in Chinese population health related quality of life: results from a EuroQol study in Beijing, China. Qual Life Res. 2005;14(1):119–32.
Article Google Scholar
Fang H, Farooq U, Wang D, Yu F, Younus MI, Guo X. Reliability and validity of the EQ-5D-3L for Kashin-Beck disease in China. Springerplus. 2016;5(1):1924.
Article Google Scholar
Wang HM, Patrick DL, Edwards TC, Skalicky AM, Zeng HY, Gu WW. Validation of the EQ-5D in a general population sample in urban China. Qual Life Res. 2012;21(1):155–60.
Article Google Scholar
Payakachat N, Ali MM, Tilford JM. Can the EQ-5D detect meaningful change? A systematic review. Pharmacoeconomics. 2015;33(11):1137–54.
Article Google Scholar
Brooks R. EuroQol: the current state of play. Health Policy. 1996;37(1):53–72.
Article CAS Google Scholar
Johnson JA, Luo N, Shaw JW, Kind P, Coons SJ. Valuations of EQ-5D health states: Are the United States and United Kingdom different? Med Care. 2005;43(3):221–8.
Article Google Scholar
Badia X, Roset M, Herdman M, Kind P. A comparison of United Kingdom and Spanish general population time trade-off values for EQ-5D health states. Med Decis Mak. 2001;21(1):7–16.
Article CAS Google Scholar
EuroQol. (2019). EQ-5D-3L Valuation. EQ-5D. May 6, 2020. Available at: https://euroqol.org/eq-5d-instruments/eq-5d-3l-about/valuation/.
Lee YK, Nam HS, Chuang LH, Kim KY, Yang HK, Kwon IS, et al. South Korean time trade-off values for EQ-5D health states: modeling with observed values for 101 health states. Value Health. 2009;12(8):1187–93.
Article Google Scholar
Jo MW, Yun SC, Lee SI. Estimating quality weights for EQ-5D health states with the time trade-off method in South Korea. Value Health. 2008;11(7):1186–9.
Article Google Scholar
Shaw JW, Johnson JA, Coons SJ. US valuation of the EQ-5D health states: development and testing of the D1 valuation model. Med Care. 2005;43(3):203–20.
Article Google Scholar
Shaw JW, Pickard AS, Yu S, Chen S, Iannacchione VG, Johnson JA, et al. A median model for predicting United States population-based EQ-5D health state preferences. Value Health. 2010;13(2):278–88.
Article Google Scholar
Liu GG, Wu H, Li M, Gao C, Luo N. Chinese time trade-off values for EQ-5D health states. Value Health. 2014;17(5):597–604.
Article Google Scholar
Zhuo L, Xu L, Ye J, Sun S, Zhang Y, Burstrom K, et al. Time trade-off value set for EQ-5D-3L Based on a nationally representative Chinese population survey. Value Health. 2018;21(11):1330–7.
Article Google Scholar
Mulhern B, Feng Y, Shah K, Janssen MF, Herdman M, van Hout B, et al. Comparing the UK EQ-5D-3L and English EQ-5D-5L Value Sets. Pharmacoeconomics. 2018;36(6):699–713.
Article Google Scholar
Janssen MF, Bonsel GJ, Luo N. Is EQ-5D-5L better than EQ-5D-3L? A head-to-head comparison of descriptive systems and value sets from seven countries. Pharmacoeconomics. 2018;36(6):675–97.
Article Google Scholar
Ferreira LN, Ferreira PL, Ribeiro FP, Pereira LN. Comparing the performance of the EQ-5D-3L and the EQ-5D-5L in young Portuguese adults. Health Qual Life Outcomes. 2016;14:89.
Article Google Scholar
Jin X, Al SF, Ohinmaa A, Marshall DA, Smith C, Johnson JA. The EQ-5D-5L Is superior to the -3L version in measuring health-related quality of life in patients awaiting THA or TKA. Clin Orthop Relat Res. 2019;477(7):1632–44.
Article Google Scholar
NICE. Position statement on use of the EQ-5D-5L valuation set for England (updated October 2019).2019. Available from: https://www.nice.org.uk/Media/Default/About/what-we-do/NICE-guidance/NICE-technology-appraisal-guidance/eq5d5l_nice_position_statement.pdf
Luo N, Liu G, Li M, Guan H, Jin X, Rand-Hendriksen K. Estimating an EQ-5D-5L value set for China. Value Health. 2017;20(4):662–9.
Article Google Scholar
van Hout B, Janssen MF, Feng YS, Kohlmann T, Busschbach J, Golicki D, et al. Interim scoring for the EQ-5D-5L: mapping the EQ-5D-5L to EQ-5D-3L value sets. Value Health. 2012;15(5):708–15.
Article Google Scholar
Web of Science. July 12, 2020. Available at: http://apps.webofknowledge.com/CitingArticles.do?product=UA&SID=5CpPTRScqIV6dWuWHLt&search_mode=CitingArticles&parentProduct=UA&parentQid=5&parentDoc=7&REFID=474383296&betterCount=112&logEventUT=WOS:000341084700014&excludeEventConfig=ExcludeIfFromNonInterProduct.
Web of Science. July 12, 2020. Available at: http://apps.webofknowledge.com/CitingArticles.do?product=WOS&REFID=571924828&SID=5CpPTRScqIV6dWuWHLt&search_mode=CitingArticles&parentProduct=UA&parentQid=15&parentDoc=37&excludeEventConfig=ExcludeIfFromFullRecPage.
Pan CW, Zhang RY, Luo N, He JY, Liu RJ, Ying XH, et al. How the EQ-5D utilities are derived matters in Chinese diabetes patients: a comparison based on different EQ-5D scoring functions for China. Qual Life Res. 2020;29(11):3087–94.
Article Google Scholar
Xia R, Zeng H, Liu Q, Liu S, Zhang Z, Liu Y, et al. Health-related quality of life and health utility score of patients with gastric cancer: a multi-center cross-sectional survey in China. Eur J Cancer Care (Engl). 2020;29(6): e13283.
Article Google Scholar
Zhao S, Shun-Ping L, Da-Wa Z, et al. Comparison of two Chinese value Sets of EQ-5D-3L scale: based on the application of urban and rural residents in Tibet. Chin Health Econ. 2019;38(12):9–12.
Google Scholar
Dolan P. Modeling valuations for EuroQol health states. Med Care. 1997;35(11):1095–108.
Article CAS Google Scholar
Kind P. A revised protocol for the valuation of health states defined by the EQ-5D-3L classification system: learning the lessons from the MVH study. York: Centre for Health Economics, University of York; 2009.
Google Scholar
Machin D, Fayers PM. Quality of life: the assessment, analysis, and reporting of patient-reported outcomes. 3rd ed. Chichester, UK: John Wiley; 2016.
Google Scholar
Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10.
Article CAS Google Scholar
Walters SJ, Brazier JE. Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D. Qual Life Res. 2005;14(6):1523–32.
Article Google Scholar
Nan L, Johnson JA, Shaw JW, Coons SJ. A comparison of EQ-5D index scores derived from the US and UK population-based scoring functions. Med Decis Mak. 2007;27(3):321–6.
Article Google Scholar
Kiadaliri AA. A comparison of Iran and UK EQ-5D-3L value sets based on visual analogue scale. Int J Health Policy Manag. 2017;6(5):267–72.
Article Google Scholar
Pan T, Mulhern B, Viney R, Norman R, Hanmer J, Devlin N. A Comparison of PROPr and EQ-5D-5L value sets. Pharmacoeconomics. 2022;40(3):297–307.
Article Google Scholar
Rencz F, Brodszky V, Gulácsi L, Golicki D, Ruzsa G, Pickard AS, et al. Parallel valuation of the EQ-5D-3L and EQ-5D-5L by time trade-off in Hungary. Value Health. 2020;23(9):1235–45.
Article Google Scholar
Law EH, Pickard AS, Xie F, Walton SM, Lee TA, Schwartz A. Parallel valuation: a direct comparison of EQ-5D-3L and EQ-5D-5L societal value sets. Med Decis Mak. 2018;38(8):968–82.
Article Google Scholar
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ: Lawrence Erlbaum; 1988.
Google Scholar
Liu GG, Guan H, Jin X, Zhang H, Vortherms SA, Wu H. Rural population’s preferences matter: a value set for the EQ-5D-3L health states for China’s rural population. Health Qual Life Outcomes. 2022;20(1):14.
Article CAS Google Scholar

Download references

Funding

The study was funded by the Shanghai Leading Academic Discipline Project of Public Health (Grant Number: GWV-10.1-XK14).

Author information

Ruo-Yu Zhang and Wei Wang contributed equally to this work.

Authors and Affiliations

Shanghai Centennial Scientific Co., Ltd, Shanghai, China
Ruo-Yu Zhang
School of Public Health, Fudan University, 130 Dong An Road, Shanghai, 200032, China
Wei Wang & Pei Wang
Business School, University of Shanghai for Science and Technology, Shanghai, China
Hui-Jun Zhou
Health Economic Research Institute, Sun Yat-Sen University, Guangzhou, China
Jian-Wei Xuan
Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
Nan Luo
Key Lab of Health Technology Assessment, National Health Commission of the People’s Republic of China (Fudan University), Shanghai, China
Pei Wang

Authors

Ruo-Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hui-Jun Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Wei Xuan
View author publications
You can also search for this author in PubMed Google Scholar
Nan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Pei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

PW designed and supervised the study. R-YZ conducted the data analysis. R-YZ, WW and PW wrote the manuscript. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Pei Wang.

Ethics declarations

Ethical approval and consent to participate

Not applicable.

Consent for publication

All the authors have reviewed the final manuscript and consented for publication.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Zhang, RY., Wang, W., Zhou, HJ. et al. A head-to-head comparison of the EQ-5D-3L index scores derived from the two EQ-5D-3L value sets for China. Health Qual Life Outcomes 20, 80 (2022). https://doi.org/10.1186/s12955-022-01988-w

Download citation

Received: 11 January 2022
Accepted: 01 May 2022
Published: 19 May 2022
DOI: https://doi.org/10.1186/s12955-022-01988-w

A head-to-head comparison of the EQ-5D-3L index scores derived from the two EQ-5D-3L value sets for China