Skip to main content

Validation and comparison of EuroQoL-5 dimension (EQ-5D) and Short Form-6 dimension (SF-6D) among stable angina patients



Several preference-based health-related quality of life (HRQoL) instruments have been published and widely used in different populations. However no consensus has emerged regarding the most appropriate instrument in therapeutic area of stable angina. This study compared and validated the psychometric properties of two generic preference-based instruments, the EQ-5D and SF-6D, among Chinese stable angina patients.


Convergent validity of the EQ-5D and SF-6D was examined with eight a priori hypotheses from stable angina patients in conjunction with Seattle Angina Questionnaire (SAQ). Responsiveness was compared using the effect size (ES), relative efficiency (RE) and receiver operating characteristic (ROC) curves. Agreement between the EQ-5D and SF-6D was tested using intra-class correlation coefficient (ICC) and Bland-Altman plot. Factors affecting utility difference were explored with multiple linear regression analysis.


In 411 patients (mean age 68.08 ± 11.35), mean utility scores (SD) were 0.78 (0.15) for the EQ-5D and 0.68 (0.12) for the SF-6D. Validity was demonstrated by the moderate to strong correlation coefficients (Range: 0.368-0.594, P< 0.001) for five of the eight hypotheses in both the EQ-5D and SF-6D. There were no serious floor effects for the EQ-5D and SF-6D, but ceiling effects for the EQ-5D were large. The areas under ROC of them all exceeded 0.5 (0.660-0.814, P<?0.001). The SF-6D showed a better discriminative capacity (ES: 0.573 to 1.179) between groups with different stable-angina-specific health status than the EQ-5D (ES: 0.426 to 1.126). RE suggested that the SF-6D (RE: 44.8 to 177.8%) was more efficient than the EQ-5D except for physical function. Poor agreement between them was observed with ICC (0.448, P< 0.001) and Bland-Altman plot analysis. Multiple liner regression showed that clinical variables significantly (P< 0.05) influenced differences in utility scores between the EQ-5D and SF-6D.


Both EQ-5D and SF-6D are valid and sensitive preference-based HRQoL instruments in Chinese stable angina patients. The SF-6D may be a more effective tool with lower ceiling effect and greater sensitivity. Further study is needed to compare other properties, such as reliability and longitudinal response.


There is an increasing demand for cost-utility analysis (CUA), which allows decision-makers to compare the value of interventions for different health problems and has been adopted by many countries such as the UK and US [1],[2]. The most commonly used outcome indicator in CUA is the quality adjusted life year (QALY) which is a combination of the time spent in a health state and a utility value representing quality of life for that particular health state [3]. Utility values usually range from 1 (full health) to 0 (death) and direct methods of measuring utilities (e.g. standard gamble or time trade-off) are complex and time-consuming. As an alternative, preference-based instruments are increasingly used in clinical studies and population surveys to generate utility scores [4]. They allow each health status to be described using a simple health status classification system, which can then be used to calculate utility scores with a validated algorithm [5]. Several preference-based instruments including the Quality of Well Being (QWB) [6], Health Utilities Index (HUI) [7], EQ-5D [8], Assessment of Quality of Life (AQoL) [9] and the SF-6D [10] have been published and widely used in different populations.

Given its low respondent burden, the EQ-5D has gained widespread use in clinical studies and population surveys. The EQ-5D has a number of country-specific choice-based preference weights, including weights for the UK, the US, Canada [11], Japan [7] and China [12]. In the UK, National Institute for Health and Clinical Excellence (NICE) currently suggests that the most preferred preference-based instrument is the EQ-5D but recognizes that the EQ-5D may not be appropriate in all circumstances [1]. The SF-6D, which is derived from the 36-item Short Form Health Survey (SF-36), is one of the most widely used generic measures of HRQoL in clinical trials. The major reason for developing SF-6D is to considerably extending the scope for undertaking economic evaluation in health care using existing and future SF-36 data sets [10]. Several studies have compared EQ-5D with SF-6D in different patient groups, including chronic prostatitis [13], chronic heart failure [14], coronary heart disease [15], chronic pain [16], type 2 diabetes [17], inflammatory arthritis [18] and mental health [19]. Fei-Li Zhao et al. found that both EQ-5D and SF-6D are demonstrated to be valid and sensitive HRQoL measures in Chinese chronic prostatitis patients, with SF-6D showing better HRQoL dimension coverage, greater sensitivity, and lower ceiling effect [13]. While, Marko Obradovic et al. found that EQ-5D scores were lower than SF-6D scores in patients with chronic pain, with EQ-5D showing higher construct validity and responsiveness [16]. In general, the two measures are not equivalent and the validity and comparative responsiveness of the EQ-5D and SF-6D differ depending on the population [13]-[19]. The choice of instrument to measure HRQoL may have potential implications for decision-making [18]. Evidence comparing the performance of these instruments is needed to inform the selection of the most appropriate instrument. In addition, the evidence requires cumulative results from different settings and types of study [20].

Stable angina, the cardinal symptom of coronary artery disease (CAD), is a major debilitating health condition with common chronic symptoms of intermittent, reversible chest pain or discomfort [21]. In China, approximately 7.7 thousand per million people have CAD and about half of them suffer from angina [22],[23]. Stable angina has a major negative impact on health-related quality of life (HRQoL), including poor general health status, pain, impaired role functioning, activity restriction, inability to self-manage, and psychological distress [24]. HRQoL measurement among patients with stable angina is thus important for evaluation of new health technologies and resource allocation decisions. Cardiac trials commonly include the collection of different disease-specific and generic measures of health status, such as the Seattle Angina Questionnaire (SAQ) [25], Angina Pectoris Quality of Life Questionnaire (APQLQ) [26], SF-36 [27], and the Nottingham Health Profile (NHP) [28]. However, these instruments can't be used to elicit utility values for calculating QALYs, which is a fundamental component in CUA as mentioned above. As the management of stable angina patients could potentially involve substantial resource consumption [29], providing preference-based measures that can be incorporated into economic evaluation is particularly important. Establishing practicality and validity of these measures is required before their application [30]. To the best of our knowledge, no preference-based instrument has been validated among stable angina patients. Therefore, the objective of this study was to evaluate the validity and sensitivity of the EQ-5D and SF-6D on stable angina patients and further to evaluate and compare the performance of these two instruments.


Study design and patient recruitment

A survey was conducted in two cities of China, Tianjin (northern China) and Chengdu (southern China), from July to December, 2011. Stable angina patients were recruited in two tertiary hospitals in Tianjin and two community health service centers (CHS) in Chengdu as chronic illness is managed in communities in Chengdu, but not in Tianjin.

Patients were included in the study if they were 18 years or above and had been clinically diagnosed with stable angina by their attending physicians based on clinical symptoms, examinations of coronary angiography, dual source Computer Tomography (CT), and history of CAD. Additional criterion included typical angina symptoms with a report of at least one episode of chest pain in the previous 3 months. Patients were excluded from participating if they had experienced acute myocardial infarction or coronary revascularization such as coronary artery bypass grafting surgery and percutaneous intervention in the previous 6 months. Patients were also excluded if they had any active exacerbation of gastrointestinal (GI) problems, such as an ulcer, or if they were unable to differentiate between their GI symptoms and angina pain. These criteria were used to help increase the likelihood that patients' chest pain was cardiac in nature rather than non-cardiac.

The study protocol was approved by the Institutional Review Board (IRB) of Tianjin University, and written informed consent concerning the conduct of the survey was obtained from each subject before participating in the study. Patients were interviewed by a trained interviewer with a standardized questionnaire. The questionnaire contained a set of socio-demographic, disease duration, comorbid conditions (hypertension, diabetes mellitus, and hyperlipidemia), and life style questions followed by the instruments of the SAQ, EQ-5D, EQ-VAS, and SF-6D. The patient-reported outcomes including EQ-5D and SF-6D measures were completed by the patients themselves. The procedure and questionnaire used were identical between the two cities.


The EQ-5D is a brief, multi-attribute, generic, preference-based HRQoL instrument. Its descriptive system covers five dimensions including mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Each dimension has three response levels (no problem, some problems, and severe problems). The EQ-5D descriptive system generates 243 health states, each of which was assigned a utility score ranging from ?0.59 to 1.00 (full health). The utility scoring algorithm adopted in this study was developed using time trade-off (TTO) based preference scores from a China general population [12]. The EQ-5D also includes a 20-cm vertical VAS, with 0 and 100 representing worst and best imaginable health states, respectively. The simplified Chinese version of the EQ-5D/VAS was verified in Chinese population [31],[32].

The SF-6D is derived from the SF-36 and covers six dimensions including physical functioning, role limitation, social functioning, pain, mental functioning, and vitality. Each dimension has four to six response levels. Totally the SF-6D system defines 18,000 health states with a utility score ranging from 0.29 to 1.00 [10]. The SF-6D utility scoring algorithm used in this study was derived from a representative sample of the UK general population using the Standard Gambling (SG) method, since no Chinese preferences were available [10]. The Chinese version of the SF-6D was translated by Lam et al. in Hong Kong, which was proven to be feasible, acceptable, reliable, and valid in a Chinese population [33].

The SAQ is a disease-specific instrument for patients with angina with 19-item self-administered questions on five dimensions including exertional capacity scale (ECS), anginal stability scale (ASS), anginal frequency scale (AFS), treatment satisfaction scale (TSS), and the disease perception scale (DPS) [25]. The SAQ is scored by assigning each response an ordinal value, beginning with 1 for the response that implies the lowest level of functioning to 5 that implies the highest level of functioning, and summing across items for each of the 5 dimensions. Scale scores for each dimension are then transformed to a 0 to 100 range by subtracting the lowest possible, dividing by the range of the scales, and multiplying by 100. As each scale monitors a unique dimension, no summary score is generated. The Chinese SAQ has been shown to be a valid, responsive and reliable instrument [34].

Data analyses

Descriptive statistics

Descriptive statistics were performed to characterize the sample and the scores of the EQ-5D/VAS, SF-6D, and SAQ. Continuous variables are presented as mean, standard deviation (SD) and categorical variables are shown in the number and proportion of the sample within each group.

Construct validation

Convergent validity of the EQ-5D and SF-6D was assessed by examining their association with the SAQ and EQ-VAS at the domain and scale level. Based on the literature and clinical experience, eight a priori hypotheses were generated where moderate-to-strong correlations were expected, namely: 1) the EQ-5D and SF-6D utility scores with SAQ physical limitation; 2) the EQ-5D and SF-6D utility scores with SAQ angina stability; 3) the EQ-5D and SF-6D utility scores with SAQ angina frequency; 4) the EQ-5D and SF-6D utility scores with SAQ treatment satisfaction; 5) the EQ-5D and SF-6D utility scores with SAQ disease perception; 6) the EQ-5D and SF-6D utility scores with the EQ-VAS; 7) the EQ-5D pain/discomfort and SF-6D pain with SAQ angina frequency; 8) the EQ-5D performing usual activities and the SF-6D physical function with the SAQ physical limitation. The correlation was estimated with Spearman's rank correlation coefficient, with p?>?0.5 considered strong correlation, 0.35 to 0.5 considered moderate correlation, and 0.2 to 0.34 weak correlation [35].

The ‘known-group’ method was used to examine the discriminative validity of the EQ-5D and SF-6D based on its ability to discriminate among patients with different subgroups [13],[36]. Patients were grouped according to socioeconomic status, duration of CAD, presence of other medical conditions and the EQ-VAS. We classified the EQ-VAS scores into four groups, namely<65 (bad), 65 to 79 (fair), 80 to 89 (good), and 90 to 100 (excellent) [37]. Subjects with poorer health status were hypothesized to have lower utility scores for these two instruments. Nonparametric Mann–Whitney U tests were performed to identify statistically significant effects of dichotomous variables on utility scores, while Kruskal-Wallis H tests for polychromous variables.

Discriminative capacity of the EQ-5D and SF-6D

Ceiling and floor effects (proportion of respondents with the best and worst possible theoretical scores, respectively) were calculated for the EQ-5D and SF-6D. Ceiling and floor effects were considered small if >15% of patients occupy the best or worst health states, respectively, and serious if >15% of patients occupy these states [18].

The discriminative capacity of the EQ-5D and SF-6D instruments to detect clinically relevant differences among stable angina patients were compared using the effect size (ES), relative efficiency (RE) statistics, and receiver operating characteristic (ROC) curves. The ROC curve procedure provides a useful method of evaluating the performance of measures against external indicators of health status. The utility measure that generates the largest area under the ROC curve is regarded as the most sensitive at detecting differences in the external indicator. A measure with perfect discrimination would generate an area under the curve (AUC) score of 1.0, whilst a measure with less discriminatory power would generate an AUC score of less than 0.5 [30]. In this analysis, the performance of the EQ-5D and SF-6D was evaluated against the five dimensional scales of the SAQ as an external indicator of health status. Scores for each scale were divided into two groups (>=?50 and<50) indicating better cardiac functioning and worse functioning [38]. ES was used to define the discriminative capacity, and was computed as the difference between the mean of the two groups mentioned above, divided by the pooled standard deviation. The pooled standard deviation was estimated from the corrected standard errors and the weighted number of individuals in the groups [39]. General guidelines define an effect size of 0.2 as small, 0.5 as moderate, and 0.8 as large [40]. This classification was used to interpret differences in the discriminative capacity of the instruments studied. RE statistic is defined as the ratio of the square of the t-statistic of the comparator instrument (assumed to be the SF-6D utility score) over the square of the t-statistic of the reference instrument (assumed to be the EQ-5D utility score). The coefficient higher than 1.0 indicates that the SF-6D is more sensitive than the EQ-5D at detecting differences in external indicators of health with the given sample size, whilst the coefficient lower than 1.0 indicates less sensitivity to detect differences [41].

Level of agreement between the EQ-5D and SF-6D

The degree of agreement between utility scores of the EQ-5D and SF-6D was assessed by the intra-class correlation coefficient (ICC) and the Bland-Altman plot. The ICC was computed with the random-effects linear regression model. Coefficients above 0.7 suggest a strong agreement [42]. The paired comparison between the EQ-5D and SF-6D utility scores was made with Wilcoxon's signed rank test. In the Bland-Altman plot, the average of the two measurements was plotted on the x-axis, and the difference between the two measurements on the y-axis, where the SF-6D was the subtrahend. The deviation of the difference from 0, which implies total agreement, indicates the degree of agreement for each subject on the plot [43].

Factors affecting utility difference between the EQ-5D and SF-6D

The factors involved in the variation of the utility difference between the EQ-5D and SF-6D were explored with multiple liner regression (MLR). The utility difference between the EQ-5D and SF-6D was entered as the dependent variable and individual characteristics including age, gender, education, working status, income, BMI, comorbid conditions, disease duration, SAQ scores, and the EQ-VAS for global health status were treated as independent variables.

All data were entered into a database using EpiData (Epidata version 3.1, Epidata Association, Odense, Denmark) and analyzed using STATA 10.0 (STATA Corp LP, Texas, USA).


Characteristics of patients

We obtained 411 valid answers from 423 participants with a response rate of 97.16% (Table 1). Half of the patients were women (50.36%), the mean age was 68.08 (11.35) years, and almost 25% had less than six years of schooling. 77.86% of the patients were retired. A high percentage of respondents reported comorbidities including hypertension (56.69%), diabetes (25.30%), and hyperlipidemia (21.17%). Except for angina stability, the mean scores of other SAQ subscales were higher than 50, indicating better functioning. The mean (SD) scores were 0.78 (0.15) for the EQ-5D, 0.68 (0.12) for the SF-6D and 71.23 (12.35) for the EQ-VAS.

Table 1 Sociodemographics and characteristics of Chinese patients with stable angina (N=?411)

Construct validation

Convergent validity was demonstrated by the moderate to strong correlation coefficients (range: 0.368-0.594, P< 0.001) for five of eight a priori hypotheses in both the EQ-5D and the SF-6D (Table 2). Correlations between the utility scores from these two instruments with the scores for SAQ angina stability were weak, while the correlations between utility and the SAQ physicals limitation, SAQ disease perception, and the EQ-VAS scores were relatively strong. Meanwhile, the SAQ physical limitation score correlated strongly with the EQ-5D usual activities and the SF-6D physical function.

Table 2 Correlations between EQ-5D or SF-6D and SAQ or EQ-VAS

Table 3 presents the univariate analyses for the SF-6D and EQ-5D utility scores within subgroups. Hypothesis for known-group discriminative validity was confirmed by the differences in utility scores among groups with different health status measured by EQ-VAS. Moreover, both measures discriminated between female and male. Another significant difference in the SF-6D was observed among patients with different education levels, whereas in the EQ-5D, significant difference was observed for the presence of acute medical conditions.

Table 3 Univariate analyses for SF-6D and EQ-5D utility scores within subgroups

Ceiling/floor effects for the EQ-5D and SF-6D

There was ceiling effect for the EQ-5D utility score (15.57%) and no patient scored at the ceiling of the SF-6D. However, serious ceiling effects existed in all domains of the EQ-5D, and the largest ceiling effect were observed for mobility (84.18%) and self-care (86.62%) domains. High ceiling effects were also observed in the social function domain (29.20%) and role limitation (26.52%) of the SF-6D. No patient scored at the floor of the EQ-5D utility, and 0.24% scored at the floor of the SF-6D utility. Floor effects were negligible on most domains, except for role limitation (21.17%) and vitality (17.03%) from the SF-6D.

The distribution of responses who reported limitations on the SF-6D dimensions was 15.57% among individuals who reported no limitations on all the EQ-5D dimensions. In this group, a majority of individuals were classified as level 1 for the SF-6D dimension of role limitation (68.75%) and social function (67.19%). Nevertheless, 96.87% of the respondents were classified as level 2 or higher in the physical functioning dimension, 87.50% of the respondents in the vitality dimension, 82.81% of the respondents in the mental health dimension, and 65.62% of the respondents in the pain dimension.

Sensitivity of the EQ-5D and SF-6D

Table 4 presents effect sizes (ES), relative efficiency (RE) statistics, and area scores under the receiver operating characteristic curves (AUC) for the EQ-5D and SF-6D utility scores between groups based on the dichotomous health status variables. Differences between the five groups for the SF-6D utility scores were large, with ES ranging from 0.573 to 1.179. Most effect sizes on the EQ-5D were moderate or large (ranging from 0. 426 to 1.126).

Table 4 Efficiency of the EQ-5D and SF-6D to detect clinically relevant difference

Statistically significant differences (P<?0.001) were found for all between-group comparisons on both the EQ-5D and SF-6D utility scores. RE statistic calculation showed that the EQ-5D was found to be 24.2% more efficient at detecting differences between groups with physical limitations. While when subjects were categorized in terms of angina stability and angina frequency, the SF-6D was 44.8% and 81.4% more efficient than the EQ-5D. When subjects were categorized in terms of treatment satisfaction and disease perception, the SF-6D was 146.3% and 177.8% more efficient than the EQ-5D.

The AUC scores generated by the ROC curves provided a further indication of the sensitivity of the two instruments. The AUC scores of both instruments above 0.5 with statistical significance suggested that the instruments were able to detect the difference between patients with better and worse functioning in the five domains of the SAQ. Except for angina frequency, the SF-6D generated higher AUC scores than the EQ-5D, indicating greater discriminatory power.

Level of agreement between the EQ-5D and SF-6D

The degree of agreement between the scores of EQ-5D and SF-6D was assessed by the Bland-Altman plot and by computing an intra-class correlation coefficient (ICC). Poor agreement between the EQ-5D and SF-6D utility scores was observed with a low ICC of 0.448. Wilcoxon's signed rank test showed that the difference was significant (P< 0.001). Bland-Altman analysis indicated lack of agreement between the two measures with the mean difference of 0.106 (Figure 1). The analysis indicated that the 95% limits of agreement between the EQ-5D and SF-6D ranged from -0.123 to 0.335 and over 95% points lay within those limits. A systematic variation was observed, with higher SF-6D at lower mean utility, and lower SF-6D at higher mean utility scores.

Figure 1

Bland-Altman plot of difference in utility scores between EQ-5D and SF-6D.

Factors affecting utility difference between the EQ-5D and SF-6D

Table 5 presents the results of the multiple linear regression analyses with the difference between the EQ-5D and SF-6D as the dependent variable. The dependent variable is normally distributed and the multiple linear regression analysis did not obviously break the standard assumptions of linear regression analysis. The values of the VIF (Variance Inflation Factor) are generally below 2 and always below 4, so there is no indication of high multicollinearity [44]. The results found that presence of acute medical conditions significantly influence the difference of the EQ-5D and SF-6D; however, the magnitude of the influence was not large (coefficient 0.043, P=?0.008). Similar results were observed for the hypertension (coefficient 0.022, P=?0.065). A large magnitude of the influence was observed for inpatient variable (coefficient 0.072, P=?0.001). A very small magnitude of the influence was observed for the EQ-VAS variable (coefficient 0.001, P=?0.040).

Table 5 Multiple linear regression analyses for utility difference between the EQ-5D and SF-6D


The evidence of validity and sensitivity of the EQ-5D and SF-6D in Chinese patients with stable angina was provided in this study, which demonstrates that the EQ-5D and SF-6D are valid and sensitive preference-based HRQoL instruments in this patient group. However, the performance of the two instruments was not identical. Our results provide useful information for the choice of preference-based HRQoL instruments for stable angina patients. To our knowledge, this is the first comparison study for the EQ-5D and SF-6D among stable angina patients.

In this study, patients from Tianjin and Chengdu were selected as our study sample. Previous evidence has suggested that patient location does not affect the validity of the results [13]. Therefore, samples from the two cities were merged to increase the statistical power and representativeness of study results. Convergent validity was demonstrated by the moderate to strong correlation coefficients with SAQ, a validated instrument for angina, in our study. The correlations between the utilities of the two instruments and two domains of SAQ, physical limitation and disease perception, were relatively strong. This is consistent with the finding that illness perception is correlated with poorer quality of life for cardiac patients [45]. As for ‘known group’ discriminative validity, both the EQ-5D and SF-6D utility scores decreased with poorer health status indicated by the EQ-VAS. Moreover, both measures showed that female patients have lower utility scores than male patients, as previously noted [46]. Furthermore, the results also indicate that increased utility scores are associated with higher education level, but statistical significance is only achieved in the SF-6D. This is consistent with previous studies indicating that lower socioeconomic status is correlated with poorer outcomes in patients with chronic diseases, including cardiac patients [47],[48].

Consistent with previous studies, ceiling effects existed in the EQ-5D [13],[20],[49]. A total of 64 individuals (15.57%) reported no limitations on all the EQ-5D dimensions, while no patients were classified in full health on the SF-6D. Based on the SF-6D responses, individuals reporting full health on the EQ-5D may still have problems on physical function, vitality and mental health dimensions. This disparity can be attributed to the descriptive system of the SF-6D, in which more response levels for each domain are provided and patients might be more likely to find the best description for their status. In fact, a five-level version of the EQ-5D is under development [50]. Preliminary studies indicated that prototype five-level versions could improve the properties of the three-level in terms of reduced ceiling effects, increased reliability, and improved ability to discriminate between different levels of health [51]. In addition, most of the patients had better cardiac functioning indicated by high SAQ scores, which can also partially explain the strong ceiling effect. Effect sizes (ES), relative efficiency (RE) statistics and AUC scores were used to test the discriminative capacity of the EQ-5D and SF-6D. Both instruments were able to detect the differences between patients with different disease severity as measured by the SAQ. It is shown that the SF-6D had greater discriminatory power to detect clinically relevant difference of stable angina patients. This may be partially explained by the serious ceiling effect of the EQ-5D. Previous studies showed that the EQ-5D would be more suitable for measuring the health of more morbidity while the SF-6D may have a limitation in severe patients [18],[49].

The mean EQ-5D score exceeded the mean SF-6D score by 0.106 with significant difference, exceeding minimally important differences (MIDs) of both measures [18]. The magnitude of difference is higher than the differences reported in other disease groups or general population [18],[20]. Interestingly, previous comparative studies have estimated that the mean EQ-5D score was higher than the SF-6D when the mean EQ-5D score exceeded 0.740, which was consistent with our results [15],[52],[53]. Conversely, mean EQ-5D utility was less than mean SF-6D utility when the mean EQ-5D score was less than 0.740 [19],[54]. The ICC analyses and Bland-Altman plot revealed the inconsistency of these instruments. There were other differences between the two instruments which may explain the different performance. The recall period of both instruments is different is that ‘today’ for the EQ-5D/VAS versus ‘the last four weeks’ for the SF-6D. Another difference is the descriptive systems and the valuations attached to the health states. The SF-6D includes broader aspects of HRQoL and has more response level for each domain. In the EQ-5D, health status is valued using the time trade-off (TTO) method, whereas the SF-6D assigns value to health states using the standard gamble (SG) [55]. Also, in the specific China scoring algorithm of the EQ-5D, if any dimension is at level 3, a N3 term will be included. The existence of N3 term in the China scoring algorithm could be one of the reasons for the discrepancy between EQ-5D and SF-6D. According to our results, the SF-6D is shown to be more appropriate choice among stable angina patients because of its higher sensitivity and lower ceiling effect.

Our study had several limitations. The first was that as a cross-section study, we did not examine the longitudinal response and reliability of the EQ-5D and SF-6D, for which are important psychometric characteristics of instruments. Secondly, the relatively small sample size of severe stable angina patients might aggregate the ceiling effect. Further studies with a larger sample size are warranted. Third, similar to some previous studies, there were no objective groups in our known-group analysis. All comparisons are relative because there was no ‘gold standard’ objective measure to compare the measures with.


The EQ-5D and SF-6D are demonstrated to be valid and sensitive preference-based HRQoL instruments in Chinese stable angina patients. The SF-6D may be a superior to the EQ-5D, with a lower ceiling effect and greater sensitivity. Further study is needed to compare other properties, such as reliability and longitudinal response.


  1. 1.

    Earnshaw J, Lewis G: NICE guide to the methods of technology appraisal: pharmaceutical industry perspective. Pharmacoeconomics 2008, 26(9):725–727. 10.2165/00019053-200826090-00002

    Article  PubMed  Google Scholar 

  2. 2.

    Sullivan SD, Lyles A, Luce B, Grigar J: AMCP guidance for submission of clinical and economic evaluation data to support formulary listing in U.S. health plans and pharmacy benefits. J Manag Care Pharm 2001, 7: 272–282.

    Google Scholar 

  3. 3.

    Drummond MF, Sculpher MJ, Torrance GW, O'Brien BJ, Stoddart GL: Methods for the Economic Evaluation of Health Care Programmes. Oxford University Press, New York; 2005.

    Google Scholar 

  4. 4.

    Kopec JA, Willison KD: A comparative review of four preference-weighted measures of health-related quality of life. J Clin Epidemiol 2003, 56(4):317–325. 10.1016/S0895-4356(02)00609-1

    Article  PubMed  Google Scholar 

  5. 5.

    Neumann PJ, Goldie SJ, Weinstein MC: Preference-based measures in economic evaluation in health care. Annu Rev Public Health 2000, 21: 587–611. 10.1146/annurev.publhealth.21.1.587

    CAS  Article  PubMed  Google Scholar 

  6. 6.

    Kaplan RM, Bush JW, Berry CC: Health status: types of validity and the index of wellbeing. Health Serv Res 1976, 11(4):478–507.

    PubMed Central  CAS  PubMed  Google Scholar 

  7. 7.

    Torrance GW, Furlong W, Feeny D, Boyle M: Multi-attribute preference functions. Health utilities index. Pharmacoeconomics 1995, 7(6):503–520. 10.2165/00019053-199507060-00005

    CAS  Article  PubMed  Google Scholar 

  8. 8.

    Rabin R, De Charro F: EQ-5D: a measure of health status from the EuroQol Group. Ann Med 2001, 33(5):337–343. 10.3109/07853890109002087

    CAS  Article  PubMed  Google Scholar 

  9. 9.

    Hawthorne G, Richardson J, Day NA: A comparison of the assessment of quality of life (AQoL) with four other generic utility instruments. Ann Med 2001, 33(5):358–370. 10.3109/07853890109002090

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    Brazier J, Roberts J, Deverill M: The estimation of a preference-based measure of health from the SF-36. J Health Econ 2002, 21(2):271–292. 10.1016/S0167-6296(01)00130-8

    Article  PubMed  Google Scholar 

  11. 11.

    Bansback N, Tsuchiya A, Brazier J, Anis A: Canadian valuation of EQ-5D health states: preliminary value set and considerations for future valuation studies. PLoS One 2012, 7(2):e31115. 10.1371/journal.pone.0031115

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  12. 12.

    Liu GG, Wu H, Li M, Gao C, Luo N: Chinese time trade-off values for EQ-5D health states. Value Health 2014, 17(5):579–604. 10.1016/j.jval.2014.05.007

    Article  Google Scholar 

  13. 13.

    Zhao FL, Yue M, Yang H, Wang T, Wu JH, Li SC: Validation and comparison of EuroQol and short form 6D in chronic prostatitis patients. Value Health 2010, 13(5):649–656. 10.1111/j.1524-4733.2010.00728.x

    Article  PubMed  Google Scholar 

  14. 14.

    Kontodimopoulos N, Argiriou M, Theakos N, Niakas D: The impact of disease severity on EQ-5D and SF-6D utility discrepancies in chronic heart failure. Eur J Health Econ 2011, 12(4):383–391. 10.1007/s10198-010-0252-4

    Article  PubMed  Google Scholar 

  15. 15.

    Van Stel HF, Buskens E: Comparison of the SF-6D and the EQ-5D in patients with coronary heart disease. Health Qual Life Outcomes 2006, 4: 20. 10.1186/1477-7525-4-20

    PubMed Central  Article  PubMed  Google Scholar 

  16. 16.

    Obradovic M, Lal A, Liedgens H: Validity and responsiveness of EuroQol-5 dimension (EQ-5D) versus Short Form-6 dimension (SF-6D) questionnaire in chronic pain. Health Qual Life Outcomes 2013, 11: 110. 10.1186/1477-7525-11-110

    PubMed Central  Article  PubMed  Google Scholar 

  17. 17.

    Mulhern B, Meadows K: The construct validity and responsiveness of the EQ-5D, SF-6D and Diabetes Health Profile-18 in type 2 diabetes. Health Qual Life Outcomes 2014, 12: 42. 10.1186/1477-7525-12-42

    PubMed Central  Article  PubMed  Google Scholar 

  18. 18.

    Harrison MJ, Davies LM, Bansback NJ, McCoy MJ, Verstappen SM, Watson K, Symmons DP: The comparative responsiveness of the EQ-5D and SF-6D to change in patients with inflammatory arthritis. Qual Life Res 2009, 18(9):1195–1205. 10.1007/s11136-009-9539-2

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  19. 19.

    Lamers LM, Bouwmans CA, van Straten A, Donker MC, Hakkaart L: Comparison of EQ-5D and SF-6D utilities in mental health patients. Health Econ 2006, 15(11):1229–1236. 10.1002/hec.1125

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Cunillera O, Tresserras R, Rajmil L, Vilagut G, Brugulat P, Herdman M, Mompart A, Medina A, Pardo Y, Alonso J, Brazier J, Ferrer M: Discriminative capacity of the EQ-5D, SF-6D, and SF-12 as measures of health status in population health survey. Qual Life Res 2010, 19(6):853–864. 10.1007/s11136-010-9639-z

    Article  PubMed  Google Scholar 

  21. 21.

    Guidelines on the Management of Stable Angina Pectoris. 2006.

  22. 22.

    An Analysis Report of National Health Services Survey in China. China Union Medical University Press, Beijing; 2008.

  23. 23.

    Kannel WB, Feinleib M: Natural history of angina pectoris in the Framingham study: Prognosis and survival. Am J Cardiol 1972, 29(2):154–163. 10.1016/0002-9149(72)90624-8

    CAS  Article  PubMed  Google Scholar 

  24. 24.

    Gandjour A, Lauterbach KW: Review of quality-of-life evaluations in patients with angina pectoris. Pharmacoeconomics 1999, 16(2):141–152. 10.2165/00019053-199916020-00003

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Spertus JA, Winder JA, Dewhurst TA, Deyo RA, Prodzinski J, McDonell M, Fihn SD: Development and evaluation of the Seattle Angina Questionnaire: a new functional status measure for coronary artery disease. J Am Coll Cardiol 1995, 25(2):333–341. 10.1016/0735-1097(94)00397-9

    CAS  Article  PubMed  Google Scholar 

  26. 26.

    Wilson A, Wiklund I, Lahti T, Wahl M: A summary index for the assessment of quality of life in angina pectoris. J Clin Epidemiol 1991, 44(9):981–988. 10.1016/0895-4356(91)90069-L

    CAS  Article  PubMed  Google Scholar 

  27. 27.

    Ware JE, Snow KK, Kosinski M, Gandek B: SF-36 Health Survey Manual and Interpretation Guide. New England Medical Center, The Health Institute, Boston, MA; 1993.

    Google Scholar 

  28. 28.

    Hunt SM, McKenna SP, McEwen J, Backett EM, Williams J, Papp E: A quantitative approach to perceived health status: a validation study. J Epidemiol Community Health 1980, 34(4):281–286. 10.1136/jech.34.4.281

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  29. 29.

    McGillion MH, Croxford R, Watt-Watson J, Lefort S, Stevens B, Coyte P: Cost of illness for chronic stable angina patients enrolled in a self-management education trial. Can J Cardiol 2008, 24(10):759–764. 10.1016/S0828-282X(08)70680-9

    PubMed Central  Article  PubMed  Google Scholar 

  30. 30.

    Streiner DL, Norman GR: Health Measurement Scales: A Practical Guide to Their Development and Use. Oxford University Press, New York; 2008.

    Google Scholar 

  31. 31.

    Wang H, Kindig DA, Mullahy J: Variation in Chinese population health related quality of life: results from a EuroQol study in Beijing, China. Qual Life Res 2005, 14(1):119–132. 10.1007/s11136-004-0612-6

    Article  PubMed  Google Scholar 

  32. 32.

    Shi JF, Kang DJ, Qi SZ, Wu HY, Liu YC, Sun LJ, Li L, Yang Y, Li Q, Feng XX, Zhang LQ, Li J, Li XL, Yang Y, Niyazi M, Xu AD, Liu JH, Xiao Q, Li LK, Wang XZ, Qiao YL: Impact of genital warts on health related quality of life in men and women in mainland China: a multicenter hospital-based cross-sectional study. BMC Public Health 2012, 12: 153. 10.1186/1471-2458-12-153

    PubMed Central  Article  PubMed  Google Scholar 

  33. 33.

    Lam CL, Brazier J, McGhee SM: Valuation of the SF-6D health states is feasible, acceptable, reliable, and valid in a Chinese population. Value Health 2008, 11(2):295–303. 10.1111/j.1524-4733.2007.00233.x

    Article  PubMed  Google Scholar 

  34. 34.

    Liu XT, Kong SP, Liao ZY, Sike L: Asessment study on physical function and the quality of life for CHD patients with SAQ. Chin Behav Sci 1997, 6: 49–51. [in Chinese]

    Google Scholar 

  35. 35.

    Spilker B: Quality of Life and Pharmacoeconomics in Clinical Trials. Lippincott-Raven Publishers, Philadelphia; 1996.

    Google Scholar 

  36. 36.

    Jin H, Wang B, Gao Q, Chao J, Wang S, Tian L, Liu P: Comparison between EQ-5D and SF-6D utility in rural residents of Jiangsu Province, china. PLoS One 2012, 7(7):e41550. 10.1371/journal.pone.0041550

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  37. 37.

    Barton GR, Sach TH, Avery AJ, Jenkinson C, Doherty M, Whynes DK, Muir KR: A comparison of the performance of the EQ-5D and SF-6D for individuals aged>or= 45 years. Health Econ 2008, 17(7):815–832. 10.1002/hec.1298

    Article  PubMed  Google Scholar 

  38. 38.

    Berra K, Fletcher B, Miller NH: Chronic stable angina: Addressing the needs of patients through risk reduction, education and support. Clin Invest Med 2008, 31(6):E391-E399.

    PubMed  Google Scholar 

  39. 39.

    Martins WP, Zanardi JV: Subgroup analysis and statistical power. Eur J Obstet Gynecol Reprod Biol 2011, 159(1):244. (Author reply 245) 10.1016/j.ejogrb.2011.07.046

    Article  PubMed  Google Scholar 

  40. 40.

    Kazis LE, Anderson JJ, Meenan RF: Effect sizes for interpreting changes in health status. Med Care 1989, 27(Suppl 3):S178-S189. 10.1097/00005650-198903001-00015

    CAS  Article  PubMed  Google Scholar 

  41. 41.

    Fayers P, Machin D: Quality of Life: Assessment, Analysis, and Interpretation. John Wiley & Sons, Chichester; 2000.

    Google Scholar 

  42. 42.

    Rabe-Hesketh S, Everitt : A Handbook of Statistical Analyses Using Stata. CRC Press/Chapman & Hall, Boca Raton; 2006.

    Google Scholar 

  43. 43.

    Bland JM, Altman DG: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986, 1(8476):307–310. 10.1016/S0140-6736(86)90837-8

    CAS  Article  PubMed  Google Scholar 

  44. 44.

    Mason CH, Perreault WD Jr: Collinearity, power, and interpretation of multiple regression analysis. J Mark Res 1991, 28: 268–280. 10.2307/3172863

    Article  Google Scholar 

  45. 45.

    Le Grande MR, Elliott PC, Worcester MU, Murphy BM, Goble AJ, Kugathasan V, Sinha K: Identifying illness perception schemata and their association with depression and quality of life in cardiac patients. Psychol Health Med 2012, 17(6):709–722. 10.1080/13548506.2012.661865

    Article  PubMed  Google Scholar 

  46. 46.

    Kimble LP, McGuire DB, Dunbar SB, Fazio S, De A, Weintraub WS, Strickland OS: Gender differences in pain characteristics of chronic stable angina and perceived physical limitation in patients with coronary artery disease. Pain 2003, 101(1-2):45–53. 10.1016/S0304-3959(02)00319-6

    PubMed Central  Article  PubMed  Google Scholar 

  47. 47.

    Sykes DH, Hanley M, Boyle DM, Higginson JD, Wilson C: Socioeconomic status, social environment, depression and postdischarge adjustment of the cardiac patient. J Psychosom Res 1999, 46(1):83–98. 10.1016/S0022-3999(98)00069-5

    CAS  Article  PubMed  Google Scholar 

  48. 48.

    Kington RS, Smith JP: Socioeconomic status and racial and ethnic differences in functional status associated with chronic diseases. Am J Public Health 1997, 87(5):805–810. 10.2105/AJPH.87.5.805

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  49. 49.

    Bharmal M, Thomas J 3rd: Comparing the EQ-5D and the SF-6D descriptive systems to assess their ceiling effects in the US general population. Value Health 2006, 9(4):262–271. 10.1111/j.1524-4733.2006.00108.x

    Article  PubMed  Google Scholar 

  50. 50.

    Herdman M, Gudex C, Lloyd A, Janssen M, Kind P, Parkin D, Bonsel G, Badia X: Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5-L). Qual Life Res 2011, 20(10):1727–1736. 10.1007/s11136-011-9903-x

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  51. 51.

    Janssen MF, Pickard AS, Golicki D, Gudex C, Niewada M, Scalone L, Swinburn P, Busschbach J: Measurement properties of the EQ-5D-5-L compared to the EQ-5D-3-L across eight patient groups: a multi-country study. Qual Life Res 2013, 22(7):1717–1727. 10.1007/s11136-012-0322-4

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  52. 52.

    Brazier J, Roberts J, Tsuchiya A, Busschbach J: A comparison of the EQ-5D and SF-6D across seven patient groups. Health Econ 2004, 13(9):873–884. 10.1002/hec.866

    Article  PubMed  Google Scholar 

  53. 53.

    Petrou S, Hockley C: An investigation into the empirical validity of the EQ-5D and SF-6D based on hypothetical preferences in a general population. Health Econ 2005, 14(11):1169–1189. 10.1002/hec.1006

    Article  PubMed  Google Scholar 

  54. 54.

    Longworth L, Bryan S: An empirical comparison of EQ-5D and SF-6D in liver transplant patients. Health Econ 2003, 12: 1061–1067. 10.1002/hec.787

    Article  PubMed  Google Scholar 

  55. 55.

    Green C, Brazier J, Deverill M: Valuing health-related quality of life. A review of health state valuation techniques. Pharmacoeconomics 2000, 17(2):151–165. 10.2165/00019053-200017020-00004

    CAS  Article  PubMed  Google Scholar 

Download references


This study was supported by Innovation Research Fund of Tianjin University, China. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information



Corresponding author

Correspondence to Jing Wu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JW, YH, FZ, HS participated in the design of the study and JZ, ZC, YH collected the data. JW and YH performed the statistical analysis, and all the other authors coordinated the manuscript development. All authors reviewed, contributed and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wu, J., Han, Y., Zhao, F. et al. Validation and comparison of EuroQoL-5 dimension (EQ-5D) and Short Form-6 dimension (SF-6D) among stable angina patients. Health Qual Life Outcomes 12, 156 (2014).

Download citation


  • Quality of life
  • Stable angina
  • EQ-5D
  • SF-6D
  • Utility
  • China