A comparison of health utility scores calculated using United Kingdom and Canadian preference weights in persons with alzheimer’s disease and their caregivers
Health and Quality of Life Outcomes volume 14, Article number: 105 (2016)
The use of the EQ-5D to asses the economic benefits of health technologies has led to questions about the cross-population transferability of preference weights to calculate health utility scores. The aim of this study is to investigate whether the use of UK and Canadian preference weights will lead to the calculation of different health utility scores in a sample of persons with Alzheimer’s disease (AD) and their primary informal caregivers.
We recruited 216 patient-caregiver dyads from nine geriatric and memory clinics across Canada. Participants used the EQ-5D-3L to rate their health-related quality-of-life (HRQoL). EQ-5D-3L responses were transformed into health utility scores using UK and Canadian preference weights. The levels of agreement between the two sets of scores were assessed using intraclass correlation coefficients (ICCs). Bland-Altman plots depicted individual-level differences between the two sets of scores. Differences in health utility scores were tested using the Wilcoxon signed rank sum test. A generalized linear model with a gamma distribution was used to examine whether participants’ socio-demographic characteristics were associated with their health utility scores.
The distributions of health utility scores derived from both the UK and Canadian preference weights were skewed to the left. The intraclass correlation coefficient was 0.94 (95 % CI: 0.92, 0.95) for persons with AD and 0.92 (95 % CI: 0.88, 0.94) for the caregivers. The Canadian weights yielded slightly higher median health utility scores than the UK weights for caregivers (median difference: 0.009; 95 % confidence interval: 0.007, 0.013). This finding persisted after stratifying by disease severity. Few socio-demographic characteristics were associated with the two sets of health utility scores.
Health utility scores exhibited small and clinically unimportant differences when calculated with UK versus Canadian preference weights in persons with AD and their caregivers. The original UK and Canadian population samples used to obtain the preference weights valued health states similarly.
Alzheimer’s disease (AD) is a chronic neurodegerative condition that accounts for 60 to 70 % of all cases of dementia. Cognitive impairment, functional decline, and behavior and mood problems are the core features of AD . AD and other dementias are the seventh leading cause of mortality and disability and the fourth leading cause of disease burden in high-income countries .
Health-related quality of life (HRQoL) is an individual’s dynamic perception of the impact of a health state upon physical, emotional, and cognitive function, social role performance, well-being, and life satisfaction . HRQoL is an important means of assessing the impact of AD treatments because available therapies mitigate the symptoms of cognitive decline, but do not alter the progression of the disease .
The EQ-5D-3L is one of the most frequently used generic instruments to measure HRQoL [5–7]. Algorithms (preference weights) can be used to convert EQ-5D-3L responses into health utility scores (range: 0 [equivalent to death] to 1 [equivalent to full health]), which are employed in cost-utility analyses to calculate quality-adjusted life-years (QALYs). The original preference weights for the EQ-5D-3L were derived from the general UK population using the time trade-off (TTO) method . Researchers generated a Canadian set of preference weights for the EQ-5D-3L using the TTO method and a sample of 1145 participants who belonged to a market research panel . In the UK and Canadian studies, the researchers chose different sub-sets of health states from the 243 total possible health states on the EQ-5D-3L. These sub-sets were further divided into smaller groups for each participant to value using the TTO method. Regression analyses were employed to develop a set of beta coefficients that would serve as the preference weights to convert EQ-5D-3L responses into health utility scores.
This study investigated whether the use of UK and Canadian preference weights would lead to the computation of different health utility scores in a sample of persons with AD and their primary informal caregivers. The topic is important because studies based in populations without domestic sets of preference weights will often draw upon the preference weights of other populations, regardless of whether the other populations’ weights are transferable. Unless transferability is assessed, researchers cannot be certain whether another population’s weights will provide unbiased health utility scores in their population of interest.
This issue is important within the context of AD because health utility scores are essential components of cost-utility analyses. These analyses can influence reimbursement decisions for AD pharmacotherapies, as evidenced in 2006 when the results of a cost-utility analysis prompted the United Kingdom’s National Health Service to delist coverage of cholinetserase inhibitors for persons with mild AD. The impact of cost-utility analyses on treatment decision-making highlights the importance of ensuring the unbiased nature of the underlying health utility scores.
Recent work in Canada has echoed our sentiments about the use of adequate preference weights in cost-utility analyses . Lien et al. point out that differences in country-specific preference weights could lead to differences in cost-utility results . Similar concerns have also been raised in Italy . In the context of Canada’s publicly-funded healthcare system, decisions regarding the efficient allocation of limited resources require support from unbiased analyses of cost-utility data. The issue extends beyond Canada to include any jurisidiction without a locally or domestically available set of preference weights. An examination of this issue may raise awareness among regulatory agencies that do not mandate the use of local or domestic preference weights.
To round out our objectives, we also explored whether socio-demographic factors might be associated with the health utility scores calculated in this study.
Subjects and data collection
Data were collected between November 2008 and August 2011 . Two hundred sixteen persons with AD and their primary informal caregivers were recruited from nine memory or geriatric clinics across Canada. Eligible participants had a diagnosis of AD, as defined by the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text revision criteria  or the National Institute of Neurological and Communicative Disorders and Stroke/Alzheimer’s Disease and Related Disorders Association criteria (NINCDS-ADRDA) . We included persons with mild or moderate AD to ensure that participants would be cognitively capable of answering the study questions. The physicians who ran the recruiting clinics assessed disease severity using the Functional Assessment Staging in Alzheimer’s Disease Scale . Participants’ primary informal caregivers also had to agree to participate in the study. All participants had to speak English or French.
We conducted separate one-on-one interviews with each participant. The interviews included socio-demographic questions (i.e., age, gender, education level, occupation and household annual income [Canadian dollars]) and the EQ-5D-3L. Each participant rated her or his own HRQoL. Caregivers did not provide proxy HRQoL ratings for persons with AD.
Prior to commencing each interview, participants read an information package about the study and they could ask the interviewer questions. The interviews began after participants signed an informed consent form. The study received ethics approval from the Hamilton Health Sciences/McMaster Health Sciences Research Ethics Board (project number 08-179) and from the local research ethics boards governing each of the nine recruitment sites.
We calculated health utility scores using the EQ-5D-3L responses and the UK  and Canadian preference weights . We did not compute health utility scores for participants who failed to answer one or more of the EQ-5D-3L questions. Participants with missing health utility scores were excluded from statistical analyses involving these scores.
Socio-demographic characteristics were summarized using medians and interquartile ranges for continuous variables, and frequencies for categorical variables. We used 1000 bootstrap samples to calculate bias corrected and adjusted 95 % confidence intervals (CIs) for all median health utility scores. We assessed the statistical significance of the median differences between the UK and Canadian health utility scores using the Wilcoxon signed rank sum test. Hodges-Lehmann’s methods for paired groups were employed to calculate the median differences and the 95 % confidence intervals for the median differences.
The overall agreement between the UK and Canadian health utility scores was assessed with the intraclass correlation coefficient (ICC), specifically the ICC(3,1) . Bland-Altman plots  were created to graphically depict the difference in each participant’s health utility score (the score based on UK weights subtracted by the score based on Canadian weights).
We tested the association between socio-demographic factors (i.e., age, gender, education level, occupation, and annual household income) and each set of health utility scores using a generalized linear model (GLM) with a gamma distribution and bootstrap 95 % CIs for the regression coefficients. This type of GLM was shown to optimally fit the data compared to a generalized additive model, quantile regression by means of residual plots, and analysis of variance. We employed the Akaike Information Criterion (AIC) to choose the optimal model from among these different approaches. Based on the literature , we considered a change in health utility score of 0.074 to be a minimum clinically important difference (MCID).
Most analyses were carried out using R v3.2.0 (R Foundation for Statistical Computing, Vienna, Austria). To calculate the ICC (3,1), we used a two-way mixed-effects analysis of variance model and absolute agreement in SPSS v19 (IBM Corp., Armonk, NY).
A total of 216 persons with AD and their primary informal caregivers were included in the study (Table 1). The median age was 80 years for persons with AD and 69 years for caregivers. One-hundred five persons with AD (48.6 %) were female and the majority (n = 143, 66.2 %) of caregivers were female. Most persons with AD (n = 112, 51.9 %) did not exceed a high school education and all except one were retired. Most caregivers (n = 147, 68.1 %) had a post-secondary education and 140 (64.8 %) were retired. Most (n = 175, 81.0 %) persons with AD were diagnosed with mild AD and the rest were diagnosed with moderate AD. Over 50 % of participants in both groups reported no problems in all five EQ-5D-3 L dimensions (Table 2).
For persons with AD and caregivers, the distributions of health utility scores derived from the UK and Canadian preference weights were left-skewed (Figs. 1 and 2). Half of the persons with AD had health utility scores above 0.85 (UK weights) or 0.84 (Canadian weights). Similarly, half of the caregivers had scores above 0.80 (UK weights) or 0.83 (Canadian weights). We could not compute health utility scores for three persons with AD because they did not answer all of the questions on the EQ-5D-3L.
The difference between the two sets of health utility scores was not statistically significant in persons with AD (p = 0.63) (Table 3). However, the difference was statistically significant in the caregivers (median difference: 0.009 [95 % CI: 0.007, 0.013]), with the median score being higher (higher meaning better health) when calculated with the Canadian versus UK preference weights. Within each set of preference weights (UK versus UK, Canadian versus Canadian), the difference in health utility scores between persons with AD and caregivers was not statistically significant.
The Canadian utilities were higher than the UK utilities in the caregiver group when the caregivers were stratified according to the disease severity (mild, moderate) of the persons under their care (Table 4). Median differences were 0.009 (95 % CI: 0.007, 0.013) in the mild subgroup and 0.013 (95 % CI: 0.007, 0.028) in the moderate subgroup. For persons with AD, the UK and Canadian health utility scores did not differ significantly.
The overall agreement between health utility scores using UK and Canadian preference weights was high. The ICC (3,1)s were 0.94 (95 % CI: 0.92, 0.95) for persons with AD and 0.92 (95 % CI: 0.88, 0.94) for caregivers. According to the Bland-Altman plots, 95 % of the >individual differences in health utility scores (UK – Canadian) fell within a range of -0.12 to 0.12 for persons with AD and within -0.16 to 0.12 for caregivers (Fig. 3). For persons with AD, only 15 (7 %) of the individual differences in score exceeded the MCID of 0.074; for caregivers, the number was also 15 (9 %).
With respect to socio-demographic characteristics, only some education categories were associated with the two sets of health utility scores (Table 5). The scores for persons with AD who had a post-graduate education were an average of 0.1 points lower (UK weights) or 0.059 points lower (Canadian weights) than the scores for persons who had a high school or less education. The scores for caregivers with a post-graduate education were an average of 0.116 points lower (UK weights) or 0.04 points lower (Canadian weights) than the scores for caregivers with a high school or less education. Caregivers with a university education also had average scores that were 0.131 points lower than caregivers with a high school or less education. No other socio-demographic variables were statistically significantly associated with either set of health utility scores. Three of the statistically significant associations exceeded the MCID of 0.074.
In this study of persons with AD and their primary informal caregivers, health utility scores derived from UK and Canadian preference weights exhibited slight differences from one another. Based on the MCID, these differences were not large enough to be considered clinically important. We could not find other studies that compared health utility scores calculated with UK and Canadian preference weights.
Evidence suggests health utility scores can be similar for people across countries with comparable socio-demographic characteristics (i.e., UK, Holland, and Germany  Spain and Germany ). Given the socio-demographic parallels between the UK and Canadian populations, the samples used to derive the UK and Canadian preference weights [8, 9] may have simply valued their health states similarly to one another. Thus, the health utility scores derived from each set of preference weights in this study did not differ appreciably from one another.
On the other hand, not every population with similar socio-demographic characteristics will place comparable value on the same health states. Representative samples of the United States (US) and UK populations valued 42 EQ-5D-3L health states and the adjusted mean difference in health utility scores was 0.10 points higher in the US population . An earlier study involving the same sample as in the present study compared health utilities calculated with US and Canadian preference weights . This comparison showed that Americans and Canadians agreed on the types of health states that should be considered ‘good’ or ‘bad’, but Canadians tended to place a lower value than Americans on most of these health states. When calculated with Canadian versus American preference weights, the mean health utility score was 0.06 points lower (95 % CI: -0.07, -0.06) in persons with AD and 0.05 points lower (95 % CI: -0.06, -0.04) in caregivers.
The findings reported in the literature raise a caution for researchers who wish to calculate health utility scores for study samples drawn from populations for which preference weights do not exist. The default practice in these situations has been to use preference weights derived from similar populations, but this approach could lead to over- or under-estimates of health utility scores. In the absence of preference weights for the population of interest, one can never be certain whether another population’s weights will provide unbiased results.
The caution about transferability of weights also applies to the EQ-5D-5L, which measures the same five health dimensions as the EQ-5D-3L, but expands the number of response options from three to five . Recent work involving the EQ-5D-5L suggests that one set of preference weights may not capture inter-regional differences in a single country’s population if the population is spread over a large geographic area .
In our study, only a small number of socio-demographic characteristics affected the UK and Canadian health utility scores. Perhaps disease-specific factors, rather than socio-demographic characteristics, could better explain the scores in AD samples. In persons with AD, some studies have shown that health utility scores are affected by levels of depression and functional ability [24, 25]. Other work has suggested that a more complex series of socio-demographic and disease factors combine to modify the relation between disease severity and health utility scores . For caregivers, health utility scores may be influenced by the extent to which patients are dependent on care, the perceived burden of being a caregiver, and the time involved in providing care [27, 28]. Researchers should consider these additional variables when they design studies to explain health utility scores in AD samples.
Our study found that most caregivers’ health utility scores were lower than the scores of persons with AD. The burden of caring for a person with AD may adversely affect a caregivers’ HRQoL . Meanwhile, the effects of cognitive impairment might prevent persons with AD from perceiving the full impact of disease on their lives, therefore leading to high ratings of HRQoL [30, 31].
Few studies could be found that assessed the HRQoL of AD caregivers. One study conducted on the Canary Islands reported that the frequency of having at least some problems on each EQ-5D-3L dimension was greater in 237 AD caregivers compared to the islands’ general population . However, the authors did not convert EQ-5D-3L responses into health utility scores. The caregiver utility scores in our study were somewhat lower than the scores reported in two studies (i.e., 0.87 , 0.88 ) of general adult populations in Canada, and about the same as the scores reported in a third Canadian study (i.e., 0.85 female, 0.81 male) .
Readers should take certain issues into account when interpreting the results of this study. The participants were recruited from geriatric or memory clinics, so they are unlikely to be representative of the average person with AD or the average caregiver. The participants with AD may be a healthier subset of all patients and the caregivers may be more informed about AD.
Health utility scores exhibited some small yet clinically unimportant differences when calculated with UK versus Canadian preference weights in a sample of persons with AD and their caregivers. The UK and Canadian populations used to obtain the preference weights valued health states similarly to one another.
AD, Alzheimer’s disease; AIC, Akaike Information Criterion; CI, confidence interval; GLM, generalized linear model; HRQoL, health-related quality-of-life; ICC, intraclass correlation coefficient; MCID, minimum clinically important difference; NINCDS-ADRDA, National Institute of Neurological and Communicative Disorders and Stroke/Alzheimer’s Disease and Related Disorders Association; QALY, quality-adjusted life-year; TTO, time trade-off; UK, United Kingdom
Kalaria RN, Maestre GE, Arizaga R, Friedland RP, Galasko D, Hall K, Luchsinger JA, Ogunniyi A, Perry EK, Potocnik F. Alzheimer’s disease and vascular dementia in developing countries: prevalence, management, and risk factors. Lancet Neurol. 2008;7:812–26.
Lopez AD, Mathers CD, Ezzati M, Jamison DT, Murray CJ. Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data. Lancet. 2006;367:1747–57.
Bowling A.Health-related quality of life: a discussion of the concept, its use and measurement. In Measuring disease: a review of disease-specific quality of life measurement scales. 2nd edition. Buckingham: Open University Press; 1995. p. 1-19.
Loveman E, Green C, Kirby J, Takeda A, Picot J, Payne E, Clegg A. The clinical and cost-effectiveness of donepezil, rivastigmine, galantamine and memantine for Alzheimer’s disease. Health Technol Assess. 2006;10:1–160.
Räsänen P, Roine E, Sintonen H, Semberg-Konttinen V, Ryynänen O-P, Roine R. Use of quality-adjusted life years for the estimation of effectiveness of health care: A systematic literature review. Int J Technol Assess Health Care. 2006;22:235–41.
Silberfeld M, Rueda S, Krahn M, Naglie G. Content validity for dementia of three generic preference based health related quality of life instruments. Qual Life Res. 2002;11:71–9.
Naglie G, Tansey C, Irvine J, Ritvo P, Black SE, Freedman M, Silberfeld M, Krahn M. Utility-based Quality of Life Measures in Alzheimer’s Disease. Qual Life Res. 2006;15:631–43.
Dolan P. Modeling valuations for EuroQol health states. Med Care. 1997;35:1095–108.
Bansback N, Tsuchiya A, Brazier J, Anis A. Canadian valuation of EQ-5D health states: preliminary value set and considerations for future valuation studies. PLoS ONE. 2012;7:e31115.
Lien K, Tam VC, Ko YJ, Mittmann N, Cheung MC, Chan KKW. Impact of country-specific EQ-5D-3L tariffs on the economic value of systemic therapies used in the treatment of metastatic pancreatic cancer. Curr Oncol. 2015;22:e443–52.
Mozzi A, Meregaglia M, Lazzaro C, Tornatore V, Belfiglio M, Fattore G. A comparison of EuroQol 5-Dimension health-related utilities using Italian, UK, and US preference weights in a patient sample. Clin Outcomes Res. 2016;8:267–74.
Oremus M, Tarride J-E, Clayton N, Raina P, Group CW-t-PS. Health Utility Scores in Alzheimer’s Disease: Differences Based on Calculation With American and Canadian Preference Weights. Value Health. 2014;17:77–83.
Nietzel M, Wakefield J. American Psychiatric Association Diagnostic and Statistical Manual of Mental Disorders. Contemp Psychol. 1996;41:642–51.
McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease Report of the NINCDS‐ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology. 1984;34:939–9.
Sclan SG, Reisberg B. Functional assessment staging (FAST) in Alzheimer’s disease: reliability, validity, and ordinality. Int Psychogeriatr. 1992;4:55–69.
Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86:420.
Bland JM, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;327:307–10.
Walters SJ, Brazier JE. Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D. Qual Life Res. 2005;14:1523–32.
Norman R, Cronin P, Viney R, King M, Street D, Ratcliffe J. International Comparisons in Valuing EQ‐5D Health States: A Review and Analysis. Value Health. 2009;12:11941200.
Bernert S, Fernández A, Haro JM, König HH, Alonso J, Vilagut G, Sevilla-Dedieu C, De Graaf R, Matschinger H, Heider D. Comparison of Different Valuation Methods for Population Health Status Measured by the EQ‐5D in Three European Countries. Value Health. 2009;12:750–8.
Badia X, Roset M, Herdman M, Kind P. A comparison of United Kingdom and Spanish general population time trade-off values for EQ-5D health states. Med Decis Mak. 2001;21:7–16.
Herdman M, Gudex C, Lloyd A, Janssen M, Kind P, Parkin D, Bonsel G, Badia X. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20:1727–36.
Al Sayah F, Bansback N, Bryan S, Ohinmaa A, Poissant L, Pullenayegum E, Xie F, Johnson JA. Determinants of time trade-off valuations for EQ-5D-5L health states: data from the Canadian EQ-5D-5L valuation study. Qual Life Res. 2015. Epublished ahead of print.
Zucchella C, Bartolo M, Bernini S, Picascia M, Sinforiani E. Quality of life in Alzheimer disease: a comparison of patients’ and caregivers’ points of view. Alzheimer Dis Assoc Disord. 2015;29:50–4.
Naglie G, Hogan DB, Krahn M, Beattie BL, Black SE, Macknight C, Freedman M, Patterson C, Borrie M, Bergman H, Byszewski A, Streiner D, Irvine J, Ritvo P, Comrie J, Kowgier M, Tomlinson G. Predictors of patient self-ratings of quality of life in Alzheimer disease: cross-sectional results from the Canadian Alzheimer’s Disease Quality of Life Study. Am J Geriatr Psychiatr. 2011;19:881–90.
Trigg R, Jones RW, Knapp M, King D, Lacey LA, DADE-2 Investigator Groups. The relationship between changes in quality of life outcomes and progression of Alzheimer’s disease: results from the Dependence in AD in England 2 Longitudinal Study. Int J Geriatr Psychiatry. 2015;30:400–8.
Serrano-Aguilar PG, Lopez-Bastida J, Yanes-Lopez V. Impact on health-related quality of life and perceived burden of informal caregivers of individuals with Alzheimer’s disease. Neuroepidemiology. 2006;27:136–42.
Abdollahpour I, Nedjat S, Salimi Y, Noroozian M, Majdzadeh R. Which variable is the strongest adjusted predictor of quality of life in caregivers of patients with dementia? Psychogeriatrics. 2015;15:51–7.
Zarit SH, Orr NK, Zarit JM. The Hidden Victims of Alzheimer’s Disease: Families Under Stress. New York: New York University Press; 1985.
Crespo M, Hornillos C, Gomez MM. Assessing quality of life of nursing home residents with dementia: feasibility and limitations in patients with severe cognitive impairment. Int Psychogeriatr. 2013;25:1687–95.
Woods RT, Nelis SM, Martyr A, Roberts J, Whitaker CJ, Markova I, Roth I, Morris R, Clare L. What contributes to a good quality of life in early dementia? Awareness and the QoL-AD: a cross-sectional study. Health Qual Life Outcomes. 2014;12:94.
Agborsangaya CB, Lahtinen M, Cooke T, Johnson JA. Comparing the EQ-5D 3L and 5L: measurement properties and association with chronic conditions and multimorbidity in the general population. Health Qual Life Outcomes. 2014;12:74.
Tarride JE, Oremus M, Pullenayegum E, Clayton N, Raina P. How does the Canadian general public rate moderate Alzheimer’s disease? J Aging Res. 2011;20:682470.
Johnson JA, Pickard AS. Comparison of the EQ-5D and SF-12 health surveys in a general population survey in Alberta, Canada. Med Care. 2000;38:115–21.
Data collection for this study was supported by the Canadian Institutes of Health Research (MOP 86487).
Canadian Willingness-to-pay Study Group: Memorial University (St. John’s, NL): Gerry Mugford, Marshall Godwin; McGill University Health Centre (Montréal, QC): Allen Huang, Yves Bacher; Institut universitaire de gériatrie de Montréal (Montréal, QC): Juan-Manual Villalpando; Queen’s University (Kingston, ON): Sudeep S. Gill; Sunnybrook Health Sciences Centre (Toronto, ON): Krista L. Lanctôt, Nathan Herrmann; McMaster University (Hamilton, ON): David Cowan; University of Western Ontario (London, ON): Robert Petrella; University of Calgary (Calgary, AB): David B. Hogan; University of British Columbia (Vancouver, BC): Philip E. Lee. Dr. Huang is now based at the Ottawa Hospital (Ottawa, ON).
Parminder Raina holds a Tier 1 Canada Research Chair in Geroscience and the Raymond and Margaret Labarge Chair in Research and Knowledge Application for Optimal Aging.
MF drafted the manuscript and conducted the statistical analysis. MO conceptualized the study with input from JET and PR. MO, JET, and PR reviewed the manuscript for important intellectual content. Members of the Canadian Willingness-to-pay Study Group recruited participants for the study. All authors (including members of the Study Group) approved the submitted manuscript.
The authors declare that they have no competing interests.
About this article
Cite this article
Fang, M., Oremus, M., Tarride, JE. et al. A comparison of health utility scores calculated using United Kingdom and Canadian preference weights in persons with alzheimer’s disease and their caregivers. Health Qual Life Outcomes 14, 105 (2016). https://doi.org/10.1186/s12955-016-0510-y