Skip to main content

Impact of Alzheimer’s Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden



There is a lack of validated instruments to measure the level of burden of Alzheimer’s disease (AD) on caregivers. The Impact of Alzheimer’s Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver’s burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm.


A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers.

The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument’s stability and consistency over time.


Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver’s burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50–0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2.


The study findings suggest the IADCQ has appropriate psychometric characteristics as a unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.


Alzheimer’s disease (AD) is an age-related, irreversible, progressive brain disorder that attacks the brain and results in increasingly impaired memory, thinking, reasoning, and behavior [1],[2]. The prevalence of AD is estimated at 5.4 million in the United States (US) and as high as 24 million globally [3],[4]. There were an estimated 454,000 new cases diagnosed in 2010 [5]. Barring significant medical breakthroughs, prevalence rates are predicted to triple by 2050 [6].

The cognitive impairments from AD significantly impact the patient’s activities of daily living [7]. Caregiving is an inherent part of managing AD, as progressive deterioration in intellectual function and other cognitive skills leads to a decline in the ability to perform activities of daily living (ADLs) [7]. Caring for patients with AD poses a large burden on both families and the healthcare community [5]. Over 35 million people worldwide currently live with Alzheimer’s disease, and this number is expected to double by 2030 and more than triple by 2050 to 115 million. In the 2010 World Alzheimer Report, Alzheimer’s Disease International estimated that the annual societal costs of dementia worldwide were US $604 billion, or 1% of the aggregated worldwide gross domestic product. Alzheimer’s Disease International also predicted almost a doubling in worldwide societal costs from US $604 billion in 2010 to US $1,117 billion by 2030 [6]. In the 2010 World Alzheimer Report, a systematic review of the world literature on the demands of caregiving looked at 10 studies where time spent assisting with basic ADLs was quantified covering 25 countries; 13 studies of time spent in generally supervising the person with dementia covering 25 countries; and 42 studies of time spent assisting with basic ADLs and instrumental activities of daily living (IADLs) combined spanning 30 countries. The report suggested that caregivers spend an average of 2.0 hours daily supporting basic ADLs, 3.6 hours with basic ADLs and IADLs combined, and 2.6 hours supervising the person with dementia. This amounts to an average weekly total of between 14 hours (ADL alone) and 43 hours (ADL, IADL, and supervision) [6]. In 2011, 15.2 million family and other uncompensated caregivers cared for patients with AD and other dementias in the US providing 17.4 billion hours of care valued at more than $210 billion [5]. Family caregivers provide 80% of home care to AD patients with the level of caregiver burden related to the extent of the patient’s cognitive impairment and functional abilities [5],[6]. Caregivers provided an average of 21.9 hours of care per week [6].

Given the high demands on caregivers of people with AD, they may experience negative impacts on physical, psychological, emotional, social, and financial aspects of their life; some positive effects have also been noted [8]-[11]. About a third of family caregivers experience symptoms of depression, and 61% rate the emotional stress of caregiving as high or very high [4]. They may rate their own health as fair or poor and tend to report that serving as a caregiver worsens their health [4]. Caregivers may also experience higher levels of depression and stress hormones, reduced immune function, slow wound healing, and more new cases of hypertension and coronary heart disease compared to non-caregivers [12]. Health care costs for caregivers are estimated to be 8% higher than non-caregivers due to the physical and emotional toll of caregiving [4]. Caregiver healthcare is estimated to cost $8.6 billion; this is in addition to the $210 billion in unpaid caregiver hours [5]. Caregiving also negatively impacts employment status and work productivity. Of the 44% of caregivers who are employed part or full time, 65% reported missing work, going in late, or leaving early, and 20% have taken a leave of absence [5]. The burden of caregiving and resulting changes in employment often lead to withdrawal or isolation from the caregiver’s wider social networks, which may further increase depression and stress [12]-[14].

Although private or public insurance offers partial coverage, providing care often results in out-of-pocket expenditures for the family. In 2008, total per-person payments from all sources for health care and long-term care for Medicare beneficiaries with AD and other dementias were 3 times as great as payments for other Medicare beneficiaries in the same age group [5]. Excluding the contributions of uncompensated caregivers, total payments for care related to AD and dementia care for patients 65 years and older were estimated at $200 billion in 2012 [5].

A caregiver burden instrument specific to AD is necessary to measure the impact of treatment for AD patients on the lives of their caregivers. The ideal instrument would be brief, self-administered, and not overly burdensome to the respondent. A review of the existing caregiver burden instruments in the published literature revealed a range of validated and non-validated instruments measuring various aspects of the caregiver experience, including burden, mood, needs, and quality of life, across a range of conditions. In a systematic review of caregiver burden instruments, Deeken et al. [15] identified 28 self-report questionnaires assessing the burden, needs, and quality of life of informal caregivers. Among the 17 instruments reviewed that specifically focused on caregiver burden, 10 of the instruments were not specific to either AD or dementia (eg, Caregiver Strain Index [16], Burden Assessment Scale [17], Appraisal of Caregiving Scale [18]); several lacked evidence of rigorous psychometric testing or evidence of adequate validity and reliability (eg, Burden Interview [19], Family Burden Scale [20], Objective Burden Questionnaire [21]; and others were too lengthy or difficult to administer in a clinical trial setting (eg, Caregiver Experience Assessment, 105 items [22]; Caregiver Stress Inventory, 43 items [23]). As a result, despite the abundance of caregiver instruments available, none were appropriate for the purposes of measuring caregiver burden during the course of a clinical trial evaluating the impact of an experimental treatment on AD patients. In addition, the Zarit Burden Interview was assessed but found to be too lengthy to be used in an Alzheimer’s Disease interventional trial where the caregiver was asked to complete a number of psychosocial measures on behalf of the AD patient [10].

Development of Impact of Alzheimer’s Disease on Caregiver Questionnaire (IADCQ)

In response to the lack of an appropriate AD caregiver instrument for clinical trials, an effort was undertaken to develop a new AD caregiver instrument. A review of the literature on the health-related quality of life (HRQoL) burden that unpaid caregivers face in caring for an individual with AD was first conducted. A Medline search was conducted using Alzheimer, dementia, caregiver burden, and caregiver quality of life as search terms from 1980 to 2010. Nineteen articles were found to be relevant. Eight articles discussed AD caregiver burden in general and 11 existing instruments were identified. However, none of these findings fulfilled the criteria of assessing AD caregiver burden for various reasons such as inability to implement the instrument in a clinical trial setting, inappropriate questions, too lengthy of an instrument, and not a self-administered instrument.

Based on this review, the initial draft of the Impact of Alzheimer’s Disease on Caregiver Questionnaire (IADCQ), consisting of 9 items, was created. To ensure that this questionnaire adequately captured the key domains that were most relevant to caregivers of AD patients in assessing the burden of caregiving, 3 focus groups of 21 unpaid caregivers (21 females and 2 males) of AD patients were held in Los Angeles, Chicago, and New Orleans. These focus groups were held to better understand the experience of caring for a patient with AD and to conduct a cognitive debriefing of the initial draft of the IADCQ. The focus groups were held to elicit concepts and were conducted with a trained moderator using a semi-structured interview guide. Caregivers described various impacts on their HRQoL due to caregiving which included emotional (worry, frustrated, sad/depressed), social (relationship with friends and family, relationship with person with AD, limit activities), physical (aging, diet, weight), sleep (falling asleep, less sleep, interruption), work (can’t retire, work at home, cut back on hours worked), time (having no time to do personal activities, giving up time to care, making adjustments to schedule), sex life, well-being (lack of freedom/independence, loss of creativity, needing to mature faster, loss of self, personality change), and financial. Caregivers also reported not knowing what to expect and needing to make decisions for the patient. They reported that their self-care was impacted and they felt homebound. Caregivers were asked to evaluate the initial draft of the IADCQ and provide input on the questions, response options, and instructions. Saturation was achieved by the third focus group. Based on the results, a revised 12-item IADCQ instrument with a 7-day recall period was developed. This instrument contained the elements most relevant to caregivers of AD patients in assessing the burden of care giving: emotional, physical, social, time, sleep, and financial impact.

The current study details the next steps in IADCQ development, including a psychometric study of the IADCQ and ascertaining the most appropriate scoring algorithm for the instrument.


Study design

The current study design and psychometric analyses were selected to establish the internal consistency and test-retest reliability of the IADCQ. Alzheimer’s disease caregivers were recruited and entered into a cross-sectional study to collect data appropriate for most of the study goals. Finally, to ascertain test-retest reliability, a subset of those who completed the psychometric study was randomly invited to participate in a second round of data collection four weeks later.

Psychometric analyses were specifically selected and ordered to ensure goals of the study were analyzed accurately. The IADCQ was first analyzed with confirmatory factor analysis (CFA) to assure construct validity; the data were checked for fit with the original conceptual framework from previous qualitative development of the IADCQ. The reliability (eg, internal consistency) was examined by assessing item-level and scale-level statistics. Finally, test-retest reliability was examined with intra-class correlations to determine the strength of the relationship between four-week administrations of the IADCQ. Pearson correlation coefficients were computed for the investigation of convergent validity.

Study participants

A national sample of men and women ≥ 18 years of age who identified themselves as an unpaid caregiver of an AD patient participated in a cross-sectional, nonrandomized, psychometric study. The AD caregivers, who previously indicated their willingness to be contacted for research purposes, were recruited via e-mail from a panel of caregivers in the US managed by a research-panel vendor. Each caregiver previously self-enrolled to participate in research related to caregiving for AD patients. Various approaches were employed to recruit panelists, such as banners, referrals, natural search optimization, affiliate marketing, and targeted e-mails. Inclusion and exclusion criteria (as completed by self-report), outlined in Table 1, were employed to determine the survey candidates’ eligibility. If caregivers were interested and eligible to participate in the study, they read and provided informed consent electronically before completing the demographic questions and the study instrument. Each initial e-mail was submitted with a unique link. When the caregiver noted they were willing to participate, the system created a follow-up email at the correct time with the same caregiver ID in the secure link. The caregivers were compensated for participating in the study. This study was reviewed and approved by the New England Institutional Review Board.

Table 1 Inclusion and exclusion criteria to recruit for survey participants

A total of 200 caregivers completed the online survey. To assess the test-retest reliability of the IADCQ, a subgroup of 50 randomly selected caregivers were asked to repeat the survey four weeks after the initial survey; 100% of these caregivers completed the second administration.


Description of the IADCQ

The Impact of Alzheimer’s Disease on Caregiver Questionnaire (IADCQ) is an instrument used to measure the burden of caregiving and includes items that represent the key concepts and domains of caregiving for an AD patient. The current version of the IADCQ has a 7-day recall period and 12 items. It has a five-point Likert scale with response choices ranging from “not at all” (0) to “extremely” (4). The IADCQ measures the burdens associated with being an AD caregiver across six theorized domains: emotional, physical, social, financial, sleep, and impact on time.

The original IADCQ qualitative research [24] helped to gain a better understanding of the impact of AD patient caring on caregivers’ HRQoL. The current study is the first quantitative evaluation of the IADCQ and was designed firstly to evaluate the psychometric characteristics of a Web-based version of the IADCQ instrument completed by caregivers of patients with AD and secondly, to determine the scoring algorithm.

A comprehensive examination of the psychometric properties of the IADCQ was undertaken in a group of caregivers for AD patients through a Web-based survey. The survey was administered twice: (1) at baseline (using the IADCQ and Short Form-12 Health Survey Version 2 [SF-12v2™]) and (2) four weeks later for a subgroup of participants (using the IADCQ).

Description of the SF-12v2

In addition to the IADCQ, the SF-12v2 was administered in the survey. The SF-12v2 is a generic HRQoL instrument that contains 12 questions representing 8 domains to provide insight into physical and mental functioning [25]. It is a valid measure of physical and mental health often used in large population health surveys or in clinical trials to assess the impact of an intervention on patient HRQoL. It was used to permit a wide array of HRQoL information, and its psychometric properties are well defined and known. In addition, it has accepted responder definitions. In this study, the SF-12v2 was used to measure convergent validity of IADCQ scores as well as to provide an understanding of the overall HRQoL of the sampled AD caregivers.

Data analysis

Descriptive statistics were first examined based on demographics (ie, age, gender, and race) and other characteristics of the AD caregiver participants (ie, caregiver history, employment status, and missing work time). Item-level evaluations were assessed to cover aspects such as completeness of responses, response choices used by participants, distribution of responses, and ordering of item means. The Web-based survey did not permit missing data; therefore, all 200 subjects had complete survey data. Survey items were defined to assess a specified level within a narrow range of the construct for instrument precision, with some participants scoring the lowest possible score (ie, floor effects) and others having the highest possible score (ie, ceiling effects).

The sequence of psychometric analyses in this study was designed to ensure proper understanding of the latent structure before performing the classical psychometric analyses. The IADCQ latent structure was evaluated with CFA to assure that the IADCQ scoring matched with the conceptual framework. We then evaluated the psychometric characteristics of the measures using classical psychometric techniques and examined the instrument properties by assessing item-level and scale-level statistics. All analyses, unless otherwise specified, were conducted using Statistical Analysis Software (SAS) version 9.1.

Latent structure analyses

Confirmatory factor analysis was used to investigate the dimensionality of the IADCQ instrument and to ensure that the scoring approach matches with the latent structure of the IADCQ. Through an earlier unpublished qualitative study, six domains in the IADCQ were identified. Latent analyses were designed to determine whether the hypothesized organization of items to domains was consistent with the empirically tested latent structure of the IADCQ. The rationale for this analysis was to measure the extent to which the scoring system explains the way that caregivers respond to the items in the IADCQ to provide evidence for the structural fidelity of the scoring system fitting with the latent constructs underlying the IADCQ.

The aggregate data of item responses from the IADCQ were submitted to CFA appropriate for categorical data. Specifically, we used a parametric extraction of maximum likelihood but subjected the covariance matrix to bootstraps to correct for the influence of non-normality [26]. Two thousand Bollen-Stine bootstraps were used during the model estimation to control for multivariate non-normality. Model fit statistics in CFA provided the measures with the strength of relationship between the theoretical model and the data: (1) goodness of fit index (GFI; measures the amount of variance and covariance in the data that are reproduced by the tested model); (2) comparative fit index (CFI; specifies the amount of difference between the examined model and the independence model); (3) non-normed fit index (NNFI; conducts the same task as CFI but takes into consideration the number of parameters in a model—an aspect that can inflate CFI); (4) root mean square error of approximation (RMSEA; determines how well the examined model reproduces the saturated model); and (5) standardized root mean square residual (SRMR; similar to RMSEA but specifies the absolute measure of model fit). The model would be considered satisfactory if the five fit indices met or surpassed these thresholds: GFI ≥ 0.90 [27], CFI and NNFI ≥ 0.95 [27], RMSEA ≤ 0.06 [28], and SRMR ≤ 0.08 [29]. Confirmatory factor analysis was conducted using Mplus Version 6.1, a latent variable modeling program [30].

Item-level analyses

For the item-level psychometric evaluation, five sets of analyses were planned after the latent analysis following the methods described in Cole et al. [31]: (1) equality of item-total correlations for each scale (highest vs. lowest item-total correlation p > 0.05); (2) equality of variances for each item per scale (Hartley’s Fmax < 3.0); (3) sufficient item-total correlations (≥0.40); (4) small alpha removed statistics (≤0.02); and (5) item-total correlations that were higher for each item’s own scale than for other scales (p < 0.05).

Scale-level evaluation

After estimating the item-level psychometrics, the scale-level properties of the IADCQ domains were examined in five aspects: (1) scale means and standard deviations; (2) floor and ceiling scores; (3) internal consistency reliability; (4) test-retest reliability; and (5) convergent validity. Along with providing descriptive statistics (ie, mean and standard deviation) for the IADCQ and SF-12v2 scores, we also assessed the overall floor and ceiling effects of the IADCQ for the purpose of assessing precision of the instrument, and the percentages of participants with the floor or ceiling scores were calculated. Floor and ceiling effects were classified when either was achieved by more than 5% of the sample.

Internal consistency reliability was measured with two techniques: coefficient alpha and average inter-item correlation. For coefficient alpha [32], reliability coefficients of ≥ 90 have been suggested for individual-level analyses [29], though an internal consistency of ≥ 80 is considered to be sufficient for most cases [32]. Internal consistency was also measured with the average inter-item correlation, which should range from between 0.3 (for a general scale) to 0.5 (for a specific scale) [33],[34].

Test-retest reliability was conducted through a one-way random effects intra-class correlation coefficient (ICC) to evaluate the reliability and stability of the IADCQ and to assess the consistency of the instrument over time. Because of the short time frame in which the instruments were administered, it was expected that the measures of these constructs would either not change or change minimally. Finally, validity of the IADCQ scale was analyzed via Pearson correlations with baseline scores on the SF-12v2 measuring physical and mental health composite scores (PCS and MCS) scales and subscales for various measures of HRQoL (all correlations were expected to be negative given the inverse relationship of healthy scores on the IADCQ and SF-12v2). The correlations between the IADCQ and the SF-12v2 scores can provide an appropriate measure of overall convergent validity.



A total of 200 AD caregivers (80 males, 120 females) completed the Web-based survey. Overall, 87% of participants were between the ages of 30 and 69 years; 40% were between 30 and 49 years; and 47% were between 50 and 69 years (Table 2). The majority of participants were white, and a third had been caregivers for < 1 year. There were 42.5% of participants employed full time (ie, ≥ 30 hours per week), followed by participants who were employed part time (ie, < 30 hours per week) because of caregiving responsibilities (13%) or retired (13%). Among the caregivers who were employed, the majority of participants had missed zero to five days from work per month due to caregiving duties. Details of the demographic statistics for the test-retest sample are also provided in Table 2.

Table 2 Demographics of the Web-based survey participants

Confirmatory factor analysis

The initially theorized, six-factor model was not supported by CFA. One of the prefaces in latent modeling is that a single factor should be considered as either the only factor or as an underlying single factor, subsuming all other factors; therefore, it is plausible to examine for their goodness of fit under a default one-factor model [35]. The goodness of fit of the single-factor model to the survey data was evaluated using the GFI, CFI [29], NNFI [36], RMSEA [37], and SRMR [26]. The CFA model obtained fit that reached most of the acceptance thresholds, where GFI = 0.934, CFI = 0.944, NNFI = 0.934, and RMSEA = 0.076 (90% confidence interval [CI]: 0.059–0.090). Although the RMSEA (0.076) was higher than the ideal value of 0.06, a strong SRMR finding (with the value of 0.040) suggested that the amount of free parameters challenged obtaining a favorable RMSEA. The analysis results indicated that a single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of the IADCQ can be used to measure AD caregiver burden.

The finalized CFA model depicts the strength of the relationship between each latent trait and its reflective items, including the standardized path coefficients for the variables on each of the items, as well as the level of correlation on the factor (Figure 1). High standardized factor loadings were observed for all items on the single factor.

Figure 1
figure 1

Finalized CFA model structure and path coefficients for the 12-item IADCQ. CFA=confirmatory factor analysis; IADCQ=Impact of Alzheimer’s Disease Caregiver Questionnaire; e=residual variance.

IADCQ scoring

The response mean values and the proportions of participants with floor and ceiling effects by IADCQ item and IADCQ total are presented in Table 3. At the item level, the floor effects (0 = “Not at all”) ranged from 5% (item 12 on Stress) to 34.5% (item 8 on Relationship with AD patient), whereas the ceiling effects (4 = “Extremely”) ranged from 2.5% (item 1 on Physical Health) to 18% (item 12 on Stress). For the IADCQ total score, two participants had a floor effect (with sum score = 0) and another two participants had a ceiling effect (with sum score = 48). A total of 98% of the participants did not have either floor or ceiling effects, indicating that the IADCQ covers an ideal range of burden.

Table 3 IADCQ item mean and percentage with floor and ceiling effects

Classical psychometric evaluations

Because of the unidimensional model structure identified through CFA, the following item-level analyses were conducted (other item-level psychometrics are only appropriate in a multidimensional instrument): item-total correlations, alpha-removed statistics, and item homogeneity. The item-total correlations ranged from 0.523 (item 5 on Worry) to 0.785 (item 12 on Stress), which were considered as substantial and satisfactory to the hypothesized scale (Table 4) [35],[38],[39]. When alpha-if-item removed statistics were reviewed, removal of any one item did not lead to an appreciable improvement in coefficient alpha. Indeed, only item 5 (on Worry) had an improvement of any positive magnitude (0.001), but this improvement was negligible and far below the criterion of 0.02 improvement needed to flag the item as poor [31]. We have also noticed that the correlation of item 5 (Worry) was significantly different from the average correlation of the scale (z score = 2.7; p = 0.006). Nevertheless, CFA results were not as strong without item 5, providing a psychometric rationale for keeping it in the scale. No other items had item-total correlations that were significantly lower than the rest of the instrument’s average item-total correlation. In addition, the Fmax value of 2.23 for the IADCQ indicates similar variances between the items.

Table 4 Item-level psychometrics

A series of scale-level psychometric evaluations were conducted (Table 5). Internal consistency reliability of the IADCQ revealed appropriate results: coefficient alpha was 0.927 and average inter-item correlation was 0.52. Reliability coefficients for the SF-12v2 scores were similar to published psychometrics for the general population [16]. The IADCQ had a mean scale score of 21.6 and a standard deviation of 10.8, which indicated that the majority of individuals within the Web-based participant population were likely to score along the scale continuum of 10.8 through 32.4. Convergent validity of the IADCQ scale was assessed by Pearson correlation coefficients with the SF-12v2 PCS and MCS scales and subscales. A low to moderate negative correlation was observed between the IADCQ and the scales of SF-12v2 with the Pearson correlation coefficients ranging from −0.58 to −0.20, which indicated a moderate convergent validity. Negative convergent correlations were expected here as higher scores on the IADCQ indicate worse functioning, whereas higher scores on the SF-12 indicate better functioning.

Table 5 Scale-level psychometrics

Intra-class correlation coefficient

Intra-class correlation coefficient (ICC) for the IADCQ scale was estimated to assess the test-retest reliability for the subgroup of 50 AD caregivers who participated in the Web-based survey at both baseline and 4 weeks later. The ICC for the IADCQ scale was 0.68 (95% CI: 0.50–0.80), which indicated a moderate agreement on test-retest reliability.


The objective of this study was to investigate the psychometric characteristics of the IADCQ designed for AD caregivers as well as to determine the most appropriate scoring algorithm for the Web-based IADCQ. Our investigation revealed that the 12-item instrument demonstrated appropriate unidimensional model fit on the CFA, a high degree of item-level reliability, good internal consistency, and moderate test-retest reliability and moderate convergent validity with the scales of SF-12v2. Not surprisingly, negative correlations were observed because higher scores on the IADCQ indicate worse state with −0.20 for General Health and −0.58 for Mental Health. The CFA model was found to have strong fit on most of the indices. Moreover, most of the factor loadings were in the range of 0.7 to 0.8, indicating that the majority of the variance for question items was explained by the factor. The study findings demonstrate that the IADCQ can be used to measure the burden of AD caregiving and that the concepts measured in the IADCQ represent a cohesive concept of caregiver burden.

In addition to demonstrating psychometrically appropriate measurement characteristics, the results suggest that the IADCQ should be scored as a single scale by summing up the scores from all 12 items. The sum score implies the overall burden of AD caregiving across all theorized areas (ie, emotional, physical, social, financial, sleep, and time), where a higher score of the IADCQ indicates more burden for an AD caregiver. The IADCQ measures the burden of the AD caregiver; however, we recognize that there may be positive aspects associated with caregiving that are not addressed by our research. Positive emotions are not included in this newly developed measurement. Previous research has found additional factors to be important when discussing caregiving, such as positive emotions from caregiving and resources that caregivers may aid in managing their challenges of caregiving [40]. In particular, Stephan et al. have evaluated this aspect in the Caregiver Reaction Assessment scale [41]. Previous development work with the IADCQ did not consider these factors as the emphasis was on the negative side of caregiver burden. Readers should consider this omission of factors when evaluating the comprehensiveness of the IADCQ for their needs.

Unlike other caregiver instruments, the IADCQ has been specifically designed to measure the burden associated with caregiving for AD patients. Issues that AD caregivers in particular tend to face, such as the potential for the AD patient to harm him/herself or others and the relationship between the caregiver and the AD patient, are included in this instrument. This may allow for increased understanding of how caring specifically for a person with Alzheimer’s disease impacts the caregiver that other caregiver instruments may not adequately capture. Additionally, it is appropriate for use in a clinical trial setting in that it is self-administered, brief (12 easy-to-complete items), and simple to score and interpret.

The caregiver population interviewed in this study appears to be demographically similar to the AD caregiver population in the US. The 2009 Behavioral Risk Factor Surveillance System survey of caregivers of patients with AD and other dementias found 70% were female, 56% were ≥ 55 years old, and 44% were employed part or full time [4]. The Alzheimer’s Association reported 75% of caregivers had been caregivers for ≥ 1 year; of these, 32% had been caregivers for ≥ 5 years [5]. However, it should be noted that because the AD caregivers completed the questionnaire online, this may not represent those caregivers who do not have access to the Internet or are not computer users.

This study is not without limitations. As this study is the first quantitative evaluation of the IADCQ, its significance should not be overstated. The single-factor should be further validated with independent samples, such as samples of clinical trial subjects. Indeed, as the originally postulated construct of the IADCQ did not obtain appropriate fit, further validation of the unidimensional model is important. It is possible that the structure of the originally hypothesized model did not fit given a combination of too few items per factor and a sample size of only moderate power.

Additionally, the current research is limited to caregivers of AD. Extrapolating these findings to caregivers of dementia patients broadly is not advised based strictly on the current research. Both regulatory [42] and psychometric [43] guidance note that without proper assessment of the similarity of content validity, presuming a larger cohort (eg, dementia) will appropriately extrapolate to a more restrictive cohort from which the research is based (eg, AD) would be inappropriate. Therefore, we caution against any use of the IADCQ for a caregiver population of a broader dementia sample without additional research to establish such efficacy.


In summary, this research supports the use of the Web-based IADCQ to measure the burden impact on caregivers of AD patients and justifies a single total-score interpretation. We found good internal consistency and moderate reliability and validity. Validation of the paper-based administration mode of the IADCQ is another area for future development. Additional psychometric evaluation should be further implemented because validity and reliability of an instrument in one administration mode (eg, Web-based survey) cannot be assumed to hold in an alternate mode (eg, paper-based survey) [37]. When the survey data are collected through other administration modes, additional psychometric properties of the instrument need to be assessed by other applicable approaches, such as development of a responder definition.



Alzheimer’s disease


Activity of daily living


Confirmatory factor analysis


Comparative fit index


Confidence interval


Goodness of fit index


Health-related quality of life


Impact of Alzheimer’s Disease on Caregiver Questionnaire


Instrumental activity of daily living


Intra-class correlation coefficient


Mental health composite score


Non-normed fit index


Root mean square error of approximation


Physical health composite score


Statistical analysis software


Short Form-12 Health Survey Version 2


Standardized root mean square residual


United States


  1. National Institute of Neurological Disorders and Stroke. [], National Institute of Neurological Disorders and Stroke. []

  2. Alzheimer’s Disease Fact Sheet. []., Alzheimer’s Disease Fact Sheet. . []

  3. Reitz C, Brayne C, Mayeux R: Epidemiology of Alzheimer disease. Nat Rev Neurol 2011, 7: 137–152. 10.1038/nrneurol.2011.2

    Article  PubMed Central  PubMed  Google Scholar 

  4. Theis W, Bleiler L: Alzheimer’s Association: Alzheimer’s disease facts and figures. Alzheimers Dement 2011, 7: 208–244. 10.1016/j.jalz.2011.02.004

    Article  Google Scholar 

  5. Association A's: Alzheimer's disease facts and figures. Alzheimers Dement 2012, 2012(8):131–168.

    Google Scholar 

  6. Alzheimer's Disease International: World Alzheimer Report.Journal of Caring 2013., Alzheimer's Disease International: World Alzheimer Report. Journal of Caring 2013. .

  7. Marshall GA, Rentz DM, Frey MT, Locascio JJ, Johnson KA, Sperling RA: Alzheimer's Disease Neuroimaging Initiative: Executive function and instrumental activities of daily living in mild cognitive impairment and Alzheimer's disease. Alzheimers Dement 2011, 7: 300–308. 10.1016/j.jalz.2010.04.005

    Article  PubMed Central  PubMed  Google Scholar 

  8. Razani J, Kakos B, Orieta-Barbalace C, Wong JT, Casas R, Lu P, Alessi C, Josephson K: Predicting caregiver burden from daily functional abilities of patients with mild dementia. J Am Geriatr Soc 2007, 55: 1415–1420. 10.1111/j.1532-5415.2007.01307.x

    Article  PubMed Central  PubMed  Google Scholar 

  9. George LK, Gwyther LP: Caregiver well-being: a multidimensional examination of family caregivers of demented adults. Gerontologist 1986, 26: 253–259. 10.1093/geront/26.3.253

    Article  CAS  PubMed  Google Scholar 

  10. Ankri J, Andrieu S, Beaufils B, Grand A, Henrard JC: Beyond the global score of the Zarit Burden Interview: useful dimensions for clinicians. Int J Geriatr Psychiatry 2005, 20: 254–260. 10.1002/gps.1275

    Article  PubMed  Google Scholar 

  11. Beach SR, Schulz R, Yee JL, Jackson S: Negative and positive health effects of caring for a disabled spouse: longitudinal findings from the caregiver health effects study. Psychol Aging 2000, 15: 259–271. 10.1037/0882-7974.15.2.259

    Article  CAS  PubMed  Google Scholar 

  12. Varela G, Varona L, Anderson K, Sansoni J: Alzheimer's care at home: a focus on caregivers strain. Prof Inferm 2011, 64: 113–117.

    PubMed Central  PubMed  Google Scholar 

  13. The MetLife Study of Working Caregivers and Employer Health Care Costs: [], The MetLife Study of Working Caregivers and Employer Health Care Costs: []

  14. Gruffydd E, Randle J: Alzheimer's disease and the psychosocial burden for caregivers. Community Pract 2006, 79: 15–18.

    PubMed  Google Scholar 

  15. Deeken JF, Taylor KL, Mangan P, Yabroff KR, Ingham JM: Care for the caregivers: a review of self-report instruments developed to measure the burden, needs, and quality of life of informal caregivers. J Pain Symptom Manage 2003, 26: 922–953. 10.1016/S0885-3924(03)00327-0

    Article  PubMed  Google Scholar 

  16. Robinson BC: Validation of a Caregiver strain index. J Gerontol 1983, 38: 344–348. 10.1093/geronj/38.3.344

    Article  CAS  PubMed  Google Scholar 

  17. Reinhard SC, Gubman G, Horwitz AV, Minsky S: Burden assessment scale for families of the seriously mentally ill. Eval Program Plann 1994, 17: 261–269. 10.1016/0149-7189(94)90004-3

    Article  Google Scholar 

  18. Oberst MT, Thomas SE, Gass KA, Ward SE: Caregiving demands and appraisal of stress among family caregivers. Cancer Nurs 1989, 12: 209–215. 10.1097/00002820-198908000-00003

    Article  CAS  PubMed  Google Scholar 

  19. Zarit SH, Reever KE, Bach-Peterson J: Relatives of the impaired elderly: correlates of feelings of burden. Gerontologist 1980, 20: 649–655. 10.1093/geront/20.6.649

    Article  CAS  PubMed  Google Scholar 

  20. Test MA, Stein LI: Alternative to mental hospital treatment. III. Social cost. Arch Gen Psychiatry 1980, 37: 409–412. 10.1001/archpsyc.1980.01780170051005

    Article  CAS  PubMed  Google Scholar 

  21. Provencher HL: Objective burden among primary caregivers of persons with chronic schizophrenia. J Psychiatr Ment Health Nurs 1996, 3: 181–187. 10.1111/j.1365-2850.1996.tb00085.x

    Article  CAS  PubMed  Google Scholar 

  22. Schofield HL, Murphy B, Herrman HE, Bloch S, Singh B: Family caregiving: measurement of emotional well-being and various aspects of the caregiving role. Psychol Med 1997, 27: 647–657. 10.1017/S0033291797004820

    Article  CAS  PubMed  Google Scholar 

  23. Pearlin LI, Mullan JT, Semple SJ, Skaff MM: Caregiving and the stress process: an overview of concepts and their measures. Gerontologist 1990, 30: 583–594. 10.1093/geront/30.5.583

    Article  CAS  PubMed  Google Scholar 

  24. Ito D, Stokes J, Piault-Louis E, Bonnet P: Development of a Measure in Burden of Alzheimer’s Caregivers. American Academy of Neurology Annual Meeting; 2010.

  25. Jenkinson C, Layte R, Jenkinson D, Lawrence K, Petersen S, Paice C, Stradling J: A shorter form health survey: can the SF-12 replicate results from the SF-36 in longitudinal studies? J Public Health Med 1997, 19: 179–186. 10.1093/oxfordjournals.pubmed.a024606

    Article  CAS  PubMed  Google Scholar 

  26. Bentler PM: Comparative fit indexes in structural models. Psychol Bull 1990, 107: 238–246. 10.1037/0033-2909.107.2.238

    Article  CAS  PubMed  Google Scholar 

  27. Enders CK: Applying the Bollen-Stine bootstrap for goodness-of-fit measures to structural equation models with missing data. Multivariate Behav Res 2002, 37: 359–377. 10.1207/S15327906MBR3703_3

    Article  Google Scholar 

  28. Nunnally JC, Bernstein IH: Psychometric Theory. 3rd edition. McGraw-Hill, New York; 1994.

    Google Scholar 

  29. L-t H, Bentler P: Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Structural Equation Modeling 1999, 6: 1–55. 10.1080/10705519909540118

    Article  Google Scholar 

  30. Muthén LK, Muthén BO: Mplus®: Statistical Analysis with Latent Variables User's Guide. Muthén & Muthén, Los Angeles CA; 2010.

    Google Scholar 

  31. Cole JC, Lin P, Rupnow MF: Validation of the Migraine-Specific Quality of Life Questionnaire version 2.1 (MSQ v. 2.1) for patients undergoing prophylactic migraine treatment. Qual Life Res 2007, 16: 1231–1237. 10.1007/s11136-007-9217-1

    Article  PubMed  Google Scholar 

  32. Cronbach L: Coefficient alpha and the internal structure of tests. Psychometrika 1951, 16: 297–334. 10.1007/BF02310555

    Article  Google Scholar 

  33. Anastasi A, Urbina S: Psychological Testing. 7th edition. Pearson, Upper Saddle River; 1998.

    Google Scholar 

  34. Fiske DW: Some hypotheses concerning test adequacy. Educ Psychol Meas 1966, 26: 69–88.

    Google Scholar 

  35. Tyler TA, Fiske DW: Homogeneity indices and test length. Educ Psychol Meas 1968, 28: 767–777. 10.1177/001316446802800306

    Article  Google Scholar 

  36. Tanaka JS, Huba GJ: Structures of psychological distress: testing confirmatory hierarchical models. J Consult Clin Psychol 1984, 52: 719–721. 10.1037/0022-006X.52.4.719

    Article  Google Scholar 

  37. Marsh HW, Hau K-T, Wen Z: In search of golden rules: comment on hypothesis-testing approaches to setting cutoff values for fit indexes and dangers in overgeneralizing Hu and Bentler's (1999) findings. Structural Equation Modeling 2004, 11: 320–341. 10.1207/s15328007sem1103_2

    Article  Google Scholar 

  38. Steiger JH, Lind JC: Statistically-based models tests for the number of common factors. In Paper presented at the Psychometric Society Meeting. Iowa City, IA 1980.

  39. Ware JE Jr, Gandek B: Methods for testing data quality, scaling assumptions, and reliability: the IQOLA Project approach. International Quality of Life Assessment. J Clin Epidemiol 1998, 51: 945–952. 10.1016/S0895-4356(98)00085-7

    Article  PubMed  Google Scholar 

  40. Zarit S: Positive aspects of caregiving: more than looking on the bright side. Ageing Ment Health 2012, 6: 673–674. 10.1080/13607863.2012.692768

    Article  Google Scholar 

  41. Stephan A, Mayer H, Renom Guiteras A, Meyer G: Validity, reliability, and feasibility of the German version of the Caregiver Reaction Assessment scale (G-CRA): a validation study. Int Psychogeriatr 2013, 10: 1621–1628. 10.1017/S1041610213001178

    Article  Google Scholar 

  42. Food and Drug Administration - Department of Health and Human Services:Guidance for Industry - Patient-Reported Outcome Measures: Used in Medical Product Development to Support Labeling Claims. Food and Drug Administration, Rockville; 2009.

    Google Scholar 

  43. Haynes SN, Richard DCS, Kubany ES: Content validity in psychological assessment: a functional approach to concepts and methods. Psychol Assess 1995, 7: 238–247. 10.1037/1040-3590.7.3.238

    Article  Google Scholar 

Download references


The authors wish to acknowledge Patrick Bonnet, PharmD, MS, of GE Healthcare for his contributions to the development of the initial draft of the IADCQ and Karen Spach, PhD, of Covance Market Access for her editorial contribution to this manuscript.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Jason C Cole.

Additional information

Competing interests

Research was supported by Baxter Healthcare Corporation (Baxter). Diane Ito and Josephine Li-McLeod are employees of Baxter. Jason C. Cole, Yaozhu J. Chen, Rebecca Cheng, and Jennifer Bolognese served as consultants to Baxter.

Authors’ contributions

DI contributed to the conception and design of the study, interpretation of results, and development and critical review of the manuscript. JL-M contributed to the conception and design of the study and provided critical review of the manuscript. JCC contributed to the conception and design of the study, interpretation of results, and development and critical review of the manuscript. YJC contributed to the interpretation of results and development and critical review of the manuscript. RC contributed to the conception and design of the study, interpretation of results, and development and critical review of the manuscript. JB contributed to the development and critical review of the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cole, J.C., Ito, D., Chen, Y.J. et al. Impact of Alzheimer’s Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden. Health Qual Life Outcomes 12, 114 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: