- Open Access
Use of rasch methodology to develop a short version of the Health Related Quality of life for Eating Disorders questionnaire: a prospective study
Health and Quality of Life Outcomesvolume 8, Article number: 29 (2010)
To confirm the internal structure of the Health Related Quality of Life for Eating Disorders version 2 questionnaire (HeRQoLEDv2) and create and validate a shortened version (HeRQoLED-S).
324 patients with eating disorders were assessed at baseline and one year later (75.6% of whom responded). We performed a confirmatory factor analysis of the HeRQoLEDv2 using baseline data, and then a Rasch analysis to shorten the questionnaire. Data obtained at year one was used to confirm the structure of the HeRQoLED short form and evaluate its validity and reliability.
Two latent second-order factors -- social maladjustment and mental health and functionality -- fit the data for the HeRQoLEDv2. Rasch analysis was computed separately for the two latent second-order factors and shortened the HeRQoLEDv2 to 20 items. Infit and outfit indices were acceptable, with the confirmatory factor analysis of the HeRQoLED short form giving a root mean square error of approximation of 0.07, a non-normed fit index and a comparative fit index exceeding 0.90. The validity was also supported by the correlation with the convergent measures: the social maladjustment factor correlated 0.82 with the dieting concern factor of the Eating Attitudes Test-26 and the mental health and functionality factor correlated -0.69 with the mental summary component of the Short Form-12. Cronbach alphas exceeded 0.89.
Two main factors, social maladjustment and mental health and functionality, explain the majority of HeRQoLEDv2 scores. The shortened version maintains good psychometric properties, though it must be validated in independent samples.
Eating disorders (ED) affect millions of people worldwide. Since the earliest publications focusing on quality of life among individuals with an ED [1–8] it has been shown that they have a high degree of impairment in various areas of health-related quality of life (HRQoL). Most of these early studies used generic tools to assess the impact of an ED on physical, mental, and social factors . However, these generic tools did not include specific questions probing how the ED affected these factors which, in most cases, limited the interpretation of the results .
The first HRQoL instruments specific to individuals with an ED were published almost simultaneously in 2006 and 2007 [10–14]. We developed one of these, the Health Related Quality of Life for Eating Disorders version 2 (HeRQoLEDv2) questionnaire [13, 14], a tool with good validity and reliability. One limitation of this 55-question instrument is that it requires a considerable amount of time to complete. We subsequently decided to develop a shorter version. Some techniques for shrinking the size of questionnaires arise from item response theory (IRT) [15–17], with Rasch analysis being a useful approach. The rationale that makes Rasch models useful as a method to shorten the size of a questionnaire is that they can be employed to assess the unidimensionality of questionnaires, and remove items that disrupt this unidimensionality, identify degrees of trait severity and remove those items that overlap in severity level . In addition, it does not require large samples sizes for adequate parameter estimation .
The objectives of the current study were to confirm a hypothesized internal structure of the HeRQoLEDv2, create a shortened version of this questionnaire (HeRQoLED-Short form), and then confirm the structure of the shortened version and examine its validity and reliability. We hypothesized that the first-order factors of the HeRQoLEDv2 could represent two second-order latent traits: "social maladjustment" and "mental health and functionality." We tested this hypothesis in the present study.
Our detailed selection criteria have been described elsewhere [13, 14]. Briefly, the population consisted of ED patients being treated by four collaborating psychiatrists, experts in ED, working in three different mental health services in the province of Bizkaia, Spain. Diagnosis of an ED was performed by psychiatrists attending the patient if the patient met the diagnostic criteria for an ED established by the Diagnostic and Statistical Manual of Mental Disorders-IV .
Patients were excluded from the study if they had any serious multiorganic or psychotic disorder that could prevent adequate completion of the materials. To be included in the study, a patient had to participate in the investigation in an informed and voluntary way. The tenets of the Declaration of Helsinki were followed, and the study gained approval from the hospital's ethics committee.
Three questionnaires -- the HeRQoLEDv2, the 12-item Short Form Health Survey (SF-12), and the Spanish version of the Eating Attitudes Test-26 (EAT-26) -- were mailed to each patient's home address soon after recruitment, which we define as time 1 (T1). Those who did not respond in a timely fashion were sent reminders after 15 days and 30 days. The same questionnaires were mailed to patients one year later, which we define as time 2 (T2). As before, those who did not respond in a timely fashion were sent reminders after 15 days and 30 days.
Data from the T1 sample were used to perform confirmatory factor analysis (CFA) of the HeRQoLEDv2 followed by Rasch analysis. The T2 data were used to perform the CFA, validity, and reliability analyses of the shortened version.
Sociodemographic data were collected from each participant. In addition, each participant completed three self-administered instruments related to HRQoL and ED:
The HeRQoLEDv2 [13, 14] is comprised of 55 items and covering nine domains: symptoms, restrictive behaviors, body image, mental health, emotional role, physical role, personality traits, social relations, and binges. The scores in each domain are converted into a range from 0 to 100, with higher scores indicating a worse perception of HRQoL.
The SF-12 [21, 22] is a short generic survey of health status that can be summarized in two subscales: the physical component summary and the mental component summary. Values range from 0 to 100, with higher values indicating better health perception.
The Spanish version of the EAT-26  was used as a measure of general eating disorder pathology. This test is composed of three factors -- dieting concern, bulimia and food preoccupation, and oral control -- and a total score. Its overall values range from 0 to 78, with higher scores indicating greater ED symptomatology.
Confirmatory factor analysis of the HeRQoLEDv2
The HeRQoLEDv2 had previously been submitted to an exploratory factor analysis to elucidate the way in which items relate to each other and with the hypothesized factors. Following this validity study , we are now able to take a step further and hypothesize an internal structure of the HeRQoLEDv2 items and submit that structure to a confirmatory factor analysis. We excluded binges and symptoms domains from the model because binges domain was an independent domain and the symptoms domain is a list of symptoms rather than a proper measurement scale. A second-order CFA composed of a measurement model and a structural model was performed. We hypothesized a measurement model consisting of seven first-order factors: restrictive behaviors (6 items), body image (8 items), social relations (5 items), mental health (9 items), emotional role (4 items), physical role (4 items), and personality traits (4 items). These seven first order factors could be associated to two second-order latent traits: "social maladjustment" and "mental health and functionality". Based on both the content of the items from the following three first order factors "restrictive behaviours", "body image" and "social relations" and based on the literature, we believed that these three factors shared a common aspect: the impact of having an ED on the socio-cultural life. This impact is manifested in the way of feeding oneself, favouring the increase of restrictive behaviours and of feelings of body dissatisfaction . Also a recent study showed that families of individuals with ED perceived serious difficulties in their interpersonal relationship with the affected one .
We also hypothesized that the mental health and functionality of individuals with an ED would affect their scores in the first-order domains of "physical role", "emotional role", "mental health", and "personality traits". The mental health and functionality of ED individuals tend to be represented by a combination of high perfectionist traits, low self-efficacy feelings, stress due to feeling overweight and depressive symptoms [26–28]. All of these traits and feelings are part of the content of the selected first order domains.
We further hypothesized that "social maladjustment" and "mental health and functionality" factors would be correlated given that an individual's mental state is likely to affect his or her social adjustment and vice versa.
Several different fit indices are applicable to these analyses [29, 30]. We used the chi-square test divided by degrees of freedom, the results of which had to be less than 2.0 to be acceptable ; the root mean square error of approximation, where values of 0.08 or less are acceptable ; and the non-normed fit index and comparative fit index, both of which had to be equal to or greater than 0.90 to be satisfactory .
Only items that showed factor loadings ≥ 0.40 in the corresponding factor were accepted . The Lagrange multiplier test, which identifies paths or covariances that should possibly be added to the model to improve the fit, was used when the model needed modification.
CFAs were performed with the CALIS procedure of the SAS program (version 8.0) .
The Rasch method was applied to the original version of the HeRQoLED as a means to develop the Health Related Quality of life for Eating Disorders - Short Form (HeRQoLED-S). The Rasch model presumes that a single trait drives item responses , so that a person's response to an item that measures a single trait is accounted for by his/her level (amount) on that trait, and not by other factors . The Rasch model assumes that the probability of a given patient responding affirmatively an item is a logistic function of the relative distance between the item location parameter (the difficulty of the item) and the respondent (the ability of the patient), and only a function of that difference . Items along the logit scale are ordered according to its difficulty level; the most difficult ones are at the top and the easiest ones, at the bottom . In our study, items which reflect the highest impact on HRQoL are placed at the top of the continuum and those which reflect the lowest impact are placed at the bottom. We used the polytomous Rasch rating scale model because our response scales are ordinal with six response options. A joint maximum-likelihood estimation process was used to estimate the parameters .
Prior to all further analyses, the functioning of rating scale categories was examined for each of the two domains of the HeRQoLED short form. The rating scale categorizations presented to respondents are intended to elicit from those respondents unambiguous, ordinal indications of the locations of those respondents along the latent trait of interest . Therefore the probability of selecting an item response category indicative of better health status should increase as the underlying level of health of the respondent increases . Linacre  suggests the following criteria to assess adequate functioning of rating scale categories: (1) More than 10 observations per category (or the findings may be unstable, i.e., non-replicable); (2) A smooth distribution of category frequencies. The frequency distribution is not jagged; (3) Clearly advancing average measures; (4) Average measures near their expected values; (5) Observational fit of the observations with their categories: Outfit mean-squares near 1.0. Values much above 1.0 are much more problematic than values much below 1.0.
Because the condition of unidimensionality is a requirement for using Rasch analysis, we applied the Rasch analysis separately to both social maladjustment and mental health and functionality factors. Unidimensionality was assessed through a principal components analysis (PCA) of the residuals extracted from the Rasch model . A violation of unidimensionality was considered if in addition to the first factor there were other factors with eigenvalues greater than 3 . Apart of the PCA, unidimensionality was assessed through examination of fit statistics. We used two indices of fit, namely the mean square information-weighted statistic (infit) and the outlier-sensitive statistic (outfit). Values between 0.7 and 1.3 for both indices indicate a good fit .
We evaluated how well the HeRQoLED - short version differentiates individuals in the measured domains on the basis of the person separation statistic  and how well it differentiates items based on the item separation index, which indicates the ability to define a distinct hierarchy of items along the measured variable. A value ≥ 2.0 for this statistic is comparable to a reliability of 0.80 and is acceptable. Correlation of items with the total scale score served to evaluate whether the items correlated in a similar way with the construct being measured .
"Item bias" or "differential item functioning" (DIF) occurs when items exhibit different difficulties for different person groups. For a given level of a trait, the probability of endorsing a specified item response should be independent of group membership . For the DIF analysis, we examined whether diagnosis subtype (anorexia nervosa, bulimia nervosa, or eating disorder not otherwise specified) may exert influence on item calibrations in subsamples. DIF analyses were performed independently for the "Social maladjustment scale" and for the "Mental health and functionality scale". Welch t gives the DIF significance as a Welch's (Student's) t-statistic. The t-test is a two-sided test for the difference between two means (i.e., the estimates) based on the standard error of the means (i.e., the standard error of the estimates). The null hypothesis is that the two estimates are the same except for measurement error. To establish a noticeable DIF between subsamples, the difference in difficulty of the item between the two groups (DIF contrast) should be at least 0.5 logits. In addition, the Welch t should be statistically significant, P < .05 .
Residual correlations between items within a scale were examined for local dependency. Correlations > 0.5 between item residuals can indicate that responses to one item may be determined by those to another .
Rasch analyses were repeated until we obtained a version that met the criteria, which was named the Health Related Quality of Life for Eating Disorders-Short Form (HeRQoLED-S). Item content was examined for the misfitting items before removal from the scale. Two of the authors of the present study (JAP and CLH) are experts in the field of eating disorders. They jointly decided whether to retain or delete an item based on the clinical importance of the content. Winsteps version 3.37 was used for the Rasch analysis .
Confirmatory factor analysis of the HeRQoLED-S
A CFA was applied to the shortened version. The hypothesized structural and measurement models were the same as those of the long version. The only difference was that fewer items were assigned to each first-order factor. The same fit indices were also used to assess the goodness of fit.
Validity and reliability of the HeRQoLED-S
Based on content similarity between subscales of different questionnaires, we hypothesised the following correlations for the analysis of concurrent validity: The social maladjustment factor would correlate positively and moderately, by means of the Pearson correlation coefficient, with the dieting concern factor of the EAT-26. The mental health and functionality factor, in turn, was hypothesized to correlate negatively and moderately with the mental component summary of the SF-12. The Cronbach alpha index of reliability was calculated for each factor; values above 0.70 were acceptable .
A total of 394 ED patients were approached for the study. Of them, 324 ED patients completed the first set of questionnaires (T1). All patients were receiving treatment for their ED at T1 but they differed in ED subtype, severity and time in treatment. We did not filter patients in these regards; therefore we expect that these patients represent the entire spectrum of ED severity. All were asked to complete the same tests again after one year. Of these, 245 patients (75.6%) responded. Most participants were women (96.3% at T1 and 95.1% at T2), with a mean age of 27 years, SD (8.76) at T1. From the baseline sample, 21% patients had been diagnosed with anorexia nervosa, 15% with bulimia nervosa, and 64% with eating disorders not otherwise specified.
Confirmatory Factor Analysis of the HeRQoLEDv2
For the CFA, only data from the 262 participants who completed the HeRQoLEDv2 at T1 with no answers missing were used. The hypothesized model described in the Introduction provided satisfactory fit indices after few adjustments. Following the Lagrange multiplier test, two pairs of errors, one belonging to the body image domain and the other to the social relations domain, were allowed to covary. Additionally, the Lagrange multiplier test suggested setting a new causal relationship between the personality traits item "Have you had lack of confidence in your own capabilities?" and the mental health domain item "Have you felt yourself worthless?". This new relation is meaningful given that lack of confidence in one's capabilities may lead an individual with an ED to feelings of worthlessness when facing problems. After these adjustments, the goodness of fit indices for the model were satisfactory (χ2 (df = 729) = 1464.67, P < .0001; χ2/df = 2.01; RMSEA = 0.06; NNFI = 0.90 and CFI = 0.90).
Figure 1 shows the path diagram of the model with the estimated parameter values included.
Rasch analysis to obtain the shortened version
Data from all 324 ED patients who responded at T1 were used for the Rasch Rating Scale analysis.
Originally, the social maladjustment domain was composed of 19 items. Nine of them were removed because they showed inadequate fit indices (infit or outfit) or because they overlapped the same level of difficulty as other items. Experts in ED evaluated the importance of the item content before removing the item. The shortened social maladjustment domain consisted of 10 items separated by 0.10 or more logit values. Table 1 shows the characteristics of the measurement level, standard error, infit, outfit, and item total correlations. The level of difficulty is represented by the trait level (δ), where high values indicate greater difficulty with social adjustment.
Four items of the social maladjustment domain did not comply with all the requirements for adequate functioning of rating scale categories. Specifically, fewer than 10 participants had endorsed the response category "Almost always" in the item RB12 "Do you fast for a day although you feel hungry". We combined adjacent categories "almost always" with respondents of "Always" to obtain a robust structure of high frequency categories. This combination reproduced satisfactory results with an outfit index of 1.3. Items RB15 "Do you avoid eating with others?" and BI27 "Do you worry about the possibility of gaining weight?" showed large outfits in one of their response categories. Response category "Always" of item RB15 presented an outfit index of 2.1. After combining respondents of adjacent categories "always" and "almost always", the outfit index reduced to 1.5. For the item BI27 the category response "never" presented an outfit index of 2.6. After combining this response category with the adjacent category of "almost never" the outfit value reduced to 1.4. The fourth problematic item is SOCR54 "Do you think that your eating habits negatively affect your family relationship?" which presented a large outfit (OUTF MNSQ = 1.9) for response category 3 "several times" but not for the remaining of the response categories. Combining adjacent categories was not a good approach since the resulting merged response category would count with an excessive number of respondents. We decided to leave the item as it was.
Unidimensionality was supported since the PCA of the residuals did not give additional factors with eigenvalues exceeding 3.00. Furthermore, the fit indices ranged from 0.77 to 1.30. All the item total correlations were high and homogeneous (see Table 1). Differential item functioning was observed only in one item, BI27 "Do you worry about the possibility of gaining weight?" with a difference slightly higher than 0.5 (DIF contrast = 0.66; p < 0.05) between the anorexia and the bulimia subgroups. For patients with anorexia nervosa, this item was slightly more difficult than for patients with bulimia. Intercorrelations between residuals were all below 0.50 (range -.30 to .47).
The final shortened scale of social maladjustment included 10 items. The item locations for the HeRQoLED-S are shown in Figure 2 (left-hand side). The person separation index (2.46) and the item separation index (12.48) exceeded the required value of 2.0, thereby indicating a reliability above 0.80. The total score was transformed to range from 0 to 100 (mean score: 48.8; SD: 23.2).
The mental health and functionality scale originally included 21 items. After performing iterative Rasch analyses and item content analysis, 11 of them were removed because they overlapped or misfit and were not clinically essential. Seven of the 10 remaining items in the scale were separated by 0.10 logit units and 3 of which were separated by 0.04 logit units (Table 2; Figure 2, right). The 3 overlapping items (Figure 2, right-hand side) were retained because they were considered clinically meaningful based on expert opinion and had adequate fit indices.
Unidimensionality was supported since the PCA of the residuals did not lead to additional factors with eigenvalues exceeding 3.00. Furthermore, the fit indices ranged from 0.72 to 1.27. The item total correlations were all high and homogeneous, ranging from 0.61 to 0.78.
Only two items of the mental health and functionality domain did not comply with the requirements for adequate functioning of rating scale categories. Specifically, the category response "Always" of item PR48 "Do you have to stop performing some tasks as a result of your physical problem?" presented an outfit index of 2.2. Therefore, we decided to combine this response option with the adjacent category "Almost always". After this combination, the outfit reduced to 1.4. The category response "Never" from the item MH36 "Do you feel happy?" was only reported by 1 participant. Thus, we decided to combine it with the adjacent category response "Almost never" to enlarge the sample. After this combination, the outfit index was -1.58.
Figure 2 (right side) shows the item and person locations along the logit scale. Positive values indicate high levels of mental health disease and dysfunction, whereas negative values indicate low levels of mental health disease and dysfunction.
The person separation index (2.5) and the item separation index (9.7) for this sample also exceeded the required value of 2.00, indicating a reliability of the scale above 0.80. The raw score in this domain was also transformed to range from 0 to 100 (mean = 48; SD = 20.3). Statistically significant DIF contrasts were not observed for any item of the scale.
Intercorrelations between residuals were below 0.50 (range -.29 to .41), except for two items ("Do you have to stop performing some tasks as a result of your physical problem?" and "Do you find it difficult to maintain the attention as a result of your physical problem?") which slightly surpassed this threshold (r = 0.51).
In summary, after applying the Rasch rating scale analysis to the original 40 items (after excluding items from binges and symptoms domains) of the HeRQoLEDv2 we obtained a shortened version of 20 questions divided in 2 factors, 'social maladjustment' and 'mental health and functionality'. This HeRQoLED short version provides separate scores for each factor. Calculating the score in both long and short versions requires summing the response options selected in the factor's items, standardizing the score to range from 0 to 100. In case of missing values we applied the mean imputation method.
Confirmatory Factor Analysis of the HeRQoLED-S
Data from the 207 patients who returned questionnaires at T2 without missing answers were used for the CFA of the HeRQoLED-S. The hypothesized model was similar to that of the long version but included only the 20 items accepted after the Rasch analysis. The Lagrange multiplier test was again used. The first pair of errors intercorrelated belonged to two items from the body image domain, and the second to the personality traits domain.
The factorial structure that resulted after allowing for these covariances between errors proved satisfactory since it resulted in acceptable fit indices (x2 (df = 160) = 305.96, P < .0001; x2/df ratio = 1.9; RMSEA = 0.07; NNFI = 0.93 and CFI = 0.94) and significant factor loadings (Figure 3).
Concurrent validity and reliability of the HeRQoLED-S
A data set for the HeRQoLED-S was created using the responses of all 245 patients who completed questionnaires at T2. As hypothesized (Table 3) the social maladjustment factor correlated more strongly with the dieting concern factor of the EAT-26 (r = 0.82, p < 0.001) than with the remaining factors. The mental health and functionality factor of the HeRQoLED-S also correlated higher with the mental summary component of the SF-12 (r = -0.69, p < 0.0001) than with the other factors. The Cronbach alpha was 0.91 for the social maladjustment domain and 0.90 for the mental health and functionality domain.
This study confirmed the internal structure of a newly developed questionnaire for eating disorders, the 55-question HeRQoLEDv2. We also applied CFA and Rasch analysis to develop a shorter 20-question version, which maintained satisfactory psychometric qualities, and we validated the internal structure of the shortened questionnaire.
Of the three other disease-specific instruments created to date for measuring HRQoL in patients with an ED, only the EDQOL questionnaire  has been subjected to a CFA. In that study, the investigators confirmed the structure of a second-order factor, presumed to be the HRQoL construct that explained the relationships between four latent first-order factors. In the current study, CFA of the HeRQoLEDv2 revealed two correlated second-order factors that explained the relationships between seven first-order factors. In theorizing our model, we did not hypothesize an orthogonal structure a priori because we assumed that the HRQoL measurement construct included the intercorrelation of physical, mental, and social factors affected by EDs and treatment [44, 45].
Validation of the HeRQoLEDv2 using CFA provides the questionnaire with greater construct validity than in the version we previously developed . To perform the CFA, we recruited 262 patients with ED. Although one could argue that this sample size is small considering the length of the questionnaire, it must be noted that it is difficult to recruit patients with ED, so recruiting this amount of participants can be considered as strength of the study more than a limitation. Among the potential statistical drawbacks derived from the sample size are the increase in sampling error, instability, and reduced reliability of factor analysis solutions .
A second aim of this study was to use modern analytical techniques to create a shorter version of the HeRQoLEDv2. Various strategies are available for the reduction of questionnaires . We chose to apply the Rasch method, as this technique produces a scale that calibrates items based on their range of difficulty for the target population.
The 20-item HeRQoLED-S that emerged from the Rasch method provided adjustment levels (infit and outfit), unidimensionality, and local independence sufficient to be considered adequate. A slight DIF was observed in only one item. We decided not to remove the item from the questionnaire since it was clinically relevant and presented satisfactory levels of functioning in the other parameters (fit statistics, local dependence, and response scale functioning).
A third aim of the study was to validate the HeRQoLED-S. A CFA applied to the HeRQoLED-S confirmed the goodness of the structure achieved using the Rasch method, as reflected in the obtained fit indices. Other studies have also applied CFA to validate the internal structure of shortened questionnaires . Apart of the hypothesized concurrent correlations between the HeRQoLED-S and specific domains of the EAT-26 and SF-12, the social maladjustment factor of the HeRQoLED-S correlated highly with the second factor of the EAT-26, bulimia and food preoccupation. This latter correlation had not been hypothesized previously. The bulimia and food preoccupation factor contains questions about the control that food exercises over an individual's life and about binges and vomiting. It makes sense that the social maladjustment domain is highly correlated with this factor because individuals who engage in bingeing and vomiting also manifest problems with social adjustment [48, 49]. However, our first hypothesis was to correlate the social maladjustment domain with the dieting concern factor because questions pertaining to it inquire about restrictive behaviors and body image, and are more similar in content to those covered by the social maladjustment domain.
We estimated that the shortened form requires approximately 5 to 7 minutes to complete, which is about one-third the time it takes to complete the original HeRQoLEDv2. This is a considerable reduction in time commitment for participants.
One limitation of the HeRQoLED-S is that its items did not cover the entire range of existing difficulties, and gaps in construct difficulty were detected. Although including more items would have helped cover the different levels of construct difficulty, this was not possible because we were working with a predetermined set of items and selected those that provided the best distribution despite the gaps. In addition, although redundant items were identified for mental health and functionality, they were maintained because the scale generally provided good content validity and good fit indices.
Coste et al.  have recommended that shortened versions of questionnaires be evaluated psychometrically (particularly with regard to construct validity and reliability) using a new and independent sample. Due to financial limitations and difficulties in recruiting another large sample of patients with an ED, the HeRQoLED-S was validated using the same patient sample as in the follow-up study. We believe this was appropriate given that the T2 sample contained a different number of patients and that the one-year interval since the last contact uses to lead to significant changes in ED symptoms, as some other studies have shown [50, 51]. Nevertheless, the same level of validity cannot be obtained from a repeat sample as from a new independent sample. Thus, the shortened HeRQoLED-S must still be validated among different groups of patients with eating disorders.
In conclusion, CFA analysis supports an internal structure of two latent factors of the 55-question HeRQoLEDv2. A short form questionnaire derived from this second order structure, the 20-item HeRQoLED-S, has been developed and validated with modern psychometric techniques that facilitate its use in research and clinical practice. Both versions have demonstrated good reliability and validity. Future applications of HeRQoLEDv2 and HeRQoLED-S using different ED patient samples will yield more evidence about their validity and reliability.
de la Rie SM, Noordenbos G, van Furth EF: Quality of life and eating disorders. Qual Life Res 2005, 14: 1511–1521. 10.1007/s11136-005-0585-0
de la Rie SM, Noordenbos G, Donker M, van Furth EF: The patient's view on quality of life and eating disorders. Int J Eat Disord 2006, 40: 13–20. 10.1002/eat.20338
Doll HA, Petersen S, Stewart-Brown S: Eating disorders and emotional and physical well-being: Associations between student self-reports of eating disorders and quality of life as measured by the SF-36. Qual Life Res 2005, 14: 705–717. 10.1007/s11136-004-0792-0
Keilen M, Treasure T, Schmidt U, Treasure J: Quality of life measurements in eating disorders, angina, and transplant candidates: are they comparable? J R Soc Med 1994, 87: 441–444.
Miller PM: Redefining success in eating disorders. Addict Behav 1996, 21: 745–754. 10.1016/0306-4603(96)00033-0
Padierna A, Quintana J, Arostegui I, González N, Horcajo M: The health-related quality of life in eating disorders. Qual Life Res 2000, 9: 667–674. 10.1023/A:1008973106611
Padierna A, Quintana JM, Arostegui I, Gonzalez N, Horcajo MJ: Changes in health related quality of life among patients treated for eating disorders. Qual Life Res 2002, 11: 545–552. 10.1023/A:1016324527729
Spitzer RL, Kroenke K, Linzer M, Hahn SR, Williams JB, deGruy FVI, Brody D, Davies M: Health-related quality of life in primary care patients with mental disorders. Results from the PRIME-MD 1000 Study. JAMA 1995, 274: 1511–1517. 10.1001/jama.274.19.1511
Hay P, Mond JM: How to 'count the cost' and measure burden? A review of health-related quality of life in people with eating disorders. J Ment Health 2005, 14: 539–552. 10.1080/09638230500400274
Adair C, Marcoux G, Cram B, Ewashen C, Chafe J, Cassin S, Pinzon J, Gusella J, Geller J, Scattolon Y, Fergusson P, Styles L, Brown K: Development and multi-site validation of a new condition-specific quality of life measure of eating disorders. Health Qual Life Outcomes 2007, 5: 23. 10.1186/1477-7525-5-23
Abraham SF, Brown T, Boyd C, Luscombe G, Russell J: Quality of life: eating disorders. Aust N Z J Psychiatry 2006, 40: 150–155.
Engel S, Wittrock D, Crosby RD, Wonderlich S, Mitchell J, Kolotkin RL: Development and psychometric validation of an eating disorder-specific health-related quality of life instrument. Int J Eat Disord 2006, 39: 62–71. 10.1002/eat.20200
Las Hayas C, Quintana JM, Padierna A, Bilbao A, Muñoz P, Madrazo A, Urresti B, Cook E: The new questionnaire Health-Related Quality of Life for Eating Disorders showed good validity and reliability. J Clin Epidemiol 2006, 59: 192–200. 10.1016/j.jclinepi.2005.06.005
Las Hayas C, Quintana J, Padierna A, Bilbao A, Muñoz P, Cook E: Health-Related Quality of Life for Eating Disorders questionnaire version-2 was responsive 1-year after initial assessment. J Clin Epidemiol 2007, 60: 825–833. 10.1016/j.jclinepi.2006.10.004
Beaton DE, Wright JG, Katz J, The Upper Extremity Collaborative Group: Development of the QuickDASH: Comparison of three item-reduction approaches. J Bone Joint Surg Am 2005, 87-A: 1038–1046. 10.2106/JBJS.D.02060
Coste J, Guillemin F, Pouchot J, Fermanian J: Methodological approaches to shortening composite measurement scales. J Clin Epidemiol 1997, 50: 247–252. 10.1016/S0895-4356(96)00363-0
Nijsten T, Unaeze J, Stern R: Refinement and reduction of the Impact of Psoriasis Questionnaire: Classical Test Theory vs. Rasch analysis. Br J Dermatol 2006, 154: 692–700. 10.1111/j.1365-2133.2005.07066.x
Smith A, Wright E, Rush R, Stark D, Velikova G, Selby P: Rasch analysis of the dimensional structure of the Hospital Anxiety and Depression Scale. Psycho-oncology 2006, 15: 817–827. 10.1002/pon.1015
Linacre J: Sample size and item calibration stability. Rasch Meas Trans 1994, 7: 328.
American Psychiatric Association: Diagnostic and Statistical Manual for Mental Disorders. 4th edition. Washington, DC: American Psychiatric Press; 1994.
Gandek B, Ware JE, Aaronson NK, Apolone G, Bjorner JB, Brazier JE, Bullinger M, Kaasa S, Leplege A, Prieto L, Sullivan M: Cross-validation of item selection and scoring for the SF-12 Health Survey in nine countries: results from the IQOLA Project. International Quality of Life Assessment. J Clin Epidemiol 1998, 51: 1171–1178. 10.1016/S0895-4356(98)00109-7
Ware JE, Kosinski M, Keller SD: A 12-Item Short-Form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care 1996, 34: 220–233. 10.1097/00005650-199603000-00003
Castro J, Toro J, Salamero M, Guimera E: The eating attitudes test: validation of the spanish version. Evaluación Psicológica/Psychological Assessment 1991, 7: 175–190.
Stice E: Review of the evidence for a sociocultural model of bulimia nervosa and an exploration of the mechanisms of action. Clin Psychol Rev 1994, 14: 1994. 10.1016/0272-7358(94)90002-7
Hillege S, Beale B, McMaster RE-MA, Hillege S: Impact of eating disorders on family life: Individual parents' stories. J Clin Nurs 2006, 15: 1016–1022. 10.1111/j.1365-2702.2006.01367.x
Bardone-Cone A, Abramson L, Vohs KD, Heatherton TF, Joiner TE Jr: Predicting bulimic symptoms: an interactive model of self-efficacy, perfectionism, and perceived weight status. Behav Res Ther 2006, 44: 27–42. 10.1016/j.brat.2004.09.009
Bardone-Cone AM, Joiner J, Crosby RD, Crow SJ, Klein MH, le Grange D, Mitchell JE, Peterson CB, Wonderlich SA: Examining a psychosocial interactive model of binge eating and vomiting in women with bulimia nervosa and subthreshold bulimia nervosa. Behav Res Ther 2008, 46: 887–894. 10.1016/j.brat.2008.04.003
Cowen P, Anderson I, Fairburn CG: Neurochemical effects of dieting: Relevance to changes in eating and affective disorders. In The biology of feast and famine: Relevance to eating disorders. Edited by: Anderson G, Kennedy S. San Diego, CA: Academic Press; 1992:269–284.
Hatcher L: Developing Measurement Models with Confirmatory Factor Analysis. In A step-by-step approach to using the SAS® System for factor Analysis and Structural Equation Modeling (ed.). Cary, NC: SAS Institute Inc.; 1994:249–342.
Browne M, Cudeck R: Alternative ways of assessing model fit. SMR 1992, 21: 230–258.
SAS QC: SAS/QC User's Guide, Version 8, Volumes 1, 2, and 3. Cary, NC: SAS Institute Inc; 1999.
Cook K, Teal C, Bjorner JB, Cella D, Chang C-H, Crane P, Gibbons L, Hays R, McHorney CA, Ocepek-Welikson K, Raczek A, Teresi J, Reeve B: IRT health outcomes data analysis project: an overview and summary. Qual Life Res 2007, 16: 121–132. 10.1007/s11136-007-9177-5
Reeve BB, Hays RD, Bjorner JB, Cook KF, Crane PK, Teresi JA, Thissen D, Revicki DA, Weiss DJ, Hambleton RK, Liu H, Gershon R, Reise SP, Lai Js, Cella D, on behalf of the PROMIS Cooperative Group: Psychometric Evaluation and Calibration of Health-Related Quality of Life Item Banks: Plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med Care 2007., 45: 10.1097/01.mlr.0000250483.85507.04
Prieto G, Delgado A: Análisis de un test mediante el modelo de Rasch. Psicothema 2003, 15: 94–100.
Wolfe F: Rasch analysis of the Western Ontario MacMaster Questionnaire (WOMAC) in 2205 patients with osteoarthritis, rheumatoid arthritis, and fibromyalgia. Ann Rheum Dis 1999, 58: 563–568. 10.1136/ard.58.9.563
Wright B, Stone M: Best test design: Rasch measurement. Chicago: MESA Press; 1979.
Linacre J: A User's Guide to WINSTEPS. Chicago, IL: MESA Press; 2009.
Tesio L: Measuring behaviours and perceptions: rasch analysis as a tool for rehabilitation research. J Rehabil Med 2003, 35: 105–115. 10.1080/16501970310010448
Duncan P, Lai S, Bode R, Perera S, De la Rosa J, GAIN Americas Investigators: Stroke Impact Scale-16. A brief assessment of physical function. Neurology 2003, 60: 291–296. 10.1001/archneur.60.2.291
Cole JC, Rabin A, Smith T, Kaufman A: Development and Validation of a Rasch-Derived CES-D Short Form. Psychol Assess 2004, 16: 360–372. 10.1037/1040-3522.214.171.1240
Davidson M, Keating JL, Eyres S: A low back-specific version of the SF-36 physical functioning scale. Spine 2004, 29: 586–594. 10.1097/01.BRS.0000103346.38557.73
Linacre J: Investigating rating scale utility. J Outcome Meas 1999, 3: 103–122.
Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika 1951, 16: 297–334. 10.1007/BF02310555
Fayers PM, Machin D: Quality of Life: assessment, analysis and interpretation. West Sussex PO19 1UD. England: John Wiley & Sons Ltd; 2000.
Revicki DA, Osoba D, Fairclough D, Barofsky I, Berzon R, Leidy NK, Rothman M: Recommendations on health-related quality of life research to support labelling and promotional claims in the United States. Qual Life Res 2000, 9: 887–900. 10.1023/A:1008996223999
MacCallum R, Widaman K, Zhang S, Hong S: Sample size in factor analysis. Psychol Methods 1999, 4: 84–99. 10.1037/1082-989X.4.1.84
Calvete E, Estévez A, López de Arrotabe E, Ruiz P: The Schema Questionnaire - Short Form. Structure and relationship with automatic thoughts and symptoms of affective disorders. Eur J Psychol Assess 2005, 21: 90–99. 10.1027/1015-57126.96.36.199
Beales DL, Dolton R: Eating disordered patients: personality, alexithymia, and implications for primary care. Br J Gen Pract 2008, 50: 21–26.
Rorty M, Yager J, Buckwalter JG, Rossotto E: Social support, social adjustment, and recovery status in bulimia nervosa. Int J Eat Disord 1999, 26: 1–12. 10.1002/(SICI)1098-108X(199907)26:1<1::AID-EAT1>3.0.CO;2-I
Bowers WA, Ansher L: The effectiveness of cognitive behavioral therapy on changing eating disorder symptoms and psychopathology of 32 anorexia nervosa patients at hospital discharge and one year follow-up. Ann Clin Psychitary 2008, 20: 79–86.
Fernandez Aranda F, Bel Villar M, Jimenez Murcia S, Turón-Gil V, Vallejo-Ruiloba J, Garcia Vilches I: Outpatient group psychotherapy for Anorexia nervosa. Anales de Psiquiatria 1997, 13: 236–242.
This study was partially supported by the Instituto de Salud Carlos III (Reference: 00/0115), the Fondo Europeo de Desarrollo Regional (FEDER) and the Department of Education, Universities and Research of the Basque Government. We are very grateful to the individuals with an ED who continue to collaborate with us in our research. We also wish to thank Drs. Begoña Urresti and Arantza Madrazo for their invaluable collaboration in patient recruitment. We are grateful to the Comisión de Investigación from the Hospital de Galdakao for funding the translation and editing of the article.
The authors declare that they have no competing interests.
CLH, JMQ, AP and AB conceived the study. CLH participated in all the phases of the study, design, recruitment of patients, data analysis, interpretation of results, and writing of the manuscript. JMQ supervised the study, participated in the interpretation of results and in the writing of the manuscript. AB designed the study and performed the statistical analysis. AP and PM collaborated actively in patient recruitment, revised the data obtained and reviewed the manuscript writing. All the authors read and approved the final manuscript.