The Herdecke questionnaire on quality of life (HLQ): Validation of factorial structure and development of a short form within a naturopathy treated in-patient collective
Health and Quality of Life Outcomes volume 3, Article number: 40 (2005)
Quality of life (QoL) of patients has become a central evaluation parameter that also acts as an aid for decisions related to treatment strategies particularly for patients with chronic illnesses. In Germany, one of the newer instruments attempting to measure distinct QoL aspects is the "Herdecke Questionnaire for Quality of Life" (HLQ). In this study, we aimed to validate the HLQ with respect to its factorial structure, and to develop a short form. The validation has been carried out in relation to other questionnaires including the SF-36 Health Survey, the Mood-Scale Bf-S, the Giessen Physical Complaints Questionnaire GBB-24 and McGill's Pain Perception Scale SES.
Data for this study derived from a model project on the treatment of patients using naturopathy methods in Blankenstein Hospital, Hattingen. In total, 2,461 patients between the ages of 16 and 92 years (mean age: 58.0 ± 13.4 years) were included in this study. Most of the patients (62%) suffered from rheumatic diseases. Factorial validation of the HLQ, it's reliability and external consistency analysis and the development of a short form were carried out using the SPSS software.
Structural analysis of the HLQ-items pointed to a 6-factor model. The internal consistency of both the long and the short version is excellent (Cronbach's α is 0.935 for the HLQ-L and 0.862 for the HLQ-S). The highest reliability in the HLQ-L was obtained for the "Initiative Power and Interest" scale, the lowest for the 2-item scales "Digestive Well-Being" and the "Physical Complaints". However, the scales found by factor analysis herein were only in part congruent with the original 5-scale model which was approved a multitrait analysis approach. The new instrument shows good correlations with several scales of other relevant QoL instruments. The scales "Initiative Power and Interest", "Social Interaction", "Mental Balance", "Motility", "Physical Complaints", "Digestive Well-Being" sufficiently differentiate the diagnostic groups, particularly between the patients suffering on connective tissue and soft tissue disorders from those with metabolic and nutritional disorders or hypersensitivity reactions.
Both the factorial validation and the development of a consistent short-form of the HLQ are important steps forward for researchers in the field of QoL who wish to use the HLQ as a reliable and valid instrument. The results indicate that the HLQ is a unique QoL-instrument that can be used for both in-patient and out-patient-treatment. However, to improve to profile of the HLQ, there is still the need for strengthening the Questionnaire in the dimensions of physical well-being. This is the subject of a separate ongoing study.
The consideration of "quality of life" (QoL) in clinical studies and various attempts to make this construct measurable to determine therapeutic success is an ongoing process. This is particularly the case in those therapeutic attempts that focus on integrative aspects of disease management that in turn offer holistic care including a variety of therapeutic directions. Here, the QoL has become a central evaluation parameter. It simultaneously acts as an aid for decisions on the choice of treatment strategy for chronically ill patients , which is obviously a challenging therapeutic aim, and is at least as significant as somatic parameters . QoL has therefore become a leading criteria in many outcome studies alongside somatic and economic factors. In the course of this development, the concept of QoL is explicitly listed as outcome parameter in many medical societies' guidelines .
However, there are a variety of opinions regarding the factors that contribute to QoL. According to a WHO-definition, QoL relates to the physical, psychological and social well-being of an individual as laid out by formal health terms . According to this definition, it is necessary to differentiate between a general and a health related QoL . The former relates to aspects that exist independently from any particular disease (e.g. items such as "being spontaneous", or "feeling exhausted"), whereas the later focuses on particular characteristics of specific diseases (e.g. factors such as "walking distance" or "pain" in rheumatic diseases)
Despite the methodological difficulties involved in making QoL measurable, we have seen the development of numerous instruments for measuring disease specific aspects of QoL [6–8] in the recent past. An advantage of disease specific instruments is precise registration regarding strains and limitations of specific diseases rather than those of diseases in general. In addition, the course of clinical diseases can be more easily registered because of the development of disease-related questionnaires ("course of disease sensitivity" of questionnaires). The majority of current recommendations by health economists and clinical pharmacological associations include suggestions regarding the use of disease specific and general QoL questionnaires .
In Germany, one of the newer instruments attempting to measure general QoL with a distinct focus is the "Herdecke Questionnaire for Quality of Life" (HLQ is the German acronym of the phrase "Herdecke Questionnaire for Quality of Life") [10, 11]. Clinical research projects have been reluctant to employ the HLQ although it was evaluated on a sample of healthy subjects, and that some reference values of clinical studies on different diseases do exist, and also despite of the fact that the HLQ has a very comprehensive understanding of the QoL problematic ,. This is mainly because conclusive validation based on a large sample is still missing. To improve this situation, this study aimed to show the characteristics of the HLQ, to describe its external validation using other test instruments, and to develop a short form of the questionnaire.
Data for this study derive from a model project on the treatment of patients using naturopathy methods in Blankenstein Hospital, Hattingen. To investigate the benefits and limits of naturopathic treatment in the field of in-patient care, the Department of Naturopathy was established as a model at the Blankenstein Hospital in Hattingen and was scientifically evaluated by the Chair of Medical Theory and Complementary Medicine of Witten/Herdecke University. This evaluation began on July 1st 1999 and was completed on March 31th 2003. It focused on the following question: "How does a three-week in-patient treatment with naturopathic methods affect the QoL of the patients, regarding a pre-post-comparison and a follow-up carried out after 6 months? Detailed information concerning this model project and its' scientific evaluation can be found in  and .
In total, 2,461 patients between 16 and 92 years (mean age 58.0 ± 13.4 years) were included in this study. The socio-demographic characteristics of the patients are shown in Table 1.
Alongside the HLQ, other standardized questionnaires were used. These included the MOS-SF-36 Health Survey , Zerssen's Mood-Scale Bf-S , the Giessener Physical Complaints Questionnaire GBB-24  and McGill's Pain Perception Scale SES .
The HLQ as referred to in this study uses 39 five-point likert scales ranging from 0 to 4 (agreement/disagreement or often/never). In contrast to the SF-36, the items are not defined by situations related to daily life and household situations (shopping, career situations, physical activity). As a result, the HLQ is very suitable for registering QoL particularly in monitoring the course of a disease or therapeutic intervention . As an evaluation scheme, Schulte et al.  described 5 scales of the 39 item HLQ, unfortunately without any confirmation by factor analysis of the following areas: Physical Well-being (4 items), Vitality (9 items), Mental behavior (10 items), Presence of Personality (9 items), Social Environment (7 items). All scales are expressed in percentage values from 0 = lowest to 100 = highest QoL.
The main question of this study relates to the re-examination of the HLQ by means of a factor and reliability analysis and the explorative evaluation of the factors. External validation was performed by correlating the HLQ scales with those of the external test instruments: MOS-SF-36 Health Survey , Zerssen's Mood-Scale Bf-S , the Giessener Physical Complaints Questionnaire GBB-24  and McGill's Pain Perception Scale SES .
Factor analysis was performed using principal components analysis with Varimax rotation on 35 of the 39 items. The items, #13 (avoided conflicts), #14 (behavior of others was unclear to me), #15 (was glad) and #29 (reduced sexual activity) were omitted following the positive preliminary results on the reliability of the HLQ by Kroez et al. . To determine the internal consistency of the questionnaire, reliability analysis was performed using Cronbach's alpha. Both factor analysis and reliability analysis were performed for the long and the short version of the HLQ.
For the short form, only relevant items with a factorial weight of >0.6 were selected. This method of selection was originally suggested by Grimley  and has successfully been applied elsewhere [22, 23].. Coefficients of determination (R-square) of short and long form scales were calculated to evaluate the proportion of variance of the original HLQ which can be explained by the short form.
Evaluation of responsiveness of the HLQ over a course of time was achieved by analyzing the change of HLQ-total score from the time of admission to the time of discharge by using a dependent t-test and calculation of Cohen's effect size (ES). Cohen's guidelines were used to classify the magnitude of effect sizes: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.
The statistical data evaluation was performed using the SPSS Version 10.0 program packet.
The descriptive statistics of each item, the reliability parameters and the difficulty index are given in Table 2. Considering the high percentage of patients with chronic rheumatic diseases, an item-difficulty index between 0.26 (Item: "I suffered from physical pain") and 0.73 (Items "Family life was a burden" and "I felt over directed") can be regarded as sufficient. This also holds for item-total correlations with values between 0.27 and 0.69 (median: 0.55) for the original HLQ and between 0.32 and 0.61 (median: 0.46) for the short version (HLQ-S), These correlations are considered to be optimal ranges for psychological test instruments.
The results of the structural analysis of the HLQ-items yielded surprising results. The scales found by factor analysis (Table 2) were only partly congruent with the scalesin the original publication . Instead, we found a new and stable 6-factor-model which fits better with the original data than the original 5-scale model derived by Schulte et al., which used a multitrait analysis approach (developed by Hays et al. ). This is underlined by a Kaiser-Meyer-Olkin measure of sampling adequacy of 0.957 and a highly significant Bartlett test of sphericity (p < 0.001). The cumulative variance explained by this model is 54.7%.
Correlation analysis (Table 3) of the earlier HLQ scale with the new scale revealed significant correlations between the scales "Social Environment (SOC)" and "Social Interaction (SOCI)" (r = 0.923 for the HLQ-L and r = 0.860 for the HLQ-S). Unfortunately, such clear correlation between an old HLQ scale with the unique factor of our current analysis was not found with the other scales. However, "physical well-being (PWB)" of the old HLQ correlated well with the new "motility (MOT)" scale (HLQ-L r = 0.929 resp. HLQ-S r = 0.958), while the old "vitality (VIT)" scale correlates with the "mental balance scale (MB)" (HLQ-L r = 0.840 resp. HLQ-S r = 0.681). The old scales "presence of personality (PERS)" and "mental balance (MEB)" are represented well by the new scale "initiative power and interest (IPI)" (See Table 3).
The internal consistency of the instruments (HLQ-L and HLQ-S), both for the total score (Cronbach's α is 0.935 for the HLQ-L and 0.862 for the HLQ-S) as well as for the subscales of the HLQ-L (Cronbach's α between 0.621 and 0.885) can be regarded as being excellent. The highest alpha reliability in the HLQ-L was obtained for the "Initiative Power and Interest" scale, the lowest for the 2-item scales "Digestive Well-Being" (0.621) and "Physical Complaints" (0.692).
The mean difference between the scales of the HLQ-S and the HLQ-L for all patients is between 1.20 ("Initiative Power and Interest ") and 2.24 points ("Social interaction") on a percentage scale. The absolute differences are clustered in groups and are given in Table 4. Although there is a low overall mean difference, absolute differences greater than 10 percent range between 17.9% ("Initiative Power and Interest") and 26.8% ("Social interaction"). However, with correlation coefficients ranging from 0.899 to 0.964, the proportion of variance of the HLQ-L can be explained by the short form ranges between 79% and 93% and thus can be regarded as an adequate proportion for a short version.
The correlation of the HLQ with other test instruments is shown in Table 5. There are acceptable correlations with r> 0.5 between the mental-health associated scales from the HLQ with those of the other instruments, for example, the "mental health"-Scale of the SF-36. In detail, the scales "Initiative Power and Interest", "Social Interaction" and "Mental Balance" of the HLQ correlate well with "mental health" and the "mental component summary", "Social Functionand "Vitality" of the SF-36 and Zerssens Bf-S Mood-Scale. The "motility" scale of the HLQ correlates with "physical function" and "vitality" of the SF-36, with the "severity of exhaustion" of the Giessener Physical Complaints Questionnaire GBB 24, and somewhat weaker with the "role physical", "bodily pain" and "physical component summary" scales of SF-36 and "limp pain" of the GBB 24. The "physical complaints" subscale of the HLQ correlates well with "bodily pain" of the SF-36 and its "physical component summary" scale, and also with the "affection pain" subscale of McGill's Pain Perception Scales SES. Among the SF-36 scales, the factor "general health" is not represented by the HLQ scales. The factors, "gastric symptoms" and the "heart symptoms" from the GBB 24 scales and "sensory pain" from the SES are not represented by the HLQ.
According to the diagnostic spectrum (Table 5), the values of the scale "Motility (MOT)" and "Physical Complaints (PHY)" show particularly low values in patients suffering from rheumatic diseases. Also, in contrast with other scales of the HLQ, these two appear to have little correlation with age, which indicates a suitable discriminatory power of the HLQ considering age and different types of disease.
The results from the responsiveness analysis are presented in Table 6. We found a high sensitivity of the HLQ-scales to change within the treatment with particularly high significant changes in the mean and calculated effect sizes between 0.39 (Digestive Well Being) and 0.92 (Mental Balance).
The aim of our study was to confirm the structure and consistency of the HLQ. Surprisingly, we found that the original scales presented earlier  were not in accordance with the results of this factor analysis. However, the scales "IPI-Initiative Power and Interest", "SOCI – Social Interaction", "MB – Mental Balance", "MOT – Motility", "PHY-Physical Complaints", "DWB – Digestive Well-Being" show a good reliability and sufficiently differentiate the diagnostic groups, especially between those patients suffering with connective tissue and soft tissue disorders from those with metabolic and nutritional disorders or hypersensitivity reactions.
Although the HLQ sub-scales "Initiative Power and Interest", "Social Interaction" and "Mental Balance" of the HLQ correlate well with the corresponding SF-36 scales and with Zerssens Bf-S Mood-Scale, and thus indicate that these qualities share several interconnections, our findings also showed that the HLQ provides several aspects of health such as "Appetite and Digestive Affections" which are not well covered by existing QoL-measures. Nevertheless, with only two items, the subscale "digestive well-being" has to be strengthened by additional items. This is also true for the scale related to physical complaints and pain. With correlation values of 0.11 (physical total scale of the SF-36) and 0.29 (sensory pain SES), it is quite obvious that this scale is deficient and needs an upgrade in respect to quality and number of items.
As, according to  internal consistency reliability is a poor predictor of responsiveness, we measured the responsiveness of the HLQ directly using Cohen's effect size. Together with the highly significant results of the t-test statistics and being aware of the methodological limitations which are immanent in obtaining results on a questionnaires responsiveness by means of effect sizes , we can nevertheless conclude that the HLQ shows sufficient responsiveness for the use in a clinical setting.
In our opinion, the HLQ is more sensitive to health changes brought about by Complementary Therapies including anthroposophic medicine or homeopathy. This does not mean that the HLQ is only suitable for such therapies. Although, there is a trend to consider QoL-questionnaires being specific for special complementary therapies such as mistletoe treatment in cancer patients , we do not favor such labels, as this might result in an inflation of "new" QoL-measures for each new therapeutic situation .
QoL is a multidimensional construct composed of functional, physical, emotional, social and spiritual well-being [29, 30] with, several interconnections between distinct constructs of well-being. The HLQ scales "Social Interaction", "Mental Balance", "Motility", and "Physical Complaints" share similarities with the these constructs, but highlights two further significant topics, i.e. "Initiative Power and Interest" and "Digestive Well-Being". The highly relevant topic of spirituality and illness is addressed in another instrument developed by our group, the SpREUK questionnaire, with its sub-scales "Search for meaningful support", "Positive interpretation of disease", "Trust in external guidance", "Support through spirituality/religiosity" [22, 31, 32].
Our evaluation indicates an adequate representation of aspects like "mental well-being" and "depression" which are essential in defining QoL, and shows special features of the HLQ that highlights its' uniqueness in the group of generic QoL-measures. Particularly in clinical studies in which, because of feasibility or patient compliance the use of huge psychometric test batteries is inappropriate, the HLQ now serves as a economic test-instrument. To conclude, we can state that this study presents necessary foundations and developments for existing and future studies that wish to use the HLQ as a reliable and valid instrument.
Spitzer WO: State of science 1986: Quality of life and functional status as target variables for research. Journal of Chronic Diseases 1987, 40: 465–471. 10.1016/0021-9681(87)90002-6
Frost MH, Sloan JA: Quality of life measurements: a soft outcome – or is it? Am J Manag Care 2002, 8: 574–9.
Lorenz W, Ollenschläger G, Geraedts M, Gerlach FM, Gandjour A, Helou A, Kirchner H, Koller M, Lauterbach K, Reinauer H, Sitter H, Thomeczek C: Das Leitlinien-Manual. Entwicklung und Implementierung von Leitlinien in der Medizin. ZaeFQ 2001,95(Suppl I):1–84.
Bradley C, Gamsu DS: Guidelines for encouraging psychological well-being: Report of a working group of the World Health Organisation Regional Office for Europe and International Diabetes Federation European Region St Vincent Declaration Action Programme for Diabetes. Diabetic Medicine 1994, 11: 510–516.
Patrick DL, Deyo RA: Generic and disease-specific measures in assessing health status and quality of life. Medical Care 1989, 27: 217–232.
Gold S, Heesen C, Schulz H, Guder U, Mönch A, Gbadamosi J: Disease specific quality of life instruments in multiple sclerosis: Validation of the Hamburg Quality of Life Questionnaire in Multiple Sclerosis (HAQUAMS). Multiple Sclerosis 2001, 7: 119–130. 10.1191/135245801678227649
Finlay AY, Khan GK: Dermatology Life Quality Index (DLQI) – A simple practical measure for routine clinical use. Clinical and Experimental Derm 1994, 19: 210–16.
Colwell HH, Mathias SD, Pasta DJ, Henning JM, Hunt RH: Development of a health-related quality-of-life questionnaire for individuals with gastroesophageal reflux disease: a validation study. Dig Dis Sci 1999, 44: 1376–83. 10.1023/A:1026647701477
Hannoveraner Konsens: Empfehlungen zur gesundheitsökonomischen Evaluation. Zeitschrift für Allgemeinmedizin 1996, 72: 485–490.
Schulte M, Kümmell HC: Entwicklung eines Fragebogens zur Lebensqualität auf der Grundlage des anthroposophischen Menschenbildes. Der Merkurstab 1996, 49: 109–122.
Schulte M, Heckmann C, Kümmell HC: Der Herdecker Fragebogen zur Lebensqualität (HLQ): Entwicklung, Ergebnisse, Verwendungsmöglichkeiten. In Akademische Forschung in der Anthroposophischen Medizin. Edited by: Heusser P. Bern: Peter Lang Verlag; 2000:249–253.
Doerfler W, Wisniowski C, Gerhard I: Behandlung des klimakterischen Symptomenkomplexes mit der Traditionellen Chinesischen Medizin (TCM). 169. Tagung der Mittelrheinischen Gesellschaft für Geburtshilfe und Gynaekologie, Homburg/Saar, München: Alete Wissenschaftl. Dienst 1997, 153.
Beer AM, Ostermann T, Matthiessen PF: Evaluation of in-hospital treatment with naturopathic methods – the Blankenstein model. Part I: Patients and therapeutic concepts. Forsch Komplementarmed Klass Naturheilkd 2001, 8: 6–13. 10.1159/000057189
Ostermann T, Beer AM, Matthiessen PF: Evaluation of inpatient naturopathic treatment – the Blankenstein model. Part II: Effective strength and health status of patients over the course of time. Forsch Komplementarmed Klass Naturheilkd 2002, 9: 269–76. 10.1159/000067520
Bullinger M, Kirchberger I: SF-36 Fragebogen zum Gesundheitszustand, Handanweisung. Göttingen: Hogrefe Verlag für Psychologie; 1998.
Zerssen D: Die Befindlichkeitsskala. Beltz-Test GmbH; 1976.
Brähler E, Scheer JW: Der Gießener Beschwerdebogen (GBB). Bern: Verlag Hans Huber; 1995.
Geissner E: Die Schmerzempfindungsskala SES – Ein differenziertes und veränderungssensitives Verfahren zur Erfassung chronischer und akuter Schmerzen. Rehabilitation 1995, 34: XXXV-XLIII.
Beer AM, Ostermann T, Matthiessen PF: Changes in quality of life during acute inpatient naturopathic treatment – results of the Blankenstein model. Gesundheitswesen 2001, 63: 242–247. 10.1055/s-2001-12917
Kroz M, von Laue HB, Zerm R, Girke M: Development of a questionnaire for endogenous regulation – a contribution for salutogenesis research. Forsch Komplementarmed Klass Naturheilkd 2003, 10: 70–77. 10.1159/000071665
Grimley DM, Bellis JM, Prochaska JO, Riley GE: Assessing the stages of change and decision making for contraceptive use for the prevention of pregnancy, sexually transmitted diseases, and acquired immunodeficiency syndrome. Health Educ Q 1993, 20: 455–470.
Büssing A, Ostermann T, Matthiessen PF: Role of Religion and Spirituality in Medical patients – Confirmatory results with the SpREUK questionnaire. Health Quality Life Outcomes 2005, 3: 10. 10.1186/1477-7525-3-10
Schumacher J, Leppert K, Gunzelmann T, Strauss B, Braehler E: Die Resilienzskala – Ein Fragebogen zur Erfassung der psychischen Widerstandsfähigkeit als Personmerkmal. Z f Klinische Psychologie, Psychiatrie und Psychotherapie 2004, 53: 16–39.
Hays RD, Hayashi T: Beyond internal consistency reliability: rationale and user's guide for the Multitrait Analysis Program (MAP) on the microcomputer. Behav Res Methods Instrum Comput 1990, 22: 167–175.
Puhan MA, Bryant D, Guyatt GH, Heels-Ansdell D, Schunemann HJ: Internal consistency reliability is a poor predictor of responsiveness. Health and Quality of Life Outcomes 2005, 3: 33. 10.1186/1477-7525-3-33
Middel B, van Sonderen E: Statistical significant change versus relevant or important change in (quasi) experimental design: Some conceptual and methodological problems in estimating magnitude of intervention-related change in health services research. International Journal of Integrated Care 2002, 2: 1–22.
Kirchberger I, Wetzel D, Finger T: Development and validation of an instrument to measure the effects of a mistletoe preparation on quality of life of cancer patients: the Life Quality Lectin-53 (LQL-53) Questionnaire. Qual Life Res 2004, 13: 463–479. 10.1023/B:QURE.0000018481.33943.97
Ostermann T: Fragen zur Mistel. Journal Club. Forsch Komplementärmed 2004, 11: 361–62.
Peterman AH, Fitchett G, Brady MJ, Hernandez L, Cella D: Measuring spiritual well-being in people with cancer: the functional assessment of chronic illness therapy – Spiritual Well-being Scale (FACIT-Sp). Ann Behav Med 2002, 24: 49–58. 10.1207/S15324796ABM2401_06
Brady MJ, Peterman AH, Fitchett G, Mo M, Cella D: A case for including spirituality in quality of life measurement in oncology. Psychooncology 1999, 8: 417–428. 10.1002/(SICI)1099-1611(199909/10)8:5<417::AID-PON398>3.3.CO;2-W
Ostermann T, Bussing A, Matthiessen PF: Pilot study for the development of a questionnaire for the measuring of the patients' attitude towards spirituality and religiosity and their coping with disease (SpREUK). Forsch Komplementarmed Klass Naturheilkd 2004, 11: 346–353. 10.1159/000082816
Büssing A, Ostermann T, Matthiessen PF: Search for meaningful support and the meaning of illness in German cancer patients. Anticancer Res 2005, 25: 1449–55.
TO performed the statistical analysis, was responsible for the methodological setting of the study and has written most parts of the manuscript, AB has written some parts of manuscript, AMB is director of the study centre and was the clinical supervisor, PFM conceived and designed the study. All authors have read and approved the final manuscript.
About this article
Cite this article
Ostermann, T., Büssing, A., Beer, AM. et al. The Herdecke questionnaire on quality of life (HLQ): Validation of factorial structure and development of a short form within a naturopathy treated in-patient collective. Health Qual Life Outcomes 3, 40 (2005). https://doi.org/10.1186/1477-7525-3-40