- Short report
- Open Access
Reliability and validity of the SF-36 Health Survey Questionnaire in patients with brain tumors: a cross-sectional study
Health and Quality of Life Outcomesvolume 15, Article number: 92 (2017)
Deterioration of health related quality of life (HRQoL) is common in brain tumor patients. This study evaluated validity and reliability of the Medical Outcomes Study 36-Item Short Form (SF-36) in patients with brain tumors.
Two hundred and seventy-seven patients admitted for brain tumor surgery were evaluated for HRQoL (SF-36 questionnaire); depressive symptoms (Beck Depression Inventory-II or BDI-II); and functional status (Barthel index or BI). Final histological diagnosis was obtained from pathology reports.
Two-hundred and twenty-seven (completion rate of 82%) patients (69% women; mean age 55.8 ± 14.4 years) completed the SF-36 questionnaire. The most common brain tumor diagnosis was meningioma (40%), followed high-grade glioma (19%). Missing data rates were ≤4%. Internal consistency was adequate for all (Cronbach α ≥ .728) but Social Functioning (Cronbach α = .527) and General Health (Cronbach α = .693) subscales. Ceiling (≥36%) and floor (≥22%) effect rates were the greatest for the Role Limitations subscales. The SF-36 subscales pertaining physical health correlated the strongest with the BI score, while the SF-36 subscales pertaining emotional health correlated the strongest with the BDI-II score. Patients with mild-moderate depressive symptoms (BDI-II score ≥20) scored lower across all SF-36 subscales, and handicap patients (BI score <90) scored the lower across all, but Mental Health, subscales.
The SF-36 is a valid and reliable instrument in brain tumor patients and therefore can be reliably applied for evaluation of HRQoL in neuro-oncology setting. Further studies exploring other psychometric properties of the SF-36 in brain tumor patients across disease progression stages are warranted.
Health-related quality of life (HRQoL) is commonly used as an outcome measure [1,2,3] and has prognostic significance in brain tumor patients. For example, improved HRQoL is associated with longer survival of high-grade glioma patients  while poor HRQoL predicts shorter survival of low-grade glioma patients .
Numerous multi-item questionnaires are used for evaluation of HRQoL in patients with brain tumors . The Medical Outcomes Study 36-Item Short Form (SF-36) health survey comprehensively evaluates patient perceived health status across broad physical and emotional health domains . It is the most commonly used generic HRQoL assessment tool worldwide, including patients with cancer [7, 8]. Due to extensive use and experience across patients and general population, the generic SF-36 questionnaire also allows to compare patient perceived health status across a variety of disorders and with the general population. Adequate psychometric properties of the SF-36 were documented in cancer patients  and in cancer survivors . However, there are no studies specifically evaluating reliability and validity of the SF-36 questionnaire in patients with established brain tumor diagnosis.
The goal of the study was to evaluate reliability and validity of the SF-36 questionnaire in patients diagnosed with brain tumors.
Consecutive adult patients admitted for surgery of primary and metastatic brain tumors at the Department of Neurosurgery of Hospital of Lithuanian University of Health Sciences, Kaunas, Lithuania, in a period from March 2010 until June 2011 were invited in the study. Patients who were unable to speak Lithuanian, or with severe cognitive or neurologic deficits were excluded.
The study was approved by the Ethics Committee for Biomedical Research at the Lithuanian University of Health Sciences, Kaunas, Lithuania. All patients gave signed informed consent.
All study patients were approached within 3 days before elective brain tumor surgery and were evaluated for sociodemographic characteristics (age, gender, highest educational degree and marital status) and previous brain tumor treatments. During the same visit patients were evaluated for HRQoL (SF-36 questionnaire ) and depressive symptom severity (Beck Depression Inventory-II (BDI-II) ) Patients were asked to complete the SF-36 and BDI-II questionnaires themselves and were given an opportunity to ask questions if any. Functional status was evaluated by the study investigator during the same visit using the Barthel index (BI) . The BDI-II and BI were selected to determine construct validity of the SF-36.
Brain tumor diagnoses were obtained from the final pathology reports. Patients diagnosed with high-grade glioma and metastatic tumor were categorized as having malignant brain tumor.
The SF-36 questionnaire includes eight multiple-item subscales that evaluate physical function, social functioning, role limitations due to physical problems, role limitations due to emotional problems, mental health, vitality, pain, and general health perception . Total score on each SF-36 subscale ranges between 0 and 100. Greater score indicates better HRQoL.
BDI-II has 21 items designed to evaluate depressive symptom severity during the previous 2 weeks . Total BDI-II score ranges between 0 and 63 with greater score indicating more severe depressive symptoms. Patients were dichotomized as having minimal (BDI-II score <20) or moderate to severe (BDI-II score ≥20) depressive symptoms. Scores on the BDI-II items 12 (interest in other people) and 17 (fatigue) were selected to evaluate convergent validity of the SF-36 subscales of Social Functioning and Vitality, respectively. Examination of individual BDI-II items is recommended for more detailed assessment of patients’ clinical presentation . Adequate construct validity and psychometric properties of the Lithuanian translation of the BDI-II were previously documented in patients with brain tumors .
Functional status was measured with the BI . Total BI score ranges from 0 to 100, with lower score indicating greater functional impairment. Patients were dichotomized as functionally dependent (BI score <90) or as functionally independent (BI score ≥90). The BI is a commonly used instrument for evaluation of functional status in hospital setting in Lithuania .
Data were analyzed using the PASW for Windows (IBM Corporation, Chicago, Illinois). Two-tailed probability values of <0.05 were considered as statistically significant.
Age, gender, brain tumor grade, and proportion of functionally dependent and moderately-to-severely depressed patients were compared in responders vs. non-responders using Pearson X 2 test and Mann-Whitney test. Internal consistency of the SF-36 subscales was evaluated using the Cronbach’s coefficient α. Internal consistency was considered adequate if Cronbach’s coefficient α values were >0.70 [15, 16]. Floor and ceiling effects for each SF-36 subscale were defined as the proportion of patients who achieved the lowest or the highest possible score, respectively, on each subscale. Floor and ceiling effect was considered present if at least 15% of respondents reached the lowest or the highest possible score, respectively .
Convergent validity was measured by calculating the Spearman correlation coefficient of all SF-36 subscale scores with scores on the BI, SF-36 Physical and Mental component summary, BDI-II total, and BDI-II items 12 (interest in other people) and 17 (fatigue). It was expected that scores on the SF-36 subscales pertaining to mental health would correlate the strongest with the SF-36 Mental component summary score and BDI-II score, while sores on the SF-36 subscales pertaining to physical health would correlate the strongest with the SF-36 Physical component summary score and BI score. Furthermore, strong correlation was expected between the SF-36 Social Functioning and Vitality scores with scores on the BDI-II items 12 (interest in other people) and 17 (fatigue), respectively. Construct validity of the SF-36 was evaluated by comparing SF-36 scores as a function of tumor grade (malignant brain tumor vs. not-malignant tumor), patient age (≥50 years vs. <50 years), depressive symptom severity (moderate-severe vs. minimal-mild depressive symptoms) and functional status (functionally dependent vs. independent) using Mann-Whitney test. It was hypothesized that greater age, diagnosis of malignant brain tumor, moderate to severe depressive symptom severity and functional dependence would be associated with lower physical and mental component scores of the SF-36.
Two hundred and seventy-seven patients were administered the battery of questionnaires and 227 (response rate of 82%) patients completed the SF-36 questionnaire (Table 1). The proportion of patients with moderate-severe depressive symptoms was lower in responders vs. non-responders (0% vs. 19%, p = 0.01). There were no statically significant differences between responders and non-responders in age (p = 0.334), gender (p = 0.243) brain tumor grade (p = 0.928) and BI score (p = 0.271). The majority of the study patients were women (69%). Mean age of patients was 55.8 ± 14.4 years. The BDI-II was completed by 203 (89%) patients and BI was administered to 213 (94%) patients. The most common brain tumor diagnosis was meningioma (40%), followed high-grade glioma (19%).
Mean scores on the SF-36 subscales are presented in Table 2. Missing data rates were low (≤4%) for all subscales. The lowest SF-36 scores were for the General Health and Role Limitations Due To Physical Problems subscales while the highest scores were for the Social Functioning and Physical Functioning subscales. Cronbach’s coefficients α were greater than .70 for all but General Health (.693) and Social Functioning (.527), subscales. Ceiling effect was present for the SF-36 subscales of Role Limitations Due To Emotional Problems (51%), Role Limitations Due To Physical Problems (36%), Social Functioning (27%) and Pain (23%) subscales. Floor effect was present for the SF-36 subscales of Role Limitations Due To Physical Problems (26%) and Role Limitations Due To Emotional Problems (22%). As expected, BI and SF-36 Physical component summary scores correlated stronger with the SF-36 subscales pertaining to physical health (Spearman rho values range from .269 to .487 and from .542 to .829, respectively) relative to the SF-36 subscales pertaining to emotional health (Spearman rho values range from .235 to .311 and from .539 to .643, respectively) (Table 3). Alternatively, the SF-36 Mental component summary score correlated stronger with the SF-36 subscales pertaining to emotional health (Spearman rho values range from .746 to .820) than with the SF-36 subscales pertaining to physical health (Spearman rho values range from .486 to .682). The BDI-II total scores correlated the strongest with the SF-36 subscales of Vitality (rho = −.602) and Mental Health (rho = −.601). BDI-II items evaluating social interactions and fatigue correlated the strongest with the SF-36 Mental Health subscale followed by the SF-36 Social Functioning (rho = −.463) and Vitality (−.440) subscales, respectively. Known-group comparison analyses demonstrated the expected findings, i.e., more advanced age, diagnosis of malignant brain tumor, moderate to severe depressive symptom severity and functional dependence were associated with lower SF-36 scores, indicating adequate discriminative abilities of the SF-36 (Fig. 1). Malignant brain tumor patients scored significantly lower on the SF-36 Physical Functioning (p = 0.001), Social Functioning (p = 0.042) and Role Limitations Due To Physical Functioning (p = 0.002) and Emotional Functioning (p = 0.034) subscales relative to patients with not malignant brain tumors. Patients aged ≥50 years scored significantly lower on the SF-36 Physical Functioning (p < 0.001), General Health (p = 0.001), Vitality (p = 0.002) and Mental Health (p = 0.04) subscales relative to patients < 50 years old. Mild to moderate depressive symptoms were associated with lower scores on all SF-36 subscales (all p-values ≤0.006) and functional dependence was associated with lower scores on all (all p-values <0.004) but Mental Health subscale of the SF-36.
The study findings indicate that the SF-36 is a reliable and valid instrument for assessment of HRQoL in patients with brain tumors.
The proportion of missing data was low (≤4%) for all SF-36 subscales corresponding to previous studies in healthy participants and in patients with somatic disorders. For example, missing data rates were comparable in the original Medical Outcomes Study from the United States (range from 1.1 to 5.9%)  and in a large sample (n = 6822) of Dutch healthy participants and patients with migraine and various cancers (range from 1 to 6.6%) . Ceiling and floor effects were present for the Role Limitations and Emotional Functioning subscales indicating low sensitivity of the SF-36 to monitor subtle variations of functional status and emotional functioning in brain tumor patients. High ceiling effect rate can be explained by dichotomous answers to the SF-36 questionnaire items on the respective subscales. Higher ceiling effect (76%) of the SF-36 Physical and Emotional Role Limitations subscales were previously reported in childhood cancer survivors . Ceiling effect observed for the SF-36 Social Functioning and Bodily Pain subscales indicates that social functioning impairment and pain are not severe symptoms in brain tumor patients. Required threshold rate for floor and ceiling effect was not reached for other SF-36 subscales  suggesting adequate sensitivity of the SF-36 for evaluating perceived health status of brain tumor patients.
The SF-36 subscales had adequate internal consistency satisfying the recommended threshold value of .70 , with the exception of the General Health and Social Functioning subscale. Low Cronbach’s α values of the SF-36 Social Functioning subscale was previously reported in Spanish patients with coronary artery disease (Cronbach’s α = .45)  and in Dutch cancer patients (Cronbach’s α = .50) . Therefore, other instruments can be considered for assessment of social functioning in hospitalized brain tumor patients.
As expected, scores on the SF-36 subscales pertaining to perceived physical and emotional health status demonstrated good agreement with other measures of physical and mental health status, respectively, indicating adequate criterion validity of the SF-36 questionnaire. These findings correspond to previous studies in patients with neurological disorders [20, 21] and non-CNS cancers [9, 10]. Our findings suggest that other instruments assessing general health status and social functioning in in-hospital population of cancer patients can be considered.
The SF-36 demonstrated good discriminate abilities. Patients with malignant brain tumors had worse physical and social functioning and greater role limitations relative to patients with not malignant tumors. Due to progressive and incurable nature, malignant brain tumors are associated with progressive disability  and worsening of mental health status . Age of 50 years or greater was associated with worse physical functioning, general health and mental health and with greater fatigue symptom severity, echoing with previous studies demonstrating detrimental effect of age on HRQoL in patients with brain tumors [23, 24]. Moderate-severe depressive symptoms were associated with worse HRQoL across all SF-36 dimensions. Mood symptoms and disorders are common [25,26,27] and were document as independent predictors of worse HRQoL of brain tumor patients [3, 26]. Functional dependence was associated with worse HRQoL in all but mental health domains suggesting that functional status should be considered important for HRQoL of brain tumor patients. The observed statically significant differences of the SF-36 subscale scores between patient subgroups were also clinically relevant because they exceed a 5 point threshold that was suggested by Ware  and is commonly used across patient populations . However, others reported that minimal clinically important thresholds are different across diseases and SF-36 subscales . For example, a study in vestibular schwannoma patients found that the minimal clinically important difference of the SF-36 was 7 points for the Mental Health Component Summary score and 8 points for the Physical Health Component Summary score . Further studies should clarify clinically important thresholds of the SF-36 subscales across brain tumor patients.
Numerous generic and brain tumor specific questionnaires are used for evaluation of HRQoL in patients with brain tumors . Generic HRQoL instruments can allow to evaluate broader domains of impairment imposed by a brain tumor and allow to compare HRQoL across disorders and with the general population. For example, scores on all SF-36 subscales in our cohort of brain tumor patients was lower when comparing to previously published SF-36 scores in the general population , indicating that brain tumor is associated with significant impairment of perceived physical and mental health status. On the other hand, generic questionnaires do not collect information on all the areas of well-being and functioning that may be important to patients with brain neoplasms, meaning they may not be sensitive enough to assess changes. The Brain cancer-specific Quality of Life Questionnaire (QLQ-BN20) of the European Organization for Research and Treatment of Cancer (EORTC) was specifically developed to evaluate HRQoL in patients with brain neoplasms  and demonstrated adequate cross-cultural psychometric properties. Brevity and disease specificity are the major advantages of the EORTC-BN20 scale. The decision to choose HRQoL instrument should be selected based on clinical and research needs.
Heterogeneous sample in terms of brain tumor histological diagnosis prevented from evaluating psychometric properties of the SF-36 questionnaire in more homogenous group of patients. Results should be generalized with caution to patients at different disease progression and to patients receiving adjuvant therapies (i.e., radiotherapy or chemotherapy) because only patients admitted for brain tumor surgery and not receiving adjuvant therapies were studied. In addition, this setting of recruitment may include a selection bias, because patients hospitalized just before surgery may be stressed and have more complaints, thereby leading to over-representation of severe symptoms. Studies addressing psychometric properties of the SF-36 questionnaire at different stages of disease progress are encouraged.
This is the first study demonstrating that the SF-36 questionnaire has adequate reliability and validity in patients with brain tumors. The SF-36 questionnaire can be considered for assessment of HRQoL in neuro-oncology setting. Other psychometric properties of the SF-36 questionnaire, including Reliability (test-retest reliability, measurement error), structural validity, responsiveness (smallest detectable change) and interpretability (minimal important change) remain to be addressed in brain tumor patients.
Beck Depression Inventory-II
Health related quality of life
The medical outcomes study 36-item short form
Taphoorn MJ, Sizoo EM, Bottomley A. Review on quality of life issues in patients with primary brain tumors. Oncologist. 2010;15:618–26.
Outcomes of cancer treatment for technology assessment and cancer treatment guidelines. American Society of Clinical Oncology. J Clin Oncol. 1996, 14:671–679.
Bunevicius A, Tamasauskas S, Deltuva V, Tamasauskas A, Radziunas A, Bunevicius R. Predictors of health-related quality of life in neurosurgical brain tumor patients: focus on patient-centered perspective. Acta Neurochir. 2014;156:367–74.
Bosma I, Reijneveld JC, Douw L, Vos MJ, Postma TJ, Aaronson NK, Muller M, Vandertop WP, Slotman BJ, Taphoorn MJ, et al. Health-related quality of life of long-term high-grade glioma survivors. Neuro Oncol. 2009;11:51–8.
Mainio A, Tuunanen S, Hakko H, Niemela A, Koivukangas J, Rasanen P. Decreased quality of life and depression as predictors for shorter survival among patients with low-grade gliomas: a follow-up from 1990 to 2003. Eur Arch Psychiatry Clin Neurosci. 2006;256:516–21.
Cheng JX, Zhang X, Liu BL. Health-related quality of life in patients with high-grade glioma. Neuro Oncol. 2009;11:41–50.
Ware Jr JE, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30:473–83.
Ware Jr JE. SF-36 health survey update. Spine. 1976;25:3130–9.
Treanor C, Donnelly M. A methodological review of the Short Form Health Survey 36 (SF-36) and its derivatives among breast cancer survivors. Qual Life Res. 2015;24:339–62.
Reulen RC, Zeegers MP, Jenkinson C, Lancashire ER, Winter DL, Jenney ME, Hawkins MM. The use of the SF-36 questionnaire in adult survivors of childhood cancer: evaluation of data quality, score reliability, and scaling assumptions. Health Qual Life Outcomes. 2006;4:77.
Beck AT, Steer RA, Brown GK. BG: Manual for the Beck Depression Inventory (BDI-II). 2nd ed. San Antonio: The Psychological Association; 1996.
Mahoney FI, Barthel DW. Functional evaluation: the Barthel Index. Md State Med. 1965:61–5
Brantley PJ, Dutton Gareth R, Wood Karen B. The Beck Depression Inventory - II (BDI-II) and the Beck Depression Inventory-Primary Care (BDI-PC). In: Maruish ME, editor. The use of psychological testing for treatment planning and outcome assessment. 3rd ed. Mahwah: Lawrence Erlbaum Associates; 1999. p. 313–25.
Bunevicius A, Deltuva V, Tamasauskas S, Tamasauskas A, Bunevicius R. Screening for psychological distress in neurosurgical brain tumor patients using the Patient Health Questionnaire-2. Psychooncology. 2013;22:1895–1900.
Nunnaly J, Bernstein I. Psychometric Theory. New York: McGraw-Hill Book Co; 1994.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.
McHorney CA, Ware Jr JE, Lu JF, Sherbourne CD. The MOS 36-item Short-Form Health Survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. Med Care. 1994;32:40–66.
Aaronson NK, Muller M, Cohen PD, Essink-Bot ML, Fekkes M, Sanderman R, Sprangers MA, te Velde A, Verrips E. Translation, validation, and norming of the Dutch language version of the SF-36 Health Survey in community and chronic disease populations. J Clin Epidemiol. 1998;51:1055–68.
Alonso J, Prieto L, Anto JM. [The Spanish version of the SF-36 Health Survey (the SF-36 health questionnaire): an instrument for measuring clinical results]. Med Clin. 1995;104:771–6.
Anderson C, Laubscher S, Burns R. Validation of the Short Form 36 (SF-36) health survey questionnaire among stroke patients. Stroke. 1996;27:1812–6.
Krokavcova M, van Dijk JP, Nagyova I, Rosenberger J, Gavelova M, Gdovinova Z, Groothoff JW. Perceived health status as measured by the SF-36 in patients with multiple sclerosis: a review. Scand J Caring Sci. 2009;23:529–38.
Osoba D, Brada M, Prados MD, Yung WK. Effect of disease burden on health-related quality of life in patients with malignant gliomas. Neuro Oncol. 2000;2:221–8.
Boele FW, Rooney AG, Grant R, Klein M. Psychiatric symptoms in glioma patients: from diagnosis to management. Neuropsychiatr Dis Treat. 2015;11:1413–20.
Cheng JX, Liu BL, Zhang X, Lin W, Zhang YQ, Liu WP, Zhang JN, Lin H, Wang R, Yin H. Health-related quality of life in glioma patients in China. BMC Cancer. 2010;10:1471–2407.
Rooney AG, McNamara S, Mackinnon M, Fraser M, Rampling R, Carson A, Grant R. Frequency, clinical associations, and longitudinal course of major depressive disorder in adults with cerebral glioma. J Clin Oncol. 2011;29:4307–12.
Pelletier G, Verhoef MJ, Khatri N, Hagen N. Quality of life in brain tumor patients: the relative contributions of depression, fatigue, emotional distress, and existential issues. J Neurooncol. 2002;57:41–9.
Bunevicius A, Deltuva V, Tamasauskas S, Tamasauskas A, Laws Jr ER, Bunevicius R. Low triiodothyronine syndrome as a predictor of poor outcomes in patients undergoing brain tumor surgery: a pilot study: clinical article. J Neurosurg. 2013;118:1279–87.
Ware JE, Snow KK, Kosinski M, Gandek B. SF-36 health survey: manual and interpretation guide. Boston: New England Medical Center, the Health Institute; 1993.
Hammerlid E, Taft C. Health-related quality of life in long-term head and neck cancer survivors: a comparison with general population norms. Br J Cancer. 2001;84:149–56.
Wyrwich KW, Tierney WM, Babu AN, Kroenke K, Wolinsky FD. A comparison of clinically important differences in health-related quality of life for patients with chronic lung disease, asthma, or heart disease. Health Serv Res. 2005;40:577–91.
Carlson ML, Tveiten OV, Yost KJ, Lohse CM, Lund-Johansen M, Link MJ. The Minimal Clinically Important Difference in Vestibular Schwannoma Quality-of-Life Assessment: An Important Step beyond P < .05. Otolaryngol Head Neck Surg. 2015;153:202–8.
Leplege A, Ecosse E, Verdier A, Perneger TV. The French SF-36 Health Survey: translation, cultural adaptation and preliminary psychometric evaluation. J Clin Epidemiol. 1998;51:1013–23.
Osoba D, Aaronson NK, Muller M, Sneeuw K, Hsu MA, Yung WK, Brada M, Newlands E. The development and psychometric validation of a brain cancer quality-of-life questionnaire for use in combination with general cancer-specific questionnaires. Qual Life Res. 1996;5:139–50.
I would like to thank Prof. Vytenis Deltuva and Dr. Sarunas Tamasauskas for administrative support and assistance with data collection.
This study was funded by the Research Council of Lithuania (grant number: MIP-044/2015).
Availability of data and materials
Data is available upon request.
AB prepared the study protocol, participated in data collection and drafted the manuscript.
The author declares that he/she has no competing interests.
Consent for publication
Ethics approval and consent to participate
The study was approved by the Ethics committee. All patients gave signed consent to participate.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.