Viability of a MSQOL-54 general health-related quality of life score using bifactor model
Health and Quality of Life Outcomes volume 19, Article number: 224 (2021)
MSQOL-54 is a multidimensional, widely-used, health-related quality of life (HRQOL) instrument specific for multiple sclerosis (MS). Findings from the validation study suggested that the two MSQOL-54 composite scores are correlated. Given this correlation, it could be assumed that a unique total score of HRQOL may be calculated, with the advantage to provide key stakeholders with a single overall HRQOL score. We aimed to assess how well the bifactor model could account for the MSQOL-54 structure, in order to verify whether a total HRQOL score can be calculated.
A large international database (3669 MS patients) was used. By means of confirmatory factor analysis, we estimated a bifactor model in which every item loads onto both a general factor and a group factor. Fit of the bifactor model was compared to that of single and two second-order factor models by means of Akaike information and Bayesian information criteria reduction. Reliability of the total and subscale scores was evaluated with Mc Donald’s coefficients (omega, and omega hierarchical).
The bifactor model outperformed the two second-order factor models in all the statistics. All items loaded satisfactorily (≥ 0.40) on the general HRQOL factor, except the sexual function items. Omega coefficients for total score were very satisfactory (0.98 and 0.87). Omega hierarchical for subscales ranged between 0.22 to 0.57, except for the sexual function (0.70).
The bifactor model is particularly useful when it is intended to acknowledge multidimensionality and at the same time take account of a single general construct, as the HRQOL related to MS. The total raw score can be used as an estimate of the general HRQOL latent score.
Over the last two decades, health related quality of life (HRQOL) measures have been increasingly included into research studies of neurodegenerative disorders, including multiple sclerosis (MS) [1,2,3]. Importantly, HRQOL instruments can disclose aspects of disease which are not considered by standard clinical tools, and that would otherwise go unrecognized. In addition, HRQOL instruments can help clinicians appreciate patient priorities particularly in terms of treatment goals, facilitate physician–patient communication, and promote shared decision making .
The Multiple Sclerosis Quality of Life-54 items (MSQOL-54) inventory was designed to address the need for HRQOL measures to be used in quality of care and clinical effectiveness research. Thus, the MSQOL-54 comprehensively assesses the HRQOL of patients with MS, an unpredictable chronic neurological disorder which affects 2.8 million people worldwide [5, 6].
Compared to other instruments, its main strength is that it combines a generic- and a disease-targeted approach. In fact, the MSQOL-54 is a multidimensional, MS-specific HRQOL instrument, based on the generic SF-36  supplemented with 18 MS-specific items . This approach allows to compare HRQOL in MS with that in other diseases and with the general population using the generic score, in addition to allowing a sensitive measure for within-disease comparisons.
It consists of 52 items combined in 12 subscales, and two single items. Two composite scores (Mental Health Composite, MHC, and Physical Health Composite, PHC) are determined by aggregating scores of the pertinent subscales . Psychometric properties like construct and content reliability, discrimination [9,10,11], and responsiveness  have been rigorously documented. It was developed in US English, and clinically validated in various languages [9,10,11, 13,14,15,16], including Italian . Despite these key advantages, MS patients were not involved in its development .
In the validation work of the MSQOL-54, Vickrey et al.  reported a quite high correlation (r = 0.66) between the two composite scores. Given this correlation, it could be hypothesized that a unique total score of HRQOL may be calculated, with the benefit to provide patients, clinicians and researchers with a single overall HRQOL assessment, to assess for example, treatment response or modify treatment plan. In this very context, applying a bifactor model to the MSQOL-54 items could be particularly useful, as it is intended to acknowledge multidimensionality and, at the same time, take account of a single general construct , as the HRQOL related to MS is. The bifactor model may constitute an alternative to the more widely-used second-order models, or correlated-traits . By definition, the bifactor model is employed so that each item loads on a general factor and only one group factor, and the general and group factors are all uncorrelated to each other . For each single item, the general factor captures what the item shares with all the other items and the group factor reflects what the item shares with the other items belonging to the same subscale, once the influence of the general factor has been removed. That is, all the covariation between items and all the covariation between subscale scores is captured by the general factor that is a broad latent dimension made of all the subscale contents. Bifactor modeling is generally used to test multifaceted constructs , and so far, has been used mainly in the area of intelligence research [19, 20], and in the study of personality [21, 22]. However, this has rarely been applied in neurology and MS research, except for a few studies [23,24,25].
In the present study, our primary aim was to apply the bifactor model to the MSQOL-54 items in order to verify whether a total HRQOL score could be calculated. Second, if the bifactor model fitted the data well, we aimed to evaluate the measurement invariance of MSQOL-54 items across age and gender.
To perform the present secondary analysis, we used data drawn from different datasets collected utilizing the MSQOL-54 within ongoing or completed projects conducted in Italy and Australia .
We obtained the data collected with the English version from the ‘HOLISM study’, an observational international study, whose methods and results have been reported elsewhere [27, 28]. Briefly, participants from Europe, Australasia, North America, and other countries were recruited in 2014 via online platforms (e.g. websites, and forums involving MS patients, and social media). The study aimed to provide an overview of risk-modifying behaviors and current lifestyle of a large international cohort of MS patients to analyze the association between these variables and disease progression. Patients with ≥ 18 years, and who could undertake an English language survey were included. In the present study, we used baseline data from English-speaking countries only: 840 (41%) from North America, 797 (39%) from Australasia, and 427 (20%) from UK and Ireland.
We obtained the data collected with the Italian version from the datasets (i.e. baseline data for longitudinal studies/trials) of the following research projects:
The ‘Care system project’ [29, 30], an observational study aimed to assess MS patients’ perceived levels of well-being and ill-being (overall, 662 MS patients from 8 MS centers, recruited between 2012 and 2017). Patients with ≥ 18 years, having a clinically-definite MS diagnosis  for at least 3 years, and having a caregiver, were included. Patients with neurological disorders other than MS, psychiatric disorders, Expanded Disability Status Scale (EDSS ) ≥ 8, being in the active phase of MS, severe cognitive impairment, were excluded.
The study ‘An abbreviated computerized version of the MSQOL-54: Development and preliminary validation using Confirmatory Factor Analysis and Item Response Theory’ [33, 34], which developed an abbreviated version of the MSQOL-54. We used data from 564 MS patients recruited at 5 MS centers, between 2005 and 2012 who participated in the retrospective phase of the study . Patients with ≥ 18 years, having a clinically-definite MS diagnosis and fluent in Italian, were included.
Other research projects carried out in 5 Italian MS centers. Patients with ≥ 18 years, having a clinically-definite MS diagnosis able to read and understand Italian, were included. Overall, 379 MS patients, recruited between 2005 and 2017, were included in the present study.
All these projects were approved by local ethics committees (St Vincent’s Hospital Melbourne Human Research Ethics Committee [LRR 055/12]; Università di Milano; San Raffaele Hospital, Milano; University Polyclinic Hospital G. Rodolico, Catania; University of Florence; S. Anna Hospital, Como; Hospital of Vaio-Fidenza, Fidenza; University ‘G. D’Annunzio’, Chieti; University of Bari; San Camillo- Forlanini Hospital, Rome; University Hospital ‘San Luigi Gonzaga’, Orbassano; Fondazione IRCCS Istituto Neurologico ‘C. Besta’, Milano; IRCCS S. Lucia Foundation, Rome). Patients gave written or online informed consent to be included in the original projects. Additional consent was not required for this secondary analysis, for which patients’ privacy and anonymity were guaranteed.
Records were included in the database if the following variables were available: MS diagnosed (according to any criteria, Italian sample) or disclosed by a physician (English-speaking sample); patient age ≥ 18 years; gender; level of disability (EDSS, Italian sample; PDDS , English-speaking sample), and disease duration.
The goodness of fit of the original second-order factor model comprising two factors, the novel second-order factor model comprising one factor, and the bifactor model was tested using confirmatory factor analysis (CFA).
According to the original factor structure of the MSQOL-54, in the two second-order factor model, it was hypothesized that 52 items loaded in 12 first-order factors and two second-order factors, corresponding to the PHC and MHC  (Additional file 1). The remaining two items (i.e. item 2 ‘Compared to one year ago, how would you rate your health in general now?’, and item 50 ‘Overall, how satisfied were you with your sexual function during the past 4 weeks?’) were not included in this model, as well in the other models, because they are single items.
In the single second-order factor model, the first-order factors were the same as in the original model, and one second-order factor was imposed, called ‘HRQOL’ (Additional file 2).
In the bifactor model, it was hypothesized that 50 items loaded onto the general HRQOL factor and on their specific group factors, whereas the two items forming the overall QOL subscale (items 53 and 54) were loaded only onto the general factor, because the bifactor model needs each group factor to be composed of at least three items to be identified (Additional file 3).
Global fit of the models was evaluated with three approximate indices recommended by Kline , namely, the root mean square error of approximation (RMSEA), the comparative fit index (CFI), and the standardized root mean square residual (SRMR). As a rule of thumb, RMSEA under 0.08 represents good fit and values below 0.05 represent very good fit ; SRMR values under 0.08 indicate good fit, and values greater than 0.10 indicate poor fit ; concerning CFI, values above 0.95 are indicative of good fit , and, as for other incremental fit indices, values below 0.90 indicate that models “can usually be improved substantially” . Akaike Information Criterion (AIC)  and Bayesian Information Criterion (BIC) [41, 42] were used for model comparisons. The model with lower AIC and BIC values was chosen as the best model to fit the data.
To evaluate the relative strength of the general HRQOL factor to group factors, magnitude of loadings was considered (values ≥ 0.40 were considered satisfactory , and explained common variance (ECV) and percentage of uncontaminated correlations (PUC) were calculated . A high ECV value or a moderate ECV value supplemented with a high PUC value (> 0.90) indicated that data were sufficiently “unidimensional” . To judge the degree to which total raw scores reflected a common single factor, the McDonald’s coefficient omega hierarchical (ωH) was computed. High values meant that the total raw score was a reliable measure of the general factor. Further, to evaluate the reliability considering all sources of common variance (general and group factor), the McDonald’s coefficient omega (ω) was calculated. Both omega hierarchical and omega were also calculated for each subscale to evaluate how much subscale scores were reliable measures of the corresponding specific latent variables, once items’ common variance due to the general factor was removed (ωS), and how reliable they were considering all sources of common variance.
Finally, we used CFA to evaluate the measurement invariance of MSQOL-54 across gender (male [26%]; female [74%]), and age (using the median of 44 years old as cut-off). Three increasingly constrained levels of measurement invariance (i.e. configural, metric, scalar) were assessed using multi-group CFA. We used the same criteria as above to assess the model fit.
In line with Chen , a worsening of CFI exceeding the cut-off of 0.010, accompanied by a change of ≥ 0.030 in SRMR or a change of ≥ 0.015 in RMSEA was deemed a signal of lack of metric invariance; as regards the scalar invariance, the threshold values for RMSEA and CFI were identical to those used for metric invariance, whereas it was 0.010 for SRMR. To liken the fit of two nested models, the χ2 difference test was not employed, as it is responsive to sample size, therefore usually providing significant results with large sample sizes .
All models were estimated using the software Mplus 7.0 with the maximum likelihood estimation with robust standard errors (MLR) .
The database consisted of 3669 MS patients (mean age 43.8 years [range 18–87], 74% women, 54% with a mild level of disability (measured with the self-reported PDDS), and mean disease duration of 7.2 years [0–48]) (Table 1). Of these, 1605 (44%) were Italian (mean age 40.9 years, 62% women, 69% with a mild disability level) and 2064 English-speaking participants (mean age 46.1 years, 83% women, 54% with a mild disability level). Compared to Italians, English-speaking participants were older, had a higher percentage of women, and had longer disease duration (p < 0.001) (Table 1).
The goodness-of-fit statistics of the three alternative CFA models are reported in Table 2.
The (original) two second-order factor model fitted the data quite well (RMSEA = 0.055; CFI = 0.888, SRMR = 0.064), only the CFI index was slightly under the cut-off value. The single second-order factor model showed similar values (RMSEA = 0.056; CFI = 0.884, RMRS = 0.068), but in terms of AIC and BIC values it was outperformed by the two second-order factor model. The bifactor model (‘bifactor 1’) produced apparently good fit measures, but the solution was inadmissible because one item of the social function subscale (item 20 ‘During the past 4 weeks, to what extent has your physical health or emotional problems interfered with your normal social activities with family, friends, neighbors, or groups’) showed a negative residual variance (Additional file 4). An inspection of the loading estimates revealed that one item (item 51 ‘During the past 4 weeks, to what extent have problems with your bowel or bladder function interfered with your normal social activities with family, friends, neighbors, or groups?’) was not a good indicator of social functioning, once parceling out the general factor. In fact, a supplementary analysis conducted with the three items of the social function subscale showed that the zero order correlations between item 51 and items 20 and 33 were 0.40 and 0.39, respectively. Further, partial correlations between the same items, after controlling for HRQOL subscale score, were lower (0.24 and 0.22, respectively) (Additional file 5). Therefore, the bifactor 1 solution was inadmissible, being necessary to re-specify a second bifactor model. In the ‘bifactor 2’ the three items of the social function subscale (20, 33, and 51) loaded onto the general factor only, and, to account for the group specificity of item 20 and item 33, residuals of these two items were allowed to correlate. This last model had satisfactory fit (RMSEA = 0.055; CFI = 0.892, RMRS = 0.062), and both AIC and BIC statistic values were better than those of the one and two second-order factor models (AIC = 1,710,637; BIC = 1,711,910; Table 2).
Standardized factor loadings for the revised bifactor model are shown in Table 3.
All items loaded satisfactorily on the general (HRQOL) factor (loading ≥ 0.40), the only exception being item 24 (‘Have you been a very nervous person?’), and the four items belonging to the sexual function scale.
Loadings on the group factors were all ≥ 0.40, except for three items (item 23 ‘Did you feel full of pep?’, item 27 ‘Did you have a lot of energy?’, and item 32 ‘Did you feel rested on waking in the morning?’) of the energy subscale, two items of health perceptions (i.e. item 34 ‘I seem to get sick a little easier than other people’ and item 36 ‘I expect my health to get worse’), and one of the emotional wellbeing subscale (item 26 ‘Have you felt calm and peaceful’).
ECV value was 0.51 (indicating that 51% of the common variance was due to the general HRQOL factor) and PUC was 0.92, denoting that the data were sufficiently ‘unidimensional’.
Omega value for the total raw score was 0.98, suggesting that the reliability considering all sources of common variance (general factor and group factors) was very high. Moreover, omega hierarchical value of the general factor was 0.87, indicating that the total raw score was a reliability measure of the general HRQOL factor.
As shown in Table 4, for the majority of the subscales, omega hierarchical value (ωS) was around 0.50, whereas it was very low (≤ 0.35) for three subscales (i.e. energy, health perceptions, and health distress)—meaning that summed scores of items belonging to these subscales were not a reliable measure of their respective domain latent variable once the general HRQOL was taken into account—and it was high (0.70) for sexual function subscale. For the latter subscale, it seems that the specific group factor accounted for more variance than the general factor, indicating that items belonging to this subscale were more likely to reflect a specific domain of HRQOL (related to sexual function) than a common general construct of HRQOL.
First, the model was estimated to evaluate the measurement invariance of MSQOL-54 across gender (Table 5, upper part). Results showed that the model produced an acceptable fit for configural invariance (RMSEA = 0.055; CFI = 0.892; SRMR = 0.063). Considering the model where loadings were imposed to be identical across gender, indices of fit were satisfactory, and worsening of the unrestrained model was insignificant (ΔRMSEA < 0.001; ΔCFI = − 0.006; ΔSRMR = 0.008), hence providing evidence of metric invariance. With regard to the scalar invariance (i.e. intercepts and loadings imposed to be invariant across groups), the model fitted the data well (RMSEA = 0.054; CFI = 0.885; SRMR = 0.063). Finally, examining the variations in fit indices when compared with the metric invariance model, cut-off values were met, supporting the scalar invariance.
Second, the model was estimated to evaluate the measurement invariance of MSQOL-54 across age (using the median of 44 years as cut-off) (Table 5, bottom part). Here, the results showed that the model produced acceptable fit for configural invariance (RMSEA = 0.054; CFI = 0.893; SRMR = 0.059), metric invariance (RMSEA = 0.054; CFI = 0.887; SRMR = 0.067), and scalar invariance (RMSEA = 0.054; CFI = 0.885; SRMR = 0.063). All the changes in fit indices across the models were satisfactory.
As far as we know, this was the first study applying the bifactor model to the MSQOL-54 in a large international database of MS patients.
The bifactor model with one general HRQOL factor and 10 specific group factors achieved acceptable fit and outperformed both the original two second-order factor model and the single second-order factor model. Also, our findings supported measurement invariance of the questionnaire across age and gender, suggesting that it has the same meaning across these socio-demographic variables, and that patients having the same ratings on MSQOL-54 general or domain factors would attain the identical value on the observed variable, regardless of sub-group membership.
Generally, the factor loadings were substantially high both on the general and the group factors, and the ECV was about 50%, indicating that MSQOL-54 items contribute to essentially the same extent to both the general HRQOL factor and to the group factors. Despite this, the data can be deemed sufficiently ‘unidimensional’, because the MSQOL-54 consists of several subscales composed of few items each, and this implies that the vast majority of correlations between items (PUC = 92%) reflect general factor variance only. Furthermore, the satisfactory value of the coefficient omega hierarchical indicated that the total raw score is a reliable measure of the general HRQOL latent variable. Taken together, all these results support the hypothesis that the MSQOL-54 has a sufficient ‘unidimensional’ structure, and thus it is appropriate to calculate a total HRQOL score.
Among the 52 items analyzed in the study—it is noteworthy to remember that items 2 and 50 were excluded from the analysis as they are single items—the weaker indicators of the general HRQOL dimension were the four items of the sexual function subscale. Considering the omega hierarchical value, the sexual function subscale is more likely to reflect a specific domain of HRQOL (namely related to sexual function) than a common general construct of HRQOL. In fact, this is the only subscale that showed an omega hierarchical value ≥ 0.70.
Another issue derives from the social function subscale. The three items of this subscale loaded onto the general factor only because one of them (item 51, dealing with bowel or bladder) was not a good indicator of social functioning, and a group factor needs at least 3 items to be identified. Thus, it was not possible to evaluate the contribution of the relative group factor.
This study has important implications for clinical practice and research. For clinical practice, it could be crucial to provide health professionals and MS patients with feedback using a single HRQOL total score, which includes aspects of HRQOL not captured by the 10 group factors—as well as with subscale scores, to add granularity. The total HRQOL score could be useful also to identify patient subgroups—with different disease forms as well as levels of disability—in order to deliver personalized interventions addressing, for example, self-efficacy or resilience. On the other hand, for researchers, it could be easier to calculate and interpret a unique total HRQOL score, when using such measure in clinical trials or other research studies. Moreover, the present results can be a stimulus for future research aimed at revising the MSQOL-54 questionnaire. Specifically, our findings highlight the need to enlarge the number of items measuring the social function subscale, because one of the three items of this subscale was not a good indicator. Furthermore, we suggest revising the sexual function subscale items by broadening the content domain so as to include also intimacy and sexual pleasure, as three of the four items from this subscale originated from Medical Outcomes Study sexuality functioning scale which focus on performance indicators .
In the present study there were a number of limitations, some of which are reported elsewhere . This secondary analysis was carried out in a large cross-sectional international MS database and should be confirmed in an independent sample, using a prospective longitudinal design. Stability of the factor structure was not established, as the data were not collected using longitudinal assessments. Further, criterion validity of the total HRQOL score should be assessed by correlating it with other pertinent questionnaires.
To conclude, this study adds new knowledge to the factorial structure of the MSQOL-54, in that a bifactor model fits the data well, outperforming the two second-order models. Therefore, it is appropriate to calculate a total HRQOL score, including all the original subscales/domains. Based on these results, in future research, items should be calibrated using item response theory in order to assess whether a multidimensional computerized adaptative version of the MSQOL-54 is feasible. Further work to integrate / revise selected items is suggested.
Availability of data and materials
The dataset generated and analysed during the current study is available in the Zenodo repository, https://doi.org/10.5281/zenodo.4591136.
Akaike Information Criterion
Bayesian Information Criterion
Confirmatory factor analysis
Comparative fit index
Explained common variance
Expanded Disability Status Scale
Health-related quality of life
Mental Health Composite
Maximum likelihood estimation with robust standard errors
The Multiple Sclerosis Quality of Life-54 items
Patient Determined Disease Steps
Percentage of uncontaminated correlations
Physical Health Composite
Root mean square error of approximation
Standardized root mean square residual
Nortvedt MW, Riise T. The use of quality of life measures in multiple sclerosis research. Mult Scler. 2003;9:63–72.
Mitchell AJ, Benito-Leon J, Gonzalez JM, Rivera-Navarro J. Quality of life and its assessment in multiple sclerosis: integrating physical and psychological components of wellbeing. Lancet Neurol. 2005;4:556–66.
Miller DM, Allen R. Quality of life in multiple sclerosis: determinants, measurement, and use in clinical practice. Curr Neurol Neurosci Rep. 2010;10(5):397–406. https://doi.org/10.1007/s11910-010-0132-4.
Solari A. Role of health-related quality of life measures in the routine care of people with multiple sclerosis. Health Qual Life Outcomes. 2005;3:16.
Atlas of MS 2020. https://www.msif.org/wp-content/uploads/2020/10/Atlas-3rd-Edition-Epidemiology-report-EN-updated-30-9-20.pdf. Accessed 23 February 2021.
Compston A, McDonald I, Noseworthy J, Lassmann H, Miller D, Smith K, et al editors. McAlpine’s multiple sclerosis. 4th ed. Edinburgh: Churchill Livingstone Elsevier; 2006.
Ware JE, Snow KK, Kosinski M, Gandek B, editors. SF-36 health survey manual and interpretation guide. Boston, MA: The Health Institute; 1993.
Vickrey BG, Hays RD, Harooni R, Myers LW, Ellison GW. A health-related quality of life measure for multiple sclerosis. Qual Life Res. 1995;4:187–206.
Solari A, Filippini G, Mendozzi L, Ghezzi A, Cifani S, Barbieri E, et al. Validation of Italian multiple sclerosis quality of life 54 questionnaire. J Neurol Neurosurg Psychiatry. 1999;67:158–62.
Idiman E, Uzunel F, Ozakbas S, Yozbatiran N, Oguz M, Callioglu B, et al. Cross-cultural adaptation and validation of multiple sclerosis quality of life questionnaire (MSQOL-54) in a Turkish multiple sclerosis sample. J Neurol Sci. 2006;240:77–80.
Taoussi KEA, Haddou EAB, Benomar A, Abouqal R, Yahyaoui M. Quality of life and multiple sclerosis: Arabic language translation and transcultural adaptation of MSQOL-54. Revue Neurol. 2012;168:444–9.
Giordano A, Pucci E, Naldi P, Mendozzi L, Milanese C, Tronci F, et al. Responsiveness of patient-reported outcome measures in multiple sclerosis relapses: the REMS study. J Neurol Neurosurg Psychiatry. 2009;80:1023–8.
Acquadro C, Lafortune L, Mear I. Quality of life in multiple sclerosis: translation in French Canadian of the MSQoL-54. Health Qual Life Outcomes. 2003;1:70.
Yamamoto T, Ogata K, Katagishi M, Shimizu H, Ogawa M, Yamamura T, et al. Validation of the Japanese-translated version Multiple Sclerosis Quality of Life-54 instrument. Rinsho Shinkeigaku. 2004;44:417–21.
Pekmezovic T, Kisic Tepavcevic D, Kostic J, Drulovic J. Validation and cross-cultural adaptation of the disease- specific questionnaire MSQOL-54 in Serbian multiple sclerosis patients sample. Qual Life Res. 2007;16:1383–7.
Füvesi J, Bencsik K, Benedek K, Mátyás K, Mészáros E, Rajda C, et al. Cross-cultural adaptation and validation of the ‘Multiple Sclerosis Quality of Life Instrument’ in Hungarian. Mult Scler. 2008;14:391–8.
Chen FF, West SG, Sousa KH. A comparison of bifactor and second-order models of quality of life. Multivar Behav Res. 2006;41(2):189–225.
Reise SP, Moore TM, Haviland MG. Bifactor Models and Rotations: Exploring the Extent to which Multidimensional Data Yield Univocal Scale Scores. J Pers Assess. 2010;92:544–59.
Gustafsson J, Balke G. General and specific abilities as predictors of school achievement. Multivar Behav Res. 1993;28:407–34.
Luo D, Petrill SA, Thompson LA. An exploration of genetic g: hierarchical factor analysis of cognitive data from the Western Reserve Twin Project. Intelligence. 1994;18:335–47.
Bludworth JL, Tracey TJG, Glidden-Tracey C. The bi-level structure of the Outcome Questionnaire-45. Psychol Assess. 2010;22:350–5.
Brouwer D, Meijer RR, Weekers AM, Baneke JJ. On the dimensionality of the Dispositional Hope Scale. Psychol Assess. 2008;20:310–5.
Chilcot J, Norton S, Kelly ME, Moss-Morris R. The Chalder Fatigue Questionnaire is a valid and reliable measure of perceived fatigue severity in multiple sclerosis. Mult Scler. 2016;22:677–84.
Chamot E, Kister I, Cutter GR. Item response theory-based measure of global disability in multiple sclerosis derived from the performance scales and related items. BMC Neurol. 2014;3:192. https://doi.org/10.1186/s12883-014-0192-1.
Mokkink LB, Knol DL, Uitdehaag BM. Factor structure of Guy’s Neurological Disability Scale in a sample of Dutch patients with multiple sclerosis. Mult Scler. 2011;17(12):1498–503.
Giordano A, Testa S, Bassi M, Cilia S, Bertolotto A, Quartuccio ME, et al. Assessing measurement invariance of MSQOL-54 across Italian and English versions. Qual Life Res. 2020;29(3):783–91.
Hadgkiss EJ, Jelinek GA, Weiland TT, Pereira NG, Marck CH, van der Meer DM. Methodology of an international study of people with multiple sclerosis recruited through web 2.0 platforms: demographics, lifestyle, and disease characteristics. Neurol Res Int. 2013;2013:580–96.
Jelinek GA, De Livera AM, Marck CH, Brown CR, Neate SL, et al. Lifestyle, medication and socio-demographic determinants of mental and physical health related quality of life in people with multiple sclerosis. BMC Neurol. 2016;16:235.
Bassi M, Falautano M, Cilia S, Goretti B, Grobberio M, et al. Illness perception and well-being among persons with multiple sclerosis and their caregivers. J Clin Psychol Med Settings. 2016;23:33–52.
Bassi M, Falautano M, Cilia S, Goretti B, et al. The coexistence of well- and ill-being in persons with multiple sclerosis, their caregivers and health professionals. J Neurol Sci. 2014;2014(337):67–73.
Polman CH, Reingold SC, Banwell B, Clanet M, Cohen JA, et al. Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann Neurol. 2011;69:292–302.
Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. 1983;33:1444–52.
Rosato R, Testa S, Bertolotto A, Confalonieri P, Patti F, et al. Development of a short version of MSQOL-54 using factor analysis and item response theory. PLoS ONE. 2016;11:e0153466.
Rosato R, Testa S, Bertolotto A, Scavelli F, Giovannetti AM, et al. Prospective validation of the abbreviated, electronic version of the MSQOL-54. Mult Scler. 2018;25(6):856–66.
Hohol MJ, Hohol MJ, Orav EJ, Weiner HL. Disease steps in multiple sclerosis: a simple approach to evaluate disease progression. Neurology. 1995;1995(45):251–5.
Kline RB. Principles and practice of structural equation modeling. New York: Guilford Publications; 2015.
Browne MW, Cudeck R. Alternate ways of assessing model fit. In: Bollen KA, Long JS, editors. Testing structural equation models. Newbury Park, CA: Sage Publication; 1993. p. 136–62.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Bentler PM, Bonett DG. Significance tests and goodness-of-fit in the analysis of covariance structures. Psychol Bull. 1980;88:588–606.
Akaike H. A new look at the statistical model identification. IEEE Trans Autom Control. 1974;19:716–23.
Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6:461–4.
Stone M. Comments on Model Selection Criteria of Akaike and Schwarz. J Roy Stat Soc: Ser B (Methodol). 1979;41:276–8.
Peipert JD, Cella D. Bifactor analysis confirmation of the factorial structure of the Functional Assessment of Cancer Therapy-General (FACT-G). Psychooncology. 2019;28:1149–52.
Reise SP. Invited paper: the rediscovery of bifactor measurement models. Multivar Behav Res. 2012;47(5):667–96.
Chen FF. Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct Equ Model. 2007;14(3):464–504.
Muthén LK, Muthén BO, editors. Mplus user’s guide. 6th edn. Los Angeles, CA: Muthén & Muthén; 1998–2011.
Sherbourne C. Social functioning: sexual problems measures. In: Stewart AL, Ware JE, editors. Measuring functioning and well-being: the medical outcomes study approach. North Carolina: Duke University Press; 1992. p. 194–204.
We thank all the MS patients who participated.
Ethical approval and consent to participate
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.
Patients gave written informed consent to being included in the original projects. Additional consent was not required for this secondary analysis, for which patients’ privacy and anonymity was guaranteed.
Consent for publication
The authors declare that they have no conflict of interest. AL has received personal compensation for consulting, serving on a scientific advisory board, speaking or other activities from Biogen, Merck Serono, Mylan, Novartis, Roche, Sanofi/Genzyme, Teva. Her institutions have received research grants from Novartis.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Configuration of the confirmatory two second-order factors model including 12 first-order factors and two second-order factors. CF, cognitive function; EB, emotional wellbeing; EN, energy; HD, health distress; HP, health perceptions; HRQOL, health-related quality of life; PH, physical health; QOL, overall quality of life; RLEP, role limitations due to emotional problems; RLPP, role limitations due to physical problems; SeF, sexual function; SF, social function.
Configuration of the confirmatory single second-order factor model, including 12 first-order factors and one second-order factor. CF, cognitive function; EB, emotional wellbeing; EN, energy; HD, health distress; HP, health perceptions; HRQOL, health-related quality of life; PH, physical health; QOL, overall quality of life; RLEP, role limitations due to emotional problems; RLPP, role limitations due to physical problems; SeF, sexual function; SF, social function.
Configuration of the Bifactor 1 model, including 11 group factors and one general factor; items of the overall QOL subscale (53, 54) loaded onto the general factor only, and their residuals were allowed to correlate. CF, cognitive function; EB, emotional wellbeing; EN, energy; HD, health distress; HP, health perceptions; HRQOL, health-related quality of life; PH, physical health; QOL, overall quality of life; RLEP, role limitations due to emotional problems; RLPP, role limitations due to physical problems; SeF, sexual function; SF, social function.
Standardized factor loadings in the Bifactor 1 model.
Correlations between items of the social function subscale.
About this article
Cite this article
Giordano, A., Testa, S., Bassi, M. et al. Viability of a MSQOL-54 general health-related quality of life score using bifactor model. Health Qual Life Outcomes 19, 224 (2021). https://doi.org/10.1186/s12955-021-01857-y
- Multiple sclerosis
- Bifactor model
- Factor analyses
- Health-related quality of life