Skip to main content

Fatigue in chronically critically ill patients following intensive care - reliability and validity of the multidimensional fatigue inventory (MFI-20)



Fatigue often occurs as long-term complication in chronically critically ill (CCI) patients after prolonged intensive care treatment. The Multidimensional Fatigue Inventory (MFI-20) has been established as valid instrument to measure fatigue in a wide range of medical illnesses. Regarding the measurement of fatigue in CCI patients, the psychometric properties of the MFI-20 have not been investigated so far. Thus, the present study examines reliability and validity of the MFI-20 in CCI patients.


A convenience sample of n = 195 patients with Critical Illness Polyneuropathy (CIP) or Myopathy (CIM) were recruited via personal contact within four weeks (t1) following the transfer from acute care ICU to post-acute ICU at a large rehabilitation hospital. N = 113 (median age 61.1 yrs., 72.6% men) patients were again contacted via telephone three (t2) and six (t3) months following the transfer to post-acute ICU. The MFI-20, the Euro-Quality of Life (EQ-5D-3 L) and the Structured Clinical Interview for the Diagnostic and Statistical Manual of mental disorders DSM-IV (SCID-I) were applied within this prospective cohort study.


The internal consistency Cronbach’s α was adequate for the MFI-total and all but the subscale Reduced Motivation (RM) (range: .50–.91). Item-to-total correlations (range: .22–.80) indicated item redundancy for the subscale RM. Confirmatory Factor analyses (CFAs) revealed poor model fit for the original 5-factor model of the MFI-20 (t2/t3, Confirmatory Fit Index, CFI = .783/ .834; Tucker-Lewis Index, TLI = .751/ .809; Root Mean Square Error of Approximation, RMSEA = .112/ .103). Among the alternative models (1-, 2-, 3-factor models), the data best fit to a 3-factor solution summarizing the highly correlated factors General −/ Physical Fatigue/ Reduced Activity (GF/ PF/ RA) (t2/ t3, CFI = .878/ .896, TLI = .846/ .869, RMSEA = .089/ .085, 90% Confidence Interval .073–.104/ .066–.104). The MFI-total score significantly correlated with the health-related quality of life (range: −.65-(−).66) and the diagnosis of major depression (range: .27–.37).


In the present sample of CCI patients, a reliable and valid factor structure of the MFI-20 could not be ascertained. Especially the subscale RM should be revised. Since the factors GF, PF and RA cannot be separated from each other and the unclear factorial structure in the present sample of CCI patients, the MFI-20 is not recommended for use in this context.

Trial registration

German Clinical Trials Registration DRKS00003386. Registered 13 December 2011, retrospectively registered.


Fatigue is a common long-term complication after treatment on Intensive Care Unit (ICU) and goes along with a negative impact on the patients’ health-related quality of life [1,2,3]. A small group of seriously ill patients needs prolonged mechanical ventilation, ongoing dependence from invasive critical care techniques and persistent monitoring of e.g. cardiopulmonary functions. These patients are at increased risk to become chronically critically ill (CCI) [4, 5]. CCI patients are often faced with initial severe acute inflammatory events that cause long-term alterations in the innate and/ or acquired immune system [6]. Consequential, damages at the myelin of the peripheral nerves may follow. The extended immobilization of CCI patients leads to adverse physiological alterations and deconditioning in multiple organ systems such as muscles, bones, joints, endocrine system. The prolonged treatment on ICU may contribute to a loss of muscle mass and maximal attainable muscle tension, increasing the risk for the development of Critical Illness Polyneuropathy (CIP) or Critical Illness Myopathy (CIM). In this context, a state of ongoing exhaustion or fatigue may occur which cannot be substantially alleviated by rest and may impair the patients’ rehabilitation process [5]. Moreover, fatigue may be accompanied by additional symptoms such as neurological, immunological, gastrointestinal, genitourinary symptoms and impaired energy balance [7, 8].

Especially CCI patients are confronted with symptoms of fatigue during their physical and cognitive rehabilitation process following the ICU treatment [3, 5]. CCI patients reported symptoms of chronic exhaustion even five years after ICU discharge [9]. Additionally, the present research showed a profound overlap between the construct of fatigue and depressive symptoms [10, 11]. Hence, health care providers should be aware of fatigue and its consequences in CCI patients.

Until now, a uniform definition or pathophysiology as well as gold standard for the assessment of fatigue do not exist. Within clinical research, self-report questionnaires such as the Multidimensional Fatigue Inventory (MFI-20) [12] are commonly used to assess the severity of fatigue symptoms (e.g. in chronic fatigue syndrome, cancer, Parkinson’s disease, craniopharyngeoma). At present, findings regarding the reliability and validity of the MFI in CCI patients are lacking. There is need for a valid instrument to measure fatigue in these patients in order to complete a patient-centered outcome set, assessing impairments following ICU discharge in these patients. Above, the valid assessment of fatigue in CCI patients allows a more meaningful evaluation of the effectiveness of interventional trials targeted on the improvement of fatigue post-discharge [13]. Following, the aim of the present study was to examine the measurement properties (internal consistency, measurement error, structural -, convergent -, discriminant validity, floor/ ceiling effects) of the MFI-20 in patients with chronic critical illness.


Participants and study procedures

N = 195 patients were consecutively enrolled in a rehabilitation hospital (Bavaria Clinic Kreischa) where they were weaned from long-term ventilation. For study participation they had to fulfill the diagnosis of a Critical Illness Polymyopathy (CIP, ICD-10: G62.80) or Critical Illness Myopathy (CIM, ICD-10: G72.80). A convenience sample of patients (see Fig. 1) with the following further inclusion criteria participated: sufficient German language skills, ICU stay of at least six days, alert and able to understand the questionnaires, transfer from ICU at acute care hospital during the weeks before inclusion in the present study. The patients were asked for study participation orally at the Bavaria Clinic Kreischa. A short cognitive test, named Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) [14, 15], was applied in order to preclude cognitive impairment. The patients were informed about the study protocol and informed consent for study participation was received within four weeks following the transfer from ICU at acute care hospital to the post-acute ICU at the rehabilitation hospital (t1).

Fig. 1
figure 1

Study flow diagram. n = 113 patients with CCI were interviewed at t2, n = 91 patients with CCI were interviewed at t3. CAM-ICU: Confusion Assessment Method for the Intensive Care Unit; CIP/ CIM: Critical Illness Polyneuropathy/ Critical Illness Myopathy; SCID: Structured Clinical Interview for DSM-IV

A detailed description of the study procedure has been already published elsewhere [16]. At t1, basic medical and sociodemographic data (marital status, educational level, work status, age, gender) were obtained from the patient record forms. Clinical data contained the clinical diagnoses, the severity of medical illnesses, the occurrence of sepsis, the sites of infections, the length of mechanical ventilation and the length of ICU stay. The latter two were completed at t2 and t3. The severity of the medical illnesses was assessed indirectly via the Barthel-Index (BI) [17]. The BI is a measure of performance in activities of daily living and of the early rehabilitation state (e.g. intensive care supervision, tracheostomy tube management, mechanical ventilation, confusion, severe impairment of communication, and dysphagia). It was assessed at admission and discharge from rehabilitation hospital by a trained study nurse. A minimum value of − 325 and a maximum value of 100 could be reached. Higher values indicate a better performance.

At t2 (three months post-transfer) and at t3 (six months post-transfer), the patients were contacted via telephone. During the telephone interviews, the following instruments with relevance for the present research question were applied: the MFI-20 [12], the questionnaire Euro-Quality of Life (EQ-5D-3 L) [18] and the Structured Clinical Interview for the Diagnostic and Statistical Manual of mental disorders DSM-IV (SCID-I) [19].


The severity of fatigue was assessed with the Multidimensional Fatigue Inventory-20 (MFI-20) [12, 20] three months and six months following the transfer from acute care ICU via telephone interview. The MFI-20 is a 20-item self-report measurement of fatigue. It covers the five dimensions General Fatigue (GF), Physical Fatigue (PF), Mental Fatigue (MF), Reduced Motivation (RM) and Reduced Activity (RA). Each subscale contains two positively (e.g. „I feel very active.“) and two negatively (e.g. „I tire easily.“) formulated items. Items are rated on a 5-point Likert scale (range 1 „yes, that is true “to 5 „no, that is not true“) which are summed up to a simple total score with a minimum value of 4 (absence of fatigue) and a maximum value of 20 for each subscale. A total fatigue score is calculated as the sum of the subscale scores (range 20–100). Higher total scores indicate higher levels of fatigue. Validity has been shown for different participant populations e.g. cancer patients, army recruits, chronic fatigue syndrome. Internal consistency has been shown to be good for the GF, PF and MF dimensions (Cronbach’s α .84) [12] and adequate for the subscales RA and RM (Cronbach’s α > .65) [12].

The health-related quality of life was measured with the questionnaire Euro-Quality of Life (EQ-5D-3 L) [18] at t2 and t3. The EQ-5D-3 L assesses five dimensions (mobility, self-care, usual activities, pain/ discomfort and anxiety/ depression) which are rated within three severity levels (no problems, some or moderate problems, extreme problems or unable). A single one-dimensional index value is generated based on a simple sum score according to Hinz et al. [21]. This sum score was subjected to a linear transformation leading to values between 0 and 100. Higher values indicate a higher health-related quality of life. In the present study Cronbach’s α for the EQ-5D-3 L was .74 at t2 and .75 at t3.

The diagnosis of major depression was ascertained via SCID-I at t2 and t3. A clinical psychologist with at least five years of clinical practice applied the SCID-I via telephone contact.

The present study protocol was in consent with the Declaration of Helsinki and a positive votum was received by the Ethics Committee (No 3278–10/11) of the Friedrich-Schiller-University, Jena, Germany.

Statistical analysis

Sociodemographic and item characteristics were displayed as frequencies. Mean values, standard deviations, medians and interquartile ranges were reported depending on the distribution of the dependent variables. Chi-squared tests or Fisher’s exact tests were applied to compare categorical data. Mann-Whitney U tests were calculated to compare ordinally or non-normally distributed data. Standardized Cronbach’s α, item-total subscale (corrected-to-total) and inter-item correlations were calculated as parameters of internal consistency. Cronbach’s α values >.70, item-total subscale (corrected-to-total) correlations ≥ .30 and inter-item correlations of .30 to .70 were acceptable [22, 23]. Measurement error, defined as systematic or random error not attributed to true changes in fatigue between the two consecutive assessments of the MFI, was calculated according to Elbers et al. [22] using Bland and Altman plots [24]. The limits of agreement were determined using the mean difference values (MFI-total, MFI subscales) between t2 and t3 ± 1.96 x standard deviation (SD) of the difference. A linear regression between difference and mean scores was applied in order to detect a proportional bias. MFI subscales were intercorrelated controlling for age and gender [12, 20, 23]. Confirmatory factor analyses (CFA) was applied using the maximum likelihood estimation (MLE) in order to assess the structural validity of the MFI-20 in the present sample of CCI patients at t2 and t3 (see Table 4, Additional file 1: Table S3). Separate CFAs were applied in the sample of CCI patients at t2 (n = 113) and t3 (n = 91). The MFI-20 may be regarded as structurally valid if the items adequately represent the proposed latent factors or measurement model. The following models were evaluated: model A assuming the original five latent fatigue factors (GF, PF, MF, RA, RM), model B assuming one single underlying latent factor, model C with a 2-factor model combining model A and B. Since the subscales representing the three factors GF, PF and RA showed high intercorrelations (see Table 3), these factors were summarized and tested in model D (3-factor model). In models A, C, and D, the latent fatigue factors were allowed to covary with mean values fixed to 0 and variances to 1. The fitness of each model with the data were evaluated using comparative fit indices (CFI, cut-off > .95; Tucker-Lewis Index, TLI,cut-off > .95) and absolute fit indices (Chi2 goodness-of-fit, Root Mean Square Error of Approximation, RMSEA, cut-off <.08) [25, 26]. A Chi2 value smaller than the number of degrees of freedom (at p > .05), can be regarded to be good. The CFA was carried out using IBM® SPSS® AMOS 24.0.0. The convergent validity was assessed via Spearman’s and point-bisearial correlation with the health-related quality of life and the SCID-based diagnosis of major depression. Above, the convergent validity was evaluated by composite reliability (CR > .7) and average variance extracted (AVE > .5) value of each factor. Discriminant validity was evaluated by maximum shared variance (MSV < AVE), average shared variance (ASV < AVE) and square root of AVE greater than inter-factor correlations [27]. CR, AVE, MSV, ASV and square root of AVE were calculated using stats tool package. Floor and ceiling effects were ascertained if more than 15% of the CCI patients had either the lowest or highest possible score on the MFI subscales at t2 or t3. For the statistical analyses SPSS 24.0 was used. All results were considered significant at p ≤. 05 (two-tailed).


Of the N = 352 potentially to be enrolled patients at the post-acute ICUs of the Bavaria Clinic Kreischa, n = 195 (55.4%) patients could be successfully interviewed within four weeks, n = 113 (57.9%) three months and n = 91 (46.7%) six months following the transfer from acute care ICU (see Fig. 1). The characteristics of the sample of CCI patients at t2 (n = 113) are summarized in Table 1. Non-participants and followed-up patients showed a similar age and gender distribution. More non-participants than followed-up patients had a lower degree of education. The followed-up patients showed significantly lower Barthel indices at discharge from post-acute ICU as well as from rehabilitation hospital compared with the non-participants. The mean total fatigue score was 55.9 (SD = 16.5). The mean values for the MFI subscales ranged from 8.7 for MF to 13.4 for PF. Additional file 2: Table S1 presents the medical comorbidities in both groups showing that non-participants are more often affected by specific medical comorbidities than followed-up patients. Significantly higher rates were observed for pneumonia, hypertension, organic brain syndrome and neurological disorders, by tendency only for diabetes, kidney diseases and sleep apnea.

Table 1 Descriptive characteristics of chronically critically ill (CCI) patients three months (t2) following discharge from ICU at acute care hospital (n = 113). Non-participants were defined as all patients of the potentially to be enrolled patients who could not be interviewed at t1, t2 and t3 for different reasons (see Flow chart, Fig. 1)


The internal consistency values were reasonsable for the MFI-total (Cronbach’s α = .91) and four of the five subscales (GF, PF, RA, MF, Cronbach’s α range: .69–.86). An inadequate value (Cronbach’s α = .50) was received for the subscale RM (Table 2). Regarding the inter-item correlations, a mean value of ≥ .30 was ascertained for the MFI-total and all but the RM subscale (.20, range .06–.32). The lowest correlation was present between item 15 („I have a lot of plans.“) with item 18 („I don’t feel like doing anything.”) (.06) and item 12 („I feel rested.“) (.02), suggesting item redundancy for item 15. Item-total correlations were ≥ .30 for all but the subscale RM (range: .22–.37) as well as the MFI-total (range: .23–.71) (Table 2). Pearson item subscale correlation coefficients ranged from .11 to .58 (absolute values) with the lowest and non-significant values for items 6, 11 and 20. Of them, only the removal of item 20 (“Physically I feel I am in an excellent condition.”) would lead to an increase of Cronbach’s α for the respective subscale (Additional file 3: Table S2). Values at t2 and t3 were not systematically different as shown in the Bland and Altman plots (Additional file 4: Figure S1).

Table 2 Item characteristics and internal consistency reliabilities for the MFI-20 subscales and total fatigue score in patients with chronic critical illness three months (t2) following the transfer from ICU at acute care hospital (n = 113)

Pairwise correlations between the MFI subscales

There was no effect of gender for all the MFI subscales. Only the MFI subscale GF was positively associated with age (Spearman’s rho = .24, p = .01). Nevertheless, partial correlations were calculated controlling for age and gender. At t2, the MFI subscales showed medium-sized to high correlations between each other, ranging from .37 to .77. At t3, the pairwise correlations ranged from .55 to .76 (see Table 3).

Table 3 Pairwise correlations between the MFI subscales at t2 and t3

Structural validity

Fit statistics revealed a poor model fit for all models at both time points (Table 4, Additional file 1: Table S3). According to the original 5-factor model (A), all but one standardized regression weights β (corresponding to factor-item correlation coefficients) were > .05 at p < .001. The lowest estimates were observed for item 15 for models A and B (t2/ t3, model A: β = .363/ .465, model B: β = .242/ .358). For model C, only the items of the MF subscale showed highly significant β coefficients at both t2 and t3. The best fit to the data (RMSEA ≤ .089) could be obtained for the 3-factor solution both at t2 and t3. All but items 9 and 16 were highly significantly correlated to the subscales. When each of the five factors was evaluated in separate CFAs, the factor RM turned out to show the poorest fit indices at both t2 and t3.

Table 4 MFI-20 models tested via Confirmatory Factor Analysis (CFA) in n = 113 patients three months post-ICU (t2)

Convergent and discriminant validity

According to model A, the composite reliability (CR) was appropriate (CR > .7) for all MFI subscales at both time points. AVE was acceptable (> .5) besides for RM. Discriminant validity (MSV < AVE, ASV < AVE) could not be ascertained at both time points. The square root of AVE was greater than the inter-factor correlations only for the subscale MF.

According to model C, the CR was appropriate (CR > .7) only for MF at both time points. AVE was inappropriate and only acceptable (> .5) for MF at t2. Discriminant validity (MSV < AVE, ASV < AVE) could be ascertained at both time points.

Referring to model D, the CR was acceptable for all subscales besides RM. AVE was only appropriate for MF. Discriminant validity (MSV > AVE) was only appropriate for MF as well. Square root of AVE was smaller than inter-factor correlations besides for MF at t3.

Since no unequivocal factor model of the MFI-20 could be confirmed in the present sample of CCI patients, correlations were calculated only with the MFI-total but not with the single MFI subscales. The relationship between the MFI-total and the health-related quality of life (EQ-5D-3 L) yielded significant high-sized correlations at both t2 and t3 (range: − 65-(−)66). Significant small- to medium-sized correlations (range: .27–.37) could be obtained with the diagnosis of major depression (see Table 5).

Table 5 Convergent validity. Spearman’s correlation between the total score of the MFI-20 and EQ-5D-3 L, point-biserial correlation between the MFI-total and the diagnosis of major depression according to SCID-I

Floor and ceiling effects

There was neither a floor nor a ceiling effect for the subscales GF, PF, RA and RM. For the MF subscale, floor effects were detected. N = 30 patients (26.5%) had the lowest possible test score at t2, n = 19 patients (20.9%) at t3.


Fatigue is a common symptom in patients following ICU stay [13]. It is supposed that especially CCI patients are affected by a persisting feeling of exhaustion [3, 9]. However, to what extent the Multidimensional Fatigue Inventory (MFI) is a reliable and valid measure of fatigue in CCI patients, remains to be elucidated. Therefore, the present study investigated the measurement properties of the MFI in a sample of patients at three and six months following the transfer from acute care ICU to post-acute ICU at the rehabilitation hospital.

Overall, in the present sample of CCI patients the MFI values of the subscales General Fatigue (GF), Physical Fatigue (PF) and Reduced Activity (RA) were higher than those of cancer patients [28]. Likewise, CCI patients showed higher values in the above mentioned MFI subscales compared with a representative German population, whereas the subscales RM and MF were similarly scored [20]. The internal consistency was adequate for the MFI-total and the GF, PF, MF, RA subscales suggesting unidimensionality of the appropriate subscales. For the RM subscale a value smaller than the cut-off criterion of .70 for Cronbach’s α was shown. The other reliability tests (corrected item-to-total and inter-item correlation) showed inadequate values for the MFI-total and the RM subscale (values < .30). This is in accordance with the literature showing low reliability for the RM subscale either [12, 29]. One explanation might be a possible response dependency between items of this subscale [29]. Alternatively, it can be supposed that the content of the items does not correctly mirror the circumstances of the CCI patients. These often old-aged patients are in a fatal situation with a high ongoing mortality rate, increased risk for recurrent complications and persistent suffering from profound functional impairments to master basic activities of daily life. Thus, given the vague situation of CCI patients, the item content (especially from item 15: I have a lot of plans.) seems to be irrelevant for them, leading to inconsistent response patterns [12]. With respect to measurement error, the Bland and Altman plots suggest overall agreement between the assessment of fatigue at t2 and t3, with a wide range of MFI values as shown by the limits of agreement.

Floor and ceiling effects could be precluded for the MFI-total and all but the MF subscale. For the latter, 26.5% of the patient sample achieved the lowest possible test score at t2 and 20.9% at t3. This finding confirms existing results in patients with chronic illnesses (e.g. [22]) and a population-based sample of well participants [23].

In the present sample of CCI patients, a reliable factor structure of the MFI-20 could not be ascertained. Fit statistics revealed poor model fit for the original 5-factor model as originally suggested by Smets et al. [12]. Alternative models (1-factor, 2-factor) evaluated in our sample also showed no satisfying fit to the data. Fit indices rather hint towards a 3-factor solution summarizing the subscales GF/ PF/ RA. In line, other studies (e.g. [12, 22, 30]) also could not fully replicate the five factors of the MFI-20 and summarized a GF/ PF-factor. However, the model fit indices are not unequivocal since the Chi2 values surmount the degrees of freedom and were significant at p < .05. One reason for the lack of a reliable factor structure may be the high intercorrelations between the fatigue subscales as formerly shown in different patient samples (e.g. radiotherapy patients/ chronic fatigue patients/ psychology or medical students/ army recruits [12]; cancer patients [31, 32]; haemodialysis patients [26]). In the present study, the strong associations (r ≥ .70) between the subscales GF, PF and RA hint toward a large amount of shared variance. However, the 3-factor model did not reveal satisfying model fit indices as well. Regarding the high intercorrelations particularly between the subscales GF, PF, RA and the unclear factorial structure [20], the MFI-total is a more valid score for fatigue than the single MFI-subscale scores.

Consequently, a low validity of the MFI-20 has to be supposed in CCI patients. The latter has been already shown in a similar setting investigating fatigue in chronically ill haemodialysis patients [26]. Similar reasons for the lacking reliability of the factor structure can be supposed in our sample, including comprehension difficulties and a high relation of item content to the multimorbid as well as potentially life-threatening health status in the present study population. In this context, the item content of some items seems to be not relevant or inappropriate (e.g. item 6 „I think I do a lot in a day.“). In the aftermath of the ICU treatment, CCI patients are mainly involved with the recovery from the serious physical illness and their rehabilitation process [33]. The item content is therefore primarily attributed to the chronic morbidity already present in the forefront of the ICU treatment or following ICU and not to fatigue per se [26]. In line, patients particularly pay attention to the physical aspects of fatigue (e.g. vitality, activity) rather than mental complaints (e.g. concentration, mood difficulties). This is supported by significantly higher values for physical fatigue than for mental fatigue at both t2 and t3 in our CCI patients. Moreover, fatigue was predominantly related to somatic or medical reasons (e.g. efforts during rehabilitation, musculoskeletal deconditioning and general weakness following long duration of immobility, sleep disorders, pain, dyspnoea, dialysis, medication).

The models which were examined by us, only partly agreed with the criteria for convergent and discriminant validity suggested by Hair et al. [27]. Good agreement could be achieved for the subscale MF, poor agreement was obvious for RM. Above, significant correlations between fatigue and related constructs (e.g. health-related quality of life) could be shown. This has been already proven in other samples (e.g. Chronic Fatigue Syndrome, ovarian carcinoma patients) using different measurements (e.g. Short Form-36, SF-36) (see [23, 34]). Above, a significant association with depression has been shown in our sample of CCI patients and corroborates former findings where recurrent or chronic fatigue has been turned out to be a risk factor for major depression and, vice versa [35, 36]. In line, a high rate of psychiatric comorbidity has been reported in patients with chronic fatigue syndrome [11].

Our results should be interpreted in the context of the methodological strengths and limitations. The main strengths of the present study includes the investigation of a homogeneous sample of CCI patients, a prospective-longitudinal design, the long-term assessment up to six months following ICU treatment, the application of the SCID-I, and the oral data assessment via direct interviewing. The latter ensured that no missing values were produced. However, the present results should be cautiously evaluated. The kind of medical interventions or rehabilitation programmes (e.g. physiotherapy, neuropsychological interventions, ergotherapy) taking place between the assessment time points (t2 and t3) were not controlled for and might have compromised the comparability of fatigue assessment. Likewise, in the present study the time period between the assessment points was too long (three months) in order to gain an appropriate estimate for the reproducibility of the fatigue measurement over time or to calculate test-retest reliability. Although three months represents a commonly accepted interval for measuring the rate of stress-related disorders following ICU [16], future studies should apply a much shorter time frame e.g. between three and seven days when the test-retest reliability of patient-reported outcome measurements is of interest (for a systematic review see [37]). Information about the pre-ICU severity of fatigue could not be obtained, therefore the severity of fatigue found in the present study cannot be attributed to the ICU treatment as causative factor. Moreover, the rate of non-participants, although similar to other studies on patient samples after long-term mechanical ventilation (e.g. [3, 38, 39]), was quite high. The sample size of CCI patients was small. Therefore, conclusions based on the present sample should be drawn with utmost caution. Future studies are needed to replicate the results of this present study in a larger sample of CCI patients, additionally using other fatigue measures such as the Chalder Fatigue Questionnaire in order to ascertain convergent validity. Furthermore, the diagnosis of major depression should be assessed using DSM-V criteria [40]. Above, future studies should also address the minimal clinically important differences of the MFI-20 and its subscales in CCI patients (for a systematic review see [41]_ENREF_[35]).


To conclude, the present study shows that the MFI-20 cannot be regarded as valid instrument for use in clinical practice and research, measuring fatigue as multidimensional construct following ICU treatment in CCI patients. Particularly the subscale Reduced Motivation showed insufficient reliability as well as validity and should be interpreted with caution. The factorial structure of the original MFI-20 could not be unequivocally approved. Our data rather hint towards a 3-factor factor solution combining General -, Physical Fatigue and Reduced Activity. Because of the unclear factorial structure, the MFI-20 cannot be considered as appropriate tool for the assessment of fatigue in CCI patients following ICU treatment. The present results demand replication in a larger sample of CCI patients. Future research according to clinical important changes of fatigue symptoms in these patients are needed using anchor-based responsiveness. Above, future research should investigate the appropriateness of the MFI-sum score in CCI patients using Rasch analysis.



Adjusted Goodness of Fit Index


Average Shared Variance


Average Variance Extracted




Cognitive Assessment for the Intensive Care Unit


Chronic Critical Illness


Confirmatroy Factory Analysis


Critical Illness Myopathy


Critical Illness Polyneuropathy


Composite Reliability


Diagnostic and Statistical Manual of mental disorders

EQ-5D-3 L:

Euro-Quality of Life;


General Fatigue


Goodness of Fit Index


Intensive Care Unit


Mental Fatigue


Multidimensional Fatigue Inventory


Maximum Shared Variance


Principal Component Analysis


Physical Fatigue


Reduced Activity


Reduced Motivation


Structured Clinical Interview for the DSM-IV


Short Form


  1. Chaboyer W, Grace J. Following the path of ICU survivors: a quality-improvement activity. Nurs Crit Care. 2003;8(4):149–55.

    Article  PubMed  Google Scholar 

  2. Steenbergen S, Rijkenberg S, Adonis T, Kroeze G, van Stijn I, Endeman H. Long-term treated intensive care patients outcomes: the one-year mortality rate, quality of life, health care use and long-term complications as reported by general practitioners. BMC Anesthesiol. 2015;15:142.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Spadaro S, Capuzzo M, Valpiani G, et al. Fatigue in intensive care survivors one year after discharge. Health Qual Life Outcomes. 2016;14(1):148.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Nelson JE, Cox CE, Hope AA, Carson SS. Chronic critical illness. Am J Respir Crit Care Med. 2010;182(4):446–54.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Thomas DC, Kreizman IJ, Melchiorre P, Ragnarsson KT. Rehabilitation of the patient with chronic critical illness. Crit Care Clin. 2002;18(3):695–715.

    Article  PubMed  Google Scholar 

  6. Benjamim CF, Hogaboam CM, Kunkel SL. The chronic consequences of severe sepsis. J Leukoc Biol. 2004;75(3):408–12.

    Article  CAS  PubMed  Google Scholar 

  7. Fukuda K, Straus SE, Hickie I, Sharpe MC, Dobbins JG, Komaroff A. The chronic fatigue syndrome: a comprehensive approach to its definition and study. International Chronic Fatigue Syndrome Study Group. Ann Intern Med. 1994;121(12):953–9.

    Article  CAS  PubMed  Google Scholar 

  8. Carruthers BM, van de Sande MI, De Meirleir KL, et al. Myalgic encephalomyelitis: international consensus criteria. J Intern Med. 2011;270(4):327–38.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Jaenichen D, Brunkhorst FM, Strauss B, Rosendahl J. Physical and mental long-term sequelae following intensive care of severe sepsis in patients and relatives. Psychother Psychosom Med Psychol. 2012;62(9–10):335–43.

    PubMed  Google Scholar 

  10. Fuller-Thomson E, Nimigon J. Factors associated with depression among individuals with chronic fatigue syndrome: findings from a nationally representative survey. Fam Pract. 2008;25(6):414–22.

    Article  PubMed  Google Scholar 

  11. Nater UM, Lin JM, Maloney EM, et al. Psychiatric comorbidity in persons with chronic fatigue syndrome identified from the Georgia population. Psychosom Med. 2009;71(5):557–65.

    Article  PubMed  Google Scholar 

  12. Smets EM, Garssen B, Bonke B, De Haes JC. The multidimensional fatigue inventory (MFI) psychometric qualities of an instrument to assess fatigue. J Psychosom Res. 1995;39(3):315–25.

    Article  CAS  PubMed  Google Scholar 

  13. Hashem MD, Nallagangula A, Nalamalapu S, et al. Patient outcomes after critical illness: a systematic review of qualitative studies following hospital discharge. Crit Care. 2016;20(1):345.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ely EW, Margolin R, Francis J, et al. Evaluation of delirium in critically ill patients: validation of the confusion assessment method for the intensive care unit (CAM-ICU). Crit Care Med. 2001;29(7):1370–9.

    Article  CAS  PubMed  Google Scholar 

  15. Klugkist M, Sedemund-Adib B, Schmidtke C, Schmucker P, Sievers HH, Huppe M. Confusion assessment method for the intensive care unit (CAM-ICU): diagnosis of postoperative delirium in cardiac surgery. Anaesthesist. 2008;57(5):464–74.

    Article  CAS  PubMed  Google Scholar 

  16. Wintermann GB, Brunkhorst FM, Petrowski K, et al. Stress disorders following prolonged critical illness in survivors of severe sepsis. Crit Care Med. 2015;43(6):1213–22.

    Article  PubMed  Google Scholar 

  17. O’Sullivan SB, Schmitz TJ. Physical rehabilitation. Philadelphia, PA: F.A. Davis; 2007.

    Google Scholar 

  18. Rabin R, de Charro F. EQ-5D: a measure of health status from the EuroQol group. Ann Med. 2001;33(5):337–43.

    Article  CAS  PubMed  Google Scholar 

  19. Wittchen H-U, Zaudig M, Fydrich T. Strukturiertes Klinisches Interview für DSM-IV. : Göttingen: Hogrefe; 1997.

  20. Schwarz R, Krauss O, Hinz A. Fatigue in the general population. Onkologie. 2003;26(2):140–4.

    CAS  PubMed  Google Scholar 

  21. Hinz A, Kohlmann T, Stobel-Richter Y, Zenger M, Brahler E. The quality of life questionnaire EQ-5D-5L: psychometric properties and normative values for the general German population. Qual Life Res. 2014;23(2):443–7.

    Article  PubMed  Google Scholar 

  22. Elbers RG, van Wegen EE, Verhoef J, Kwakkel G. Reliability and structural validity of the multidimensional fatigue inventory (MFI) in patients with idiopathic Parkinson's disease. Parkinsonism Relat Disord. 2012;18(5):532–6.

    Article  PubMed  Google Scholar 

  23. Lin JM, Brimmer DJ, Maloney EM, Nyarko E, Belue R, Reeves WC. Further validation of the Multidimensional Fatigue Inventory in a US adult population sample. Popul Health Metr. 2009;7:18.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10.

    Article  CAS  PubMed  Google Scholar 

  25. Hu LT, Bentler PM. Evaluating model fit. In: Hoyle R, editor. Structural equation modeling: concepts, issues, and applications. Thousand Oaks: Sage; 1995. p. 76–99.

    Google Scholar 

  26. Chilcot J, Guirguis A, Friedli K, et al. Measuring fatigue using the multidimensional fatigue Inventory-20: a questionable factor structure in Haemodialysis patients. Nephron. 2017;136(2):121–6.

    Article  CAS  PubMed  Google Scholar 

  27. Hair J, Black W, Babin B, Anderson R. Multivariate data analysis. Upper Saddle River: Prentice-Hall, Inc; 2010.

    Google Scholar 

  28. Singer S, Kuhnt S, Zwerenz R, et al. Age- and sex-standardised prevalence rates of fatigue in a large hospital-based sample of cancer patients. Br J Cancer. 2011;105(3):445–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Dencker A, Sunnerhagen KS, Taft C, Lundgren-Nilsson A. Multidimensional fatigue inventory and post-polio syndrome - a Rasch analysis. Health Qual Life Outcomes. 2015;13:20.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Fillion L, Gelinas C, Simard S, Savard J, Gagnon P. Validation evidence for the French Canadian adaptation of the multidimensional fatigue inventory as a measure of cancer-related fatigue. Cancer Nurs. 2003;26(2):143–54.

    Article  PubMed  Google Scholar 

  31. Hagelin CL, Wengstrom Y, Runesdotter S, Furst CJ. The psychometric properties of the Swedish multidimensional fatigue inventory MFI-20 in four different populations. Acta Oncol. 2007;46(1):97–104.

    Article  PubMed  Google Scholar 

  32. Buss T, Kruk A, Wisniewski P, Modlinska A, Janiszewska J, Lichodziejewska-Niemierko M. Psychometric properties of the polish version of the multidimensional fatigue Inventory-20 in cancer patients. J Pain Symptom Manage. 2014;48(4):730–7.

    Article  PubMed  Google Scholar 

  33. Myhren H, Ekeberg O, Toien K, Karlsson S, Stokland O. Posttraumatic stress, anxiety and depression symptoms in patients during the first year post intensive care unit discharge. Crit Care. 2010;14(1):R14.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Holzner B, Kemmler G, Meraner V, et al. Fatigue in ovarian carcinoma patients: a neglected issue? Cancer. 2003;97(6):1564–72.

    Article  PubMed  Google Scholar 

  35. Addington AM, Gallo JJ, Ford DE, Eaton WW. Epidemiology of unexplained fatigue and major depression in the community: the Baltimore ECA follow-up, 1981-1994. Psychol Med. 2001;31(6):1037–44.

    Article  CAS  PubMed  Google Scholar 

  36. McCann K, Boore JR. Fatigue in persons with renal failure who require maintenance haemodialysis. J Adv Nurs. 2000;32(5):1132–42.

    Article  CAS  PubMed  Google Scholar 

  37. Paiva CE, Barroso EM, Carneseca EC, et al. A critical analysis of test-retest reliability in instrument validation studies of cancer patients under palliative care: a systematic review. BMC Med Res Methodol. 2014;14:8.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Jubran A, Lawm G, Duffner LA, et al. Post-traumatic stress disorder after weaning from prolonged mechanical ventilation. Intensive Care Med. 2010;36(12):2030–7.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Chiumello D, Taccone P, Berto V, et al. Long-term outcomes in survivors of acute respiratory distress syndrome ventilated in supine or prone position. Intensive Care Med. 2012;38(2):221–9.

    Article  CAS  PubMed  Google Scholar 

  40. American Psychiatric Association A. Diagnostic and statistical manual of mental disorders: DSM-5 TM. 5th ed. Arlington, VA: American Psychiatric Association; 2013.

    Book  Google Scholar 

  41. Nordin A, Taft C, Lundgren-Nilsson A, Dencker A. Minimal important differences for fatigue patient reported outcome measures-a systematic review. BMC Med Res Methodol. 2016;16:62.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We would like to kindly thank Stefan Rueckriem, Sara Wuestemann, Clara L. Buck, Christine Schier, and Corinna Klotzsche for their dedicated support in patient enrollment and data assessment. We also thank Stefan Rueckriem, Prof. Frank Oehmichen and Prof. Marcus Pohl for supervising data acquisition and contributing their expertise for the improvement of the present study design.


This study was supported by the German Federal Ministry of Education and Research grant 01EO1002.

Availability of data and materials

Data sharing consent was not obtained. Requests for data access may be sent to the corresponding author and will be judged individually with regard to the Data Protection Act of Saxony.

Author information

Authors and Affiliations



GBW performed the statistical analyses and wrote the manuscript. JR designed the study protocol and proof-read the manuscript. KW, BS, AH and KP contributed to the study conception and improved the final version of the manuscript. KP supervised the enrollment of the patients. All authors meet all authorship requirements, read and approved the final manuscript version.

Corresponding author

Correspondence to Gloria-Beatrice Wintermann.

Ethics declarations

Ethics approval and consent to participate

All participants gave written informed consent for study participation. The study was conducted in accordance with the Declaration of Helsinki. The study protocol was approved by the Ethics Committee (No 3278–10/11) of the Friedrich-Schiller-University, Jena, Germany.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S3. MFI-20 models tested via Confirmatory Factor Analysis (CFA) in n = 91 patients six months post-ICU (t3). (DOCX 16 kb)

Additional file 2:

Table S1. Medical comorbidities of the patients being followed-up (n = 113) and the non-participants (n = 239). Patients who could not be included or followed-up for different reasons were referred to as non-participants. (DOCX 17 kb)

Additional file 3:

Table S2. Correlation coefficients of the MFI-20 items with the subscales. The item-subscale correlation coefficients are significant with **p < .01 or ***p < .001. (DOCX 15 kb)

Additional file 4:

Figure S1. Bland and Altman plot comparing the five subscales and the total score of the MFI-20 between t2 (three months post-ICU) and t3 (six months post-ICU). Bold lines represent the mean differences, dotted lines represent the 95% limits of agreement (see Additional files). Differences and mean values were created using MFI-20 scores of t2 and t3. (JPEG 2516 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wintermann, GB., Rosendahl, J., Weidner, K. et al. Fatigue in chronically critically ill patients following intensive care - reliability and validity of the multidimensional fatigue inventory (MFI-20). Health Qual Life Outcomes 16, 37 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: