- Open Access
Psychometric validation of the Chinese version of the Short Inflammatory Bowel Disease Questionnaire and evaluation of its measurement invariance across sex
Health and Quality of Life Outcomes volume 19, Article number: 253 (2021)
This study aimed to evaluate the psychometric properties of the Chinese version of the Short Inflammatory Bowel Disease Questionnaire (C-SIBDQ), and its measurement invariance across sex in Chinese patients with inflammatory bowel disease (IBD).
Between September 2018 and July 2021, 284 patients with IBD were recruited from a spleen and stomach clinic. All participants completed the C-SIBDQ, 12-item Short-Form Health Survey (SF-12), nine-item Patient Health Questionnaire Depression Scale (PHQ-9), and the seven-item Generalized Anxiety Disorder Scale (GAD-7). Floor and ceiling effects were evaluated by testing frequencies and composition ratios for the minimum and maximum C-SIBDQ scores. Exploratory and confirmatory factor analysis (CFA) were used to evaluate the C-SIBDQ’s factor structure and construct validity. Convergent validity was evaluated through examining bivariate correlations between the C-SIBDQ and the SF-12, PHQ-9, and GAD-7. Internal consistency reliability and retest reliability were evaluated by respectively calculating the Cronbach’s α and the intraclass correlation coefficient (ICC) among a subsample (n = 79) after 2 weeks. The measurement invariance across sex was evaluated through multiple-group CFA.
The C-SIBDQ scores showed no floor or ceiling effects and had a single-factor structure and good convergent validity, with significant correlations with the SF-12, PHQ-9 and GAD-7. Good internal consistency (Cronbach’s α = 0.920) and test–retest reliability (ICC = 959) were observed. The C-SIBDQ also showed measurement invariance across sex, and females showed higher C-SIBDQ scores than males.
The C-SIBDQ has high reliability, validity, and stability across sex, and can be used in clinics to assess the health-related quality of life of patients with IBD.
Inflammatory bowel disease (IBD) represents a group of chronic nonspecific intestinal inflammatory diseases with unclear etiology, including ulcerative colitis (UC) and Crohn’s disease (CD) . In China, there were approximately 350,000 IBD cases in total between 2005 and 2014; however, according to the Chinese Center for Disease Control and Prevention, this figure is expected to reach 1.5 million by 2025 . IBD is characterized as a long-term disease with difficult treatment and easy susceptibility to relapse; it also imposes serious economic pressure, medical burden, and psychological burden for patients, their family caregivers, and society , and has become a major public health problem that needs to be solved urgently.
Health-related quality of life (HRQoL) refers to one’s perception of their physical and mental well-being and the effect of disease and/or its treatment on this perception . Recently, an obvious trend has been observed regarding the more frequent usage of patient HRQoL assessments as outcome indicators for chronic conditions, including IBD . A previous systematic review demonstrated that the Inflammatory Bowel Disease Questionnaire (IBDQ) developed by Guyatt et al. , which is widely used to assess disease-specific HRQoL among patients with IBD, is one of the most reliable and valid evaluation tools in this regard. The IBDQ comprises 32 items and four dimensions: intestinal symptoms, systemic symptoms, social function, and emotional function . The IBDQ has been psychometrically validated among Chinese populations . However, it should be noted that, IBDQ administration is relatively time-consuming (approximately 20 min) due to its large number of items, ; thus, respondents are prone to fatigue and poor compliance. From the perspective of survey efficiency, questionnaire-design experts generally believe that short, simple, and easy-to-answer questionnaires are most appropriate, as they can reduce the burden on the respondents and surveyors .
In an attempt to improve the efficiency of the survey and reduce its burden, Irvine et al.  developed a short version of the IBDQ (known as the “SIBDQ”); consequently, they found that the SIBDQ has high reliability and validity. Patients who are fully competent in terms of reading and writing English can complete the SIBDQ in approximately 5 min. The SIBDQ comprises 10 items and contains the same four dimensions as the IBDQ ; thus, it can rapidly evaluate the physical, social, and emotional status in patients with IBD. It has been widely used worldwide, both in clinical practice and academic research. However, further cross-cultural verification and psychometric validation is required in non-English speaking countries, since SIBDQ was originally developed in English. German  and Portuguese  versions of SIBDQ have been successfully verified and have reported good reliability and validity. However, to the best of our knowledge, there has been no psychometric validation of the Chinese version of the SIBDQ (C-SIBDQ). A previous China-based study used the SIBDQ to investigate the HRQoL of patients with IBD during the Coronavirus Disease 2019 pandemic . It is not feasible to use the C-SIBDQ without conducting a psychometric validation, as it is not clear whether its assessment of HRQoL among Chinese populations is valid and credible. Verification of the C-SIBDQ would help to ensure the standardization of assessment results and make them more scientific and reliable.
Measurement invariance is an important index that reflects the validity of a scale; it refers to measuring whether a tool has the same structure and/or meaning across different groups (e.g., between males and females) or across different time points . Previous studies have shown statistically significant differences between males and females regarding SIBDQ scores . However, it is not clear whether this difference is due to sex or the structure of the measurement instrument; moreover, existing studies have not examined the measurement invariance of the SIBDQ across sex. In order to accurately compare the measurement results of a self-report tool across multiple groups, it is essential to establish the cross-group measurement invariance of the scale . Exploring the measurement invariance of the SIBDQ would be beneficial for improving the accuracy of HRQoL assessments of patients with IBD and ensuring comparability between subgroups.
Thus, the objectives of the present study are to evaluate, in the context of Chinese patients with IBD, the psychometric properties of the C-SIBDQ and its measurement invariance across sex.
From September 2018 to July 2021, a total of 284 patients with IBD were recruited from a spleen and stomach clinic in Jinan, China. Convenience sampling was used to select participants from among the patients attending the clinic. The inclusion criteria were: (1) clinically diagnosed with IBD; (2) aged 18 years or older; (3) willing to voluntarily participate in the survey (after being fully appraised of the study content and goals); and (4) able to read and understand all of the questionnaire content. The exclusion criterion was having a severe cognitive impairment such as dementia. Two weeks after the initial sample had completed the study questionnaire, 78 members of the sample completed the C-SIBDQ retest.
This study was ethically approved by the Medical Ethics Committee of the Affiliated Hospital of Shandong University of Traditional Chinese Medicine (Identification Code: 2017-010-KY). All participants provided written informed consent.
Demographic and clinical characteristics questionnaire
The participants were administered a questionnaire that collected demographic and clinical characteristics, including age, sex, residence, education level, marital status, type of IBD, and disease activity.
The Mayo score  was used to assess the disease activity of UC. The Mayo score included four clinical indicators: bowel frequency, rectal bleeding, endoscopy results, and overall doctor evaluation. Each indicator was rated using a 4-point Likert scale (0–3 points), and the total score ranged from 0 to 12. The scores were categorized as remission (0–2), mild (3–5), and moderate (6–10) and severe (11–12).
The Crohn's Disease Activity Index (CDAI)  was used to assess the disease activity of CD. CDAI comprised eight scoring indicators: the number of loose stools, the number of days of abdominal pain, general health, extraintestinal manifestations and complications, opioid antidiarrheals, abdominal masses, hematocrit reduction, and standard weight deviation. The total score was obtained by computing the weighted sum of all the eight indicators scores, ranging from 0 to 600 points. The CDAI scores were categorized as remission (< 150), mild (150–219), and moderate (220–450) and severe (> 450), respectively.
The original SIBDQ is a self-report questionnaire that comprises 10 items and measures four dimensions: bowel symptoms, emotional function, systemic symptoms, and social function. For each item, respondents are asked to indicate, in regard to the past 2 weeks, their IBD symptoms and the impact of IBD on their overall feelings and mood. Each item is scored using a 7-point Likert scale (ranging from 1 to 7), with total scores ranging from 10 to 70; the higher the score, the better the respondent’s HRQoL. In this study, considering that the Chinese version of the IBDQ has been verified and is widely used in China [20, 21], for the present investigation we directly used all 10 items from the existing C-SIBDQ, without further translation.
12-item Short-Form Health Survey
The 12-item Short-Form Health Survey (SF-12) is a universal assessment tool for quality of life , and has previously been found to have good reliability and validity among patients with IBD . The scale contains 12 items and two components: physical health and mental health. The total score is determined by converting the sum of the item scores to a 0–100-point range using a standardized method . The higher the score, the better the respondent’s quality of life. In this study, the Cronbach's α coefficient for this scale was 0.874. We used the SF-12 as an evaluation indicator of the convergent validity of the C-SIBDQ.
9-item Patient Health Questionnaire Depression Scale
The 9-item Patient Health Questionnaire Depression Scale (PHQ-9) is a tool for measuring depressive symptoms and was compiled based on the nine diagnostic criteria of depression stipulated in the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV) . It has been validated among patients with IBD . The scale contains nine items, and each item is scored using a 4-point Likert-type scale (0 = “nothing at all,” 3 = “every day”). The higher the score, the more severe the respondent’s depressive symptoms. For this study, the Cronbach’s α coefficient for the scale was 0.917. The PHQ-9 was used as an evaluation indicator of the convergent validity of the C-SIBDQ.
7-item Generalized Anxiety Disorder Scale
The 7-item Generalized Anxiety Disorder Scale (GAD-7) is a tool for measuring generalized anxiety disorder based on the diagnostic criteria for anxiety disorder stipulated in the DSM-IV . It has been validated among patients with IBD . The scale contains seven items, and each item is scored using a 4-point Likert-type scale (0 = “nothing at all,” 3 = “every day”). The higher the score, the more severe the respondent’s generalized anxiety disorder. For this study, the Cronbach’s α coefficient for this scale was 0.951. The GAD-7 was used as an evaluation indicator of the convergent validity of the C-SIBDQ.
SPSS version 25 (IBM SPSS Statistics, Armonk, NY, USA) was used to perform descriptive analysis, Student’s t-test, Pearson’s correlation analysis, exploratory factor analysis (EFA), and an internal consistency test. Mplus 7.0 was used to perform confirmatory factor analysis (CFA). Continuous variables, such as C-SIBDQ scores, were described using means ± standard deviations (SDs), and categorical variables, such as sex, were described using n (%).
Floor and ceiling effects were evaluated based on the frequencies and composition ratios of the minimum and maximum C-SIBDQ scores. Minimum and maximum composition ratios of less than 15%, respectively, were considered to indicate an absence of floor or ceiling effects . Reliability was evaluated using internal consistency and intraclass correlation coefficient (ICC) in the overall and retest sample, respectively. Validity evaluation was computed using construct and convergence validity. Construct validity was evaluated through factor analysis. We randomly divided the total sample into two groups: sample A (n = 142; 86 males, 56 females) and sample B (n = 142; 89 males, 53 females). Through a chi-squared test and Student’s t-test, we determined that sample A and B did not statistically significantly differ in terms of demographic characteristics, clinical characteristics, or C-SIBDQ scores. Sample A was used for EFA. The Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy and Bartlett’s test of sphericity were used to assess the EFA suitability of the C-SIBDQ data [30, 31]; when the KMO value is greater than 0.50 and Bartlett’s sphericity test is significant, the data are considered suitable for factor analysis . Sample B was used for CFA with maximum likelihood estimation. We used chi-square value/degrees of freedom (χ2/df), comparative fit index (CFI), the Tucker-Lewis index (TLI), and root mean square error of approximation (RMSEA) to evaluate the CFA model fit. A χ2/df of less than 5.000, CFI and TLI of greater than 0.900, and an RMSEA of less than 0.080 indicate good model fit .
Multiple-group CFA was used to assess the C-SIBDQ’s measurement invariance across sex. First, we constructed factor structure models of the C-SIBDQ for male (n = 175) and female (n = 109) samples, respectively. Second, we constructed a configural invariance model to evaluate whether the factor structures for the male and female samples were consistent. Third, we constructed a metric invariance model by limiting factor loading. Finally, on the basis of this limited factor loading, we constructed a scalar invariance model by limiting the intercept. If the changes in the fitting indices CFI (ΔCFI) and RMSEA (ΔRMSEA) from the configural invariance model to the metric invariance model to the scalar invariance model were less than 0.010 , this would indicate that the C-SIBDQ has measurement invariance. In this study, the significance level of all hypothesis tests was less than 0.05.
Participants’ demographic and clinical characteristics
The final sample included 284 patients with IBD (249 with UC, 35 with CD). Their mean age was 41.63 ± 11.88 years (range: 19–79 years). Most were male (61.6%), lived in urban areas (67.3%), had university or higher education level, and were married (88.0%). The average duration of disease was 5.54 ± 5.57 years. The mean scores for the SF-12, PHQ-9, and GAD-7 were 59.55 ± 20.27, 9.10 ± 6.25, and 7.05 ± 5.51, respectively. More details are shown in Table 1.
Score distribution of the C-SIBDQ
The mean, SD, skewness, kurtosis, and minimum and maximum values for the C-SIBDQ were 46.96, 12.52, − 0.60, − 0.24, 14.00, and 70.00, respectively. All skewness and kurtosis values were between − 1 and 1, indicating that the C-SIBDQ data in this study followed normal univariate distribution. The number of participants, who scored minimum and maximum values, were two and three, respectively, giving constituent ratios of 0.7% and 1.1%, respectively; this indicated that this scale had no floor or ceiling effects.
EFA and CFA were used to evaluate the construct validity of the C-SIBDQ. First, based on Sample A, we conducted the KMO test and Bartlett’s test of sphericity. The KMO value was 0.917, and the Bartlett’s test of sphericity statistic was 799.136 (P < 0.001); this indicated that the C-SIBDQ was suitable for factor analysis. Second, we conducted principal component analysis with orthogonal rotation on the 10 items; consequently, one factor was extracted with eigenvalues of 5.641, indicating that it explained 56.4% of the total variance. All items had factor loadings of 0.557 or larger on this factor (Table 2). Third, using the data for Sample B, we performed CFA on the single-factor structure obtained through EFA. The CFA results showed that the single-factor model had a good fit to the data (χ2/df = 28.759/22 = 1.307, RMSEA = 0.047, CFI = 0.993, TLI = 0.985). Therefore, according to the EFA and CFA results, the C-SIBDQ showed a single-factor structure.
The results of the Pearson’s correlation analysis illustrated that the C-SIBDQ is positively correlated with SF-12 scores (r = 0.790, P < 0.001), and negatively correlated with PHQ-9 (r = − 0.675, P < 0.001) and GAD-7 scores (r = − 0.628, P < 0.001).
Internal consistency and test–retest reliability
The internal consistency of the C-SIBDQ was good (Cronbach’s α was 0.920). High test–retest reliability (ICC = 0.959) was found for a sub-sample (n = 78) measured 2 weeks later.
Measurement invariance across sex
The measurement invariance of the C-SIBDQ across sex was also tested. The results of the CFA for the male and female samples showed that the model fit the data well, indicating that multiple-group CFA was appropriate. In addition, the fit indices of the configural invariance model, metric invariance model, and scalar invariance model were satisfactory. When comparing the metric invariance model with the configural invariance model and the scalar invariance model, both ΔRMSEA and ΔCFI were less than 0.010. This indicated that the C-SIBDQ has measurement invariance across sex. Table 3 shows the fit indices for all measurement invariance models and the inter-model differences.
Sex difference in the C-SIBDQ
The t-test results showed a statistically significant sex difference in C-SIBDQ scores (t = 2.591, P = 0.010), with females (mean = 49.39, SD = 12.07) showing higher scores than males (mean = 45.46, SD = 12.59).
This study verified the reliability and validity of the C-SIBDQ and its measurement invariance across sex. To the best of our knowledge, this study is the first to evaluate the psychometric properties and measurement invariance of the C-SIBDQ among Chinese patients with IBD. The results showed that the C-SIBDQ is suitable for this population and has good psychometric characteristics. This means that the C-SIBDQ is a relatively short and easy-to-use tool that community and clinical staff can utilize to assess the HRQoL of patients with IBD in China.
The C-SIBDQ showed no floor or ceiling effects, indicating sufficient reactivity and content validity . In addition, our study showed that a newly established single-factor structure is the most suitable factor structure for the C-SIBDQ; the C-SIBDQ is inconsistent with the four-factor structure of the original SIBDQ  and the versions in other languages . The previous four-factor structure comprised intestinal symptoms, social function, emotional function, and systemic symptoms, which provides a detailed reflection of HRQoL for certain populations . However, from the perspective of psychosomatic medicine, the physical and psychological conditions of a disease are interrelated and influenced by social culture . This means that social culture may induce factor differences. For example, although both the original version of the SIBDQ and the Portuguese version contain a four-factor structure [11, 13], there are differences between the two versions regarding which items are assigned to each factor; this is because differing languages and cultural backgrounds can affect patients’ understanding of the SIBDQ . The difference in the factor structure does not affect the application of the C-SIBDQ because all 10 items showed good factor loadings and were retained. Moreover, the score for the C-SIBDQ was determined to be related to the scores for the SF-12, PHQ-9, and GAD-7, showing good convergent validity; this finding is similar to those of previous validation studies . Further, a previous study found that the active inflammatory state of IBD intersects with the pathobiology of depression and anxiety , while a questionnaire-based study also found depression and anxiety to relate to the disease activity of IBD and the HRQoL of patients with IBD . Both of these studies support our above-mentioned results concerning convergent validity.
Our research showed that the C-SIBDQ’s Cronbach’s α and ICC were 0.920 and 0.959, respectively, indicating good internal consistency reliability and test–retest reliability. This result is similar to those for the Portuguese and German versions [12, 13]. In the verification of the Portuguese version, Cronbach’s α and ICC were both determined to be 0.80, while in the German version, the SIBDQ’s Cronbach’s α and ICC were determined to be 0.84 and 0.60, respectively. However, compared to these findings, the C-SIBDQ reports higher reliability. In addition, the reliability measurement result for the C-SIBDQ is consistent with the levels observed in previous China-based surveys that used other evaluation tools (e.g., the IBDQ) .
We verified the newly established single-factor structure’s measurement invariance (configuration, metric, scalar) for male and female samples. The results showed that the C-SIBDQ can reliably account for differences between male and female patients. This is certainly important, because it has previously been reported that there are differences in SIBDQ scores between sex groups . Our study also showed that females (mean = 49.39, SD = 12.07) have higher C-SIBDQ scores than males (mean = 45.46, SD = 12.59). Therefore, measurement invariance across sex was supported; that is, the difference in C-SIBDQ scores reflected the true difference between males and females.
This verification of the C-SIBDQ has important practical significance for patients with IBD, caregivers, and medical staff. First, its low number of items and easy-to-use format can afford quick evaluations of the HRQoL of patients with IBD in clinical environments, which can help medical staff understand patients’ problems and implement targeted treatment. Second, in relation to clinical intervention research, researchers can use it to evaluate the effectiveness of treatment or nursing measures. Third, patients can use the C-SIBDQ for self-assessment in order to determine their own HRQoL and whether they require medical attention. Caregivers can also provide diet management and psychological support to patients based on the results of the C-SIBDQ.
This study has the following limitations. First, we only recruited participants from one clinic; thus, the results may not be representative of all patients with IBD in China. In the future, a more representative sample should be used to verify the reliability and validity of the C-SIBDQ. Second, the number of UC and CD patients in our study was not balanced; we did not group these for verification because there were fewer CD patients than UC patients. In the future, in order to verify the applicability of the C-SIBDQ for UC and CD populations, respectively, and compare the two, it will be necessary to collect larger sample sizes from multiple clinics. Third, this study did not investigate other disease-related factors affecting HRQoL among patients with IBD, such as the course of disease, comorbidities, and extraintestinal diseases. These disease factors should be considered when evaluating HRQoL in IBD patients in the future.
Implications for future studies
With the development of clinimetrics, more and more scholars believe psychometrics is restricted as it: evaluates component homogeneity, directs insufficient attention towards sensitivity, and insufficiently evaluates clinical utility [38,39,40]. Clinimetrics aims to evaluate the sensitivity, clinical utility and validity of the patient-related scale from a clinical perspective [41, 42]. The systematic review suggests that psychometrics and clinical measurement should be combined in the development and verification of patient -related scales to make sure that the scales are credible and effective in both psychological and clinical measurements . Therefore, future studies need to explore the clinical characteristics of C-SIBDQ further. Future researchers can analyze the sensitivity of C-SIBDQ and its ability to distinguish between the HRQoL of patients with different disease characteristics (such as disease course), according to the clinimetric criteria for patient-reported outcome measures . In addition, the predictive effectiveness of C-SIBDQ—whether it can predict the future development or recurrence of IBD—can also be examined.
The C-SIBDQ has high reliability, validity, and stability across sex, and can be used in clinics as a tool for assessing the quality of life of patients with IBD.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author via email.
Inflammatory bowel disease
Health-related quality of life
Inflammatory Bowel Disease Questionnaire
Short Inflammatory Bowel Disease Questionnaire
Chinese version of the Short Inflammatory Bowel Disease Questionnaire
12-Item Short-Form Health Survey
9-Item Patient Health Questionnaire Depression Scale
7-Item Generalized Anxiety Disorder Scale
Intraclass correlation coefficient
Exploratory factor analysis
Confirmatory factor analysis
Comparative fit index
Root mean square error of approximation
Xavier RJ, Podolsky DK. Unravelling the pathogenesis of inflammatory bowel disease. Nature. 2007;448:427–34.
Kaplan GG. The global burden of IBD: from 2015 to 2025. Nat Rev Gastroenterol Hepatol. 2015;12:720–7.
Collaborators GIBD. The global, regional, and national burden of inflammatory bowel disease in 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Gastroenterol Hepatol. 2020;5:17–30.
Ebrahim S. Clinical and public health perspectives and applications of health-related quality of life measurement. Soc Sci Med. 1995;41:1383–94.
Maunder RG, Cohen Z, McLeod RS, Greenberg GR. Effect of intervention in inflammatory bowel disease on health-related quality of life: a critical review. Dis Colon Rectum. 1995;38:1147–61.
Chen XL, Zhong LH, Wen Y, Liu TW, Li XY, Hou ZK, Hu Y, Mo CW, Liu FB. Inflammatory bowel disease-specific health-related quality of life instruments: a systematic review of measurement properties. Health Qual Life Outcomes. 2017;15:177.
Guyatt G, Mitchell A, Irvine EJ, Singer J, Williams N, Goodacre R, Tompkins C. A new measure of health status for clinical trials in inflammatory bowel disease. Gastroenterology. 1989;96:804–10.
Leong RW, Lee YT, Ching JY, Sung JJ. Quality of life in Chinese patients with inflammatory bowel disease: validation of the Chinese translation of the Inflammatory Bowel Disease Questionnaire. Aliment Pharmacol Ther. 2003;17:711–8.
Irvine EJ, Feagan BG, Wong CJ. Does self-administration of a quality of life index for inflammatory bowel disease change the results? J Clin Epidemiol. 1996;49:1177–85.
Schaeffer NC, Presser S. The science of asking questions. 2003; 29:65–88.
Irvine EJ, Zhou Q, Thompson AK. The Short Inflammatory Bowel Disease Questionnaire: a quality of life instrument for community physicians managing inflammatory bowel disease. CCRPT Investigators. Canadian Crohn’s Relapse Prevention Trial. Am J Gastroenterol. 1996;91:1571–8.
Rose M, Fliege H, Hildebrandt M, Körber J, Arck P, Dignass A, Klapp B. Validation of the new German translation version of the “Short Inflammatory Bowel Disease Questionnaire” (SIBDQ). Z Gastroenterol. 2000;38:277–86.
Roseira J, Sousa HT, Marreiros A, Contente LF, Magro F. Short Inflammatory Bowel Disease Questionnaire: translation and validation to the Portuguese language. Health Qual Life Outcomes. 2021;19:59.
Yu M, Ye Z, Chen Y, Qin T, Kou J, Tian D, Xiao F. Questionnaire assessment helps the self-management of patients with inflammatory bowel disease during the outbreak of Coronavirus Disease 2019. Aging (Albany NY). 2020;12:12468–78.
Putnick DL, Bornstein MH. Measurement invariance conventions and reporting: the state of the art and future directions for psychological research. Dev Rev. 2016;41:71–90.
Zhang CK, Hewett J, Hemming J, Grant T, Zhao H, Abraham C, Oikonomou I, Kanakia M, Cho JH, Proctor DD. The influence of depression on quality of life in patients with inflammatory bowel disease. Inflamm Bowel Dis. 2013;19:1732–9.
Meredith W. Measurement invariance, factor analysis and factorial invariance. Psychometrika. 1993;58:525–43.
Schroeder KW, Tremaine WJ, Ilstrup DM. Coated oral 5-aminosalicylic acid therapy for mildly to moderately active ulcerative colitis. A randomized study. N Engl J Med. 1987;317:1625–9.
Best WR, Becktel JM, Singleton JW, Kern F Jr. Development of a Crohn’s disease activity index. National Cooperative Crohn’s Disease Study. Gastroenterology. 1976;70:439–44.
Ren WH, Lai M, Chen Y, Irvine EJ, Zhou YX. Validation of the mainland Chinese version of the Inflammatory Bowel Disease Questionnaire (IBDQ) for ulcerative colitis and Crohn’s disease. Inflamm Bowel Dis. 2007;13:903–10.
Fu H, Kaminga AC, Peng Y, Feng T, Wang T, Wu X, Yang T. Associations between disease activity, social support and health-related quality of life in patients with inflammatory bowel diseases: the mediating role of psychological symptoms. BMC Gastroenterol. 2020;20:11.
Ware J Jr, Kosinski M, Keller SD. A 12-Item Short-Form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care. 1996;34:220–33.
Burisch J, Weimers P, Pedersen N, Cukovic-Cavka S, Vucelic B, Kaimakliotis I, Duricova D, Bortlik M, Shonova O, Vind I, et al. Health-related quality of life improves during one year of medical and surgical treatment in a European population-based inception cohort of patients with Inflammatory Bowel Disease: an ECCO-EpiCom study. J Crohns Colitis. 2014;8:1030–42.
Ware JE, Keller SD, Kosinski M. SF-12: How to score the SF-12 physical and mental health summary scales. Boston: Health Institute, New England Medical Center; 1995.
Levis B, Benedetti A, Thombs BD. Accuracy of Patient Health Questionnaire-9 (PHQ-9) for screening to detect major depression: individual participant data meta-analysis. BMJ. 2019;365:l1476.
Litster B, Bernstein CN, Graff LA, Walker JR, Fisk JD, Patten SB, Bolton JM, Sareen J, El-Gabalawy R, Marrie RA. Validation of the PHQ-9 for suicidal ideation in persons with inflammatory bowel disease. Inflamm Bowel Dis. 2018;24:1641–8.
Spitzer RL, Kroenke K, Williams JB, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Internal Med. 2006;166:1092–7.
Bernstein CN, Zhang L, Lix LM, Graff LA, Walker JR, Fisk JD, Patten SB, Hitchon CA, Bolton JM, Sareen J, et al. The validity and reliability of screening measures for depression and anxiety disorders in inflammatory bowel disease. Inflamm Bowel Dis. 2018;24:1867–75.
Liu QM, Wang LJ. t-Test and ANOVA for data with ceiling and/or floor effects. Behav Res Methods. 2021;53:264–77.
Williams B, Onsman A, Brown T. Exploratory factor analysis: a five-step guide for novices. Australas J Paramed. 2010; 8.
Hair JF, Black WC, Babin BJ, Anderson RE, Tatham RL. Multivariate data analysis. Upper Saddle River: Prentice Hall; 1998.
Measuring Model Fit. http://davidakenny.net/cm/fit.htm.
Widaman KF, Reise SP. Exploring the measurement invariance of psychological instruments: applications in the substance use domain. In: The science of prevention: methodological advances from alcohol and substance abuse research. Washington, DC: American Psychological Association; 1997. p. 281–324.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.
Fava GA, Sonino N, Wise TNJK. The psychosomatic assessment: strategies to improve clinical practice. Adv Psychosom Med. 2012;32:1–18.
Bernstein CN. Psychological stress and depression: risk factors for IBD? Dig Dis. 2016;34:58–63.
Cao Q, Huang YH, Jiang M, Dai C. The prevalence and risk factors of psychological disorders, malnutrition and quality of life in IBD patients. Scand J Gastroenterol. 2019;54:1458–66.
Carrozzino D, Patierno C, Guidi J, Berrocal Montiel C, Cao J, Charlson ME, Christensen KS, Concato J, De Las CC, de Leon J, et al. Clinimetric criteria for patient-reported outcome measures. Psychother Psychosom. 2021;90:222–32.
Fava GA, Rafanelli C, Tomba E. The clinical process in psychiatry: a clinimetric approach. J Clin Psychiatry. 2012;73:177–84.
Fava GA, Ruini C, Rafanelli C. Psychometric theory is an obstacle to the progress of clinical research. Psychother Psychosom. 2004;73:145–8.
Fava GA, Tomba E, Sonino N. Clinimetrics: the science of clinical measurements. Int J Clin Pract. 2012;66:11–5.
Bech P. Clinical psychometrics. Hoboken: Wiley; 2012.
Tomba E, Bech P. Clinimetrics and clinical psychometrics: macro- and micro-analysis. Psychother Psychosom. 2012;81:333–43.
We thank all the participants recruited in this study.
The study was supported by the National Natural Science Foundation of China (81673969). The funders did not participate in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
This study was ethically approved by the Medical Ethics Committee of the Affiliated Hospital of Shandong University of Traditional Chinese Medicine (Identification Code: 2017-010-KY). All participants provided written informed consent.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Sun, D., Chi, L., Liu, J. et al. Psychometric validation of the Chinese version of the Short Inflammatory Bowel Disease Questionnaire and evaluation of its measurement invariance across sex. Health Qual Life Outcomes 19, 253 (2021). https://doi.org/10.1186/s12955-021-01890-x
- Inflammatory bowel disease
- Quality of life
- Assessment tool
- Measurement invariance