Health-related quality of life in Iranian adolescents: a psychometric evaluation of the self-report form of the PedsQL 4.0 and an investigation of gender and age differences
Health and Quality of Life Outcomes volume 19, Article number: 108 (2021)
Research on the psychometric properties of the Persian self-report form of the Pediatric Quality of Life Inventory Version 4.0 (PedsQL 4.0) in adolescents has several gaps (e.g., convergent validity) that limit its clinical application and therefore the cross-cultural impact of this measure. This study aimed at investigating the psychometric properties of the PedsQL 4.0 and the effects of gender and age on quality of life in Iranian adolescents.
The PedsQL 4.0 was administered to 326 adolescents (12–17 years). A subsample of 115 adolescents completed the scale two weeks after the first assessment. Confirmatory Factor Analysis (CFA), correlation of the PedsQL 4.0 with the Weiss Functional Impairment Rating Scale-Self-report (WFIRS-S), and Item Response Theory (IRT) analysis were conducted to examine validity. Cronbach’s alpha, McDonald’s Omega, and Intra class correlation (ICC) were calculated as well to examine reliability. Gender and age effects were also evaluated.
Internal consistency and test–retest reliability of the total PedsQL 4.0 scale was .92 and .87, respectively. The PedsQL 4.0 scores showed negative moderate to strong correlations with the WFIRS-S total scale. The four-factor model of the PedsQL 4.0 was not fully supported by the CFA—the root mean square error of approximation and the comparative fit index showed a mediocre and poor fit, respectively. IRT analysis indicated that all items of the PedsQL 4.0 fit with the scale and most of them showed good discrimination. The items and total scale provided more information in the lower levels of the latent trait. Males showed significantly higher scores than females in physical and emotional functioning, psychosocial health, and total scale. Adolescents with lower ages showed better quality of life than those with higher ages in all scores of the PedsQL 4.0.
The PedsQL 4.0 showed good psychometric properties with regard to internal consistency, test–retest reliability, and convergent validity in Iranian adolescents, which supports its use in clinical settings among Persian-speaking adolescents. However, factor structure according to our CFA indicates that future work should address how to improve fit. In addition, studies that include PedsQL 4.0 should consider gender and age effects were reported.
Health-related quality of life refers to one's perception and subjective appraisal of his/her health and well-being within a cultural context . According to the World Health Organization , health is not just the absence of disorder and weakness, but the presence of physical, mental, and social well-being. Given these definitions, health encompasses well-being in different domains of functioning and quality of life includes well-being in those domains.
Health-related quality of life has been recognized as an important outcome measure in health care services and clinical decisions. There are various measures for assessing health-related quality of life and one of these measures is the Pediatric Quality of Life Inventory, Version 4.0 (PedsQL 4.0) . The PedsQL 4.0 is a multidimensional instrument measuring physical, emotional, social, and school functioning that has been translated into numerous languages. This scale consists of child self-report forms for ages 5–7, 8–12, and 13–18 years and parent proxy-report forms for ages 2–4 (toddler), 5–7 (young child), 8–12 (child), and 13–18 (adolescent) . The Persian version of the PedsQL 4.0 has been psychometrically evaluated in several studies on healthy and patient samples of Iranian children and adolescents [5,6,7,8,9]. Among these studies, one study  used a sample of children and adolescents diagnosed with Type 1 Diabetes and their parents. The sample in two studies [6, 7] included healthy and chronically ill children and their parents. Participants in one study  consisted of a group of children with attention-deficit/hyperactivity disorder (ADHD) and their parents and a school-based control group with their parents. The sample of one study  consisted of school children and their parents. While these studies make important contributions, to our knowledge, only one study  has assessed the psychometric properties of the Persian self-report form of the PedsQL 4.0 in adolescents. Moreover, no previous study has examined the convergent validity of the PedsQL 4.0 in Iranian adolescents. Additionally, only one study  has assessed and compared the PedsQL 4.0 scores between male and female adolescents ages 13–18 and no study has addressed the age differences in PedsQL 4.0 scores in Iranian adolescents. In Amiri et al. , the PedsQL 4.0 self-report form showed construct validity and good internal consistency for the subscales (r = 0.68 to 0.78) and for the total scale (r = 0.88). However, convergent validity, test–retest-reliability, and age differences in the PedsQL 4.0 were not assessed. There is a need for studies to more fully examine the psychometric properties of the Persian self-report form of the PedsQL 4.0 in adolescents to improve real-world assessment in clinical and school settings.
The first aim of this study is to assess the validity and reliability of the Persian self-report form of the PedsQL 4.0 in a healthy sample of Iranian adolescents. The second aim of this study is to investigate the effects of gender and age on the PedsQL 4.0 scores. Given the finding that among Iranian adolescents, males have better quality of life than females in the physical and emotional subscales, and total scale of the PedsQL 4.0 , we hypothesized that males will show better quality of life in physical functioning, emotional functioning, and total scale of the PedsQL 4.0.
Participants were taken from four public secondary schools through multistage sampling. All participants provided assent and their parents provided written consent. Participants received information about the aim of the study and necessary instructions for answering the questionnaires. In order to assess the test–retest reliability, a subsample of 115 participants completed the PedsQL 4.0 two weeks after the first administration. These participants were selected from classes that had enough time to allow students participate in retest evaluation. Students completed the questionnaires in a classroom setting. Inclusion criteria were ages ≥ 12 to ≤ 17 years old and enrolled in grades 7 to 12. All students with the mentioned criteria were eligible to be included in this study. Psychopathology was not assessed and was not part of the study inclusion criteria. This study was approved by the Research Ethics Committee of School of Psychology of Shiraz University and also by the Research Ethics Committee of the Shiraz School Board. This study was done in accordance with the ethical codes of the Psychology and Counseling Organization of Iran.
Pediatric quality of life inventory version 4.0-self-report (PedsQL 4.0)
In this study, the self-report form of the PedsQL 4.0 was used for all age groups. The PedsQL 4.0 includes 23 items covering four subscales: physical functioning (8 items, e.g., "It is hard for me to do sports activity or exercise"), emotional functioning (5 items, e.g., "I feel sad or blue"), social functioning (5 items, e.g., "Other kids don't want to be my friend"), and school functioning (5 items, e.g., "I have trouble keeping up with my schoolwork"). In addition to computing the score of each of the mentioned subscales, psychosocial health summary score was obtained by calculating the sum of item scores in the emotional, social, and school functioning subscales divided by the number of responded items . The respondent is instructed to rate the severity of his/her problem in each item based on a 5-point Likert response scale (0 = never a problem; 1 = almost never a problem; 2 = sometimes a problem; 3 = often a problem; 4 = almost always a problem) during the last month. The scores are reversed based on a 0 to 100 scale (0 = 100, 1 = 75, 2 = 50, 3 = 25, and 4 = 0) and higher scores indicate better quality of life.
Weiss functional impairment rating scale-self-report (WFIRS-S)
The WFIRS-S  was developed to measure the impact of emotional and behavioral problems on functional impairment. The WFIRS-S includes 69 items assessing an adolescent’s or an adult’s functioning across seven domains: family (8 items, e.g., "Problems taking care of your family"), work (11 items, e.g., "Problems working in a team"), school (10 items, e.g., "Problems meeting minimum requirements to stay in school" life skills (12 items, e.g., "Problems with sleeping"), self-concept (5 items, e.g., "Feeling incompetent"), social (9 items, e.g., "Problems making friends") and risk (14 items, e.g., "Being involved with the police"). The items can be rated on a 0 (never or not at all) to 3 (very often or very much) Likert scale or can be rated as “not applicable”. The WFIRS-S has been shown to be a valid and reliable rating scale in measuring functional impairment . The Persian version of the WFIRS-S has been psychometrically assessed in Iranian adolescents and the internal consistency and test–retest reliability were respectively 0.94 and 0.80 for the total scale .
Statistical analyses were performed using IBM SPSS Statistics 22, IBM SPSS Amos 24, Stata 14, and IRTPRO 5.0.
Mean, standard deviation, median, skewness, and kurtosis were determined for the PedsQL 4.0 items.
Internal consistency of each subscale and total scale of the PedsQL 4.0 was assessed by Cronbach’s alpha and McDonald’s Omega . Internal consistency is considered acceptable, good, and excellent for Cronbach’s alpha coefficients greater than 0.7, 0.8, and 0.9, respectively . The Cronbach’s alpha requires the assumption of tau-equivalence  and shows bias in estimating the internal consistency for Likert type rating response scales , therefore, McDonald’s Omega was also estimated to evaluate the internal consistency. McDonald’s Omega values above 0.7 and 0.8 can be interpreted as acceptable and good estimates of internal consistency, respectively [17, 18]. Test–retest reliability was measured by two-way random effects model, absolute agreement intraclass correlation coefficient (ICC) . ICC between 0.61–0.80 is interpreted as a good reliability and between 0.81–1.00 is considered excellent .
Construct validity of the PedsQL 4.0 was assessed in this study. Construct validity is defined as the extent to which an instrument assesses the hypothesis or theory it aimed to measure . Different components of the construct validity including convergent validity and factorial validity were examined. Moreover, Item Response Theory (IRT) as a modern psychometric method was performed for assessing the construct validity. Convergent validity of the PedsQL 4.0 with the WFIRS-S was measured by computing Spearman’s rank correlation coefficient. We used WFIRS-S to measure the convergent validity of the PedsQL 4.0 for two reasons. First, the WFIRS-S measures functional impairment which has been considered in conceptualization of the quality of life . Second, the previous studies [13, 22] showed moderate to strong correlations between the PedsQL 4.0 and WFIRS that we expected this correlation size in our study. Because higher scores in the PedsQL 4.0 indicate better quality of life and lower scores in the WFIRS-S show better functioning in life domains, negative correlations between the scores of two scales was expected. Correlation coefficients below 0.29 is considered small, between 0.30 and 0.49 is moderate, and greater than 0.50 is high . In order to assess the factorial validity of the PedsQL 4.0, Confirmatory Factor Analysis (CFA) was conducted. For the CFA, the four-factor model of the scale confirmed in other studies [24, 25] was assessed. The goodness of fit of the CFA model was examined by using the comparative fit index (CFI) and the root mean square error of approximation (RMSEA). Rigdon  argues that the RMSEA appears to be a better index in confirmatory contexts, while CFI is an appropriate index in exploratory contexts. CFI  is an incremental fit index which is based on the comparison of a hypothesized model with a null model and ranges between 0 (poor fit) and 1.00 (perfect fit). A CFI ≥ 0.95 shows a good fit . RMSEA  is one of the absolute measures of fit which determines how well a hypothesized model fits a perfect model. A value of RMSEA < 0.08 indicates an appropriate fit and a value < 0.1 indicates a mediocre fit . For the IRT evaluation, the graded response model (GRM) was applied . GRM is appropriate when dealing with ordered polytomous items such as Likert response items. For each item, the goodness-of-fit index (the p-value of S-χ2 < 0.001) [32, 33], discrimination parameter, and threshold or difficulty parameter were calculated. Discrimination parameter refers to the ability of an item in distinguishing between different levels of the latent trait. Threshold or difficulty parameter demonstrates the level of the latent trait in which the probability of response to or above a given category equals 0.5. Category characteristic curve and item information curve for each item and test information curve for the total scale of the PedsQL 4.0 were analyzed. Category characteristic curve represents the probability of responding to a given category of an item as a function of the latent trait. Information curve indicates the amount of information provided by an item or a scale at various levels of the latent trait. The discrimination parameter of an item determines the height of the information curve and the threshold parameter(s) determines the location of the information curve. Higher information curve shows more precision for an item in measuring the latent trait.
Age and gender differences
Kruskal–Wallis test was used to assess the effects of gender and age on the PedsQL 4.0 subscales, psychosocial health scale, and total scale. Dunn’s test was performed for pairwise comparisons. A p-value less than 0.05 was regarded as statistically significant.
The sample of the current study were 326 male and female adolescents (mean age = 14.84, SD = 1.78) who were studied in grades 7 to 12 of public secondary schools of Shiraz, Iran (see Table 1 for more details).
Table 2 shows the mean, standard deviation, median, skewness, and kurtosis for the items of the PedsQL 4.0. For the physical functioning items, the mean scores ranged from 79.31 (SD = 27.37) to 96.70 (SD = 12.96). The mean scores of the emotional functioning items were between 75.40 (SD = 29.31) and 85.63 (SD = 25.97). For the social functioning items, the mean scores varied in the range of 89.23 (SD = 20.68) and 93.71 (SD = 16.25). The mean scores of the school functioning items were between 79.09 (SD = 25.59) and 86.81 (SD = 22.38). For all items of the PedsQL 4.0, the skewness was lower than 1 except for item 4 of the emotional functioning with a skewness of − 0.84. All items of the physical functioning except for item 4, item 4 of the emotional functioning, all items of the social functioning, and all items of the school functioning except for item 2 showed a kurtosis higher than 1.
The Cronbach’s alpha was good (α = 0.80–0.83) for the subscales of the PedsQL 4.0 and was excellent for the psychosocial health (α = 0.90) and total scale (α = 0.92) of the PedsQL 4.0. The McDonald's Omega was good for the subscales (ωt = 0.81 to 0.84), psychosocial health (ωt = 0.90), and total scale (ωt = 0.92) of the PedsQL 4.0 (Table 3).
As shown in Table 3, the test–retest reliability was good for the school functioning (ICC = 0.74) and was excellent for the physical functioning, emotional functioning, social functioning, psychosocial health, and total scale of the PedsQL 4.0 (ICCs = 0.80–0.88).
Correlations between the PedsQL 4.0 and the total scale of the WFIRS-S indicated that physical and social functioning subscales had moderate and significant correlations with the WFIRS-S total scale (r = − 0.44 and r = − 0.48, respectively; p = 0.001). Emotional and school functioning subscales, psychosocial health summary score, and total scale of the PedsQL 4.0 showed high and significant correlations with the WFIRS-S total scale (r = − 0.54 to − 0.62; p = 0.001).
The results of CFA for the four-factor model of the PedsQL 4.0 showed that the RMSEA was 0.08 and the CFI was 0.84.
Table 4 demonstrates the goodness-of-fit index, the discrimination parameter, and the threshold parameter for 23 items of the PedsQL 4.0. The p-value of S-χ2 showed that all items fit the scale.
In the physical functioning subscale, the discrimination parameter of the items ranged from 1.19 to 3.2 (Table 4). Item 8 was the highest discriminating item (a = 3.02) and item 1 was the lowest discriminating item (a = 1.19). In terms of the severity (Table 4), item 1 (b1 = − 4.18, b2 = − 3.08, b3 = − 1.65, b4 = − 1.01) and item 5 (b1 = − 3.25, b2 = − 3.04, b3 = − 2.22, b4 = − 1.83) emerged at lower levels of the latent trait (physical functioning) and item 2 (b1 = − 2.79, b2 = − 1.98, b3 = − 1.12, b4 = − 0.46) and item 8 (b1 = − 2.37, b2 = − 1.95, b3 = − 1.10, b4 = − 0.63) were endorsed at higher levels of the latent trait (physical functioning).
In the emotional functioning subscale, the discrimination parameter of the items ranged from 1.90 to 2.46 (Table 4). Item 3 was the highest discriminating item (a = 2.46) and item 4 was the lowest discriminating item (a = 1.90). In terms of the severity (Table 4), item 1 (b1 = − 3.02, b2 = − 2.17, b3 = − 1.06, b4 = − 0.49) emerged at lower levels of the latent trait (emotional functioning) and item 5 (b1 = − 2.21, b2 = − 1.41, b3 = − 1.56, b4 = − 0.08) was endorsed at higher levels of the latent trait (emotional functioning).
In the social functioning subscale, the discrimination parameter of the items ranged from 1.64 to 2.59 (Table 4). Item 5 was the highest discriminating item (a = 2.59) and item 1 was the lowest discriminating item (a = 1.64). For items 1 and 2, only three threshold indices were obtained because the frequency of response to the first category for item 1 and second category for item 2 was zero. In terms of the severity (Table 4), items emerged at similar levels of the latent trait (social functioning).
In the school functioning subscale, the discrimination parameter of the items ranged from 1.08 to 2.72 (Table 4). Item 1 was the highest discriminating item (a = 2.72) and item 4 was the lowest discriminating item (a = 1.08). In terms of the severity (Table 4), item 4 (b1 = − 4.27, b2 = − 3.32, b3 = − 1.90, b4 = − 0.83) emerged at lower levels of the latent trait (school functioning) and item 2 (b1 = − 2.32, b2 = − 1.85, b3 = − 0.94, b4 = − 0.06) was endorsed at higher levels of the latent trait (school functioning).
Figure 1 shows the category characteristic curves for the highest discriminating item and the lowest discriminating item of each subscale of the PedsQL 4.0. As can be observed in the category characteristic curves, the items generally were endorsed in below average levels of the latent trait which shows lower levels of quality of life.
Figure 2 shows the item information functions for the items of each subscale of the PedsQL 4.0. Item 8 in the physical functioning subscale, items 3 and 5 in the emotional functioning subscale, items 5 and 3 in the social functioning subscale, and items 1 and 3 in the school functioning subscale provided more information than other items. Figure 3 shows the test information function for the PedsQL 4.0 total scale. As can be seen in Fig. 3, the items of the PedsQL 4.0 provided more information for the lower levels of latent trait with the range of − 2.4 to − 0.8.
Gender and age effects on PedsQL 4.0 scores
Table 5 shows the results of the Kruskal–Wallis test to compare males and females in the PedsQL 4.0 scores. There were significant differences between males and females in Physical Functioning (χ2(1) = 25.76, p = 0.001) and Emotional Functioning (χ2(1) = 22.81, p = 0.001). Male adolescents showed higher mean rank scores (better quality of life) than females in the physical functioning and emotional functioning subscales. Significant effects were found between males and females in the psychosocial health scale (χ2(1) = 9.97, p = 0.002) the total scale (χ2(1) = 13.89, p = 0.001). Male adolescents showed higher mean rank scores than females on the total scale (Table 5).
Table 6 shows the results of the Kruskal–Wallis test to compare age groups in the PedsQL 4.0 scores. There were significant differences in physical functioning (χ2(5) = 18.53, p = 0.002), emotional functioning (χ2(5) = 28.83, p = 0.001), social functioning (χ2(5) = 16.54, p = 0.005), school functioning (χ2(5) = 29.57, p = 0.001) subscales, psychosocial health scale (χ2(5) = 34.86, p = 0.001), and total scale (χ2(5) = 34.71, p = 0.001) of the PedsQL 4.0 among ages (Table 6). Pairwise comparisons indicated that adolescents in lower ages have significantly higher mean scores in all subscales, psychosocial health, and total scale of the PedsQL 4.0 than adolescents in higher ages (Table 6).
This study assessed the psychometric properties of the Persian version of the PedsQL 4.0 self-report form in Iranian adolescents ages 12–17. The gender and age effects on the PedsQL 4.0 scores were examined. Internal consistency was good to excellent—across both methods of measurement (i.e., Cronbach’s alpha coefficient and McDonald’s omega)—for the subscales and total scale of the PedsQL 4.0. Internal consistencies in this study are similar to the results in Varni, Burwinkle, Seid, and Skarr’s study  and higher than in another study  in Iranian adolescents with a similar age (13–18 years) to our sample. In this study, we also assessed test–retest reliability, convergent validity of the PedsQL 4.0 with a measure of performance in functional domains of life (i.e., WFIRS-S), and the age differences in the PedsQL 4.0 scores—these outcomes have not been examined in previous studies of a Persian-speaking sample and are discussed in greater detail below.
Descriptive investigation of the PedsQL 4.0 items showed that the mean score for most items was close to 80 or above this score. All items showed a negative skewness. These findings show that the participants normally report no major problem in most items of the PedsQL 4.0. These results could be expected in healthy school-based samples and are consistent with the results of previous studies with similar samples [7, 24].
This is the first study reporting the test–retest reliability of the PedsQL 4.0 specifically in an Iranian sample of adolescents ages 12–17 years. Test–retest reliability was 0.87 for the total scale and psychosocial health and between 0.74 and 0.88 for the subscales. In a study of Iranian children and adolescents ages 8–18 years, test–retest reliability of the self-report form of the PedsQL 4.0 was 0.87 for the total scale and 0.71 to 0.80 for the subscales . The difference between the results of test–retest reliability for the subscales of the PedsQL 4.0 in the current study and the results of Pakpour et al.’s study  might be explained by the age difference between the sample of two studies.
The current study is the first to assess the convergent validity of the PedsQL 4.0 with another measure in Iranian adolescents. Correlation between the psychosocial health score of the PedsQL 4.0 and functional impairment on the WFIRS-S was negative and high. The correlation between physical functioning and functional impairment was negative and moderate. The stronger correlation of functional impairment with psychosocial health score than with physical functioning reflects more overlap between the PedsQL 4.0 emotional, social, and school functioning subscales with the WFIRS-S content.
The findings of CFA showed that the fit of the four-factor structure of the PedsQL 4.0 was mediocre according to the RMSEA and poor based on CFI. This finding shows that the four-factor model of the PedsQL 4.0 was not fully supported by the CFA indices. The four-factor structure of the PedsQL 4.0 was confirmed in Kook and Varni’s  study in which a larger sample of children with a broader age range (8–18 years) was used in comparison to our study. The results of Varni et al.’s study  showed that the four-factor model of the PedsQL 4.0 was acceptable but the five-factor model including physical, emotional, social, school, and missed school factors indicated a marginally superior fit relative to the four-factor model.
The results of the IRT analysis showed that all items of the PedsQL 4.0 had good fitness with the scale. The discrimination parameter indicated that most items of the PedsQL 4.0 were highly discriminating (a = 1.45–3.02) . Among the PedsQL 4.0 items, item 8 of the physical functioning, item 3 of the emotional functioning, item 5 of the social functioning, and item 1 of the school functioning showed the highest discrimination and items 1 and 4 of the physical functioning and items 4 and 5 of the school functioning showed the lowest discrimination. Lower discriminating power for items 4 and 5 of the school functioning was also found in another study by Hill et al. . Items 4 and 5 assessed the degree to which a respondent misses school because of not feeling well or to go to the doctor/hospital. The events proposed in these items for missing the school are more frequent in populations with chronic disorders  not in healthy individuals that consists our sample. Therefore, these items provided low information and were less precise for measuring quality of life in our sample. The evaluation of the threshold parameter demonstrated that all items of the PedsQL 4.0 were located at the lower half of the latent trait. The evaluation of the information provided by each item of the PedsQL 4.0 and by the total scale indicated that this measure would be more precise and useful for measuring the lower levels of quality of life.
The investigation of gender and age effects on the PedsQL 4.0 scores showed that males perceived their function as better in physical and emotional domains relative to female adolescents. This finding is consistent with studies conducted in other countries [38,39,40] and with a study on Iranian adolescents . Higher rated physical functioning in males than females may reflect a gender-based disparity in physical strength. Moreover, lower physical functioning and emotional functioning in females than males may have been caused by gender differences in mood problems. Females are not only more vulnerable to have depression, but also, they are more seriously affected by chronic depression, for example they show lower levels of quality of life when they have chronic depression . Epidemiological studies have indicated that females report more internalizing problems than males [42, 43]. Our findings related to gender differences in quality of life can also demonstrate the sensitivity of the PedsQL 4.0 to identify these differences. Our cross-sectional study of age on quality of life showed that older adolescents reported more difficulties in all domains of quality of life relative to adolescents with the lower ages. Adolescence is an intermediate developmental stage between childhood and adulthood, and represents a phase of life involving diverse physical, psychological, and social changes and experiences that could influence health-related quality of life .
In this study, health-related quality of life was measured by adolescents’ self-report. Future studies should assess the test–retest reliability of the PedsQL 4.0 in adolescents by using parent proxy-report. In this study, only one measure was used to examine the convergent validity of the PedsQL 4.0. Future studies should consider additional measures to assess the convergent validity of the PedsQL 4.0. Another limitation of our study was that the same reporting source (i.e., self-report) was used for our outcome measures, so greater convergence among measures was expected. It is suggested that future studies consider different reporting source for measuring outcome variables.
The findings of our study showed that the PedsQL 4.0 has acceptable psychometric properties as shown by excellent internal consistency, high test–retest reliability, and negative moderate to strong convergent validity with a functional impairment scale (WFIRS-S). Therefore, it is a reliable and valid scale in measuring health-related quality of life in a healthy sample of Iranian adolescents. The PedsQL 4.0 would be a useful instrument for persons with lower levels of quality of life. Significant gender and age differences in quality of life should be considered as important factors in assessment of quality of life in adolescents. In addition, the four-factor structure of the PedsQL 4.0 was not supported—alternative factor structures (e.g., the PedsQL 4.0 second-order factor model consisting of physical functioning and psychosocial health factors  and the five-factor model including physical, emotional, social, school, and missed school factors of the PedsQL 4.0 ) need to be studied in future research.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
- PedsQL 4.0:
Pediatric quality of life inventory version 4.0
Weiss Functional Impairment Rating Scale-Self-report
Confirmatory factor analysis
Comparative fit index
Root mean square error of approximation
Intraclass correlation coefficient
Item response theory
Graded response model
Baumann C, Erpelding ML, Perret-Guillaume C, et al. Health-related quality of life in French adolescents and adults: norms for the DUKE Health Profile. BMC Public Health. 2011;11:401.
World Health Organization. Constitution of the World Health Organization basic document. World Health Organization; 1948.
Varni JW, Seid M, Kurtin PS. PedsQL 4.0: reliability and validity of the Pediatric Quality of Life Inventory version 4.0 generic core scales in healthy and patient populations. Med Care. 2001;39(8):800–12.
Varni JW, Limbers CA, Burwinkle TM. How young can children reliably and validly self-report their health-related quality of life? An analysis of 8,591 children across age subgroups with the PedsQL 4.0 Generic Core Scales. Health Qual Life Outcomes. 2007;5:1.
Jafari P, Forouzandeh E, Bagheri Z, Karamizadeh Z, Shalileh K. Health related quality of life of Iranian children with type 1 diabetes: reliability and validity of the Persian version of the PedsQLTM Generic Core Scales and Diabetes Module. Health Qual Life Outcomes. 2011;9:104.
Gheissari A, Farajzadegan Z, Heidary M, et al. Validation of Persian version of PedsQLTM 4.0TM generic core scales in toddlers and children. Int J Prev Med. 2012;3(5):341–50.
Amiri P, Ardekani EM, Jalali-Farahani S, et al. Reliability and validity of the Iranian version of the Pediatric Quality of Life InventoryTM 40 Generic Core Scales in adolescents. Qual Life Res. 2010;19(10):1501–8.
Jafari P, Ghanizadeh A, Akhondzadeh S, Mohammadi MR. Health-related quality of life of Iranian children with attention deficit/hyperactivity disorder. Qual Life Res. 2011;20(1):31–6.
Pakpour AH, Yekaninejad MS, Zarei F, Hashemi F, Steele MM, Varni JW. The PedsQLTM Oral Health Scale in Iranian children: reliability and validity. Int J Paediatr Dent. 2011;21(5):342–52.
Varni JW, Seid M, Kurtin PS. PedsQL 4.0: reliability and validity of the Pediatric Quality of Life Inventory version 4.0 generic core scales in healthy and patient populations. Med Care. 2001;39:800–12.
Zinbarg RE, Revelle W, Yovel I, Li W. Cronbach’s α, Revelle’s β, and McDonald’s ω H: their relations with each other and two alternative conceptualizations of reliability. Psychometrika. 2005;70:123–33.
Weiss MD, McBride NM, Craig S, Jensen P. Conceptual review of measuring functional impairment: findings from the Weiss Functional Impairment Rating Scale. Evid Based Ment Health. 2018;21(4):155–64.
Hadianfard H, Kiani B, Weiss MD. Psychometric properties of the Persian version of the Weiss functional impairment rating scale-self-report form in Iranian adolescents. J Atten Disord. 2017;23(13):1600–9.
George D, Mallery P. SPSS for Windows step by step: a simple guide and reference. 11.0 update. 4th ed. Allyn & Bacon; 2003.
Graham JM. Congeneric and (essentially) tau-equivalent estimates of score reliability: what they are and how to use them. Educ Psychol Measur. 2006;66(6):930–44. https://doi.org/10.1177/0013164406288165.
Zumbo BD, Gadermann AM, Zeisser C. Ordinal versions of coefficients alpha and theta for Likert rating scales. J Mod Appl Stat Methods. 2007;6(1):21–9.
Hermsen LA, Leone SS, Smalbrugge M, Knol DL, van der Horst HE, Dekker J. Exploring the aggregation of four functional measures in a population of older adults with joint pain and comorbidity. BMC Geriatr. 2013;13:119. https://doi.org/10.1186/1471-2318-13-119.
Feißt M, Hennigs A, Heil J, et al. Refining scores based on patient reported outcomes—statistical and medical perspectives. BMC Med Res Methodol. 2019;19:167. https://doi.org/10.1186/s12874-019-0806-9.
McGraw KO, Wong SP. Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996;1:30–46. https://doi.org/10.1037/1082-989X.1.1.30.
Bartko JJ. The intraclass correlation coefficient as a measure of reliability. Psychol Rep. 1966;19:3–11.
Ginty AT. Construct validity. In: Gellman MD, Turner JR, editors. Encyclopedia of behavioral medicine. Springer; 2013.
Tarakçıoğlu MC, Memik NÇ, Olgun NN, Aydemir Ö, Weiss MD. Turkish validity and reliability study of the Weiss Functional Impairment Rating Scale-Parent Report. Atten Defic Hyperact Disord. 2015;7(2):129–39. https://doi.org/10.1007/s12402-014-0158-6.
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Lawrence Erlbaum; 1988.
Kook S, Varni JW. Validation of the Korean version of the pediatric quality of life inventoryTM 4.0 (PedsQLTM) generic core scales in school children and adolescents using the rasch model. Health Qual Life Outcomes. 2008;6:41. https://doi.org/10.1186/1477-7525-6-41.
Petersen S, Hägglöf B, Stenlund H, Bergström E. Psychometric properties of the Swedish PedsQL, Pediatric Quality of Life Inventory 4.0 generic core scales. Acta Paediatr. 2009;98(9):1504–12. https://doi.org/10.1111/j.1651-2227.2009.01360.x.
Rigdon EE. CFI versus RMSEA: a comparison of two fit indexes for structural equation modeling. Struct Equ Model Multidiscip J. 1996;3(4):369–79.
Bentler PM. Comparative fit indices in structural equation models. Psychol Bull. 1990;107:238–46. https://doi.org/10.1037/0033-2909.107.2.238.
Hu L, Bentler PM. Evaluating model fit. In: Hoyle R, editor. Structural equation modeling, issues, concepts, and applications. Sage; 1995. p. 76–99.
Steiger JH. Structural model evaluation and modification: an interval estimation approach. Multivar Behav Res. 1990;25:173–80.
MacCallum RC, Browne MW, Sugawara HM. Power analysis and determination of sample size for covariance structure modeling. Psychol Methods. 1996;1(2):130–49. https://doi.org/10.1037/1082-989x.1.2.130.
Samejima F. Estimation of latent ability using a response pattern of graded scores. Psychometrika. 1969;34(S1):1–97.
Yang F, Zhao F, Zheng Y, Li G. Modification and verification of the Infant-Toddler Meaningful Auditory Integration Scale: a psychometric analysis combining item response theory with classical test theory. Health Qual Life Outcomes. 2020;18(1):367.
Chiarotto A, Bishop A, Foster NE, Duncan K, Afolabi E, Ostelo RW, et al. Item response theory evaluation of the biomedical scale of the Pain Attitudes and Beliefs Scale. PLoS ONE. 2018;13(9):e0202539.
Varni JW, Burwinkle TM, Seid M, Skarr D. The PedsQL 4.0 as a pediatric population health measure: feasibility, reliability, and validity. Ambul Pediatr. 2003;3(6):329–41.
Varni JW, Limbers CA, Newman DA, Seid M. Longitudinal factorial invariance of the PedsQL 4.0 Generic Core Scales child self-report version: one year prospective evidence from the California State Children’s Health Insurance Program (SCHIP). Qual Life Res. 2008;17(9):1153–62.
Baker FB, Kim SH. The basics of item response theory using R. Springer; 2017.
Hill CD, Edwards MC, Thissen D, Langer MM, Wirth RJ, Burwinkle TM, et al. Practical issues in the application of item response theory. Med Care. 2007;45(Suppl 1):S39–47.
Upton P, Eiser C, Cheung I, et al. Measurement properties of the UK-English version of the Pediatric Quality of Life Inventory 4.0 (PedsQL) generic core scales. Health Qual Life Outcomes. 2005;3:22.
Avison WR, Mcalpine DD. Gender differences in symptoms of depression among adolescents. J Health Soc Behav. 1992;33(2):77–96.
Reinfjell T, Diseth TH, Veenstra M, Vikan A. Measuring health-related quality of life in young adolescents: reliability and validity in the Norwegian version of the Pediatric Quality of Life Inventory 4.0 (PedsQL) generic core scales. Health Qual Life Outcomes. 2006;4:61.
Kornstein S, Schatzberg A, Thase M, Yonkers K, McCullough J, Keitner G, et al. Gender differences in chronic major and double depression. J Affect Disord. 2000;60(1):1–11.
Rescorla L, Ivanova MY, Achenbach TM, et al. International epidemiology of child and adolescent psychopathology II: integration and applications of dimensional findings from 44 societies. J Am Acad Child Adolesc Psychiatry. 2012;51(12):1273-1283.e8.
Philipp J, Zeiler M, Waldherr K, et al. Prevalence of emotional and behavioral problems and subthreshold psychiatric disorders in Austrian adolescents and the need for prevention. Soc Psychiatry Psychiatr Epidemiol. 2018;53(12):1325–37.
Frisén A. Measuring health-related quality of life in adolescence. Acta Paediatr. 2007;96(7):963–8.
The authors thank all study participants for providing data for this study.
No funding was provided for this study.
Ethics approval and consent to participate
This study was approved by the Research Ethics Committee of School of Psychology of Shiraz University and also by the Research Ethics Committee of the Shiraz School Board. All participants provided assent and their parents provided written consent.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Hadianfard, H., Kiani, B., Azizzadeh Herozi, M. et al. Health-related quality of life in Iranian adolescents: a psychometric evaluation of the self-report form of the PedsQL 4.0 and an investigation of gender and age differences. Health Qual Life Outcomes 19, 108 (2021). https://doi.org/10.1186/s12955-021-01742-8
- PedsQL 4.0
- Quality of life
- Internal consistency
- Test–retest reliability
- Construct validity
- Item response theory (IRT)
- Convergent validity