Attitudes toward and beliefs about obese persons across Hong Kong and Taiwan: wording effects and measurement invariance
Health and Quality of Life Outcomes volume 17, Article number: 134 (2019)
The psychosocial consequences of obesity are important but often underrated. The Attitudes Toward Obese Persons (ATOP) and Beliefs About Obese Persons (BAOP) scales used to measure weight-related bias have little psychometric information, especially in East Asian contexts. The objective of this study was to use rigorous statistical methods to demonstrate the psychometric properties of these two instruments in Hong Kong and Taiwanese college students.
A convenience sample of 707 students was recruited from the universities in Hong Kong and Taiwan. Several competing confirmatory factor analyses (CFAs) were conducted to confirm the factorial structure of the ATOP and BAOP. The best fit models for the ATOP and BAOP were chosen for the examination of the measurement invariance across subcultures. We then compared configurable models with or without loading and/or intercept constrained before correlating the latent constructs between the best models for the ATOP and BAOP.
The comparison in multiple CFAs found that the model with one factor and two correlated-wording-method factors outperformed the other models for both the ATOP and BOAP. However, the internal consistency was suboptimal (ATOP: α = .56 to .80; BTOP: α = .57 to .65) and the measurement invariance was somewhat unsupported among the Hong Kong and Taiwan samples. Moreover, after controlling wording effects, the latent construct of the ATOP was moderately associated with that of BAOP (r = .356; p < .001).
Chinese versions of the ATOP and BAOP can be treated as a unidimensional factor for use in Hong Kong and Taiwan university students. However, further refinements of both instruments may be needed before using them to capture the social attitudes and beliefs toward obesity individuals, which is expected to advance our understanding of weight-related bias in East Asian contexts.
Obesity is a serious global epidemic , including in many East Asian countries. For example, 35% of Hong Kong adults are overweight  and 43% of Taiwanese adults are overweight . These rising rates pose public health challenges given the negative effects of overweight and obesity on physical health (e.g., high risk of cardiovascular disease, type II diabetes, and hypertension) [4, 5] and psychosocial well-being (e.g., high risk of depressive symptoms and decreased quality of life) [6,7,8]. However, the psychosocial consequences of obesity are often underestimated. For example, weight-based teasing, bullying, and discrimination may, in turn, cause internalizing or externalizing emotional problems. Previous research has shown that being teased or bullied because of weight was associated with higher odds of depression or suicidal ideation in obese youth . Moreover, empirical studies revealed that people with obesity had a lower chance than their normal-weight counterparts of being recruited by employers . In addition, healthcare providers have been shown to have negative attitudes toward people with obesity , which may jeopardize their treatment quality. Given that weight bias (or weight stigma) is a major contributor to psychosocial difficulties affecting people with obesity, it is imperative that public health professionals begin to tackle this issue. As revealed in recent studies, Hong Kong and Taiwan populations have a similar issue on weight-related stigma, which is linked to negative psychological outcomes [11,12,13].
Given its critical role, studies measuring weight bias (including the biased attitude toward and beliefs about people with obesity) have been expanding in recent decades [14,15,16]. The Attitudes Toward Obese Persons (ATOP) and Beliefs About Obese Persons (BAOP) scales are commonly used instruments; however, only two studies have thoroughly examined their psychometric properties [17, 18]. Additionally, other studies have revised the ATOP and BAOP into different versions of factorial structures [19,20,21]. However, to the best of our knowledge, earlier works reported a three-factor structure for the ATOP and a one-factor structure for BAOP [17, 18], but no studies have conducted a confirmatory factor analysis (CFA) to examine the factor structures of the ATOP and BAOP. Specifically, as most studies treated the ATOP as a unidimensional tool [22, 23], we should obtain evidence showing that the ATOP can be used based on a one-factor structure. Furthermore, no studies have considered the wording effects of these instruments (i.e., the positively worded and negatively worded items used in the ATOP and BAOP) or measurement invariance issues. Given that several studies have identified the influences of wording effects on the factorial structure of an instrument [24,25,26,27], we should not ignore the potential wording effects in the ATOP and BAOP when examining their factorial structures. We then hypothesized that the factorial structures of the ATOP and BAOP would be more consolidated if potential wording effects were accounted for in the CFA models.
Another important psychometric issue is measurement invariance; that is, whether the scale assesses the same construct in two different populations, including factorial structures and item descriptions. A prerequisite to comparing or combining the instrument scores between two populations is that the instrument is measurement equivalent across the two populations [28, 29]. Measurement invariance is extremely important when conducting cross-cultural studies or subcultural analysis. Therefore, Hong Kong and Taiwan, which share the Chinese culture but have developed distinctive subcultures due to previous history of different colonization (Hong Kong used to be governed by the United Kingdom and Taiwan used to be governed by Japan), could be a valuable venue for developing and examining the psychometric properties of the ATOP and BAOP across these two areas. Importantly, if the ATOP and BAOP are measurement invariant across Hong Kong and Taiwan subcultures, future studies on weight bias can confidently use both instruments to describe and compare attitudes toward and beliefs about people with obesity.
The study purpose is to translate, adapt, and examine the psychometric properties of the Chinese versions of the ATOP and BAOP in adult samples recruited in Taiwan and Hong Kong. We applied rigorous statistical methods (i.e., competing CFA models) to demonstrate the properties of the two instruments. Moreover, in the psychometric testing, we carefully considered the issues of wording effects and measurement invariance across Hong Kong and Taiwan samples.
The study was approved by the Ethics committee of The Hong Kong Polytechnic University (Ref number: HSEARS20161214002) and the procedures were carried out in accordance with the Declaration of Helsinki. All participants received clear information about the study, fully understood the study purpose, and all signed a written informed consent.
Participants and study design
We used convenience sampling and cross-sectional design to recruit both Hong Kong and Taiwan participants between March and July 2017. Hong Kong participants were from one university (located in Kowloon), and the Taiwan participants were from five universities (one located in Northern Taiwan, one in Central Taiwan, and three in Southern Taiwan). The inclusion criteria were (1) aged over 18 years; (2) agreed to participate; (3) understood written Chinese in traditional characters. The exclusion criteria were (1) had cognitive impairments or had difficulties in understanding questionnaires; (2) had a physical disability that causes difficulties in answering questionnaires.
For the recruitment in Hong Kong, the corresponding author contacted two colleagues teaching in health-related program (both are in occupational therapy) and one colleague not in health-related program (in mechanical engineering) to assist in inviting participants. For the recruitment in Taiwan, the second author contacted one acquaintance in Central Taiwan and another in Southern Taiwan to invite participants from their Introduction to Medical Management and Introduction to Epidemiology courses. The second author also invited participants from her Introduction to Psychology course. The fourth author recruited participants from her Developmental Psychology and Introduction of Family Life Education courses in Northern Taiwan. After obtaining the approval from the university professors to distribute questionnaires during their class, several research assistants (or the professors) used the last 20 min of a class to describe the study purpose and recruit participation. If the students were willing to participate, they first signed a written informed consent, and then completed a background information sheet, the ATOP, and the BAOP. In total, 400 students in Hong Kong and 307 in Taiwan turned in the written informed consents and the questionnaires.
We adopted the rule of thumb in factor analysis (i.e., 15 cases per item) to determine our sample size. Given that the ATOP has 20 items and the BAOP has 8 items, we used the item number in the ATOP for sample size estimation: 20 multiplied by 15 equals to 300 participants. Because we were unsure whether Hong Kong and Taiwan participants could be analyzed together, we proposed to have 300 participants in each area. Thus, the sample size of 400 in Hong Kong and that of 307 in Taiwan were sufficient for our psychometric testing.
Translation procedure for the ATOP and BAOP
After contacting the developer of the ATOP and BAOP (Prof. Allison), we learned that both instruments had never been translated into Chinese, and we obtained the permission to translate them. In order to ensure their linguistic validity, we adopted a standard translation procedure including forward translation, back translation, and reconciliation [30, 31]. Two independent Hong Kong translators who were fluent in English and were majoring in psychology did the forward translations. After receiving the two independent translations, the corresponding author worked with a research assistant with a Bachelor degree in psychology to reconcile the two forward translations. The back translation was done by one mainland Chinese translator with a bachelor’s degree in English who has been living in the U.S. in an immersion program for 1 year. The third author compared the back translation to the original version and provided additional comments to revise. After revising all the comments, the final translated versions of the ATOP and BAOP were circulated among the first, second, and corresponding authors to ensure its readability for Taiwan and Hong Kong populations.
Given the similarity in Hong Kong and Taiwanese cultures, we did not further adopt the approach of transadaptation to modify the questionnaires so that we could evaluate the cross-cultural psychometric properties of the translated ATOP and BAOP.
Demographics were assessed using a background information sheet that asks the age, gender, height, weight, major in the university, and self-perceived weight status (underweight, normal-weight, or overweight).
The Attitudes Toward Obese Persons Scale contains 20 items rated on a six-point Likert-type scale (− 3 = strongly disagree to 3 = strongly agree). The ATOP was originally adapted from the Attitudes Toward Disabled Persons Scale . After reverse coding the 13 negatively worded items, summing the 20-item scores and adding 60 to the summated score, a higher score indicates more positive attitudes toward people with obesity . Three factors (Different Personality; Social Difficulties; Self-Esteem) have been extracted from the ATOP scale in the original version  and the Turkish version . However, the internal consistency has never been tested for each factor, and most studies treat the ATOP as a unidimensional scale [22, 23]. In terms of the entire ATOP, the internal consistency ranged between 0.80 and 0.84 .
The Beliefs About Obese Persons Scale, a scale measures the extent that an individual believes obesity is under the control of a person with obesity, contains 8 items rated on a six-point Likert-type scale (− 3 = strongly disagree to 3 = strongly agree). After reverse coding the 6 negatively worded items, summing the 8-item scores and adding 24 to the summated score, a higher score indicates stronger beliefs that people with obesity cannot control their weight status . The BAOP is treated as a unidimensional scale [22, 23], and the one-factor structure has been supported in its original and Turkish versions [17, 18]. In addition, the internal consistency of the BAOP ranged between 0.65 and 0.82 .
We first separately present the characteristics of Hong Kong and Taiwan participants using mean (for continuous data) or frequency (for categorical data). Then, we compared the differences in the characteristics between Hong Kong and Taiwan participants using independent t-test (for continuous data with normal distribution), Mann-Whitney test (for continuous data with non-normal distribution) or χ2 test (for categorical data). Using the Shapiro-Wilk test, age, body mass index (BMI), and BAOP score were found to be non-normally distributed in both Hong Kong and Taiwan samples; ATOP score was normal distributed in both Hong Kong and Taiwan samples. Moreover, we used Cohen’s d to present the effect size of the differences in ATOP and BAOP scores, where a value > 0.2 indicates nonnegligible . Afterward, we calculated the internal consistency using Cronbach’s α and McDonald’s ω, where > 0.7 indicates satisfactory , for both the ATOP (including the three factors and the entire the ATOP) and the BAOP.
Several CFAs with a diagonal weighted least squares (DWLS) estimator were conducted to confirm the factorial structure of the ATOP and BAOP. In the ATOP, we compared five models to understand its structure: a three-factor structure (Different Personality, Social Difficulties, and Self-Esteem) without wording factors (Fig. 1a), a one-factor structure (ATOP) without wording factors (Fig. 1b), a one-factor structure (ATOP) with one wording factor (negative wording; Fig. 1c), a one-factor structure (ATOP) with two correlated wording factors (positive and negative wordings; Fig. 1d), and a one-factor structure (ATOP) with two uncorrelated wording factors (positive and negative wordings; Fig. 1e). We did not test the three-factor structure with wording factor(s) because this would make the CFA model too complicated and might violate the principle of parsimony. In addition, most studies apply the ATOP as a one-factor structure rather than a three-factor structure [22, 23].
In the BAOP, we compared four models to confirm its unidimensional structure: a one-factor structure without wording effects (Fig. 2a), a one-factor structure (BAOP) with one wording factor (negative wording; Fig. 2b), a one-factor structure (BAOP) with two correlated wording factors (positive and negative wordings; Fig. 2c), and a one-factor structure (BAOP) with two uncorrelated wording factors (positive and negative wordings; Fig. 2d). All the models were examined using the following fit indices to decide whether they are supported: comparative fit index (CFI) and Tucker-Lewis index (TLI) > 0.95 [33, 34]; root mean square error of approximation (RMSEA) < 0.06 ; standardized root mean square residual (SRMR) < 0.08 . In addition to using the aforementioned to fit indices, we adopted expected cross-validation index (ECVI) to compare these CFA models, and a lower value of ECVI indicates a better model . If the fit indices in the models with wording effect(s) (i.e., Fig. 1c to e for ATOP; Fig. 2b to d for BAOP) outperformed he models without wording effect (i.e., Fig. 1a and b for ATOP; Fig. 2a for BAOP), we might conclude that ATOP or BAOP contains wording artifacts.
After the best model among the five ATOP models and the best among the four BAOP models were confirmed, we used the two best models (one in the ATOP and another in the BAOP) to test the measurement invariance across subcultures (Hong Kong and Taiwan). According to a review , we constructed three submodels for the best models in the ATOP and BAOP: a configural model that did not constrain any factor loadings or item intercepts across Hong Kong and Taiwan samples; a loading constrained model that constrained all the factor loadings but not item intercepts across Hong Kong and Taiwan samples; and a loading and intercept constrained model that constrained all the factor loadings and all the item intercepts across Hong Kong and Taiwan samples. We then compared loading constrained model to configural model, and loading and intercept constrained model to loading constrained model. The measurement invariance is supported when ∆CFI > -0.01, ∆RMSEA< 0.015, and ∆SRMR< 0.01 in the model comparisons . However, some argue that when the constrained model has satisfactory fit indices (i.e., CFI > 0.95, RMSEA< 0.06, and SRMR< 0.08), we still can claim the measurement invariance as supported even the values of ∆CFI, ∆RMSEA, and ∆SRMR are unsatisfactory . Moreover, we used the best models to investigate the Pearson correlation between the latent ATOP and latent BAOP constructs to demonstrate the concurrent validity, and we expected the correlation above 0.3; i.e., a moderate correlation .
In addition, we used analysis of variance to compare the ATOP and BAOP scores among the participants who had different self-perceived weight status. Specifically, we compared whether the ATOP and BAOP scores were different among self-perceived overweight, self-perceived normal-weight, and self-perceived underweight groups. Bonferroni adjustment was applied to the post hoc comparisons.
IBM SPSS 23.0 (IBM Corp., Armonk, NY) was used to perform descriptive and inferential statistics for participants’ characteristics and the differences in ATOP and BAOP scores. R software was used to conduct CFAs and internal consistency: CFAs using the lavaan package  and internal consistency using Psych package .
There were slightly fewer male participants than female participants. More than half of the Hong Kong participants were majoring in a health-related program, whereas less than one third of the Taiwan participants were majoring in health-related program. The Hong Kong participants had significantly lower BMI values than did the Taiwan participants. No significant difference was found in age between Hong Kong and Taiwan participants (Table 1). The Hong Kong participants had lower scores than did the Taiwan participants on both the ATOP and BAOP; however, a significant difference was found in BAOP (Cohen’s d = 0.24) but not in the ATOP (Cohen’s d = 0.07).
The Cronbach’s α calculated from the entire sample (i.e., Hong Kong and Taiwan participants) in the ATOP was 0.67 for Different Personality factor; 0.56 for Social Difficulties factor; 0.65 for Self-Esteem factor; 0.79 for the entire ATOP. The Cronbach’s α calculated from the entire sample in BAOP was 0.61. Table 2 further reported the Cronbach’s α calculated from the Hong Kong and Taiwan samples separately. Apart from Cronbach’s α, we calculated the McDonald’s ω for both ATOP (Entire sample: 0.86; Hong Kong sample: 0.79; Taiwan sample: 0.84) and BAOP (Entire sample: 0.78; Hong Kong sample: 0.75; Taiwan sample: 0.85).
The CFA results of the three-factor structure of the ATOP were unsatisfactory: CFI = 0.862, TLI = 0.842, RMSEA (90% CI) = 0.072 (0.067, 0.077), and SRMR = 0.078. Additionally, the one-factor structure of the ATOP was not supported (Detailed fit indices information in Table 3). However, after considering the wording effects in the one-factor structure, all the fit indices were substantially improved (CFI = 0.904 to 0.986, TLI = 0.888 to 0.982, RMSEA = 0.024 to 0.060, and SRMR = 0.041 to 0.049). Among all the models, the model with one ATOP trait and two correlated-wording-method factors performed the best. In addition, the model had the smallest value in ECVI (0.476 vs. 0.597 to 1.601). Similar findings in terms of the wording effects were shown in the BAOP. Although all the BAOP models had satisfactory fit indices (Detailed fit indices information in Table 3), the model with one BAOP trait and two correlated-wording-method factors outperformed other models, including the smallest value of ECVI (0.024 vs. 0.103 to 0.119; Table 3).
Because of the excellent fit indices, we additionally tested the measurement invariance across Hong Kong and Taiwan for both the ATOP and BAOP using the model with one trait (ATOP or BAOP) and two correlated-wording-method factors (Table 4). A slightly high value was found in ∆RMSEA (0.019) when we tested the invariance of factor loading in the ATOP. In addition, ∆CFI (− 0.010) and ∆SRMR (0.007 and 0.003) both supported the invariance of factor loadings and item intercepts in the ATOP. In terms of the BAOP, only invariance of factor loading was supported by ∆CFI (− 0.004) but not by other fit indices (∆RMSEA = 0.017, ∆SRMR = 0.013); invariance of item intercept was not supported by any fit indices (∆CFI = -0.029, ∆RMSEA = 0.026, and ∆SRMR = 0.012). Moreover, after controlling for wording effects, the latent construct of the ATOP was moderately associated with the latent construct of BAOP (r = 0.356; p < 0.001). Specifically, the model used for assessing correlation between ATOP and BAOP loaded all negatively worded items from ATOP and BAOP on the single negative wording construct; all positively worded items from ATOP and BAOP on the single positive wording construct.
In the comparisons between the three groups in different self-perceived weight status, we found that self-perceived overweight group had significantly lower ATOP score than did the self-perceived normal-weight group (difference = 3.02; p = 0.04); self-perceived underweight group had significantly higher BAOP score than did the self-perceived normal-weight and self-perceived overweight groups (difference = 1.62 and 1.41, respectively; p < 0.05).
Our study examined the psychometric properties of two commonly used instruments of weight bias (ATOP and BAOP) and extended the usage for East Asian populations. Specifically, we found that both instruments had a justifiable unidimensional structure, though the wording effects should be taken into account. In addition, we found that the ATOP and BAOP were not completely measurement invariant across Hong Kong and Taiwan university students. Moreover, as our results showed that some domains in the ATOP and the entire BAOP had low internal consistencies, further refinements are needed to improve both instruments when used in Hong Kong and Taiwan populations. Despite this, the use of the entire ATOP may be appropriate because the single-factor structure of ATOP outperformed its three-factor structure.
Given the limited evidence on the psychometric properties of the ATOP and BAOP, we can only compare our results to two previous studies [17, 18]. Nevertheless, our findings somewhat corresponded to the previous evidence. Regarding the correlation between the ATOP and BAOP, Allison et al.  found moderate associations (r = 0.40 to 0.45) between the two instruments; Dedeli et al.  reported moderate correlations (r = 0.54 to 0.68); our findings on the correlation between the ATOP and BAOP latent constructs was moderate as well (r = 0.356). Specifically, the correlation found in our study controlled for the wording effects. The correlation between the ATOP and BAOP corresponds to the health psychology theories (e.g., Theory of Planned Behavior) that attitude is correlated with belief . However, some may argue that the moderate correlation between the ATOP and BAOP is not strong enough to declare their validity. Nevertheless, our results clearly showed that the scores of ATOP and BAOP significantly differed in the groups with different self-perceived weight status. This finding corresponded to recent studies showing that self-perceived overweight adolescents might have more weight bias [42, 43]. Therefore, we suggest that using this as an external criterion, our results supported the validity of ATOP and BAOP in measuring weight bias in youth.
Our study showed that the Cronbach’s α was over 0.7 for the entire ATOP, which is an acceptable value . However, the internal consistency for the entire BAOP and some domains of the ATOP was unsatisfactory. As compared to previous studies, the psychometric performance of our Chinese versions of ATOP and BAOP was somewhat inferior to those of the English (α = 0.80 to 0.84 for the ATOP and 0.65 to 0.82 for BAOP in diverse American samples ) and Turkish versions (α = 0.86 for the ATOP and 0.84 for BAOP ). However, the low internal consistencies can be justified by the two considerations: (1) Cronbach’s α increases with the number of items in a scale; therefore, the unsatisfactory results for some ATOP domains and BAOP may due to the few number of items; (2) given our study aim is to use ATOP and BAOP for heterogeneous sample (e.g., Hong Kong and Taiwan people), it is acceptable to have a Cronbach’s α value lower than 0.7.
Moreover, our CFA results may demonstrate the impact of wording effects. The CFA model that did not account for wording effects performed the worst as compared with other CFA models taking wording effects into account. Specifically, even if we reverse recoded the scores of negatively worded items to align their directions to the scores of positively worded items; the reverse recoded scores may not be comparable to the scores of positively worded items. For example, if a respondent strongly agrees with the item “Most obese people feel that they are not as good as other people” the respondent would not necessarily strongly disagree with an item worded, “Most obese people feel that they are as good as other people”. Instead, the respondent might only agree in the later item. Based on the CFA results, we suggest that the common practice of treating the ATOP and BAOP as unidimensional structures [22, 23] may be acceptable. In other words, the ATOP and BAOP scoring methods can accurately assess attitudes toward and beliefs about people with obesity.
Another important finding from the CFA is that Hong Kong and Taiwan university students may not interpret the items of ATOP and BAOP, although both were compatible with a unidimentional structure, completely in the same way, given that the measurement invariance was not supported by all the fit indices. We proposed several possibilities for the different interpretations between Hong Kong and Taiwan university students. First, the subcultures as well as Confucian philosophy, political and economic systems may have a significant role in social values, which might give participants in Hong Kong and Taiwan different attitudes toward obesity. Although both areas are affiliated with Chinese culture, Hong Kong and Taiwan have been colonized by the UK and Japan, respectively, for a long time . Different colonizers’ cultures differentiated the education systems in these two areas  and subsequently led to dissimilar comprehension in phrases or sentences. Second, the primary languages used in Hong Kong and Taiwan are different. Although both Hong Kong and Taiwan university students were able to speak Mandarin, most Hong Kong university students communicate in Cantonese in their daily lives. In contrast, a great proportion of Taiwan students speak Mandarin and Taiwanese interchangeably in their daily lives. As a result, the use of different spoken languages was likely to influence their interpretations of the items. Future studies are thus warranted to investigate our aforementioned postulations. Third, the majors in universities were different between the participants of Hong Kong and those of Taiwan. In Hong Kong, 54.3% majored in health-related disciplines, compared to 29.3% in Taiwan. Education and knowledge about health might affect participants’ sensitivity or attitudes towards obesity and stigma, which may explain why samples in Hong Kong and Taiwan might have interpreted items on ATOP and BAOP somewhat differently.
There are some limitations in the study. First, all the participants were university students. Therefore, both Hong Kong and Taiwan samples represented populations with a high level of education. Given that educated people may have better health literacy , their attitudes and beliefs might not be representative of those with a low level of education. Second, the convenience sampling conducted in a small number of universities might also restrict the generalizability of our results. Third, although we observed that wording was a potential method effect in the factorial structure, we did not conduct an experimental study to confirm such an effect. Specifically, an experimental study comparing the current version of the ATOP (or BAOP) to the ATOP (or BAOP) with all items worded positively could provide useful evidence. However, such an experimental study is outside of the scope of our investigation, and we encourage future studies to use experimental designs. Fourth, as we did not collect other data that can be used as external criteria, other than self-perceived weight status, we were unable to examine the convergent and discriminant validity for both the ATOP and BAOP. Furthermore, as we did not collect the ATOP and BAOP at different time points, we were unable to investigate the test-retest reliability for both instruments. Given that convergent validity, discriminant validity, and test-retest reliability are important information for an instrument, future studies are strongly recommended to test these properties. Fifth, we acknowledged somehow the existence of cultural differences in Hong Kong and Taiwan. This may explain less than desirable measurement invariance results, which may be improved if a transadaptation method being applied to modify the items. Last, different educational backgrounds (i.e., health-related vs. non-health-related undergraduate training) in the samples of Hong Kong and Taiwan may partly account for the result of measurement variance in this study. Replication of this study is recommended, controlling for the potential confounding effects of demographic characteristics, such as educational backgrounds. Future studies will contribute to further clarifying if Chinese versions of ATOP and BAOP could be used both in Hong Kong and Taiwan.
In summary, the Chinese versions of both the ATOP and BAOP might have suboptimal psychometric properties because of their low internal consistency, especially the subscales in the ATOP and the entire BAOP. However, the suboptimal internal consistency can be justified by the small number of items in ATOP domains and BAOP. Also, the use of the ATOP and BAOP across heterogeneous sample justifies the low internal consistency. Although the common usage of the ATOP and BAOP as a unidimensional structure can be supported by our findings, future refinements are necessary for strengthening both instruments. We believe that further studies on weight bias using reliable measurements are needed in Hong Kong and Taiwan to fill the literature gap.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Attitudes Toward Obese Persons
Beliefs About Obese Persons
Confirmatory factor analysis
Comparative fit index
Expected cross-validation index
Root mean square error of approximation
Standardized root mean square residual
World Health Organization. Global health observatory: Obesity. 2013 https://www.who.int/gho/ncd/risk_factors/overweight/en/. Accessed 22 Dec 2017.
Ko GTC. Obesity in Hong Kong – risk and burden. Hong Kong Med Diary. 2010;15:9–11.
Chang H-C, Yang H-C, Chang H-Y, Yeh C-J, Chen H-H, Huang K-C, Pan W-H. Morbid obesity in Taiwan: prevalence, trends, associated social demographics, and lifestyle factors. PLoS One. 2007;12:e0169577.
Reilly JJ, Methven E, McDowell ZC, Hacking B, Alexander D, Stewart L, Keinar CJ. Health consequences of obesity. Arch Dis Child Fetal Neonatal Ed. 2003;88:748–52.
Sweeting NH. Gendered dimensions of obesity in childhood and adolescence. Nutr J. 2008;7:1.
Lin Y-C, Latner JD, Fung XCC, Lin C-Y. Poor health and experiences of being bullied in adolescents: self-perceived overweight and frustration with appearance matter. Obesity. 2017; epub ahead of print.
Lin C-Y, Su C-T, Wang J-D, Ma H-I. Self-rated and parent-rated quality of life (QoL) for community-based obese and overweight children. Acta Paediatr. 2013;102:e114–9.
Su C-T, Wang J-D, Lin C-Y. Child-rated versus parent-rated quality of life of community-based obese children across gender and grade. Health Qual Life Outcomes. 2013;11:206.
Puhl RM, Latner JD. Stigma, obesity, and the health of the nation's children. Psychol Bull. 2007;133:557–80.
Puhl RM, Heuer CA. The stigma of obesity: a review and update. Obesity. 2009;17:941–64.
Cheng MY, Wang S-M, Lam YY, Luk HT, Man YC, Lin C-Y. The relationships between weight bias, perceived weight stigma, eating behavior and psychological distress among undergraduate students in Hong Kong. J Nerv Ment Dis. 2018;206:705–10.
Lin C-Y, Strong C, Latner JD, Lin Y-C, Tsai M-C, Cheung P. Mediated effects of eating disturbances in the association of perceived weight stigma and emotional distress. Eat Weight Disord. 2018; epub ahead of print.
Wong PC, Hsieh Y-P, Ng HH, Kong SF, Chan KL, Au TYA, Lin C-Y, Fung XCC. Investigating the self-stigma and quality of life for overweight/obese children in Hong Kong: a preliminary study. Child Indic Res. 2018; epub ahead of print.
Latner JD, O’Brien KS, Durson LE, Brinkman LA, MacDonald T. Weighing obesity stigma: the relative strength of different forms of bias. Int J Obes. 2008;32:1145–52.
Puhl RM, Suh Y. Health consequences of weight stigma: implications for obesity prevention and treatment. Curr Obes Rep. 2015;4:182–90.
Tomiyama AJ. Weight stigma is stressful. A review of evidence for the cyclic obesity/weight-based stigma model. Appetite. 2014;82:8–15.
Allison DB, Basile VC, Yuker HE. The measurement of attitudes toward and beliefs about obese persons. Int J Eat Disord. 1991;10:599–607.
Dedeli O, Bursalioglu SA, Deveci A. Validity and reliability of the Turkish version of the attitudes toward obese persons scale and the beliefs about obese persons scale. Clin Nurs Stud. 2014;2:105–17.
Ambwani S, Thomas KM, Hopwood CJ, Moss SA, Grilo CM. Obesity stigmatization as the status quo: structural considerations and prevalence among young adults in the U.S. Eat Behav. 2014;15:366–70.
Snethen JA, Treisman RA, Buseh AG, Kelber ST. Exploring future nurses' attitudes and beliefs related to childhood overweight. J Nurs Educ. 2014;53:S101–5.
Tsai T-I, Luck L, Jefferies D. Wilkes L. Collegian: A tool to measure the attitudes and beliefs of nursing students towards childhood overweight/obesity; 2017. epub ahead of print
Lillis J, Luoma JB, Levin ME, Hayes SC. Measuring weight self-stigma: the weight self-stigma questionnaire. Obesity. 2010;18:971–6.
Puhl RM, Masheb RM, White MA, Grilo CM. Attitudes toward obesity in obese persons: a matched comparison of obese women with and without binge eating. Eat Weight Disord. 2010;15:e173–9.
Chang K-C, Wang J-D, Tang H-P, Cheng C-M, Lin C-Y. Psychometric evaluation using Rasch analysis of the WHOQOL-BREF in heroin-dependent people undergoing methadone maintenance treatment: further item validation. Health Qual Life Outcomes. 2014;12:148.
Lee C-T, Lin C-Y, Tsai M-C, Strong C, Lin Y-C. Psychometric evaluation and wording effects on the Chinese version of the parent-proxy Kid-KINDL. Health Qual Life Outcomes. 2016;14:123.
Lin C-Y, Luh W-M, Cheng C-P, Yang A-L, Ma H-I. Evaluating the wording effect and psychometric properties of the Kid-KINDL: using the multitrait-multimethod approach. Eur J Psychol Assess. 2014;30:100–9.
Lin C-Y, Strong C, Tsai M-C, Lee C-T. Raters interpret positively and negatively worded items similarly in a quality of life instrument for children: Kid-KINDL. Inquiry. 2017;54:1–7.
Gregorich SE. Do self-report instruments allow meaningful comparisons across diverse population groups? Testing measurement invariance using the confirmatory factor analysis framework. Med Care. 2006;44:78–94.
Vandenberg RJ, Lance CE. A review and synthesis of the measurement invariance literature: suggestions, practices, and recommendations for organizational research. Organ Res Methods. 2000;3:4–70.
Eremenco SL, Cella D, Arnold BJ. A comprehensive method for the translation and cross-cultural validation of health status questionnaires. Eval Health Prof. 2005;28:212–32.
Strong C, Lin Y-C, Tsai M-C, Lin C-Y. Factor structure of sizing me up, a self-reported weight-related quality of life instrument, in community children across weight status. Child Obes. 2017;13:111–9.
Yuker HE, Block JR. Research with the attitudes towards disabled persons scale (ATDP) 1960–1985. Hempstead: Hofstra University, Center for the Study of Attitudes Toward Persons with Disabilities; 1986.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Chang C-C, Su J-A, Lin C-Y. Using the affiliate stigma scale with caregivers of people with dementia: psychometric evaluation. Alzheimers Res Ther. 2016;8:45.
Cook KF, Kallen MA, Amtmann D. Having a fit: impact of number of items and distribution of data on traditional criteria for assessing IRT’s unidimensionality assumption. Qual Life Res. 2009;18:447–60.
Cheng C-P, Luh W-M, Yang A-L, Su C-T, Lin C-Y. Agreement of children and parents scores on Chinese version of pediatric quality of life inventory version 4.0: further psychometric development. Appl Res Qual Life. 2016;11:891–906.
Chen FF. Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct Equ Model. 2007;14:464–504.
Wu T-H, Chang C-C, Chen C-Y, Wang J-D, Lin C-Y. Further psychometric evaluation of the Self-Stigma Scale-Short: measurement invariance across mental illness and gender. PLoS One. 2015;10:e0117592.
Rosseel Y, Oberski D, Byrnes J, Vanbrabant L, Savalei V, Merkle E, et al. Package ‘lavaan’. 2017; https://cran.r-project.org/web/packages/lavaan/lavaan.pdf. Accessed 22 Dec 2017.
Revelle W, Revelle MW. Package ‘psych’. 2017; https://cran.r-project.org/web/packages/psych/psych.pdf. Accessed 22 Dec 2017.
Ajzen I. The theory of planned behavior. Organ Behav Hum Decis Process. 1991;50:179–211.
Lin Y-C, Latner JD, Fung XCC, Lin C-Y. Poor health and experiences of being bullied in adolescents: self-perceived overweight and frustration with appearance matter. Obesity. 2018;26:397–404.
Lee K, Dale J, Guy A, Wolke D. Bullying and negative appearance feedback among adolescents: is it objective or misperceived weight that matters? J Adolesc. 2018;63:118–28.
Janz NK, Becker MH. The health belief model: a decade later. Health Educ Behav. 1984;11:1–47.
Cheung GW, Chow IH-S. Subcultures in greater China: a comparisons of managerial values in the People’s republic of China, Hong Kong, and Taiwan. Asia Pac J Manag. 1999;16:369–87.
Morris P. Asia’s four little tiger: a comparison of the role of education in their development. Comp Educ. 1996;32:95–110.
Webbink D, Martin NG, Visscher PM. Does education reduce the probability of being overweight? J Health Econ. 2010;29:29–38.
We deeply appreciate all the participants, all the teachers who helped us recruit the participants, and a group of undergraduate students (Ms. Isabelle Cheng, Ms. Koey Lam, Ms. Jessica Luk, Ms. Renee Man, and Ms. Sin Man Yeung) in the Department of Rehabilitation Sciences, The Hong Kong Polytechnic University who assisted in data collection and data entry.
This research was supported in part by (received funding from) the startup fund in the Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong.
Ethics approval and consent to participate
The present study protocol has been approved by the Institutional Review Board (IRB) in the Ethics Committee of The Hong Kong Polytechnic University (Ref number: HSEARS20161214002).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Tsai, MC., Strong, C., Latner, J.D. et al. Attitudes toward and beliefs about obese persons across Hong Kong and Taiwan: wording effects and measurement invariance. Health Qual Life Outcomes 17, 134 (2019). https://doi.org/10.1186/s12955-019-1198-6