Skip to main content

Chinese cross-cultural adaptation and validation of the Oxford shoulder score



The Oxford Shoulder Score (OSS) is a reliable and valid construct measuring non-specific shoulder pain, which are widely used to evaluate shoulder related quality of life. This study was to cross-culturally adapt and psychometrically validate a simplified Chinese version of the OSS (SC-OSS).


Cross-cultural adaptation was performed according to the international recognized guidelines. Consecutive patients with nonspecific shoulder pain were recruited to test the psychometric properties of SC-OSS. Item response trend and item-total correlation were evaluated to measure homogeneity. Principal component analysis (PCA) was used to evaluate the factorial structure. Cronbach's α and intra-class correlations were used to determine the reliability. Construct validity was analyzed by evaluating the correlations between SC-OSS and the Constant-Murley shoulder outcome score (CMSOS), the short form (36) health survey (SF-36) containing eight domains, and pain visual analogue scale (VAS).


Overall, 121 patients were recruited. Each of the 12 items was properly responded and correlated with the total items. PCA extracted one factor for SC-OSS. SC-OSS had excellent reliability, with a Cronbach's α of 0.92 and intra-class correlation coefficient of 0.97 (95 % CI: 0.94-0.98). SC-OSS had a high correlation with CMSOS, physical functioning (PF) and bodily pain (BP) domains of SF-36 and VAS (r = -0.70, -0.65, -0.53, and -0.66, respectively). SC-OSS moderately correlated with role-physical (RP), social functioning (SF), general health perception (GH) and vitality (VT) (r = -0.45, -0.42, -0.39 and -0.36, respectively), but had a low correlation with role-emotional (RE) and mental health (MH) domains of SF-36 (r = -0.28 and -0.23, respectively).


SC-OSS demonstrated excellent acceptability, internal consistency, reliability and construct validity, which can be recommended for application in mainland China.


Shoulder pain is a common disabling orthopedic condition experienced by many patients worldwide, just following low back pain and neck pain as the third most common disease in orthopedic visits [36]. The lifetime prevalence of shoulder pain in the general population is 6.7-66.7 % [22]. Shoulder pain is also a major problem in China. One previous study reported a 38.8 % one-year prevalence rate in the working population [43]. The most common causes of persistent shoulder pain are rotator cuff disorders, adhesive capsulitis, and glenohumeral osteoarthritis [25]. In addition, shoulder pain greatly affects arm function and leads to disability, which remains an enormous burden for the society [24, 25].

Recently, more and more healthcare providers are concerned about the quality of life of patients with shoulder pain [20, 37]. Interventions that better alleviate symptoms and improve shoulder functions are selected based on patients' reports [6, 37]. Therefore, measurement tools for patient-reported outcomes are essential and critical in the management of shoulder pain. Although there have been several validated self-reported questionnaires for low back pain and neck pain in China [3942], scarce constructs are available for Chinese physicians to evaluate the status of shoulder pain and the effectiveness of interventions. To evaluate the quality of life in patients with shoulder pain, several constructs have been developed such as CMSOS [8], disabilities of the arm, shoulder and hand (DASH), simple shoulder test (SST), and shoulder pain and disability index (SPADI) [1]. Each has its advantages in measuring pain perception and functional disability. Currently, only CMSOS is available for Chinese patients [16, 23]. However, CMSOS can be affected by surgeon bias [30]. As a result, new patients-based constructs for patients with shoulder pain are needed. Therefore, it is imperative to develop a valid and easy administrative construct specific for Chinese patients with shoulder pain.

Cross-cultural adaptation of an existing measure may render the adapted measure as an objective uniform criterion for international or multi-center clinical trials [3, 17]. Due to differences in idioms and traditions in daily life, translation of a questionnaire directly from another language is inadequate for application of the questionnaire in a new culture. Therefore, cultural adaptation of the tool is vital, and psychometric properties must be tested [3, 17].

The Oxford Shoulder Score (OSS), first described by Dawson et al. in 1996, is a shoulder pain specific questionnaire for the evaluation of pain perception and daily function in patients suffering from shoulder pain, with excellent internal consistency, reliability and validity [10]. It has been widely accepted for its simplicity and easy administration for doctors and patients, and has been applied in several clinical conditions such as shoulder surgery [10], rotator cuff injury [34], and frozen shoulder [7]. Currently, it has been translated and validated in Dutch [4], German [19], Danish [15], Korean [28], Turkish [35], Norwegian [12] and Italian [26]. However, no valid simplified Chinese version is available.

Therefore, the aim of this study was to cross-culturally adapt and psychometrically evaluate the simplified Chinese version of OSS (SC-OSS) in patients with shoulder pain in mainland China.


Translation and cross-cultural adaptation

The English version of OSS was translated and cross-culturally adapted according to international guidelines for the process of cross-cultural adaptation of self-report measures [10, 17]. Owing to the acquirement of much experience in validation of many chronic pain related questionnaires by our group [40, 41], the employed process was just similar as the translation and back-translation method. Then, a pilot test of the pre-final version of SC-OSS was carried out in 30 patients with shoulder pain. Each patient completed SC-OSS and was subsequently interviewed about any difficulties in completing the questionnaire or understanding the purpose and meaning of each question. The expert committee, consisting of four forward and backward translators, two orthopedic surgeons, one rehabilitation physician, one physical therapist, one language expert and two patients with shoulder pain, reviewed the findings and developed the final SC-OSS (Appendix), which was subjected to further psychometric testing.


Ethnical Han Chinese outpatients over 18 years of age with a confirmed diagnosis of non-specific shoulder pain for over 4 weeks due to rotator cuff pathology or inflammatory arthritis were eligible for recruitment in the Orthopedic Department at the Changhai Hospital between November 2013 and May 2014. Approximately 50 % of the eligible patients were randomly selected to take “test and retest” with an interval of 3 to 5 days. Patients who had shoulder pain with specified causes, such as shoulder instability, acute traumatic shoulder injury, tuberculosis or tumor, and who were unwilling to complete the questionnaires were excluded from the study. Demographic and clinical data and medical history were obtained, and a physical examination was performed at admission. Radiological images such as X-ray, computerized tomography, or magnetic resonance imaging were performed if they were required to evaluate the disease or confirm the diagnosis. The study was approved by the Human Research Ethics Committee of the Changhai Hospital, and written informed consent was obtained from every participant.



OSS contains 12 items, each of which scores from 0 (worst) to 4 (best), according to a modified version [10], with the total score ranging from 0 (worst) to 48 (best). In the original version, each question was scored from 1 to 5, with 1 representing the best outcomes [10]. Unintuitive findings reported by surgeons using the original version led to the modified scoring system [11].


CMSOS is widely used for assessing the outcomes of the treatment of shoulder disorders, especially with surgeries, including pain perception scoring, functional assessment, range of motion and strength measures [9]. CMSOS consists of four variables with two objective (pain and activities of daily life) and two subjective domains (active range of motion and shoulder strength). It has been trans-culturally adapted into Chinese with excellent reliability and validity [23].

SF-36 health survey

SF-36 is widely applied to assess the health status of patients [38]. It contains eight domains, including physical functioning (PF), role limitations due to physical health (role-physical, or RP), bodily pain (BP), general health perception (GH), social functioning (SF), role limitations due to emotional problems (role-emotional or RE), vitality (VT), and mental health (MH). A higher score indicates a healthier status and less function loss. SF-36 has also been translated and culturally adapted into Chinese [21].

Pain VAS

Pain VAS allows patients to rate their pain intensity on a 100 mm line anchored with two endpoints labeled 0 (No Pain) and 100 (Worst Possible Pain). It is highly accepted and widely used as a psychometric tool for its simplicity and good reliability, validity, and responsiveness [2].

Psychometric testing

Response trend

The scores obtained for each of the items among the patients should be normally distributed, such that the mean score obtained on an item is close to the center of the possible response range of scores available for that item. Items with a mean score near the extreme of a possible range have low variance, whereas items that vary over a narrow range have poor correlations with other items [14].

Therefore, the standardized values of skewness (Z-skewness) were computed for each item. Any items with a skewness value within 1.96 were defined to have a response trend of normal distribution [14].

Ceiling and floor effects

Ceiling and floor effects were also analyzed, with such effects being present if over 15 % of participants achieved the highest or lowest score, respectively.


A measure should be homogeneous; items should assess different facets of the same construct and correlate moderately with each other, and more importantly, each item should correlate with the total items. Therefore, internal consistency and item-total correlations were determined as functions of homogeneity. Internal consistency was calculated as Cronbach’s α, which was considered acceptable if the value was over 0.70 [27]. An item-total correlation coefficient was determined by Pearson’s correlation analysis. An item that had an item-total correlation coefficient less than 0.20 was eliminated [32].

Analysis for the factorial structure

The structure of SC-OSS was explored. Principal Component Analysis (PCA) with both varimax and oblimin rotations were adapted. The Kaiser-Meyer-Olkin measure and Bartlett's test of sphericity were conducted. The factorial structure was determined by the scree plot of the eigenvalues against the component numbers.

Test-retest reliability

To assess the reproducibility of SC-OSS, the intra-class correlation coefficient (ICC) was tested between the scores obtained at the first test and the second test. An ICC value was interpreted as good (>0.60) and excellent (> 0.80) reliability [31]. Also, a Bland-Altman plot was carried out to assess within subject variation and limits of agreement [5].

Construct validity

Construct validity was analyzed by Pearson’s correlation coefficients for the correlations between SC-OSS and other shoulder pain related measures (i.e. CMSOS, SF-36 and VAS). Correlation was interpreted as poor, fair, moderate, very good and excellent when r = |0.00-0.20|, r = |0.21-0.40|, r = |0.41-0.60|, r = |0.61-0.80| and r = |0.80-1.00|, respectively [13].

Statistical analysis

It has been advised that at least 100 patients are necessary for internal consistency analysis and 50 for appropriate analyses of reliability and construct validity [33]. Therefore, over 100 patients were recruited in the study. Statistical analysis was performed using the Statistical Package for the Social Sciences (SPSS) version 18.0 (SPSS, Chicago, IL). Mean values are reported with standard deviation (SD), and ICC values are presented with 95 % confidence intervals (CIs). A P value of less than 0.05 was considered statistically significant for all analyses.


Translation and cross-cultural adaptation

There were no major linguistic problems in the translation. Some discrepancies were encountered due to cultural differences. Specifically, the third item “car” was replaced by “private automobile,” as car has many meanings in Chinese such as small cars, sport-utility vehicle or jeep. “Chopsticks and spoons” were substituted for the fourth item “knife and fork” because Chinese usually do not use a knife and fork. Some male participants reported that they did not do household work so the fifth item was amended as “if you are doing the household shopping, could you do it on your own”. In addition, “bowl” was used to replace the sixth item “tray” since most Chinese people commonly use a bowl to hold food.

Characteristics of patients

Overall, 121 eligible patients were recruited in the study. Of the eligible patients, 55 were selected and participated in the test-retest. The mean completion time was 2.1 (±1.2) minutes. Nearly all of the participants filled out the questionnaire properly. Only 1 or 2 items were left blank in 18 participants with a responding rate was 95.9 % (1426/1452) for total items. The clinical and demographic characteristics of the patients are summarized in Table 1.

Table 1 Baseline demographic and clinical characteristics of patients with non-specific shoulder pain

Response trend and ceiling and floor effects

Z-skewness showed that none of the items were distributed outside 1.96. The item-total correlation analysis revealed that each item had a moderate to good item-total correlation, and none had a Pearson coefficient of less than 0.30. Detailed results are demonstrated in Table 2. No ceiling or floor effect was detected in the total or separate items.

Table 2 Respond trend, corrected item-total correlation and factor loading for each item of SC-OSS

Factorial structure

The Bartlett’s Test of sphericity failed to show that the correlation matrix was suitable for analysis of the factorial structure with adjustment (P < 0.001). Both the eigenvalues and the scree plot (Fig. 1) suggested one latent factorial structure, which jointly accounted for 54.2 % of the total variance.

Fig. 1
figure 1

The scree plot of the eigenvalues against the component numbers for SC-OSS


High internal consistency and test-retest reliability were confirmed, as the Cronbach’s α was 0.92 and the ICC was 0.97 (95 % CI: 0.94-0.98) for SC-OSS. The Bland-Altman plot demonstrated the differences between scores from the two test sessions for the individual patients and the overall means of the two sessions (Fig. 2). No systematic bias was found, indicating favorable test-retest agreement of SC-OSS.

Fig. 2
figure 2

The Bland-Altman plot for test-retest agreement of SC-OSS. The differences between patients for SC-OSS from two test sessions were plotted against the mean of the two session total scores. The line indicates the 95 % (±1.96 standard deviation) limits of agreement

Construct validity

SC-OSS had correlated highly with CMSOS, VAS, and PF and BP domains of SF-36 (r = -0.66,-0.70, -0.65, and -0.53, respectively). SC-OSS correlated moderately with domains of SF-36 relating to function loss due to chronic pain, including RP, SF, GH and VT (r = -0.45, -0.42, -0.39 and -0.36, respectively). Finally, SC-OSS had low correlation with domains of SF-36 unrelated to shoulder pain, including RE and MH in SF-36 (r = -0.28 and -0.23) (Table 3).

Table 3 Comparison on reliability and construct validity among different versions of OSS


In the present study, the English version of the OSS was successfully adapted and psychometrically validated into simplified Chinese. Statistical analysis revealed that all items of SC-OSS were well distributed and moderately correlated with each other, with excellent internal consistency, test-retest reliability and construct validity. Factor analysis demonstrated that SC-OSS was uniform and well-structured for shoulder pain in Chinese patients.

To evaluate the quality of life in patients with shoulder pain, several constructs have been developed such as CMSOS [8], OSS [10], disabilities of the arm, shoulder and hand (DASH), simple shoulder test (SST), and shoulder pain and disability index (SPADI) [1]. Each has its advantages in measuring pain perception and functional disability. Currently, only CMSOS is available for Chinese patients [16, 23]. However, CMSOS can be affected by surgeon bias [30]. Specifically, two components of CMSOS are evaluated by the doctor, which leads to major variations among clinical settings [30]. Also, the reliability of CMSOS has been challenged due to the lack of standardization in the assessment procedures [8, 30]. As a result, new patients-based constructs for patients with shoulder pain are needed. Of the other four constructs mentioned above, the OSS is the most cross-culturally adapted and validated and has been proven to be simple, acceptable and reliable in many different cultures and areas [4, 13, 16, 21, 27, 29, 36]. Hence, we decided to cross-culturally adapt the OSS, for Chinese patient with shoulder pain, as well as provide an international validated tool for multi-center research on quality of life.

In the present study, the acceptance of SC-OSS was high as all the items were well responded by Chinese patients with non-specific shoulder pain, with a responding rate of 95.9 %. High responding rates were also demonstrated in Italian [26], Turkish [35], German [19] and Korean [28] patients for their corresponding version. In the present study, the mean completion time was 2.1 min, similar to that reported for the Turkish and Korean versions. A longer completion time (3 min and 25 s) was reported for the German study, and a completion time was not reported in the Dutch and Italian studies. The short completion time and high responding rate indicates that SC-OSS is highly acceptable, and thus could be easily administered with little effort.

During the cross-cultural adaptation process some minor modifications had to be made due to cultural differences between Chinese and Western patients. Homogeneity analysis revealed that none of the items should be omitted. Item-total correlation demonstrated that each item made a contribution to the sum of the construct. Similar results were also found in English [10], Turkish [35], Korean [28] and German190 studies. The English study applied the ceiling or floor effect, which has a similar statistical meaning to the response trend, and also showed good homogeneity [8]. Homogeneity analyses were not reported in the Italian [26] and Dutch [4] studies. Therefore, SC-OSS is homogeneous in measuring shoulder pain in a Chinese cultural background.

OSS was designed to assess the function of shoulder movement. The total items could be divided into two subgroups: pain related items and interferences of shoulder function related items. Yet, one factor structure was obtained in SC-OSS. No description of facture analysis were reported in other versions of OSS, including Dutch [4], German,19 Danish [15], Korean [28], Turkish [35], Norwegian [12] and Italian versions [26]. Therefore, the factor structure of OSS needs more investigation in different cultural settings.

SC-OSS had good internal consistency and reproducibility, indicating excellent reliability. The Cronbach’s α was 0.92, almost the same as that reported in the English and other language versions, indicating that OSS remains stable in different cultures. In addition, the ICC was 0.97, which is also consistent with that reported in most studies, suggesting that the OSS remains stable over time.

The construct validity of SC-OSS was tested against CMSOS, VAS and SF-36, as these constructs are commonly used in China. As expected, SC-OSS highly correlated with CMSOS, similar to the English, Dutch, German, Italian and Korean studies. However, regarding the correlation with VAS, the Korean study [28] verified a low to moderate correlation (r = 0.34), whereas our study observed a high correlation (r = 0.70). This discrepancy is probably due to the difference in patient demographic characteristics (i.e. a female dominant sample with younger age in our study versus a male dominant sample with older age in the Korean study). However, since no other studies reported a correlation between OSS and VAS, this assumption should be further tested and other factors contributing to the discrepancy should also be explored. SF-36 has been applied worldwide for evaluating patient quality of life, especially in chronic pain patients [18]. It has been used in most studies evaluating the construct validity of the OSS. In the present study, SC-OSS highly correlated with PF and BP, moderately correlated with RP, SF, GH and VT, and had a low correlation with RE and MH. Since the OSS focused on shoulder pain perception and functional disability, it is reasonable that SC-OSS has correlated highly with PF and BP, but had a low correlation with RE and MH. This phenomenon has also been demonstrated in the original English OSS version [10], indicating that the OSS has excellent construct validity across different cultures. However, it should be noted that the English version [10], Italian [26], Dutch [4] and German [19] versions adapted a 1 to 5 rating scale, whereas the Turkish [35] and present study adapted a revised scoring 0-4 rating scale. The different scoring may have led to slight discrepancies in evaluating the correlations between the OSS and other instruments.

There are some limitations in this study. First, the patients in this study are from a single center, which may not fully represent the whole country. Multi-center large sample study is favored. Second, responsiveness (i.e. sensitivity to change, reflects the ability to detect clinically significant changes) of SC-OSS was not determined, so as the German, Korean, Turkish and Italian studies.


SC-OSS was successfully cross-culturally adapted and psychometrically validated in Chinese patients with non-specific shoulder pain. SC-OSS demonstrates high acceptance, excellent internal consistency, reliability, and solid construct validity. These findings provide Chinese surgeons and investigators with a tool to evaluate patients with non-specific shoulder pain.



Oxford shoulder score


Simplified Chinese version of the OSS


Principal component analysis


Constant-Murley shoulder outcome score


he short form (36) health survey


Physical functioning




Bodily pain


General health perception


Social functioning






Mental health


Visual analogue scale


Intra-class correlation coefficient


Confidence intervals


Disabilities of the arm, shoulder and han


Simple shoulder test


Shoulder pain and disability index


  1. Angst F, Schwyzer HK, Aeschlimann A, Simmen BR, Goldhahn J. Measures of adult shoulder function: Disabilities of the Arm, Shoulder, and Hand Questionnaire (DASH) and its short version (QuickDASH), Shoulder Pain and Disability Index (SPADI), American Shoulder and Elbow Surgeons (ASES) Society standardized shoulder assessment form, Constant (Murley) Score (CS), Simple Shoulder Test (SST), Oxford Shoulder Score (OSS), Shoulder Disability Questionnaire (SDQ), and Western Ontario Shoulder Instability Index (WOSI). Arthritis Care Res (Hoboken). 2011;63 Suppl 11:S174–88. doi:10.1002/acr.20630.

    Article  Google Scholar 

  2. Barten JA, Pisters MF, Huisman PA, Takken T, Veenhof C. Measurement properties of patient-specific instruments measuring physical function. J Clin Epidemiol. 2012;65(6):590–601. doi:10.1016/j.jclinepi.2011.12.005.

    Article  CAS  PubMed  Google Scholar 

  3. Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976). 2000;25:3186–91.

    Article  CAS  Google Scholar 

  4. Berendes T, Pilot P, Willems J, Verburg H, te Slaa R. Validation of the Dutch version of the Oxford Shoulder Score. J Shoulder Elbow Surg. 2010;19(6):829–36. doi:10.1016/j.jse.2010.01.017.

    Article  PubMed  Google Scholar 

  5. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10.

    Article  CAS  PubMed  Google Scholar 

  6. Carter MJ, Mikuls TR, Nayak S, Fehringer EV, Michaud K. Impact of total shoulder arthroplasty on generic and shoulder-specific health-related quality-of-life measures: a systematic literature review and meta-analysis. J Bone Joint Surg Am. 2012;94(17):e127.

    Article  PubMed  Google Scholar 

  7. Clement RG, Ray AG, Davidson C, Robinson CM, Perks FJ. Frozen shoulder : long-term outcome following arthrographic distension. Acta Orthop Belg. 2013;79(4):368–74.

    PubMed  Google Scholar 

  8. Conboy VB, Morris RW, Kiss J, Carr AJ. An evaluation of the Constant-Murley shoulder assessment. J Bone Joint Surg Br. 1996;78(2):229–32.

    CAS  PubMed  Google Scholar 

  9. Constant CR, Murley AH. A clinical method of functional assessment of the shoulder. Clin Orthop Relat Res. 1987;214:160–4.

    PubMed  Google Scholar 

  10. Dawson J, Fitzpatrick R, Carr A. Questionnaire on the perceptions of patients about shoulder surgery. J Bone Joint Surg Br. 1996;78(4):593–600.

    CAS  PubMed  Google Scholar 

  11. Dawson J, Rogers K, Fitzpatrick R, Carr A. The Oxford shoulder score revisited. Arch Orthop Trauma Surg. 2009;129(1):119–23. doi:10.1007/s00402-007-0549-7. Epub 2008 Jan 9.

    Article  PubMed  Google Scholar 

  12. Ekeberg OM, Bautz-Holter E, Tveita EK, Keller A, Juel NG, Brox JI. Agreement, reliability and validity in 3 shoulder questionnaires in patients with rotator cuff disease. BMC Musculoskelet Disord. 2008;9:68. doi:10.1186/1471-2474-9-68.

    Article  PubMed Central  PubMed  Google Scholar 

  13. Feise RJ, Michael Menke J. Functional rating index: a new valid and reliable instrument to measure the magnitude of clinical change in spinal conditions. Spine (Phila Pa 1976). 2001;26:78–86. discussion 87.

    Article  CAS  Google Scholar 

  14. Field A: Discovering statistics using SPSS: Sage publications; 2009. (ISBN No. 1847879071)

  15. Frich LH, Noergaard PM, Brorson S. Validation of the Danish version of Oxford Shoulder Score. Dan Med Bull. 2011;58:A4335.

    PubMed  Google Scholar 

  16. Ge Y, Chen S, Chen J, Hua Y, Li Y. The development and evaluation of a new shoulder scoring system based on the view of patients and physicians: the Fudan University shoulder score. Arthroscopy. 2013;29(4):613–22. doi:10.1016/j.arthro.2012.11.009.

    Article  PubMed  Google Scholar 

  17. Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46(12):1417–32.

    Article  CAS  PubMed  Google Scholar 

  18. Hawker GA, Mian S, Kendzerska T, French M. Measures of adult pain: Visual Analog Scale for Pain (VAS Pain), Numeric Rating Scale for Pain (NRS Pain), McGill Pain Questionnaire (MPQ), Short-Form McGill Pain Questionnaire (SF-MPQ), Chronic Pain Grade Scale (CPGS), Short Form-36 Bodily Pain Scale (SF-36 BPS), and Measure of Intermittent and Constant Osteoarthritis Pain (ICOAP). Arthritis Care Res (Hoboken). 2011;63 Suppl 11:S240–52. doi:10.1002/acr.20543.

    Article  Google Scholar 

  19. Huber W, Hofstaetter JG, Hanslik-Schnabel B, Posch M, Wurnig C. The German version of the Oxford Shoulder Score--cross-cultural adaptation and validation. Arch Orthop Trauma Surg. 2004;124(8):531–6.

    Article  PubMed  Google Scholar 

  20. Kindermans HP, Huijnen IP, Goossens ME, Roelofs J, Verbunt JA, Vlaeyen JW. "Being" in pain: the role of self-discrepancies in the emotional experience and activity patterns of patients with chronic low back pain. Pain. 2011;152(2):403–9. doi:10.1016/j.pain.2010.11.009.

    Article  PubMed  Google Scholar 

  21. Li L, Wang HM, Shen Y. Chinese SF-36 Health Survey: translation, cultural adaptation, validation, and normalisation. J Epidemiol Community Health. 2003;57(4):259–63.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. Luime JJ, Koes BW, Hendriksen IJ, Burdorf A, Verhagen AP, Miedema HS, et al. Prevalence and incidence of shoulder pain in the general population; a systematic review. Scand J Rheumatol. 2004;33(2):73–81.

    Article  CAS  PubMed  Google Scholar 

  23. Ma T, Kao MJ, Lin IH, Chiu YL, Chien C, Ho TJ, et al. A study on the clinical effects of physical therapy and acupuncture to treat spontaneous frozen shoulder. Am J Chin Med. 2006;34(5):759–75.

    Article  PubMed  Google Scholar 

  24. Maund E, Craig D, Suekarran S, Neilson A, Wright K, Brealey S, et al. Management of frozen shoulder: a systematic review and cost-effectiveness analysis. Health Technol Assess. 2012;16(11):1-264. doi:10.3310/hta16110.

    Article  CAS  PubMed  Google Scholar 

  25. Meislin RJ, Sperling JW, Stitik TP. Persistent shoulder pain: epidemiology, pathophysiology, and diagnosis. Am J Orthop (Belle Mead NJ). 2005;34(12 Suppl):5–9.

    PubMed  Google Scholar 

  26. Murena L, Vulcano E, D'Angelo F, Monti M, Cherubino P. Italian cross-cultural adaptation and validation of the Oxford Shoulder Score. J Shoulder Elbow Surg. 2010;19(3):335–41. doi:10.1016/j.jse.2009.07.068.

    Article  PubMed  Google Scholar 

  27. Nunnally J. BI: Psychometric Theory. San Francisco: McGraw-Hill; 1994.

    Google Scholar 

  28. Roh YH, Noh JH, Kim W, Oh JH, Gong HS, Baek GH. Cross-cultural adaptation and validation of the Korean version of the Oxford shoulder score. Arch Orthop Trauma Surg. 2012;132(1):93–9. doi:10.1007/s00402-011-1393-3.

    Article  PubMed  Google Scholar 

  29. Rocourt MH, Radlinger L, Kalberer F, Sanavi S, Schmid NS, Leunig M, et al. Evaluation of intratester and intertester reliability of the Constant-Murley shoulder assessment. J Shoulder Elbow Surg. 2008;17(2):364–9. doi:10.1016/j.jse.2007.06.024.

    Article  PubMed  Google Scholar 

  30. Roy JS, MacDermid JC, Woodhouse LJ. A systematic review of the psychometric properties of the Constant-Murley score. J Shoulder Elbow Surg. 2010;19(1):157–64. doi:10.1016/j.jse.2009.04.008.

    Article  PubMed  Google Scholar 

  31. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8.

    Article  CAS  PubMed  Google Scholar 

  32. Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use. New York: Oxford University Press; 1995.

    Google Scholar 

  33. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.

    Article  PubMed  Google Scholar 

  34. Tudisco C, Bisicchia S, Savarese E, Fiori R, Bartolucci DA, Masala S, et al. Single-row vs. double-row arthroscopic rotator cuff repair: clinical and 3 Tesla MR arthrography results. BMC Musculoskelet Disord. 2013;14:43. doi:10.1186/1471-2474-14-43.

    Article  PubMed Central  PubMed  Google Scholar 

  35. Tugay U, Tugay N, Gelecek N, Ozkan M. Oxford Shoulder Score: cross-cultural adaptation and validation of the Turkish version. Arch Orthop Trauma Surg. 2011;131(5):687–94. doi:10.1007/s00402-010-1242-9.

    Article  PubMed  Google Scholar 

  36. van der Windt DA, van der Heijden GJ, Scholten RJ, Koes BW, Bouter LM. The efficacy of non-steroidal anti-inflammatory drugs (NSAIDS) for shoulder complaints. A systematic review. J Clin Epidemiol. 1995;48(5):691–704.

    Article  PubMed  Google Scholar 

  37. Veehof MM, Oskam MJ, Schreurs KM, Bohlmeijer ET. Acceptance-based interventions for the treatment of chronic pain: a systematic review and meta-analysis. Pain. 2011;152(3):533–42. doi:10.1016/j.pain.2010.11.002.

    Article  PubMed  Google Scholar 

  38. Ware Jr JE, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med care. 1992;30:473–83.

    Article  PubMed  Google Scholar 

  39. Wei X, Wang Z, Yang C, Wu B, Liu X, Yi H, et al. Development of a simplified Chinese version of the Hip Disability and Osteoarthritis Outcome Score (HOOS): cross-cultural adaptation and psychometric evaluation. Osteoarthritis Cartilage. 2012;20(12):1563–7. doi:10.1016/j.joca.2012.08.018.

    Article  CAS  PubMed  Google Scholar 

  40. Wei X, Yi H, Wu B, Qi M, Liu X, Chen Z, et al. A valid cross-culturally adapted simplified Chinese version of the Quebec Back Pain Disability Scale. J Clin Epidemiol. 2012;65(12):1321–8. doi:10.1016/j.jclinepi.2012.06.007.

    Article  PubMed  Google Scholar 

  41. Wei X, Zhu X, Bai Y, Wu D, Chen J, Wang C, et al. Development of the Simplified Chinese Version of the Spinal Appearance Questionnaire: cross-cultural adaptation and psychometric properties evaluation. Spine (Phila Pa 1976). 2012;37(17):1497–504. doi:10.1097/BRS.0b013e3182407e25.

    Article  Google Scholar 

  42. Yi H, Ji X, Wei X, Chen Z, Wang X, Zhu X, et al. Reliability and validity of simplified Chinese version of Roland-Morris questionnaire in evaluating rural and urban patients with low back pain. PLoS One. 2012;7(1):e30807. doi:10.1371/journal.pone.0030807.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  43. Yu S, Lu ML, Gu G, Zhou W, He L, Wang S. Musculoskeletal symptoms and associated risk factors in a large sample of Chinese workers in Henan province of China. Am J Ind Med. 2012;55(3):281–93. doi:10.1002/ajim.21037.

    Article  PubMed  Google Scholar 

Download references


This project is supported by the National Natural Science Foundation of China (Grant No. 81171766 and 81572211).

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Xianzhao Wei or Zimin Wang.

Additional information

Competing interests

The authors, their immediate families, and any research foundation with which they are affiliated did not receive any financial payments or other benefits from any commercial entity related to the subject of this article.

Authors‘ contributions

Conception and design: XZW, ZMW. Data analysis and interpretation: XMX, XZW and FW. Collection and assembly of data: XMX, FW, XLW. Manuscript writing: XMX, XZW, FW. Manuscript modifying: XZW and ZMW. Manuscript Final approval of manuscript: XMX, XZW, FW, XLW, and ZMW.

Ximing Xu, Fei Wang and Xiaolin Wang contributed equally to this work.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, X., Wang, F., Wang, X. et al. Chinese cross-cultural adaptation and validation of the Oxford shoulder score. Health Qual Life Outcomes 13, 193 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: