- Open Access
Shoulder pain and disability index: cross cultural validation and evaluation of psychometric properties of the Spanish version
Health and Quality of Life Outcomes volume 13, Article number: 200 (2015)
The Shoulder Pain Disability Index (SPADI) is a recently published but widely used outcome measure.
This study included 136 patients with shoulder disorders. SPADI was first translated and back-translated and then subjected to psychometric validation. Participants completed the Spanish versions of the SPADI, general health (SF-12), the Simple Shoulder Test (SST), Disability of Arm, Shoulder, and Hand (DASH) questionnaires and a pain intensity visual analog scale (VAS).
The factors explained 62.8 % of the variance, with an internal consistency of α = 0.916 and 0.860, respectively. The confirmatory factor analysis showed a Comparative Fit Index of 0.82 and a Normed Fit Index of 0.80. The Root Mean Square Error of Aproximation was 0.12. The x 2 test for the 2-factor model was significant (x 2 = 185.41, df = 62, p < 0.01). The test-retest reliability was high, with an item ranging of the interclass correlation coefficient (ICC) from 0.89 to 0.93. The ICC for the total score was 0.91 (95 % CI 0.88 to 0.94). Measurement error by minimal detectable change (MDC)95 was 12.2 %. In the construct validity analysis, strong positive correlations were observed between Spanish Version of the SPADI and DASH (pain: r = 0.80; p < 0.01; disability: r = 0.76; p < 0.01). Moderate positive correlations were observed between Spanish Version of the SPADI and VAS (pain: r = 0.67; p < 0.01; disability: r = 0.65; p < 0.01). Moderate negative correlations were obtained between Spanish Version of the SPADI and SST-Sp (pain: r = −0.71; p < 0.01; disability: r = −0.75; p < 0.01). However, pain total Spanish Version of the SPADI was only weakly correlated with physical and mental components of SF-12 (both r = 0.40; p < 0.01).
This Spanish version of SPADI demonstrated satisfactory psychometric properties in a patient sample in the hospital setting.
Health related patient reported outcome measures (PROMs) is considered essential to determine the impact of disease on the life of individuals, taking account not only of the clinical diagnosis of a disease but also of its impact .
Shoulder pain is one of the most common causes of musculoskeletal pain, with a prevalence of 20–33 % in the general population . Shoulder disorders are responsible for major medical, social, and economic costs . They are often accompanied by pain and restricted movement, hampering certain activities  and compromising psychological and social wellbeing .
Various questionnaires are available in English to evaluate the impact of shoulder disorders on function. With regard to other languages, it is recommended to translate, culturally adapt, and validate existing instruments in order to avoid the further proliferation of different methods [6–8].
The original version of the Shoulder Pain Disability Index (SPADI) is a quality of life questionnaire developed to evaluate the pain and disability associated with shoulder dysfunction . The SPADI is a 13-item shoulder function index on the ability of responders to carry out basic activities of daily living. Each item is scored by a numeric rating scale that ranges from 0 (no pain/no difficulty) to 10 (worst pain imaginable/so difficult it required help). SPADI provides a pain scale (5 items; scale score range 0–50 points, expressed as percentage) and a disability scale (8 items; scale score range 0 – 80 points, expressed as percentage). The two scale scores are averaged to derive a total Spanish Version of the SPADI score (0–100 points). A higher score indicates greater pain-related disability . The original version of the SPADI was initially proposed as a visual analog scale (VAS) and subsequently validated as a numerical scale to enable administration by telephone, obtaining reliable and valid results .
Previous systematic reviews have found no single questionnaire to superior to others in terms of administrative burden or measurements properties . There are other validated questionnaires already available as Disability of Arm, Shoulder, and Hand (DASH) or Simple Shoulder Test (SST) but several studies have recommended SPADI also, as specific scale for the shoulder due to its easy administration and rapid completion (3–10 min), which is facilitated by the brevity of the questions and the numerical response scale used (from 0 to 100) [11–13]. Systematic reviews have described SPADI as one of the highest-quality questionnaires related to the upper extremity and have endorsed its utilization [10, 13].
Although the SPADI questionnaire has been validated in German , Slovenian , Turkish , Italian , Portuguese , Persian , and Danish , it has not yet been validated in Spanish.
The objective of this study was to report the procedure followed for the cross-cultural adaptation and subsequent validation of a Spanish version of SPADI, including an examination of its psychometric properties.
A two-stage observational study was conducted. The first stage comprised the translation and cross-cultural adaptation of SPADI, while the second stage consisted of a prospective evaluation of the internal consistency, reliability, construct validity and measurement error of Spanish Version of the SPADI.
Stage 1- Translation and cross cultural adaptation
Two physicians, with adequate expertise in shoulder disorders management and both lenguages, and an independent native professional interpreter translated the English version of SPADI into Spanish and organized a meeting to take account of possible cultural issues. A back-translation process was carried out by a specialist translator to guarantee the conceptual equivalence of the terms used, as recommended in the literature [21, 22]. People involved in translation worked independently. In a second meeting, we compared the two versions and found no appreciable differences between them. A final version of the Spanish Version of the SPADI was agreed and tested in a pilot study with 40 patients (24 females, age = 45,6 ± 13.0 years) with shoulder problems (fractures and tendinopaties) recruited from among rehabilitation outpatients at hospital setting. This pilot study included cognitive debriefing standardised interviews carried out for one member of the research staff to assess its comprehensibility and ensure that the items retained the meaning of the original version.
Stage 2- Evaluation of psychometric properties
Participants and procedure
This questionnaire validation study included 136 volunteers with different shoulder disorders recruited from among rehabilitation outpatients in a hospital setting. Inclusion criteria were the presence of a shoulder disorder and the availability of a diagnosis by a specialist rehabilitation physician; diagnoses were classified into six subcategories (Table 1). Exclusion criteria were age under 18 years and inadequate command of Spanish to complete the questionnaires. All patients signed their informed consent to participation in the study, which was approved by the Research Ethics Committee of our hospital.
Participants who met the selection criteria completed a Spanish version of the SF-12 , SST , DASH  questionnaires and VAS in the hospital with the assistance of rehabilitation service staff. SF-12 (Version 1) is a self-administered instrument with 12 items on physical and mental health status; responses are scored (for intensity or frequency) on a Likert-type scale (3–6 points according to the item). These items are used to calculate the physical and mental summary measures. This instrument has shown adequate reliability (ICC = 0.73-0.86) . SF12 was used to check discriminant construct validity. The SST is a 12-item shoulder function scale on the ability (yes/no) of respondents to perform 12 activities of daily living (ADLs). The total SST score (0 to 100) expresses the percentage of items with a positive response. The Spanish version of the SST was recently validated and showed adequate reliability (ICC = 0.69–0.94) . The 30-item DASH measures the function and symptoms of patients with upper extremity musculoskeletal disorders. The total score ranges from 0 (best state) to 100 (worst). The Spanish version of DASH has been validated and showed adequate reliability (Cronbach alpha = 0.96) . SST and DASH were usded to check convergent validity. Pain intensity was tested using a VAS (0 to 10).
A randomly selected subgroup (n = 56) of the total sample repeated the questionnaires after an interval of 24–48 h to study the reliability of the Spanish Version of the SPADI using a test-retest methodology. This sample was selected using a random numeric sequence generated by a computer. A brief interval of 24–48 h was selected to avoid fluctuations in the severity of the symptoms. The questionnaires were administrated in same conditions used in the all sample.
Sample size was selected in accordance with recommendations to include 4–10 subjects per variable, with a minimum sample size of 100 subjects to ensure stability of the variance–covariance matrix in the confirmative factor analysis .
Means and standard deviations of the demographic variables were calculated. Construct validity and factor structure were then determined from maximum likelihood extraction (MLE) with varimax rotation, establishing the satisfaction of the following three criteria as a priori extraction requirement: scree-plot inflection, eigenvalue >1.0, and variance >10 %. The fit of confirmatory factor analysis was considered to be acceptable if the comparative fit index (CFI) and the normalized fit index (NFI) were greater than 0.90, with root mean square error of approximation (RMSEA) values equal to or less than 0.08 [27, 28] The internal consistency of measures was evaluated by determining the Cronbach alpha coefficient in an expected range of 0.70 to 0.90 . The test-retest reliability was analyzed by using the type 2.1 interclass correlation coefficient, and the error sensitivity was calculated with the MDC95 analysis of Stratford. Where MDC = 1.96 × SEM × square root of 2 . The construct validity was determined by comparing Spanish Version of the SPADI with SF-12, SST, DASH and VAS scores. Correlations were calculated using Pearson approach. A correlation value below 0.25 indicates a weak relationship, a value between 0.25 and 0.50 a fair relationship, a value between 0.50 and 0.75 a moderate to strong relationship, and a correlation above 0.75 a strong relationship . Moderate to strong positive (DASH, VAS) and negative (SST) correlations with Spanish Version of the SPADI were expected, with a similar pattern of expected correlations for both dimensions of the index. A lower correlation was expected between Spanish Version of the SPADI and SF-12, which is a generic functionality scale. SPSS version 21.0 for IOS (IBM, Chicago, IL) and LISREL v.8.8 (SSI Inc., Lincolnwood, IL) were used for the statistical analysis.
Potentially eligibles participants in database during the period of the study were 234 patients wich were invited for eligibilty 172 patients. 36 patients refused to participate and finally 136 participants were included. 40 participants completed the second questionnaire of the 56 invited. All participants comprehend the questionnaire and found it easy to complete. No conceptual ambiguities or language difficulties were encountered in the translation of the SPADI (Additional file 1). Minor changes in the Spanish version included the replacement of imperial with metric measures, e.g., “10 lb” becomes “4.5 Kg” in item 12. After the cognitive debriefing interviews ensured that the items retained the meaning of the original version.
Table 1 exhibits the demographic characteristics of the participants and the distribution of diagnoses. In the factorial analysis, the correlation matrix for Spanish Version of the SPADI was adequate according to results of the Kaiser-Meyer-Oklin (0.92) and Bartlett’s sphericity test (p < 0.001). A 62.8 % of the variance was explained by two factors. The item loading is shown in Table 2: factor 2 comprised items 1, 9, and 10, while factor 1 included the remaining items with the exception of item 13, which showed cross-loading. The confirmatory factor analysis showed a Comparative Fit Index of 0.82 and a Normed Fit Index of 0.80. The Root Mean Square Error of Aproximation was 0.12. The x2 test for the 2-factor model was significant (x 2 = 185.41, df = 62, p < 0.01 (Fig. 1).
For the Spanish Version of the SPADI, there were no significant missing responses (<7 missing response in 136 patients), similar level of missing response was found in the rest of questionnaires used in this study. After check the origen of this missing data a missing at random was assumed. A high degree of internal consistency was obtained for each factor α = 0.92 (CI95 % 0.91 to 0.95) and 0.82 (CI95 % 0.76 to 0.86), The test-retest reliability was also evaluated with ICC, with an item ranging from 0.89 to 0.93. The ICC for the total score was 0.91 (95 % CI 0.88 to 0.94). Measurement error by MDC95 was 12.2 %.
The construct validity was confirmed by strong positive correlations between the Spanish Version of the SPADI and DASH (pain: r = 0.80; p < 0.01; disability: r = 0.76; p < 0.01), moderate positive correlations between SPADI and VAS (pain: r = 0.67; p < 0.01; disability: r = 0.65; p < 0.01), moderate negative correlations between SPADI and SST (pain: r = −0.71; p < 0.01; disability: r = −0.75; p < 0.01) and weak positive correlation with physical and mental components of SF-12 (both r = 0.40; p < 0.01).
SPADI demonstrated good internal consistency and convergent validity and reliability in a sample of 136 patients with different shoulder disorders.
The confirmatory factor analysis showed an acceptable fit with a CFI of 0.82 and NFI of 0.80, but the error was higher (RMSEA = 0.12) than the recommended value of 0.08 . The internal consistency value calculated for this version was similar to that obtained for versions validated in other languages [14–20] and within an acceptable range, in common with the original questionnaire and all other published versions . The Spanish version showed a clear difference in the loading of the two factors, with items that evidenced a high correlation with one of the factors and a low correlation with the other. Our finding contrasts with the similar loading of the two factors reported for the original questionnaire  and subsequent versions , which hindered a clear separation between pain and disability dimensions. The fit indices associated with the confirmatory factor analysis model were satisfactory, although the error of approximation was an exception and did not indicate an optimal fit, which may possibly be due to an effect of our specific study population on the factor structure. It should be borne in mind that a slightly increased error does not necessarily imply that the structure of the scale is poor. The good construct validity obtained with this Spanish version supports its usefulness for evaluating patients’ perception of the impact of shoulder lesions.
The test-retest reliability of Spanish Version of the SPADI (0.89–0.93) was superior to that reported for the original questionnaire (0.64–0.66), slightly higher than that of the Persian  and Danish  versions, and similar to that of the Brazilian , German , Turkish , and Slovenian  versions. Perhaps, the reason could be a Spanish sample is more homogenous. The Measurement error by MDC95 was 12.2 %, lower than the original version, was of 18 % .
As expected, the lowest correlation (divergent construct validity) was observed between SPADI and SF-12, reflecting the discriminative validity of the instrument, given that SF-12 is a generic functionality scale. Moderate-strong correlations were found with the specific shoulder-related instruments (SST and DASH), indicating adequate convergent validity. Low-moderate correlation was found between SPADI and VAS, as observed in previous cross-cultural validation studies [14, 20].
The present study population was limited to hospital outpatients in a Spanish urban setting, and different results may be obtained in other types of population. The greater reliability observed for the Spanish version than for the original questionnaire may be attributable to the shorter interval between tests (48 h) which could induce an artificial inflation of correlation coefficients due to recall bias. Studies of this version of the questionnaire are warranted to test its validity in other Spanish-speaking countries (e.g., Latin America, Philippines, etc.). A study strength is that the sample size was larger (n = 136) than in previous validation studies of this instrument, reducing standard errors and supporting its application in individual and group studies . Further research is needed to establish longitudinal validity, responsiveness and thresholds for minimal important change in the Spanish version of SPADI.
In conclusion, the psychometric properties of the Spanish version of SPADI are similar to those of the original questionnaire and subsequent adaptations in different languages, supporting its utilization as a reliable clinimetric instrument in the setting of shoulder disorders.
World Health Organization. ICF - International Classification of Functioning, Disability and Health. Geneva, Switzerland: World Health Organization; 2001. p. 10–20.
McBeth J, Jones K. Epidemiology of chronic musculoskeletal pain. Best Pract Res Clin Rheumatol. 2007;21:403–25.
Virta L, Joranger P, Brox JI, Eriksson R. Costs of shoulder pain and resource use in primary health care: a cost-of-illness study in Sweden. BMC Musculoskelet Disord. 2012;13:17.
Largacha M, Parsons IM, Campbell B, Titelman RM, Smith KL, Matsen III F. Deficits in shoulder function and general health associated with sixteen common shoulder diagnoses: a study of 2674 patients. J Shoulder Elbow Surg. 2006;15:30–9.
Paananen M, Taimela S, Auvinen J, Tammelin T, Zitting P, Karppinen J. Impact of self-reported musculoskeletal pain on health- related quality of life among young adults. Pain Medicine. 2011;12:9–17.
Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phil 1976). 2000;25:3186–91.
Kirkley A, Alvarez C, Griffin S. The development and evaluation of a disease-specific quality-of-life questionnaire for disorders of the rotator cuff: The Western Ontario Rotator Cuff Index. Clin J Sport Med. 2003;13:84–92.
Leggin BG, Michener LA, Shaffer MA, Brenneman SK, Iannotti JP, Williams Jr GR. The Penn shoulder score: reliability and validity. J Orthop Sports Phys Ther. 2006;36:138–51.
Roach KE, Budiman-Mak E, Songsiridej N, Lertratanakul Y. Development of a shoulder pain and disability index. Arthritis Care Res. 1991;4:143–9.
Schmidt S, Ferrer M, González M, González N, Valderas JM, Alonso J, et al. Evaluation of shoulder-specific patient-reported outcome measures: a systematic and standardized comparison of available evidence. J Shoulder Elbow Surg. 2014;23:434–44.
Williams Jr JW, Holleman Jr DR, Simel DL. Measuring shoulder function with the Shoulder Pain and Disability Index. J Rheumatol. 1995;22:727–32.
MacDermid JC, Solomon P, Prkachin K. The Shoulder Pain and Disability Index demonstrates factor, construct and longitudinal validity. BMC Musculoskelet Disord. 2006;7:12.
Bot SD, Terwee CB, van der Windt DA, Bouter LM, Dekker J, de Vet HC. Clinimetric evaluation of shoulder disability questionnaires: a systematic review of the literature. Ann Rheum Dis. 2004;63:335–41.
Angst F, Goldhahn J, Pap G, Mannion AF, Roach KE, Siebertz D, et al. Cross-cultural adaptation, reliability and validity of the German Shoulder Pain and Disability Index (SPADI). Rheumatology (Oxford). 2007;46:87–92.
Jamnik H, Spevak MK. Shoulder Pain and Disability Index: validation of Slovene version. Int J Rehabil Res. 2008;31:337–41.
Bicer A, Ankarali H. Shoulder pain and disability index: a validation study Turkish women. Sind Med J. 2010;5:865–70.
Marchese C, Cristalli G, Pichi B, Manciocco G, Mercante G, Pellini R, et al. Italian cross-cultural adaptation and validation of three different scales for the evaluation of shoulder pain and dysfunction after neck dissection: University of California-Los Angeles (UCLA) Shoulder Scale, Shoulder Pain and Disability Index (SPADI) and Simple Shoulder Test (SST). Acta Otorhinolaryngol Ital. 2012;32:12–7.
Martins J, Napoles B, Hoffman C, Oliveira A. Versão brasileira do Shoulder Pain and Disability Index: tradução, adaptação cultural e confiabilidad. Revista Brasileira Fisioterapia São Carlos. 2010;14:527–36.
Ebrahimzadeh MH, Biriandineiad A, Golhasani F, Moradi A, Vahedi E, Kachooei AR. Cross-cultural adaptation, validation, and reliability testing of the Shoulder Pain and Disability Index in the Persian population with shoulder problems. Int J Rehabil Res. 2013; Oct 9. [Epub ahead of print].
Christiansen DH, Andersen JH, Haahr JP. Cross-cultural adaption and measurement properties of the Danish version of the Shoulder Pain and Disability Index. Clin Rehabil. 2013;27:355–60.
Cuesta-Vargas A, Gonzalez-Sanchez M, Farasyn A. Development of a Spanish version of the “Backache Index” Cross cultural linguistic adaptation and reliability. J Back Musculoskelet Rehabil. 2010;23:105–10.
Muñiz J, Elosua P, Hambleton RK. International test commission guidelines for test translation and adaptation. Psicothema. 2013;25:151–7.
Gandek B, Ware JE, Aaronson NK, Apolone G, Bjorner JB, Brazier JE, et al. Cross-validation of item selectionand scoring for the SF-12 Health Survey in nine countries: results from the IQOLA Project International Quality of Life Assessment. J Clin Epidemiol. 1998;51:1171–8.
Membrilla-Mesa MD, Tejero-Fernández V, Cuesta-Vargas AI, Arroyo-Morales M. Validation and reliability of a Spanish version of Simple Shoulder Test (SST-Sp). Qual Life Res. 2014;24(2):411–6.
Hervás MT, Navarro-Collado MJ, Peiró S, Rodrigo-Pérez JL, López- Mateú P, Martínez-Tello I. Spanish version of the DASH questionnaire.Cross-cultural adaptation, reliability, validity and responsiveness. Med Clin. 2006;127:441–7.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.
Schumacher RE, Lomax RGA. Beginner's guide to structural equation modeling. 14 Mahwah, NJ: Lawrence Erlbaum; 1996. 15.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Mokkink LB, Terwee CB, Knol DL, Stratford PW, Alonso J, Patrick DL, et al. The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content. BMC Med Res Methodol. 2010;10:22.
Stratford PW. Getting more from the literature: Estimating the standard error of measurement from reliability studies. Physiother Can. 2004;56:27–30.
Colton T. Statistics in Medicine. Boston: Little Brown; 1974.
Roddey TS, Olson SL, Cook KF, Gartsman GM, Hanten W. 30.-Comparison of the University of California-Los Angeles Shoulder Scale and the Simple Shoulder Test with the shoulder pain and disability index: single-administration reliabilityand validity. Phys Ther. 2000;80:759–68.
Fayers PM, Machin D. Scores and measurements: validity, reliability, sensitivity. In: Quality of life: The assessment, analysis and interpretation of patient reported outcomes. West Sussex, England: John Wiley& Sons Ltd; 2007.
This study received a grant (8.06/5.42.3064-3) from the Research Office of the University of Malaga.
The authors declare that they have no competing interests.
AIC-V and MAM has made a contribution to the conception and design of this study and to analyze the data. RPC, VTF, LMM and MDMM recruit the patient, collect data and made substantial contributions to the manuscript. All the authors have given final approval of the version to be published.
About this article
Cite this article
Membrilla-Mesa, M.D., Cuesta-Vargas, A.I., Pozuelo-Calvo, R. et al. Shoulder pain and disability index: cross cultural validation and evaluation of psychometric properties of the Spanish version. Health Qual Life Outcomes 13, 200 (2015). https://doi.org/10.1186/s12955-015-0397-z
- Patient reported outcomes