Skip to main content

Cross-cultural adaptation, reliability and validity of the Fremantle Knee Awareness Questionnaire in Italian subjects with painful knee osteoarthritis


Background and aim

Growing attention is being given to utilising physical function measures to better understand and manage knee osteoarthritis (OA). The Fremantle Knee Awareness Questionnaire (FreKAQ), a self-reported measure of body-perception specific to the knee, has never been validated in Italian patients. The aims of this study were to culturally adapt and validate the Italian version of the FreKAQ (FreKAQ-I), to allow for its use with Italian-speaking patients with painful knee OA.


The FreKAQ-I was developed by means of forward–backward translation, a final review by an expert committee and a test of the pre-final version to evaluate its comprehensibility. The psychometric testing included: internal structural validity by Rasch analysis; construct validity by assessing hypotheses of FreKAQ correlations with the knee injury and osteoarthritis outcome score (KOOS), a pain intensity numerical rating scale (PI-NRS), the pain catastrophising scale (PCS), and the Hospital anxiety and depression score (HADS) (Pearson’s correlations); known-group validity by evaluating the ability of FreKAQ scores to discriminate between two groups of participants with different clinical profiles (Mann–Whitney U test); reliability by internal consistency (Cronbach’s alpha) and test–retest reliability (intraclass correlation coefficient, ICC2.1); and measurement error by calculating the minimum detectable change (MDC).


It took one month to develop a consensus-based version of the FreKAQ-I. The questionnaire was administered to 102 subjects with painful knee OA and was well accepted. Internal structural validity confirmed the substantial unidimensionality of the FreKAQ-I: variance explained was 53.3%, the unexplained variance in the first contrast showed an eigenvalue of 1.8, and no local dependence was detected. Construct validity was good as all of the hypotheses were met; correlations: KOOS (rho = 0.38–0.51), PI-NRS (rho = 0.35–0.37), PCS (rho = 0.47) and HADS (Anxiety rho = 0.36; Depression rho = 0.43). Regarding known-groups validity, FreKAQ scores were significantly different between groups of participants demonstrating high and low levels of pain intensity, pain catastrophising, anxiety, depression and the four KOOS subscales (p ≤ 0.004). Internal consistency was acceptable (α = 0.74) and test–retest reliability was excellent (ICC = 0.92, CI 0.87–0.94). The MDC95 was 5.22 scale points.


The FreKAQ-I is unidimensional, reliable and valid in Italian patients with painful knee OA. Its use is recommended for clinical and research purposes.


Knee osteoarthritis (OA) occurs due to a broad variety of factors, including genetic, metabolic, biomechanical, or post-traumatic causes [1]. It usually develops over a 10- to 15-year period due to articular cartilage damage, bony osteophyte formation and sclerosis of the subchondral bone [2]. Patients commonly report local pain after prolonged rest, stiffness, reduced range of motion, muscle weakness, swelling, and locking [3]. These clinical manifestations lead to difficulty in functional activities such as prolonged sitting, walking and climbing stairs, and an overall reduced quality of life [4].

People with persistent pain secondary to knee OA also exhibit disruption of some of the mechanisms thought to underpin body-perception, including impairments in limb laterality recognition [5] as well as reduced proprioceptive [6, 7] and tactile [8] acuity. Based on this premise, a new scale, derived from the Fremantle Back Awareness Questionnaire [9], namely the Fremantle Knee Awareness Questionnaire (FreKAQ), was recently introduced to directly measure altered body-perception specific to the knee. The developers demonstrated its unidimensionality, and that its score can provide a measure of perceptual impairment for patients suffering from knee OA [10]. Further, the FreKAQ showed acceptable internal consistency, test–retest reliability, and construct validity [10]. Indeed, it was correlated with pain intensity, disability, pain related catastrophising, kinesiophobia and anxiety [10].

Consequently, the FreKAQ attracted our attention as a useful means of assessing symptoms of impaired body-perception in subjects suffering from chronic knee OA, in order to comprehensively plan conservative treatments. However, there is potential for considerable variability in the performance of a patient-reported outcome measure when cross-culturally adapted and used in a different country from where it was originally developed [11]. Established methodological standards are advocated when validation studies are conducted, and these studies contribute to increasing the quality of an outcome measure, allowing for more robust comparison of results across different countries [12].

As an Italian version of the FreKAQ has not been cross-culturally adapted and psychometrically analysed, Italian researchers and clinicians are limited from using this instrument. Therefore, the aims of this study were to develop an adapted Italian version of the questionnaire (FreKAQ-I), evaluate—in individuals with painful knee osteoarthritis- its reliability by assessing internal consistency and test–retest reliability, measurement error by assessing minimum detectable change, as well as construct validity by Rasch analysis and testing a priori hypotheses regarding associations between FreKAQ-I and measures of knee pain related disability, pain intensity, pain related catastrophising and mood.


This cross-sectional study was approved by our Local Ethical Committee on the 24/10/2019 (no. 2386), and conducted in accordance with ethical and humane principles of research outlined in the Declaration of Helsinki.


The study involved people attending the Rehabilitation Unit of a Research Hospital in Milan (Italy), between November 2019 and May 2020. The inclusion criteria were: primary knee OA ruled in by X-ray examination [13]; stable knee pain of at least 3 months duration; an age of > 18 years; and fluency in Italian (i.e. the ability to read and write). Exclusion criteria were: other causes of knee pain (e.g. previous lower limb surgery, infection, fracture, osteonecrosis or malignancy); systemic illness; recent myocardial infarctions or cerebrovascular events, ruled in by imaging (radiographs and, in doubtful cases, Computed Tomography or Magnetic Resonance Imaging) and/or case history; mental health/psychiatric problems (Mini-Mental State Examination scale of < 24); and unwillingness or inability to provide informed consent.


Outpatients visiting the centre during the study period were evaluated by two physiatrists, coordinated by the principal investigator. Those satisfying the inclusion criteria were invited to sign a written informed consent form. Once the patients had given their approval to participate in the study, their demographic and clinical characteristics were recorded by a research assistant and participants completed the questionnaires listed below. All participants were asked to complete the FreKAQ-I a second time 8 ± 2 (mean ± standard deviation) days after their initial appointment.


FreKAQ—The questionnaire is self-administered, includes 9 items and investigates neglect-like symptoms, reduced proprioceptive acuity, and issues of body shape and size. Each item has a 5-point Likert scale (ranging from “never” = 0 to “always” = 4), with higher scores indicating greater levels of knee-specific disrupted body-perception [10].

Knee injury and osteoarthritis outcome score (KOOS)—This is a 42-item self-administered questionnaire made of five subscales, labeled: Pain, Symptoms, Activities of Daily Living (ADLs), Sport/Recreation (Sport/Rec), and knee-related Quality of Life (QoL).

In this study, the Sport/Rec subscale was not administered due to the characteristics of the population enrolled.

A 5-point Likert scale ranging from 0 (no problems) to 4 (extreme problems) is used to score each item, and the raw scores of each subscale are separately transformed into a 0–100 scale, with 0 indicating the worst problems and 100 indicating no problems [14].

Pain intensity numerical rating scale (PI-NRS)—An 11-point numerical rating scale ranging from 0 (no pain at all) to 10 (the worst imaginable pain) was used [15]. Participants were asked to rate their current pain intensity at both rest and during movement.

Pain catastrophising scale (PCS)—This is a 13-item self-report questionnaire in which participants are asked to rate the degree to which they have any of the thoughts described in the questionnaire using a five-point scale, ranging from 0 (never) to 4 (always). The total score is calculated by adding the scores of the individual items, and ranges from 0 to 52 [16].

Hospital anxiety and depression score (HADS)—The scale is composed of 14 items that create subscale scores for anxiety (HADS-A, 7 items) and depression (HADS-D, 7 items). The total score for each subscale is calculated by adding the scores of the individual items (0–3) and varies from 0 to 21 [17].

Other than the FreKAQ, all of the above measures previously demonstrated satisfactory psychometric properties in Italian populations [14,15,16,17,18].

Cross-cultural adaptation of the FreKAQ

Adaptation of the FreKAQ was done in accordance with the protocol issued by the American Association of Orthopaedic Surgeon Outcomes Committee. The principles of good practice for the translation and cultural adaptation process for patient-reported outcomes measures based on the report of the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) Task Force were also taken into account [19, 20].

Step 1 Translation into Italian. The items taken from the English version of the original FreKAQ tool [10] were translated into Italian with the aim of retaining the concepts of the original while using culturally and clinically fitting expressions. Two adaptations were made independently by 2 Italian professional translators experienced in the biomedical field. The translators were given a clear explanation of the concepts expressed to capture the conceptual meaning of the items, while keeping the language colloquial and compatible with a reading age of 12 years. Discrepancies between the translators were resolved by consensus. Step 1 ended when a common adaptation was agreed.

Step 2 Back-Translation into English. Two bilingual English native speaking translators, without any biomedical background, independently back-translated the initial adaptation. The principle investigator reviewed these translations and, with the help of the back-translators, made sure the Italian version reflected the same item content as the original version and was conceptually equivalent.

Step 3 Expert committee. To achieve harmonization of the adaptation process, the adaptations were submitted to a bilingual committee of clinicians, methodologists, and translators, chaired by the principle investigator. To identify any discrepancies or mistakes, the committee explored the semantic, idiomatic, and conceptual equivalence of the items and response options. This phase ended when a prefinal version was agreed.

Step 4 Test of the prefinal version. A test of the prefinal version was performed to investigate the level of acceptability, comprehensibility, interpretation and relevance of the adaptation, to highlight any items that may be inadequate at a conceptual level and to identify any other issues that cause uncertainty. Cognitive interviews were conducted by a trained psychologist who administered the new measure to 10 subjects with primary knee OA. These interviews were semi-structured and related to the following items: (1) was the questionnaire acceptable and easy to complete?; (2) how clear and comprehensible was the instruction to you?; (3) how clear and comprehensible were the questions to you?; (4) how clear and comprehensible were the response options to you?; (5) are there any questions which should be phrased in a clearer way?; (6) are there any questions which are too similar or irrelevant?

The Expert Committee reviewed the results from cognitive debriefing with the aim of identifying any change important for amelioration of the Italian prefinal version.

Step 5 Submission to the developer. The material related to the cross-cultural adaptation of the questionnaire was sent to the scale developer (B.M.W) for ultimate review. The final version is available in Additional file 1.

Statistical analysis

The following items were analyzed.


The time needed to answer the questionnaire was recorded. The subjects were asked about any problems they encountered and the data were checked for missing or multiple responses.

Descriptive statistics

Descriptive statistics of collected measures was calculated, including indices of central tendency (mean) and spread (standard deviation), as well as percentage of minimum and maximum scores (floor or ceiling effect was considered to be present if > 15% of the subjects achieved the lowest or highest possible scores, respectively) [21].

Construct validity

  1. 1.

    Internal construct (structural) validity. Rasch analysis (Winsteps software v. 3.68.3) was used to examine the main psychometric properties of the FreKAQ-I with the rating scale model (due to the common response structure). We followed the same procedure detailed in our previous studies [22, 23]. In short, to investigate whether the pattern of responses met the assumptions of the Rasch model, the following psychometric issues were examined:

    1. (a)

      Functioning of the response categories, according to criteria suggested by Linacre [24]

    2. (b)

      Internal construct validity, calculated from each item’s chi-square fit statistics (infit and outfit mean-square statistics, MnSq). Values from 0.70 to 1.30 [25, 26], associated with standardized z values (ZStd) less than 2.0, were considered as an indicator of acceptable fit

    3. (c)

      Reliability, in terms of person and item reliability, providing the degree of replicability of person and item placements along the trait continuum [27]

    4. (d)

      Unidimensionality of the scale, examining the unexplained variance after the Rasch dimension is extracted, as obtained by a Principal Component Analysis of the residuals (PCAr). Additional factors are not likely to be present in the residuals if at least 50% of the variance is explained by the Rasch factor, and the eigenvalue of the first residual factor is ≤ 2 [24]

    5. (e)

      Local independence between items. No residual association among item responses should be found once the dominant factor (Rasch factor) has been conditioned out; a correlation of residuals > 0.30 indicates possible local dependence [28]

  2. 2.

    External construct validity—Based on previous studies describing the development of the Fremantle body awareness questionnaires and the maladaptive perceptions model proposed by Wand et al. [29], and Nishigami et al. [10], we examined external construct validity of the Italian version of the FreKAQ in the following way.

    1. (a)

      As for hypotheses testing, it was hypothesized a priori—that the FreKAQ score would achieve positive significant correlations (at fair to moderate levels, between 0.30 and 0.60) with measures analyzing other constructs pertinent to individuals with knee osteoarthritis, such as the KOOS subscales (symptoms, pain, ADL function and QoL), pain intensity (PI-NRS), pain catastrophizing (PCS), and anxiety/depression levels (HADS). After plot inspection for linearity of associations, correlations were calculated with Spearman’s rank correlation coefficient.

    2. (b)

      The known-group method was applied to evaluate the ability of the FreKAQ scores to discriminate between two groups of participants with different clinical profiles. We assessed: the four KOOS subscales (symptoms, pain, ADL function, and QoL) and pain intensity (dichotomized by group median); pain catastrophizing (< 24 points vs. ≥ 24 points) [30]; anxiety and depression levels (< 8 points vs. ≥ 8 points in each subscale) [31]. Differences between groups were assessed using the Mann–Whitney U test with Bonferroni correction for multiple comparisons with alpha set at 0.0056 (0.05/9).

Construct validity was considered good if ≥ 75% of the hypotheses (14 out of 18) was met.

Reliability and measurement error

Internal consistency was investigated through Cronbach’s alpha (values of > 0.70 being considered acceptable). Test–retest reliability (intraclass correlation coefficient, ICC2.1, with good and excellent reliability respectively indicated by values of 0.70–0.85 and > 0.85) [21] was examined. The SEM was estimated using the formula:

$$SEM = SD\sqrt {1 - ICC_{2,1} }$$

where SD is the baseline standard deviation of the measurements [21].

The minimum detectable change (MDC) was calculated by the equation:

$$MDC = SEM*z\,value*\sqrt 2$$

The 95% confidence level (CI) (MDC95) corresponds to a z value of 1.96.

Analyses were performed with Winsteps v. 3.68.2 (, Beaverton—OR, USA) [24] for Rasch analysis and with R and RStudio [32, 33] for the other analyses.

We determined that a sample size of 101 would provide adequate statistical power (expecting to obtain an ICC of 0.70, with a 95%CI of 0.20) for test–retest reliability [34]. Moreover, in Rasch analysis a sample size of 100 participants is able to ensure item calibration stability within ± 0.5 logits with 95% confidence.



One hundred-thirty subjects were invited to participate, of whom 20 were excluded. The reasons for exclusion were, systemic illness (n = 3), cognitive impairment (n = 3), recent myocardial infarction (n = 2), recent cerebrovascular event (n = 1) and unwillingness to participate (n = 11). Of the remaining 110 subjects, 8 dropped out before starting the study because of, logistic problems (n = 5), economic constraints (n = 1), or personal problems (n = 2). Thus, the final study population consisted of 102 subjects, whose socio-demographic and clinical characteristics are reported in Table 1.

Table 1 Socio-demographic and clinical characteristics of the study population (n = 102)

Translation and cross-cultural adaptation

The translation procedure took 1 month to reach a culturally adapted version, and all the items were easily forward and back-translated. No difficulties were evidenced during the review of the back translations. Some concern was raised related to the difficulty in discriminating between the Italian terms “rarely” versus “occasionally”, and “some of the time” versus “moderate amount of time”. Accordingly, the expert committee decided to simplify the labels of the rating categories maintaining just 5 simple descriptors: “never”, “rarely”, “sometimes”, “often”, “always”. The correctness of the process, the content of the items and the concepts expressed were confirmed by the experts. Then, the research panel reviewed the results from cognitive interviews and made only small adjustments based on interpretations and meanings raised by the subjects in order to improve items comprehension. As no other significant issues were pointed out further, the principal investigator and the Expert committees confirmed the work done and finalized the definitive version of the translation.

Scale psychometric properties


All of the questions were well accepted. The questionnaire was completed in about 4 min. No missing responses or multiple answers were found. There were no further problems in comprehension.

Descriptive statistics

Mean and standard deviation of the scores of each collected measure, as well as percentage of minimum and maximum scores are reported in Table 2. No floor or ceiling effect was observed in any of the questionnaires used, including the FreKAQ-I.

Table 2 Distributions (mean and standard deviation, SD) and percentage of minimum and maximum scores of the scores of the Fremantle Knee Awareness Questionnaire (FreKAQ), Knee injury and osteoarthritis outcome score (KOOS), pain intensity numerical rating scale (PI-NRS), pain catastrophising scale (PCS), and Hospital anxiety and depression score (HADS)

Construct validity

Internal construct (structural) validity. Rasch analysis showed that in the 5-level rating scale the average category measures advanced monotonically and the category oufit mean-square were less than 2.0. However, some step calibrations were disordered, and the differences between step difficulties were quite narrow (always < 0.98 logits).

All items fitted the underlying construct that the scale was intended to measure (Table 3). The mean person ability was − 0.35 logits (range from − 3.29 to 0.85), indicating that the average difficulty of the items (endorsability) was quite well targeted to the mean sample ability (agreeability). The item reliability was 0.98, while person reliability was 0.67. The PCAr confirmed the substantial unidimensionality of the FreKAQ: the variance explained by the Rasch factor was 53.3%, while the unexplained variance in the first contrast showed an eigenvalue of 1.8. No local dependence was detected, i.e. no strong (> 0.0.20) residual correlation between items was found.

Table 3 Summary of Rasch analysis results related to the FreKAQ, containing item-difficulty measures (with standard error, SE) and fit information

External construct validity

External construct validity was considered good as:

  1. 1.

    All a priori hypotheses were confirmed. Related results can be seen in Table 4.

  2. 2.

    As for the known-group method, FreKAQ scores were significantly different between two groups of participants with different levels (high vs. low) of the following clinical variables: the four KOOS subscales (symptoms, pain, ADL function, and QoL); pain intensity; pain catastrophizing; anxiety and depression levels (for all variables, p ≤ 0.004).

Table 4 Construct validity. Spearman’s correlations between the FreKAQ-I and KOOS, PI-NRS, PCS and HADS. Confidence intervals by bootstrap, 1000 repetitions

Reliability and measurement error

Cronbach’s α was 0.74. Test–retest reliability was excellent (ICC = 0.92; CI 0.87–0.94). The SEM was 1.88 and the MDC95 was 5.22, reflecting the smallest change in score that is likely to reflect a true change rather than a measurement error.


This study describes the process of cross-cultural adaptation of the FreKAQ and evaluation of structural validity, reliability, measurement error and construct validity of the questionnaire in Italian-speaking subjects with painful knee OA. The scale confirmed its unidimensionality, displayed good construct validity and acceptable reliability.

Cross-cultural adaptation requires a process of translation, backward translation, expert committee revision and testing of the pre-final version in order to guarantee the meaning of the original items are adequately captured in the Italian language. These recommended guidelines were followed in the current study and all of the steps indicated that cross cultural adaptation of the FreKAQ was successful. The on-field test with cognitive debriefing confirmed the overall general comprehensibility of the translated questionnaire. The decision to simplify the labels of the rating categories maintaining just 5 simple descriptors: “never”, “rarely”, “sometimes”, “often”, “always” is in line with other widely-used scales, such as some of those in the PROMIS item bank [35].

The final version of the questionnaire was highly acceptable, easily understood and capable of being self-administered. The questionnaires burden is also low as it required less than five minutes to complete. It therefore seems to be applicable in everyday clinical practice. However, among various content validity aspects, only the comprehensibility of the FreKAQ was assessed in this study, whereas its relevance and comprehensiveness to measure body perception was not assessed; this research gap should be filled by future studies conducting a thorough assessment of content validity, possibly following the recently published COSMIN methodology [36].

Construct validity

Internal construct (structural) validity

Overall, Rasch analysis of the FreKAQ-I showed that the scale has acceptable psychometric properties. The scale’s unidimensionality was confirmed, all items fit the Rasch model, and no local item dependence was found. In addition, the targeting of item difficulty to patient ability was quite good, and item reliability was high.

The only concern came from the rating scale diagnostics, which showed some category misfunctioning due to the limited respondents’ ability to discern between the three central levels of frequency. This finding is in agreement with observations indicating problems with the use of the 5-grade frequency-related response categories, often called ‘vague quantifiers’; there is neither a formal nor an informal definition given for the meaning of terms such as ‘sometimes’, ‘rarely’, ‘occasionally’, and thus they have fuzzy boundaries [37]. Furthermore, the intervals between category steps were quite narrow, in line with those in the original paper [10] where they were lower than 0.8 logits.

The FreKAQ is based on a formative model, i.e., the latent construct (here “knee awareness”) is determined as a combination of independent—albeit correlated—indicators of the variable under measurement, thus creating a composite index. Items are a mix of positively- and negatively-worded questions respectively related to neglect-like symptoms (items 1–3), reduced proprioceptive acuity (items 4 and 5), and perceived body shape and size (items 7–9), with item 6 alternatively assigned to the second or the third domain.

As such, these items do not necessarily have good correlations or orient in a consistent hierarchy.

External construct validity

As for hypotheses testing, the correlation with a measure of disability (KOOS) was as expected (Table 3), suggesting patients showing high disability due to painful knee OA also display greater levels of body perception disruption specific to the knee. This is in keeping with results reported for the Japanese version of the questionnaire [10], where a moderate correlation with disability was also noted (0.41). However, these authors used a different measure of disability (i.e., The Oxford Knee Score) so a detailed comparison with the current findings cannot be undertaken. As for correlation with pain intensity, estimates were also as expected, with higher levels of pain being associated with higher levels of perceptual dysfunction, in line with results reported for the Japanese version of the questionnaire [10]. Both the Japanese study and the current study examined knee pain intensity at rest and with motion and report remarkably consistent findings. Results from both the Japanese [10] and current study suggest that disrupted body perception is more strongly related to disability than pain in people with painful knee OA. This trend is also seen in various studies of the lumbar spine version of the questionnaire in people with back pain [29, 38,39,40]. Together these results suggest that disrupted body perception may impact more on the functional consequences of pain rather than the experience of pain itself. Future investigations that examine the role of body perception in mediating the relationship between pain and disability may be worth exploring.

With respect to pain catastrophising, we achieved a lower correlation than reported for the Japanese version of the FreKAQ (0.70). It is possible that in the Italian clinical context some FreKAQ and PCS items are interpreted differently than in Japan, hence leading to a different correlation than previously found. As for mood disorders, our correlations substantially replicate the moderate correlations already achieved in the original study (rho = 0.36 and 0.43 with anxiety and depression scores, respectively) [10], suggesting that patients suffering from anxiety and depression are more prone to show disrupted body perception specific to the knee.

Moreover, data related to known-group validity supported previous conceptualizations and reports indicating that a measure of impaired body perception (FreKAQ scores) would be able to discriminate between participants with different levels of knee symptoms, function, pain intensity, pain catastrophizing and psychological distress [10, 29, 41].

Reliability and measurement error

The internal consistency of the FreKAQ-I was acceptable (0.72), but inferior to that reported in the original study (0.88) [10]. However, this low internal consistency is unsurprising, when taking into account both the formative model of the scale, and similar results seen from another modified version of the Fremantle Awareness Questionnaires examining self-perceptions in neck disorders [42]. This result, coupled with the low Rasch person reliability, indicates that the tool is useful for group-level comparisons but less so for clinical application in single individuals (for whom a minimum reliability threshold of 0.85–0.90 is desirable). These results suggest the scale seems able to clearly distinguish between persons with impaired versus not-impaired knee self-perception without permitting a more fine grained assessment of impairment levels.

Test–retest reliability showed an excellent agreement between the results on days 1 and 8, even higher than reported in the original scale [10]. The FreKAQ-I displayed an acceptable measurement error. Given the high degree of repeatability of our results, the SEM and MDC were relatively small: in particular, the MDC indicated—at a 95% confidence level—that, if an individual shows a change of more than 6 points after a given intervention, it would not be due to a measurement error.

This study has some limitations. Firstly, it is a cross-sectional study and responsiveness as well as minimal important change of the FreKAQ-I could not be assessed. Secondly, the relationships between knee-related perceptual dysfunction and physical performance measures were not considered because only questionnaires were used. Thirdly, correlations with other measures including psychological factors (e.g. Tampa Scale of Kinesiophobia or Pain Self-Efficacy Questionnaire) and quality of life issues (e.g. Short-Form Health Survey 36-items) available in Italian language were not analyzed [43,44,45]. Fourthly, our study was restricted to patients with primary knee OA and it is uncertain whether these findings can be extended to patients with other causes of knee pain, and future studies in these populations are recommended.


The Italian version of the FreKAQ shows a one-factor structure, it is reliable and valid, and has an acceptable measurement error. The newly adapted tool can be recommended for clinical and research purposes in order to improve the assessment and treatment planning of patients with painful knee OA in Italy.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.



Confidence interval


Fremantle knee awareness questionnaire


Fremantle knee awareness questionnaire-Italian


Hospital anxiety and depression score


Hospital anxiety and depression score-anxiety


Hospital anxiety and depression score-depression


Intraclass correlation coefficient


International Society for Pharmacoeconomics and Outcomes Research


Knee injury and osteoarthritis outcome score


Minimum detectable change


Mean-square statistics




Principal component analysis of the residuals


Pain catastrophising scale


Pain intensity numerical rating scale


Standard error


Standard error of the mean


Standardized z values


  1. Castaneda S, Roman-Blas JA, Largo R, et al. Osteoarthritis: a progressive disease with changing phenotypes. Rheumatology. 2014;53(1):1–3.

    Article  Google Scholar 

  2. Jüni P, Hari R, Rutjes AW, et al. Intra-articular corticosteroid for knee osteoarthritis. Cochrane Database Syst Rev. 2015;(10):CD005328.

  3. Lespasio MJ, Piuzzi NS, Husni ME, et al. Knee osteoarthritis: a primer. Perm J. 2017;21:16–183.

    Article  Google Scholar 

  4. Mahir L, Belhaj K, Zahi S, Azanmasso H, Lmidmani F, El Fatimi A. Impact of knee osteoarthritis on the quality of life. Ann Phys Rehabil Med. 2016;59(Suppl):e159.

    Article  Google Scholar 

  5. Stanton TR, Lin CW, Smeets RJ, Taylor D, Law R, Lorimer MG. Spatially defined disruption of motor imagery performance in people with osteoarthritis. Rheumatology (Oxford). 2012;51:1455–64.

    Article  Google Scholar 

  6. Cammarata ML, Dhaher YY. Associations between frontal plane joint stiffness and proprioceptive acuity in knee osteoarthritis. Arthritis Care Res (Hoboken). 2012;64:735–43.

    Article  Google Scholar 

  7. Chang AH, Lee SJ, Zhao H, Ren Y, Zhang LQ. Impaired varus-valgus proprioception and neuromuscular stabilization in medial knee osteoarthritis. J Biomech. 2014;47:360–6.

    Article  Google Scholar 

  8. Stanton TR, Lin CW, Bray H, et al. Tactile acuity is disrupted in osteoarthritis but is unrelated to disruptions in motor imagery performance. Rheumatology (Oxford). 2013;52:1509–19.

    Article  Google Scholar 

  9. Wand BM, James M, Abbaszadeh S, George PJ, Formby PM, Smith AJ, et al. Assessing self-perception in patients with chronic low back pain: development of a back-specific body-perception questionnaire. J Back Musculoskelet Rehabil. 2014;27(4):463–73.

    Article  Google Scholar 

  10. Nishigami T, Mibu A, Tanaka K, Yamashita Y, Yamada E, Wand BM, et al. Development and psychometric properties of knee-specific body-perception questionnaire in people with knee osteoarthritis: The Fremantle Knee Awareness Questionnaire. PLoS ONE. 2017;12(6):e0179225.

    Article  Google Scholar 

  11. Prinsen CAC, Mokkink LB, Bouter LM, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27:1147–57.

    Article  CAS  Google Scholar 

  12. Mokkink LB, de Vet HCW, Prinsen CAAC, et al. COSMIN risk of bias checklist for systematic reviews of Patient-Reported Outcome Measures. Qual Life Res. 2018;27(5):1171–9.

    Article  CAS  Google Scholar 

  13. Ornetti P, Brandt K, Hellio-Le Graverand MP, et al. OARSI-OMERACT definition of relevant radiological progression in hip/knee osteoarthritis. Osteoarthr Cartil. 2009;17:856–63.

    Article  CAS  Google Scholar 

  14. Monticone M, Ferrante S, Salvaderi S, Rocca B, Totti V, Foti C, Roi GS. Development of the Italian version of the knee injury and osteoarthritis outcome score for patients with knee injuries: cross-cultural adaptation, dimensionality, reliability, and validity. Osteoarthr Cartil. 2012;20:330–5.

    Article  CAS  Google Scholar 

  15. Salaffi F, Stancati A, Silvestri CA, Ciapetti A, Grassi W. Minimal clinically important changes in chronic musculoskeletal pain intensity measured on a numerical rating scale. Eur J Pain. 2004;8:283–91.

    Article  Google Scholar 

  16. Monticone M, Baiardi P, Ferrari S, et al. Development of the Italian version of the pain catastrophising scale (PCS-I): cross-cultural adaptation, factor analysis, reliability, validity and sensitivity to change. Qual Life Res. 2012;2012:211045–50.

    Google Scholar 

  17. Costantini M, Musso M, Viterbori P. Detecting psychological distress in cancer patients: validity of the Italian version of the Hospital Anxiety and Depression Scale. Support Care Cancer. 1999;7:121–7.

    Article  CAS  Google Scholar 

  18. Monticone M, Ferrante S, Salvaderi S, Motta L, Cerri C. Responsiveness and minimal important changes for the Knee Injury and Osteoarthritis Outcome Score in subjects undergoing rehabilitation after total knee arthroplasty. Am J Phys Med Rehabil. 2013;92:864–70.

    Article  Google Scholar 

  19. Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine. 2000;25:3186–91.

    Article  CAS  Google Scholar 

  20. Wild D, Grove A, Martin M, et al. Principles of good practice for the translation and cultural adaptation process for patient-reported outcomes (PRO) measures: report of the ISPOR task force for translation and cultural adaptation. Value Health. 2005;8:94–104.

    Article  Google Scholar 

  21. Terwee CB, Bot SD, de Boer MR, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.

    Article  Google Scholar 

  22. Burger H, Giordano A, Bavec A, Franchignoni F. The Prosthetic Mobility Questionnaire, a tool for assessing mobility in people with lower-limb amputation: validation of PMQ 2.0 in Slovenia. Int J Rehabil Res. 2019;42:263–9.

    Article  Google Scholar 

  23. Franchignoni F, Magistroni E, Parodi G, Massazza G, Ferriero G, Giordano A. Development of a simplified Cold Intolerance Symptom Severity questionnaire in patients with peripheral nerve injury. Int J Rehabil Res. 2019;42:63–7.

    Article  Google Scholar 

  24. Linacre JM. A user’s guide to Winsteps Ministep. Rasch-model computer programs. Program Manual 4.4.4. Chicago, IL:; 2019.

  25. Tesio L. Measuring behaviours and perceptions: Rasch analysis as a tool for rehabilitation research. J Rehabil Med. 2003;35:105–15.

    Article  Google Scholar 

  26. Smith AB, Rush R, Fallowfield LJ, Velikova G, Sharpe M. Rasch fit statistics and sample size considerations for polytomous data. BMC Med Res Methodol. 2008;8:33.

    Article  Google Scholar 

  27. Bond TG, Fox CM. Applying the Rasch model—fundamental measurement in the human sciences. 3rd ed. New York: Routledge; 2015.

    Book  Google Scholar 

  28. Petrillo J, Cano SJ, McLeod LD, Coon CD. Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples. Value Health. 2015;18:25–34.

    Article  Google Scholar 

  29. Wand BM, Catley MJ, Rabey MI, O’Sullivan PB, O’Connell NE, Smith AJ. Disrupted self-perception in people with chronic low back pain further evaluation of the Fremantle Back Awareness Questionnaire. J Pain. 2016;17:1001–12.

    Article  Google Scholar 

  30. Gauthier N, Thibault P, Sullivan MJ. Catastrophizers with chronic pain display more pain behaviour when in a relationship with a low catastrophizing spouse. Pain Res Manag. 2011;16:293–9.

    Article  Google Scholar 

  31. Snaith RP. The Hospital anxiety and depression scale. Health Qual Life Outcomes. 2003;1(1):29.

    Article  Google Scholar 

  32. R Core Team. R: A Language and Environment for Statistical Computing, Vienna, Austria: R Foundation for Statistical Computing; 2018.

  33. RStudio Team. RStudio: Integrated Development Environment for R, Boston, MA: RStudio, Inc.; 2016.

  34. Zou GY. Sample size formulas for estimating intraclass correlation coefficients with precision and assurance. Stat Med. 2012;31:3972–81.

    Article  CAS  Google Scholar 

  35. Health Measures. Intro to PROMIS® (Patient-Reported Outcomes Measurement Information System). Evanston, IL (USA): Northwestern University. © 2020.

  36. Terwee CB, Prinsen CAC, Chiarotto A, Westerman MJ, Patrick DL, Alonso J, Bouter LM, de Vet HCW, Mokkink LB. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27:1159–70.

    Article  CAS  Google Scholar 

  37. Streiner DL, Norman GR, Cairney J. Health measurement scales: a practical guide to their development and use. 5th ed. Oxford: Oxford University Press; 2015.

    Book  Google Scholar 

  38. Nishigami T, Mibu A, Tanaka K, Yamashita Y, Shimizu ME, Wand BM, Catley MJ, Stanton TR, Moseley GL. Validation of the Japanese version of the Fremantle Back Awareness Questionnaire in patients with low back pain. Pain Pract. 2018;18:170–9.

    Article  Google Scholar 

  39. Janssens L, Goossens N, Wand BM, Pijnenburg M, Thys T, Brumagne S. The development of the Dutch version of the Fremantle Back Awareness Questionnaire. Musculoskelet Sci Pract. 2017;32:84–91.

    Article  Google Scholar 

  40. Ehrenbrusthoff K, Ryan CG, Grüneberg C, Wand BM, Martin DJ. The translation, validity and reliability of the German version of the Fremantle Back Awareness Questionnaire. PLoS ONE. 2018;13(10):e0205244.

    Article  Google Scholar 

  41. Moseley GL, Gallace A, Spence C. Bodily illusions in health and disease: physiological and clinical perspectives and the concept of a cortical “body matrix.” Neurosci Biobehav Rev. 2012;36:34–46.

    Article  Google Scholar 

  42. Onan D, Gokmen D, Ulger O. The Fremantle Neck Awareness Questionnaire in chronic neck pain patients: Turkish version, validity and reliability study. Spine. 2020;45:E163–9.

    Article  Google Scholar 

  43. Monticone M, Giorgi I, Baiardi P, Barbieri M, Rocca B, Bonezzi C. Development of the Italian version of the Tampa Scale of Kinesiophobia (TSK-I): cross-cultural adaptation, factor analysis, reliability, and validity. Spine. 2010;35:1241–6.

    Article  Google Scholar 

  44. Monticone M, Giordano A, Franchignoni F. Scale shortening and decrease in measurement precision: Analysis of the Pain Self-Efficacy Questionnaire and its short forms in an Italian-speaking population with neck pain disorders. Phys Ther. 2021;pzab039.

  45. Apolone G, Mosconi P. The Italian SF-36 survey: translation, validation and norming. J Clin Epidemiol. 1998;51:1025–36.

    Article  CAS  Google Scholar 

Download references


The authors would like to thank all of the subjects who got involved into the study.


The authors received no specific funding for this work.

Author information

Authors and Affiliations



Authors’ contributions MM and FF initiated study conception and design. CS, GS, GL and SR performed the data collection and acquisition of data. IP, AG and FF performed the data analysis. MM, IP, AG and FF interpreted the data. MM and FF wrote the manuscript. MM, IP and FF edited the manuscript. BMW and TN had a role in critical revision. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Igor Portoghese.

Ethics declarations

Ethics approval and consent to participate

This cross-sectional study was approved by our Local Ethical Committee on the 24/10/2019 (no. 2386), and conducted in accordance with ethical and humane principles of research of the Declaration of Helsinki.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

The Italian version of the Fremantle Knee Awareness Questionnaire.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Monticone, M., Sconza, C., Portoghese, I. et al. Cross-cultural adaptation, reliability and validity of the Fremantle Knee Awareness Questionnaire in Italian subjects with painful knee osteoarthritis. Health Qual Life Outcomes 19, 114 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: