The Oxford hip score: the patient's perspective
Health and Quality of Life Outcomes volume 3, Article number: 66 (2005)
In the last 25 years, assessment of orthopaedic intervention has become patient focused, with the development of self-completion patient-centred outcome measures. The Oxford hip score (OHS) is a joint specific outcome measure tool designed to assess disability in patients undergoing total hip replacement (THR). Although the psychometric properties of the OHS have been rigorously examined, there is little research on the patient's perspective of the OHS. Therefore, the aim of this study is to assess whether the OHS is an adequate disability measure from the patient's perspective using qualitative analysis of annotations written on the OHS by patients.
In total, 276 orthopaedic patients completed an OHS between April 2004 and May 2005. One hundred and fifty six pre-operative patients listed for a THR completed the OHS during a pre-admission assessment clinic, and 120 post-operative patients completed the OHS postally in the home setting. Patient's unprompted annotations in response to the questions on the OHS were recorded and grouped into thematic categories.
In total, 46 (17%) patients made 52 annotations when completing the OHS. These annotations identified five main areas of difficulty that patients experienced: lack of question clarity (particularly concerning the use of aids), difficulty in reporting measurements of pain, restrictive and irrelevant questions, the influence of co-morbidities on responses, and double-barrelled questions.
Although the OHS is a useful short tool for the assessment of disability in patients undergoing THR, this study identified several problem areas that are applicable to patient-centred outcome tools in general. To overcome these current limitations, further work is underway to develop a more individualised patient-centred outcome measure of disability for use in patients with osteoarthritis.
During the last decade, the assessment of outcomes in orthopaedic surgery has shifted from the success or failure of an implant towards patient satisfaction and quality of life . Initially, surgeon assessment of total hip replacement (THR) outcome was accepted, with the development of tools such as the Harris Hip Score  and the Charnley score . However, these measures presuppose a concordance between the views of patients and clinicians, which has been proved to be an erroneous assumption [4, 5], particularly in subjective domains such as pain . Consequently, the last 25 years has witnessed the development of generic and disease-specific self-completion patient-centred outcome measures. Generic measures such as the SF-12  and Nottingham Health Profile  endeavour to assess all important dimensions of health-related quality of life . Disease-specific tools such as the Arthritis Impact Measurement Scale (AIMS)  and the Western Ontario and McMaster University Osteoarthritis Index (WOMAC)  focus on specific aspects of disability relating to a particular condition. These are supplemented by joint specific measures such as the Oxford hip score (OHS)  and the Hip Disability and Osteoarthritis Outcome Score (HOOS) .
The OHS is a patient-centred questionnaire that is designed to assess functional ability and pain from the patient's perspective. It is a short, twelve-item questionnaire developed for completion by patients undergoing THR  and is extensively referenced in the orthopaedic literature [14–21]. The OHS has been demonstrated to be highly sensitive to change in patients undergoing primary THR [12, 16, 21, 19, 22] and revision THR [15, 16]. It correlates well with patient satisfaction [15, 19] and other patient-centred instruments, such as the Euroqol 5D . Responsiveness of the OHS to change has been found to be greater than generic measures such as the SF-36 [16, 18] and disease specific measures such as the WOMAC . The OHS has been utilized in a broad range of contexts, including studies comparing different prostheses , surgeon and patient expectations , and the outcomes of NHS and private patients .
Although the OHS has been shown to have internal consistency and produce data of high reliability and validity , there is a shortage of published data on the patient's perception of the OHS. During the validation of the questionnaire, there was no reference to difficulties that patients experienced when completing the OHS, beyond a brief statement that "the patients had little difficulty in completing it" . Previous research has explored patient's perception of the OHS, and found that patients encountered several limitations of the OHS relating to question specificity, response category clarity, exclusion of comorbidities, and experience of pain . However, this study was limited to a small sample size and during the past half decade no further work has been published investigating the patient's experience of the OHS. Therefore, the aim of this study was to determine, from the patient's perspective, if the OHS is an adequate questionnaire for measuring disability. This was achieved by analysing unprompted, spontaneous annotations generated by patients completing a paper copy of the OHS.
Between April 2004 and May 2005 patients attending orthopaedic preadmission assessment clinic at the Avon Orthopaedic Centre, under the care of one consultant orthopaedic surgeon (IDL), and awaiting THR, were administered an OHS as part of a routine questionnaire pack used in the clinic. These patients were sampled as they were expected to be unfamiliar with the OHS, as the introduction of this questionnaire into routine clinical care in this clinic was initiated in April 2004. Between January 2005 and May 2005, consecutive patients with 12-months follow-up, who received a THR under the care of a consultant orthopaedic surgeon (IDL), completed a postally administered OHS as part of their on-going clinical care. In addition, all patients that had an IPS Stem (DePuy) between 1997 and 2004, under the care of one consultant orthopaedic surgeon (IDL) complete a postal OHS as part of another study. During administration the patients were not instructed to annotate or comment on the questions on the OHS.
The OHS consists of 12 questions about pain and disability experienced over the past four weeks. Each item has five response categories, given a score of between 1–5 (low disability to high disability). Scoring involves summating the total for each item to produce a final score between 12–60, with a higher score indicating greater disability. In this study the OHS was presented on a double-sided sheet of A4 paper, with six questions on each side. The response categories to each question were formatted as a Likert scale (Figure 1) with the coding frame integrated into the questionnaire. Both the questionnaires administered in the pre-admission clinic and the postal questionnaires were formatted in an identical manner.
Each questionnaire was reviewed for spontaneously generated annotations and these annotations were then grouped into thematic categories.
In total, 276 patients completed the OHS questionnaire. In pre-admission assessment clinic, 156 consecutive patients, listed for a THR, completed an OHS. For patients that attended the clinic twice between April 2004 and May 2005, only the first questionnaire was included in the analysis. Post-operatively, 120 patients completed a postally administered OHS. The pre-operative mean OHS was 44.1 (SD 8.3, range 21–59) and the mean post-operative OHS was 24.1 (SD 11.3, range 12–55). Patients who completed the OHS after surgery had a mean follow-up period of 24 months (SD 19, range 12–77 months). The sample consisted of 169 women (61%) and 107 men (39%) with a mean age of 58 years (SD 15.9, range 14–82 years). Patient's diagnoses are listed in Table 1.
Forty-six (17%) patients annotated a total of 52 questions (Table 2). Five patients drew 16 arrows linking boxes, signalling that they felt they were unable to place themselves in a single category provided by the OHS. Question six, which asks "During the past 4 weeks, for how long have you been able to walk before pain from your hip becomes severe (with or without a stick)?" most frequently elicited annotation, whereas question eleven, which asks "how much has pain from your hip interfered with your usual work (including housework)?" was the only question that was not annotated by any of the patients. The annotations were broadly grouped into five main categories, each highlighting difficulties the patients experienced when completing the OHS (Table 3).
The mean pre-operative OHS of 44.1 and post-operative score of 24.1 are similar to previous results [12, 16, 19], indicating that the sample in this study was representative of other lower limb orthopaedic patients. The pre-operative and the post-operative groups were purposively sampled as separate cohorts to avoid familiarly with the OHS, which could comprise the validity of the results. As the completion of the OHS was only introduced into this clinic in April 2004, the postal OHS completed by the post-operative patients was likely to be their initial contact with the questionnaire. Similarly, the patients attending the pre-admission assessment clinic should not have previously encountered the questionnaire. However, a limitation of the study was that patients may have previously completed the OHS for their GP or under the care of a different consultant, and this prior exposure to the OHS may have influenced the patient's responses.
This study has highlighted several pitfalls and limitations of the OHS, and of available disability measures in general. However, although the current study identified substantial areas of difficulty, analysing unprompted annotations has limitations. The results are confined to the difficulties encountered by individuals who were self-motivated to comment upon these problems. As a result of this methodology, conclusions are drawn from the responses of only 17% of the patients sampled. For the remaining 83% of patients, the OHS could have been adequate from their perspective or alternatively, they could have encountered problems, but not have documented them on paper as they were not instructed to do so. Therefore, further research needs to be undertaken, in which patient are explicitly encourage to comment upon any difficulties when completing the OHS, in order to assess the extent of it's applicability. Alternatively, qualitative interviews could be employed to explore the patient's perspective on the OHS in greater depth, although findings from qualitative work have raised similar areas of difficulty to those in the current study .
The five general themes of difficulties that emerged from the analysis of annotations is discussed in more detail below.
Seventeen percent of patients annotated answers they provided on the OHS, suggesting that the patients felt that the questions were inadequate to suitably express themselves. Five general themes emerged from the analysis of annotations and each thematic category is discussed in more detail below.
The aspect of the OHS that appeared to cause the greatest difficulty for the patients, with 29% of annotations, was the lack of question clarity. Within this theme, the predominant area of uncertainty was whether the questions were enquiring about actual level of disability or the level of disability after accounting for the use of aids or specialised devices, such as long handled shoehorns or helping hands. When responding to question four, which asks respondents "have you been able to put on a pair of socks, stocking or tights?", a number of individuals answered accounting for the use of an aid, and other people gave two answers; one referring to the level of disability in performing the activity when using an aid and one when not using an aid. The same lack of clarity has resulted from this question previously . Question six, which asks the respondent "long have you been able to walk before pain from your hip becomes severe (with or without a stick)?", acknowledges that many individuals need to use a walking stick. However, it is not specified in the question whether the patients should provide a response for actual or relative disability. Consequently, inconsistent results were obtained, with patients providing two answers i.e. distance walked with and without a walking stick. Therefore, the score becomes dependant on whether the respondent chooses to take account of the walking stick. These findings suggest that many respondents perceive the question as ambiguous. Further evidence for the lack of question clarity is based upon a large study of pre-operative patients, who most frequently omitted question six when completing the OHS .
In summary, it appears that the predominant area of ambiguity due to lack of question clarity on the OHS is whether patients should take into consideration the use of aids or specialised devices when responding to questions. Not taking consideration of the use of aids and devices, and indeed any assistance in activities, is a common oversight of many patient-centred measures of disability, such as the WOMAC . Individuals who take into consideration the use of an aid when answering a question will appear less disabled than they are in reality. This lack of clarity could confound results, resulting in patients with the highest level of disability, who utilize specialised equipment in many activities, appearing to be the least disabled on paper. To enhance question clarity and gain consistent results it would appear advisable to specify to patients whether they should account for the use of aids or devices when responding to the question. However, modification of validated outcome measures can be fraught with problems , and therefore it may be more advisable to use an outcome tool that considers the modifying effect of aids and assistance on disability.
Measurement of pain
Nearly a quarter of all the annotations provided an explanation of the nature of pain. Frequently patients commented that the intensity of pain can fluctuate greatly over four weeks and that the level of pain is heavily dependant on factors such as medication and activity. As a consequence, several patients felt they could not give an 'average' level of pain for the last four weeks. Therefore, a limitation of the OHS is that it attempts to categorise patients into a single category of pain when in fact pain, predominantly arthritic pain, is not static, but rather a dynamic entity. In a previous study, when interviewed about difficulties encountered when completing the OHS, individuals explained that they learned to ignore the pain, and that it could be masked by medication, and as a consequence struggled to complete the questions referring to pain . Thus questions relating to 'average' pain appear inadequate to capture the experience of individuals with arthritic pain.
Restrictive and irrelevant questions
Twenty three percent of annotations were descriptive or explanatory comments, supplementing the information recorded by the question. These annotations included descriptions of pain or how activities had to be modified as a consequence of disability, such as climbing stairs backwards, and the causes of pain. Furthermore, inadequate response categories resulted in 16 arrows being drawn between boxes, indicating patients were unable to place themselves into a single category. The original article on the OHS does not indicate how these responses should be scored . Although it has recently been suggested that the highest score should be used, it may be argued that this is not a true reflection of the patient's answer and the clinician is introducing bias by selecting which answer to accept .
Expansion of answers was necessary for several patients to explain that, although they had answered the question, it was not applicable to them. Comments written in response to question seven, which asks "have you been able to climb a flight of stairs?", suggest that climbing stairs is not applicable to everyone as some individuals have stair lifts installed or they live in a bungalow. In reply to question nine, "have you been limping when walking because of your hip?", a respondent answered that they don't limp but explained this was a result of them being confined to a wheelchair. Although the questionnaire accounts for people that cannot drive by asking about difficulty travelling by public transport in question three, this question was not applicable to a patient who used an adapted taxi. The OHS appears to restrict individual's answers and fails to allow them to express themselves adequately, as well as including questions that are not relevant to all individuals.
The OHS was designed as a site-specific outcome measure for orthopaedic evaluation, and as such, has been favoured over more generic outcome measures [16, 21]. However, an underlying theme in the annotations was the difficulties that patients encountered when attempting to separate the disability and pain resulting from the affected hip from that arising from other co-morbidities. Contrary to Dawson and colleagues finding that the OHS is not influenced by co-morbidities , the effect size of the OHS has been found to be substantially smaller in patients with other mobility limiting conditions, compared with patients with unilateral hip osteoarthritis (OA), suggesting that other co-morbidities do influence the OHS . The Oxford knee score, which has a comparable format to the OHS, produced similar results for patients with and without knee pain, in the presence of other co-morbidities, providing evidence that the questionnaire is not joint specific . Patients with consistently high scores on the OHS have been found to suffer from multiple co-morbidities  and patients have verbalised that they find it difficult to separate pain from their hip from pain arising from other sites . Therefore, co-morbidities appear to compromise the specificity of the OHS in evaluating disability resulting from hip symptoms, although joint specific questionnaires are designed to exclude the effects of co-morbidities.
In addition to the influence of diffuse co-morbidities, patients found it difficult to distinguish between pain originating from bilateral hips, highlighting a limitation of the OHS in considering only a single joint, which does not reflect the pattern of OA. In a sample of 500 OA patients, 53% of patients had more than one symptomatic joint . Recently, this issue has been addressed by the modification of the OHS to ask about bilateral hip joints, although the success of this new design is questionable as 41% of the patients completed the OHS for the operated side only and 12% of patients did not discriminate between the two joints .
During the validation process it is advisable to eliminate any double-barrelled questions , yet question three asks two questions in one: "have you had any trouble getting in and out of a car or using public transport because of your hip?". Several patients answered the two parts of the question separately as it is common to use both modes of transport. Similarly, question four asks three questions in one: "have you been able to put on a pair of socks, stocking or tights?". Again some patients answered this as three questions, with women often finding tights harder to put on than socks.
The OHS is a useful short tool that is frequently utilised to assess the patient's perception of hip function, mobility and pain. It is quick both for the patient to complete and the clinician to score. Although the OHS is a widely used and validated patient-centred outcome tool, it appears that the OHS is not without problems, in concordance with previous findings . It is unclear to patients whether the questions are asking about level of disability before or after accounting for the use of aids and devices. Individuals found it difficult to respond to questions about the severity of their symptoms due to the dynamic nature of pain and the use of medications to mask the pain. They also had difficulty separating other co-morbidities from the symptoms of the affected hip. Also double-barrelled questions caused confusion and not all questions on the OHS were relevant, or important, to the patient.
It could be argued that the difficulties patients experience with the OHS are due to the brevity of the scale, and could be reduced by the inclusion of additional questions. However, although there is little research on the problems experienced by patients while completing longer scales, such as the WOMAC  or HOOS , it appears that the limitations highlighted in the OHS could be applied to these longer questionnaires. The WOMAC does not account for the use of aids or devices, includes questions asking patients about their average pain level over the past 4 weeks, and has double barrelled questions such as "what degree of difficulty do you have with getting in/out of bath/shower?". In addition, the WOMAC items have been found to be influenced by other co-morbidities, such as low back pain . Previous research has found that the items on the WOMAC are unimportant, or irrelevant, to some individuals with OA . This latter limitation is applicable to many validated patient-centred outcome measures. No single activity is important to all individuals, nor is the importance of being able to perform that activity necessarily stable over time . Hence, an ideal would be to weight items of disability with the importance of performing that activity. This would allow non-applicable items to be rated as of no importance and thus not contribute to the score, producing a more individualised patient-centred outcome measure. Further work is underway to develop a personal impact of disability in osteoarthritis.
Katz JN, Phillips CB, Poss R, Harrast JJ, Fossel AH, Liang MH, Sledge CB: The validity and reliability of a Total Hip Arthroplasty Outcome evaluation questionnaire. J Bone Joint Surg Am 1995, 77: 1528–1534.
Harris WH: Traumatic arthritis of the hip after dislocation and acetabular fractures: treatment by mold arthroplasty. An end-result study using a new method of result evaluation. J Bone Joint Surg Am 1969, 51: 737–755.
Charnley J: The long-term results of low-friction arthroplasty of the hip performed as a primary intervention. J Bone Joint Surg Br 1972, 54: 61–76.
Hewlett SA: Patients and clinicians have different perspectives on outcomes in arthritis. J Rheumatol 2003, 30: 877–879.
Rothwell PM, McDowell Z, Wong CK, Dorman PJ: Doctors and patients don't agree: cross sectional study of patients' and doctors' perceptions and assessments of disability in multiple sclerosis. BMJ 1997, 314: 1580–1583.
Janse AJ, Gemke RJ, Uiterwaal CS, van der Tweel I, Kimpen JL, Sinnema G: Quality of life: patients and doctors don't always agree: a meta-analysis. J Clin Epidemiol 2004, 57: 653–661. 10.1016/j.jclinepi.2003.11.013
Ware J Jr, Kosinski M, Keller SD: A 12-Item Short-Form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care 1996, 34: 220–233. 10.1097/00005650-199603000-00003
Hunt SM, McKenna SP, McEwen J, Backett EM, Williams J, Papp E: A quantitative approach to perceived health status: a validation study. J Epidemiol Community Health 1980, 34: 281–286.
Guyatt G: A taxonomy of health status instruments. J Rheumatol 1995, 22: 1188–1190.
Meenan RF, Gertman PM, Mason JH: Measuring health status in arthritis. The arthritis impact measurement scales. Arthritis Rheum 1980, 23: 146–152.
Bellamy N, Buchanan WW, Goldsmith CH, Campbell J, Stitt LW: Validation study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee. J Rheumatol 1988, 15: 1833–1840.
Dawson J, Fitzpatrick R, Carr A, Murray D: Questionnaire on the perceptions of patients about total hip replacement. J Bone Joint Surg Br 1996, 78: 185–190.
Nilsdotter AK, Lohmander LS, Klassbo M, Roos EM: Hip disability and osteoarthritis outcome score (HOOS) – validity and responsiveness in total hip replacement. BMC Musculoskelet Disord 2003, 4: 10. 10.1186/1471-2474-4-10
Dawson J, Jameson-Shortall E, Emerton M, Flynn J, Smith P, Gundle R, Murray D: Issues relating to long-term follow-up in hip arthroplasty surgery: a review of 598 cases at 7 years comparing 2 prostheses using revision rates, survival analysis and patient-based measures. J Arthroplast 2000, 15: 710–717. 10.1054/arth.2000.7109
Dawson J, Fitzpatrick R, Frost S, Gundle R, McLardy-Smith P, Murray D: Evidence for the validity of a patient-based instrument for assessment of outcome after revision hip replacement. J Bone Joint Surg Br 2001, 83: 1125–1129. 10.1302/0301-620X.83B8.11643
Dawson J, Fitzpatrick R, Murray D, Carr A: Comparison of measures to assess outcomes in total hip replacement surgery. Qual Health Care 1996, 5: 81–88.
Field RE, Cronin MD, Singh PJ: The Oxford hip scores for primary and revision hip replacement. J Bone Joint Surg Br 2005, 87: 618–622. 10.1302/0301-620X.87B5.15390
Fitzpatrick R, Dawson J: Health-related quality of life and the assessment of outcomes of total hip replacement surgery. Health Psychol 1997, 12: 793–803.
Fitzpatrick R, Morris R, Hajat S, Reeves B, Murray DW, Hannen D, Rigge M, Williams O, Gregg P: The value of short and simple measures to assess outcomes for patients of total hip replacement surgery. Qual Health Care 2000, 9: 146–150. 10.1136/qhc.9.3.146
Moran M, Khan A, Sochart DH, Andrew G: Expect the best, prepare for the worst: surgeon and patient expectation of the outcome of primary total hip and knee replacement. Ann R Coll Surg Engl 2003, 85: 204–206. 10.1308/003588403321661415
Ostendorf M, van Stel HF, Buskens E, Schrijvers AJ, Marting LN, Verbour AJ, Dhert WJ: Patient-reported outcome in total hip replacement. A comparison of five instruments of health status. J Bone Joint Surg B 2004, 86: 801–808. 10.1302/0301-620X.86B6.14950
Dawson J, Fitzpatrick R, Murray D, Carr A: The problem of 'noise' in monitoring patient-based outcomes: generic, disease-specific and site-specific instruments for total hip replacement. J Health Serv Res Polic 1996, 1: 224–231.
McMurray R, Heaton J, Sloper P, Nettleton S: Measurement of patient perceptions of pain and disability in relation to total hip replacement: the place of the Oxford hip score in mixed methods. Qual Health Care 1999, 8: 228–233.
Pynsent PB, Adams DJ, Disney SP: The Oxford hip and knee outcome questionnaires for arthroplasty. J Bone Joint Surg Br 2005, 87: 241–248. 10.1302/0301-620X.87B2.15095
Harcourt WG, White SH, Jones P: Specificity of the Oxford Knee status questionnaire. The effect of disease of the hip or lumbar spine on patients' perception of knee disability. J Bone Joint Surg Br 2001, 83: 345–347. 10.1302/0301-620X.83B3.11298
Cushnaghan J, Dieppe P: Study of 500 patients with limb joint osteoarthritis. I. Analysis by age, sex, and distribution of symptomatic joint sites. Ann Rheum Dis 1991, 50: 8–13.
Streiner DL, Norman GR: Selecting the items. In Health Measurement Scales: A Practical Guide to Their Development and Use. Oxford: Oxford University Press; 1995:54–66.
Wolfe F: Determinants of WOMAC function, pain and stiffness scores: evidence for the role of low back pain, symptom counts, fatigue and depression in osteoarthritis, rheumatoid arthritis and fibromyalgia. Rheumatology (Oxford) 1999,38(4):355–61. 10.1093/rheumatology/38.4.355
O'Boyle CA, McGee H, Hickey A, O'Malley K, Joyce CR: Individual quality of life in patients undergoing hip replacement. Lancet 1992, 339: 1088–1091. 10.1016/0140-6736(92)90673-Q
We would like to give thanks to the staff and patients at the Avon Orthopaedic Centre for their cooperation in this study.
No external funding was received for this study.
VW was involved in the acquisition, analysis and interpretation of the data, and drafted the manuscript
IDL was involved in the conception of the study, revision of the manuscript and gave final approval of the version to be published
VJC was involved in the conception and design of the study and revision of the manuscript
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Wylde, V., Learmonth, I.D. & Cavendish, V.J. The Oxford hip score: the patient's perspective. Health Qual Life Outcomes 3, 66 (2005). https://doi.org/10.1186/1477-7525-3-66