A systematic review of mobility instruments and their measurement properties for older acute medical patients

de Morton, Natalie A; Berlowitz, David J; Keating, Jennifer L

doi:10.1186/1477-7525-6-44

Review
Open access
Published: 05 June 2008

A systematic review of mobility instruments and their measurement properties for older acute medical patients

Natalie A de Morton^1,2,
David J Berlowitz² &
Jennifer L Keating¹

Health and Quality of Life Outcomes volume 6, Article number: 44 (2008) Cite this article

22k Accesses
66 Citations
6 Altmetric
Metrics details

Abstract

Background

Independent mobility is a key factor in determining readiness for discharge for older patients following acute hospitalisation and has also been identified as a predictor of many important outcomes for this patient group. This review aimed to identify a physical performance instrument that is not disease specific that has the properties required to accurately measure and monitor the mobility of older medical patients in the acute hospital setting.

Methods

Databases initially searched were Medline, Cinahl, Embase, Cochrane Database of Systematic Reviews and the Cochrane Central Register of Controlled Trials without language restriction or limits on year of publication until July 2005. After analysis of this yield, a second step was the systematic search of Medline, Cinahl and Embase until August 2005 for evidence of the clinical utility of each potentially suitable instrument. Reports were included in this review if instruments described had face validity for measuring from bed bound to independent levels of ambulation, the items were suitable for application in an acute hospital setting and the instrument required observation (rather than self-report) of physical performance. Evidence of the clinical utility of each potentially suitable instrument was considered if data on measurement properties were reported.

Results

Three instruments, the Elderly Mobility Scale (EMS), Hierarchical Assessment of Balance and Mobility (HABAM) and the Physical Performance Mobility Examination (PPME) were identified as potentially relevant. Clinimetric evaluation indicated that the HABAM has the most desirable properties of these three instruments. However, the HABAM has the limitation of a ceiling effect in an older acute medical patient population and reliability and minimally clinically important difference (MCID) estimates have not been reported for the Rasch refined HABAM. These limitations support the proposal that a new mobility instrument is required for older acute medical patients.

Conclusion

No existing instrument has the properties required to accurately measure and monitor mobility of older acute medical patients.

Background

The functional independence of older people is an important indicator of their health status. Diminished independence in hospitalised older people is associated with increased risk of transfer to nursing home, carer burden, mortality and healthcare costs after discharge [1]. Independent mobility is also a key factor in determining readiness for discharge for older hospitalised patients. An instrument that accurately measures and monitors this important construct for hospitalised older patients would have a range of useful applications in clinical care.

Mobility is the focus of the Timed Up and Go (TUG) [2] and Functional Ambulation Classification (FAC) [3] and a subsection of the Barthel Index (BI) [4–6]. These instruments have limitations for measuring mobility in acutely hospitalised patients or others who exhibit a broad spectrum of ability such as community dwelling older people [7–11]. The FAC is a relatively insensitive measure of change for older acute medical patients [11]. The TUG and the BI have inadequate scale width [7–11] and do not adequately capture changes in physical health for people whose limitations are either severe or relatively modest. The TUG has a floor effect with approximately one-quarter of hospitalised older people unable to complete this test because they are too weak [9]. The BI has a ceiling effect with approximately one quarter of patients scoring within the error margin of the highest score [9]. It has also been argued that the BI is a multidimensional scale (i.e. measures multiple constructs) and consequently summation of BI item scores to obtain a total score does not yield an interpretable index [8].

Many trials in aged care in the acute hospital setting have been confounded by inadequate physical outcomes measures. The importance of measures of physical ability across the spectrum of ability has been argued by those prescribing exercise for older people [12]. Pressure on already limited healthcare resources is predicted to increase as the average population age rises. An outcome measure that can accurately measure mobility is required to identify interventions that optimize physical outcomes of hospitalised older patients and facilitate effective targeting of healthcare services.

When selecting an outcome measure for a particular clinical purpose, there are many factors to consider [13]. No systematic review assists clinicians to determine the most appropriate mobility outcome measure for older general medical patients in the acute care setting. Therefore, the aims of this review were to:

identify potentially relevant instruments for measuring mobility in older acute medical patients.
summarise and compare the relevant clinimetric properties of the included instruments.

Methods

This review was conducted in two phases. Initially, a broad systematic search was performed to identify existing instruments for measuring the mobility of hospitalised older acute medical patients. For each instrument that was included, a second search was conducted to identify papers reporting research into its clinimetric properties. This second phase of searching was not constrained to studies of older patients. Data on the clinimetric properties of identified instruments were subsequently extracted and compared.

Phase One: instrument search

Inclusion and exclusion criteria

Reports were included in this review if they described instruments with face validity for measuring from bed bound to independent levels of ambulation and the items were suitable for testing in an acute care hospital (e.g. did not require a laboratory or large open spaces, were not community-based tests such as transferring in and out of a car). The instrument had to be administered by observation of physical performance to counter assessment limitations associated with cognitive deficits and recall bias in hospitalised older patients. For instruments that measured across multiple domains, the report was included if a subtotal for mobility could be determined. Instrument use in the acute hospital setting is also likely to be influenced by practical factors such as the time required for test administration. Therefore this review aimed to identify an instrument that could be conducted, if necessary, during a hospital medical ward round. Based on this criterion, instruments that took greater than 10 minutes to administer, on average, were excluded. Instruments were also excluded if they were not freeware or required expensive equipment as cost is likely to be a barrier to clinical use in many acute hospital settings. Since health care providers can also vary from new graduates to experienced and specialised clinicians, it is also important that an appropriate mobility instrument does not require a minimum level of clinical experience to administer and can therefore be applied by all clinical staff. Therefore, instruments were excluded from the review if a report stipulated that a minimum level of clinical experience was required to administer the test. Instruments that were condition specific (e.g. stroke), consisted of only one item or, due to a known ceiling effect on the BI, the ambulatory items (i.e. high level items) were the same as the ambulatory items on the BI were also excluded from this review.

Instrument identification and selection

Electronic databases were searched without language restriction or limits on year of publication until July 2005. A sensitive search was conducted for key search terms for 'older adults', 'mobility' and 'outcome measures'. Search terms for 'older adults' and 'mobility' were limited to the title or abstract to constrain the magnitude of the review yield to a manageable size. The complete search strategy is shown in Appendix 1. Databases searched were Medline, Cinahl, Embase, Cochrane Database of Systematic Reviews and the Cochrane Central Register of Controlled Trials. All papers were screened for mobility instruments that were reported in the title or abstract. Mobility was defined according the World Health Organisation's International Classification of Functioning (ICF) [14]. Hard copies were obtained of the instruments reported in included papers.

Additional papers were identified by searching the American Physical Therapy Association Catalog of Tests and Measures [15], the UK Chartered Society of Physiotherapy website [16] and the Australian Physiotherapy Association Neurology Special Group Handbook [17]. Two independent reviewers examined hard copies of all included papers and applied inclusion and exclusion criteria. Disagreement between assessors was resolved with discussion.

Phase Two: clinimetric search

In phase one a finite set of relevant instruments were identified. A second systematic search was then conducted to identify what was known about the clinimetric properties of each instrument. The search strategy is shown in Appendix 2. Medline, Cinahl and Embase were searched until August 2005. Papers were screened based on title and abstract for data on clinimetric properties of relevant instruments. Hard copies of potentially relevant papers were obtained. If a reason for instrument exclusion (criteria described for the phase one search) became apparent while examining clinimetric reports, the instrument was excluded.

Inclusion criteria for phase two were that data were provided on clinimetric properties of instruments identified in phase one and that these data enabled estimation of properties such as reliability, validity, minimally clinically important difference (MCID), responsiveness to change, internal structure/dimensionality or acceptability or feasibility.

Instrument evaluation

Data were extracted for each instrument identified by this review and were summarised under each of the following categories:

Instrument characteristics

The instrument items, response options, scoring system, equipment requirements, time to administer and floor and ceiling effects were extracted.

Internal structure and dimensionality

Data reporting the results of Rasch analysis, factor analysis or Cronbach's alpha were extracted.

Reliability

The following data about reliability of instruments were extracted: the type of reliability study conducted (e.g. inter or intra-rater reliability), the methods employed to conduct the study (e.g. independent assessments or video recording of the same patient assessment), assessor training and the characteristics of the patient group. Reliability estimates are reported using many indices. Any of the following were extracted: intraclass correlation coefficient (ICC), Pearson's r, Spearman's rho, Bland and Altman's limits of agreement [18], the minimal detectable change with 90% (MDC₉₀) or 95% (MDC₉₅) confidence intervals, the root mean square of the residuals (RMS) associated with the test-retest regression or the standard error of measurement (SEM). If reliability data were not reported in the units of measurement, the SEM and MDC₉₀ were calculated from related statistics where possible.

Validity

Reports of the opinions of experts in the field regarding instrument items or item content were extracted as evidence of face or content validity respectively. Correlational data and associated 95% confidence intervals (e.g. ICCs, Pearson's r, Spearman's rho) were extracted as evidence of convergent (high correlation with measures of related constructs) and discriminant validity (low correlation with measures of unrelated constructs). For groups of patients who are known to differ in their mobility, group mean scores (and standard deviations) and between groups comparison data were extracted as evidence of 'known groups' validity. Data that indicated a relationship between mobility instrument scores and subsequent relevant health outcomes (e.g. a regression model) were extracted as evidence of predictive validity.

Minimally clinically important difference

The MCID has been defined by Jaeschke, Singer and Guyatt [19] as "the smallest difference in score in the domain of interest which patients perceive as beneficial......". The MCID provides clinicians with the change in scores that patients perceive to represent an important amount of change. MCID point estimates and associated 95% confidence intervals were extracted from relevant papers. In the absence of reports that provided MCID data, the MCID was estimated using the distribution-based approach recommended by Norman et al. [20].

Responsiveness to change

For instruments included in this review, responsiveness indices and associated 95% confidence intervals were extracted. Data reporting significant change scores between assessments in a group of patients who were expected to change was considered adequate evidence of instrument responsiveness to change and was therefore extracted.

Acceptability and feasibility

Relevant data were extracted from any study that formally investigated the acceptability and/or feasibility of an instrument included in this review.

Results

Phase one: instrument search

The search identified 4100 papers. After screening of title/abstract, 3775 papers were excluded. From the remaining 325 papers, 178 assessment measures were identified (see Additional file 1) and hard copies were obtained. Predetermined inclusion and exclusion were applied. Seven physical performance mobility measures were included in this review:

Clinical Outcomes Variable Scale (COVS) [21]
Elderly Mobility Scale (EMS) [22]
General Motor Function Assessment Scale [23]
Goal Attainment Scale [24, 25]
Hierarchical Assessment of Balance and Mobility (HABAM) [26, 27]
Physical Disability Index [28]
Physical Performance and Mobility Examination [29]

Phase two: clinimetric search

After obtaining hard copies of papers that reported the clinimetric properties of the seven remaining instruments, a further four instruments were excluded. Table 1 shows that three instruments were excluded due to a reported average administration time of more than 10 minutes. One instrument was excluded as a minimum of 1 year of clinical experience and 7 hours of training were required to administer the instrument.

Table 1 Reason for exclusion of mobility assessment instruments

Full size table

Three instruments were included in this review and were subjected to rigorous clinimetric evaluation: the Elderly Mobility Scale (EMS) [22], the Hierarchical Assessment of Balance and Mobility (HABAM) [26, 27] and the Physical Performance Mobility Examination (PPME) [29]. Figure 1 shows a flow diagram of the inclusion and exclusion of instruments in this review (Phase 1). The most common reasons for instrument exclusion were that the items did not measure across the mobility spectrum or that the instrument items measured domains other than mobility. No instrument was excluded due to cost only. For each instrument that was included, Figure 2 shows a flow diagram of the inclusion and exclusion of papers reporting the clinimetric properties of each instrument (Phase 2).

Elderly Mobility Scale

Characteristics

The EMS was developed in the 1990s in England as a mobility assessment tool for frail older adults [22]. The characteristics of the EMS are summarised in Table 2. A ceiling effect has been identified for the EMS. For community dwelling older adults who had experienced a single fall in the previous 6 months, "approximately 50% of single fallers scored 19 – 20" [30] and for twenty healthy 81 to 90 year old women, all scored the highest possible score of 20 on the EMS [22].

Table 2 Characteristics of the EMS, HABAM and PPME

Full size table