Giving meaning to the scores of the Amsterdam instrumental activities of daily living questionnaire: a qualitative study
Health and Quality of Life Outcomes volume 20, Article number: 47 (2022)
Everyday functioning is a clinically relevant concept in dementia, yet little is known about the clinical meaningfulness of scores on functional outcome measures. We aimed to establish clinically meaningful scoring categories for the Amsterdam Instrumental Activities of Daily Living Questionnaire (A-IADL-Q), representing no, mild, moderate and severe problems in daily functioning.
Informal caregivers (n = 6) of memory-clinic patients and clinicians (n = 13), including neurologists and nurse specialists, working at various memory clinics in The Netherlands. In focus groups, participants individually ranked nine summaries of fictional patients from least to most impairment in daily functioning. Then, they placed bookmarks to demarcate the thresholds for mild, moderate and severe problems. Individual bookmark placements were then discussed to reach consensus. Clinicians completed a survey in which they placed bookmarks, individually.
While individual categorizations varied somewhat, caregivers and clinicians generally agreed on the thresholds, particularly about the distinction between ‘no’ and ‘mild’ problems. Score categories were no problems (T-score ≥ 60), mild problems (T-score 50–59), moderate problems (T-score 40–49), and severe problems in daily functioning (T-score < 40), on a scale ranging 20–80.
Our findings provide categories for determining the level of functional impairment, which can facilitate interpretation of A-IADL-Q scores. These categories can subsequently be used by clinicians to improve communication with patients and caregivers.
Impairment in daily functioning due to cognitive decline is a core characteristic of dementia . Recent studies have shown that changes in daily functioning, in particular in ‘instrumental activities of daily living’ (IADL) , may occur well before dementia and even as early as the preclinical stage of Alzheimer’s disease [3,4,5,6]. IADL comprise cognitively complex activities such as doing grocery shopping, cooking, and using a computer, and as such, reflect cognitive functions in everyday life. IADL assessments can be helpful for monitoring disease progression and evaluating treatment effects [7, 8].
Impairment in IADL is fundamentally clinically important, as it reflects a person’s inability to live independently. IADL impairment is considered a key element in measuring clinically meaningful treatment effects, because it is related to reduced quality of life, caregiver burden, and apathy [9, 10]. However, a given score on an IADL instrument does not directly indicate whether the level of impairment requires clinical attention . Also, to patients and caregivers, the score itself does not translate to a meaningful concept of problems in daily functioning.
In this study, we set out to investigate the clinical meaningfulness of Amsterdam IADL Questionnaire (A-IADL-Q) scores by establishing clinically meaningful score cutoffs, representing no, mild, moderate and severe problems in daily functioning. Establishing these cutoffs could aid in the meaningful interpretation of A-IADL-Q scores, which could in turn improve communication between clinicians, patients and caregivers.
We asked informal caregivers of patients who visited our outpatient memory clinic between May and August 2019 to participate in a one-time, 3-h focus group. Additionally, we recruited caregivers through our center’s social media accounts. We approached neurologists, geriatricians, nurse specialists and neuropsychologists from various memory clinics in the Netherlands through contacts of the authors and by using a mailing list for members of the Dutch memory clinics network (Nederlands Geheugenpoli Netwerk).
The study was approved by the ethical review board of the VU University Medical Center, and all participants provided written informed consent.
The Amsterdam Instrumental Activities of Daily Living Questionnaire (A-IADL-Q) is an outcome measure that is self-completed by a caregiver and was designed to capture early impairment in daily functioning due to cognitive decline . For the current study, we used the short version of the instrument , which consists of a selection of 30 activities from the original 70-item version. Items were selected based on cross-cultural applicability, frequency of endorsement, and clinical relevance, as judged by clinicians, caregivers and patients . Items are rated on a five-point scale ranging from ‘no difficulty performing the activity’ to ‘unable to perform the activity’. The A-IADL-Q is scored using item response theory (IRT), which accounts for varying ‘difficulty’ of items such that impairment in a more complex activity (e.g., managing the household budget) contributes differently to the total score than impairment in a relatively simple activity (e.g., using the TV remote control). This information is contained in the scoring parameters, as described in detail elsewhere [13, 14]. The total score, or T-score, represents the latent trait of ‘daily functioning’ and is normally-distributed with a mean of 50 and a standard deviation (SD) of 10 in a memory clinic population. Scores thus range from approximately 20–80, with higher scores representing better daily functioning.
We created nine short clinical summaries (‘vignettes’) of fictional patients who had some degree of functional impairment, using combinations of five items of the A-IADL-Q for each vignette. We selected a subset of fifteen items to reduce the number of different activities presented in each vignette and increase comparability between them. The selection was made based on the IRT parameters to have items distributed across the latent trait, so that both more and less impaired ends of the daily functioning spectrum were covered. We then determined what item response category would be most likely to be endorsed given a certain T-score, based on the methods and using an R script adapted from Morgan, Mara . An overview of the most likely item responses of the fifteen items is included in the Additional file 1. The vignettes were created by combining the most likely responses of five items at different T-scores (i.e., different degrees of impairment), and were placed five points (0.5 SD units) apart, ranging from 20 (all most likely item responses were ‘unable to perform’) to 60 (all most likely item responses were ‘no difficulty’). We randomly assigned each vignette a gender, common Dutch surname, random age in the range of 60–70 years, and a stock photo. The vignettes can be found in the Additional file 1.
In the focus groups, we asked each panelist to describe what they considered ‘mild’, ‘moderate’ and ‘severe problems’ in daily functioning, to understand how the panelists defined these categories and create a framework for the subsequent categorization and discussion. Subsequently, panelists individually ordered the vignettes from the one representing the least functional impairment to the one representing the most. Panelists then discussed the order of the vignettes and reached a consensus ordering. Then, panelists individually placed bookmarks between the vignettes to create categories representing no, mild, moderate, and severe problems in daily functioning. This ‘bookmarking’ method was previously developed by Cook and colleagues . Finally, a second group discussion resulted in a consensus categorization. Group discussions were based on the nominal group theory .
Clinicians individually completed an online survey that was modeled after the focus group procedures, and in which they were first asked to describe what they considered ‘mild’, ‘moderate’ and ‘severe problems’. Next, the nine vignettes were presented in order from least to most impaired, and the clinicians were instructed to categorize them into no, mild, moderate and severe problems.
As the clinicians completed the survey independently, consensus between them was determined by taking the mode of the categorization for each vignette (1 = no problems, 2 = mild problems, 3 = moderate problems, 4 = severe problems). The overall consensus categorization was the mode of the three separate consensus categorizations: two from the focus groups with informal caregivers, and the consensus between clinicians. Analyses were performed in R version 4.1.0 .
Forty patient caregivers were invited through the Alzheimer Center Amsterdam to participate in the focus groups. Six individuals (age 68 ± 10 years old, 4 women) agreed to participate and they were spread across two focus groups. Four panelists were partners, and two were adult children of a person with dementia. Clinicians were approached through contacts of the authors, as well as through a mailing list for clinicians working in memory clinics in The Netherlands. Thirteen clinicians (five neurologists, five nurse specialists, two neuropsychologists and a geriatrician; age 46 ± 13 years old, 8 women) completed the survey.
Caregivers and clinicians had differing definitions of what they considered ‘problems in daily functioning’. One caregiver defined ‘problems’ as having any amount of difficulty with performing some activity, whereas another stated that they considered ‘problems’ to be the complete inability to perform an activity. Clinicians wrote that ‘mild problems’ cause minimal impairment predominantly in the most complex activities, whereas ‘severe problems’ imply that a person can no longer function independently. As a result of the various personal definitions, individual categorizations differed slightly, with some panelists categorizing more strictly, where fewer problems were classified as more severe, while others were more lenient, classifying more problems as less severe. Consensus between the focus groups was largely similar, except that in one group, two more vignettes were classified as representing ‘severe problems’, creating a 10-point difference between the cutoffs for ‘severe problems’ in the two groups (see Fig. 1). The vignettes at the extremes, i.e., ‘no problems’ and ‘severe problems’ were classified the same across clinicians and caregivers. The classifications of ‘moderate’ and ‘severe’ problems differed among clinicians, similar to the caregivers.
The final average categorization was as follows: T-scores ≥ 60 were classified as showing ‘no problems’, T-scores 50–59 were classified as ‘mild problems’, T-scores 40–49 as ‘moderate problems’ and T-scores < 40 as ‘severe problems’ (Fig. 1).
In this study, we involved stakeholders to determine clinically meaningful scoring categories for the measurement of functional impairment using the Amsterdam IADL Questionnaire. Informal caregivers and clinicians established categories representing no (T-score ≥ 60), mild (50–59), moderate (40–49), and severe problems in daily functioning (< 40) in IADL.
Clinical meaningfulness in the context of Alzheimer’s disease and related disorders has been gaining attention over recent years [19, 20]. Clinicians have a good understanding of the disease and its effects on patients and caregivers. Still, when conclusions are based solely on judgments by clinicians, these only comprise part of the picture. Especially, caregivers could add a unique perspective since they observe and can therefore reflect on functioning in AD patients in everyday life. This is a major advantage of our study.
The cutoffs between no and mild, and mild and moderate problems were unanimously agreed upon by caregivers and clinicians. This is especially important, as it seems that clear, clinically meaningful distinctions can be made in subtle degrees of IADL impairment. There was, however, some disagreement among caregivers on the precise placement of a cutoff to make the distinction between moderate and severe problems. It is arguable that the difference between these categories is of less importance, as there is already considerable impairment. Our findings also show that the clinical interpretation may depend on individual definitions and opinions, which has likely contributed to the slight differences we found in categorizations. The categories we present may not reflect everyone’s personal interpretation of different degrees of functional impairment.
Nevertheless, the proposed categories can help clarify the meaning of a given score, and thus provide concrete guidance for communicating test results with patients and their caregivers. This is important as many patients and caregivers report unmet information needs, especially about what test results mean [21, 22]. When discussing test results, communication may benefit from the use of clear language and interpretable categories, rather than raw scores. Our study provides such ready-to-use scoring categories for the Amsterdam IADL Questionnaire.
An important strength of this work is that we used a qualitative approach involving stakeholders (both caregivers and clinicians) to determine clinically meaningful categories in the scoring of a functional outcome measure. Limitations of this study include its small sample size, predominance of women, and recruitment in The Netherlands only, which limit the generalizability of our results. A future study should expand on our work by including a larger sample size representing a more diverse group of caregivers. Future work should also focus on the meaningfulness of changes in daily functioning, as changes may be meaningful, even when they fall entirely within the scoring categories we established here.
In conclusion, we used caregiver and clinician input to place thresholds and thus create meaningful categories for assessing the severity of impairment in everyday functioning in the context of AD. Specifically, these categories may be useful for distinguishing absence of any problems from the existence of mild problems, which is relevant in early disease stages. Our findings give meaning to total scores, which in and of their own are usually rather unintuitive. By providing clear language about the level of impairment, the categories could support clinicians in explaining the meaning of test results to patients and their caregivers.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Amsterdam instrumental activities of daily living questionnaire
Instrumental activities of daily living
Scheltens P, Blennow K, Breteler MMB, De Strooper B, Frisoni GB, Salloway S, et al. Alzheimer’s disease. Lancet. 2016;388(10043):505–17.
Lawton MP, Brody EM. Assessment of older people: self-maintaining and instrumental activities of daily living. Gerontologist. 1969;9(3):179–86.
Marshall GA, Aghjayan SL, Dekhtyar M, Locascio JJ, Jethwani K, Amariglio RE, et al. Activities of daily living measured by the Harvard Automated Phone Task track with cognitive decline over time in non-demented elderly. J Prev Alzheimers Dis. 2017;4(2):81–6.
Dubbelman MA, Jutten RJ, Tomaszewski Farias SE, Amariglio RE, Buckley RF, Visser PJ, et al. Decline in cognitively complex everyday activities accelerates along the Alzheimer’s disease continuum. Alzheimers Res Ther. 2020;12:138.
Peres K, Helmer C, Amieva H, Orgogozo JM, Rouch I, Dartigues JF, et al. Natural history of decline in instrumental activities of daily living performance over the 10 years preceding the clinical diagnosis of dementia: a prospective population-based study. J Am Geriatr Soc. 2008;56(1):37–44.
Sikkes SAM, De Rotrou J. A qualitative review of instrumental activities of daily living in dementia: What’s cooking? Neurodegener Dis Manag. 2014;4(5):393–400.
Giebel CM, Sutcliffe C, Stolt M, Karlsson S, Renom-Guiteras A, Soto M, et al. Deterioration of basic activities of daily living and their impact on quality of life across different cognitive stages of dementia: a European study. Int Psychogeriatr. 2014;26(8):1283–93.
Luck T, Riedel-Heller SG, Luppa M, Wiese B, Bachmann C, Jessen F, et al. A hierarchy of predictors for dementia-free survival in old-age: results of the AgeCoDe study. Acta Psychiatr Scand. 2014;129(1):63–72.
Martyr A, Nelis SM, Quinn C, Rusted JM, Morris RG, Clare L, et al. The relationship between perceived functional difficulties and the ability to live well with mild-to-moderate dementia: findings from the IDEAL programme. Int J Geriatr Psychiatry. 2019;34(8):1251–61.
Zanetti O, Geroldi C, Frisoni GB, Bianchetti A, Trabucchi M. Contrasting results between caregiver’s report and direct assessment of activities of daily living in patients affected by mild and very mild dementia: the contribution of the caregiver’s personal characteristics. J Am Geriatr Soc. 1999;47(2):196–202.
Cella D, Choi S, Garcia S, Cook KF, Rosenbloom S, Lai JS, et al. Setting standards for severity of common symptoms in oncology using the PROMIS item banks and expert judgment. Qual Life Res. 2014;23(10):2651–61.
Sikkes SAM, de Lange-de Klerk ES, Pijnenburg YAL, Gillissen F, Romkes R, Knol DL, et al. A new informant-based questionnaire for instrumental activities of daily living in dementia. Alzheimers Dement. 2012;8(6):536–43.
Jutten RJ, Peeters CFW, Leijdesdorff SMJ, Visser PJ, Maier AB, Terwee CB, et al. Detecting functional decline from normal aging to dementia: development and validation of a short version of the Amsterdam IADL Questionnaire. Alzheimers Dement (Amst). 2017;8:26–35.
Sikkes SAM, Knol DL, Pijnenburg YAL, de Lange-de Klerk ES, Uitdehaag BM, Scheltens P. Validation of the Amsterdam IADL Questionnaire(c), a new tool to measure instrumental activities of daily living in dementia. Neuroepidemiology. 2013;41(1):35–41.
Morgan EM, Mara CA, Huang B, Barnett K, Carle AC, Farrell JE, et al. Establishing clinical meaning and defining important differences for patient-reported outcomes measurement information system (PROMIS((R))) measures in juvenile idiopathic arthritis using standard setting with patients, parents, and providers. Qual Life Res. 2017;26(3):565–86.
Cook KF, Cella D, Reeve BB. PRO-bookmarking to estimate clinical thresholds for patient-reported symptoms and function. Med Care. 2019;57(Suppl 1):S13–7.
Jones J, Hunter D. Consensus methods for medical and health services research. BMJ. 1995;311(7001):376–80.
R Core Team. R: A language and environment for statistical computing. 4.1.0 ed2021.
Siemers E, Holdridge KC, Sundell KL, Liu-Seifert H. Function and clinical meaningfulness of treatments for mild Alzheimer’s disease. Alzheimers Dement (Amst). 2016;2:105–12.
Rentz DM, Wessels AM, Bain LJ, Weber CJ, Carrillo MC. Clinical meaningfulness addressed at Alzheimer’s association research roundtable. Alzheimers Dement. 2020;16(5):814.
Visser LNC, Kunneman M, Murugesu L, van Maurik I, Zwan M, Bouwman FH, et al. Clinician-patient communication during the diagnostic workup: the ABIDE project. Alzheimers Dement (Amst). 2019;11:520–8.
Fruijtier AD, Visser LNC, Bouwman FH, Lutz R, Schoonenboom N, Kalisvaart K, et al. What patients want to know, and what we actually tell them: the ABIDE project. Alzheimers Dement (N Y). 2020;6(1):e12113.
The authors would like to thank the caregivers and clinicians who provided their useful input for this study.
The work for this study was supported by public–private funding of which SAMS was the recipient. She received funding from Health-Holland, Topsector Life Sciences & Health (PPPallowance; LSHM20084-SGF, project DEFEAT-AD), and the National Institutes of Health, as well as license fees from Green Valley, VtV Therapeutics, Alzheon, Vivoryon, and Roche, and honoraria from Boehringer and Toyama. All funding is paid to her institution. LNCV is supported by a fellowship grant received from Alzheimer Nederland (WE.15-2019-05) and recipient of ABOARD, which is a public–private partnership receiving funding from ZonMW (#73305095007) and Health ~ Holland, Topsector Life Sciences & Health (PPP-allowance; #LSHM20106). All funding is paid to the institution.
Ethics approval and consent to participate
This study was approved by the ethical review board of the VU University Medical Center. All participants provided written informed consent prior to participation.
Consent for publication
SAMS and PS co-developed the A-IADL-Q, which is freely available for use by academic users and not-for-profit organizations. License fees from for-profit organizations are paid to their institution. The other authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
. Provides additional information on how the vignettes were made, what activities they are composed of, and lists all vignettes shown to participants.
About this article
Cite this article
Dubbelman, M.A., Terwee, C.B., Verrijp, M. et al. Giving meaning to the scores of the Amsterdam instrumental activities of daily living questionnaire: a qualitative study. Health Qual Life Outcomes 20, 47 (2022). https://doi.org/10.1186/s12955-022-01958-2