Skip to main content

Anticipated adaptation or scale recalibration?



The aim of our study was to investigate anticipated adaptation among patients in the subacute phase of Spinal Cord Injury (SCI).


We used an observational longitudinal design. Patients with SCI (N = 44) rated their actual, previous and expected future Quality of Life (QoL) at three time points: within two weeks of admission to the rehabilitation center (RC), a few weeks before discharge from the RC, and at least three months after discharge. We compared the expected future rating at the second time point with the actual ratings at the third time point, using student’s t-tests. To gain insight into scale recalibration we also compared actual and previous ratings.


At the group level, patients overpredicted their improvement on the VAS. Actual health at T3(M = 0.65, sd =0.20)) was significantly lower than the predicted health at T1 of T3 (M = 0.76, sd = 0.1; t(43) = 3.24, p < 0.01), and at T2 of T3(M = 0.75,sd = 0.13; t(43) = 3.44, p < 0.001). Similarly the recalled health at T3 of T2 (M = 0.59, sd = 0.18) was significantly lower than the actual health at T2 (M = 0.67, sd = 0.15; t(43) = 3.26, p <0.01). Patients rated their future and past health inaccurately compared to their actual ratings on the VAS. In contrast, on the TTO patients gave accurate estimates of their future and previous health, and they also accurately valued their previous health. Looking at individual ratings, the number of respondents with accurate estimates of their future and previous health were similar between the VAS and TTO. However, the Bland-Altman plots show that the deviation of the accuracy is larger for the TTO then the VAS. That is the accuracy of 95% of the respondents was lower in the TTO then in the VAS.


Patients at the onset of a disability were able to anticipate adaptation. Valuations given on the VAS seem to be biased by scale recalibration.


Patients experiencing an illness or disability generally give high Quality of Life (QoL) ratings compared to ratings given by people imagining having such an illness or disability [1]. Such an underestimate of QoL by people imagining illness or disability is common. Previous research suggests that this is due to people focusing on negative aspects of a disability (focusing illusion) and a failure to anticipate adaptation [2, 3]. Patients adapt physically as well as psychologically to their disability, leading to an improvement of their QoL. People imagining a disability have a tendency to focus on the incident and fail to anticipate this ability to adapt. As a consequence, they misjudge their long-term emotional responses to such events [4].

Besides underprediction of QoL, overprediction has also been shown. Peeters et al. [5] showed that patients expected greater improvement of their QoL at the onset of a disability than they actually got. These findings may have been led by a measurement bias called scale recalibration. Although patients in this study were recovering from surgery they did not report any actual improvement over time. If the findings of this study were led by scale recalibration, it will become impossible to draw conclusions about anticipated adaptation. We cannot differentiate between scale recalibration and anticipated adaptation.

Together with reprioritization and reconceptualization, scale recalibration is one of the three constituents of response shift. Response shift was initially described by Golembiewski et al. [5]. Sprangers and Schwartz introduced it into QoL research [6]. The first construct, reprioritization describes the process in which a respondent changes the importance of the domains that constitute the underlying construct. For instance, a respondent might first claim that her QoL is mainly based on her physical wellbeing whereas overtime she might find that her physical wellbeing is not that important any more for her QoL. The second contruct, reconceptualization, is a redefinition of the construct QoL. That is, a respondent might initially assume that QoL is based on physical health but overtime she might change her definition of QoL in which physical health is not part of the construct anymore [6]. The third construct, scale recalibration is a process in which the intervals of a scale used to measure a dimension are recalibrated after an intervening event. In the field of QoL research, this event may be a course of treatment, the development of a disability, or the diagnosis of an illness. Such event might cause the intervals on a scale to expand or contract, depending on the change in context [5]. Likert scales have proven to be sensitive to scale recalibration [7].

Constituents of response shift are strongly related to adaptation. Response shift has been described an effect of adaptation [8]. The process of adaptation to chronic illness and disability is dynamic, nonlinear and unpredictable, with individuals undergoing several transitions, with the last transition evidenced by a reconstructed self-identity including appreciation of diverse activities and life goals [9]. Quotes reported in qualitative studies of adaptation (“changing my habits, which means that I changed, I no longer do the same work, I used to work a lot before” [10]), are similar to quotes reported in a qualitative studies on response shift (“I can do some work in the house, like sweeping the floor, so I’m not limited” [11]). This has been argued to be mostly true for the concepts reconceptualization and reprioritization, as scale recalibration refers to the measurement scale [12]. This distinction between scale recalibration on the one hand and reconceptualization and reprioritization on the other, in relation to adaptation has been criticized, with some arguing that the changes in the scale are also part of the adaptation process. As can be read from a thorough discussion in the literature there are different theoretical points of view on the role for scale recalibration [1216]. As response shift literature argues that scale recalibration is part of the response shift and therefore part of the effect of adaptation [6, 8], others have argued that scale recalibration should be regard as measurement error [7, 12].

In the context of this study, we regard scale recalibration as measurement bias and reconceptualization and reprioritization as an effect of adaptation. We do not make a distinction between physical and psychological adaptation, therefore the concept 'adaptation’ refers to aspects such as learning how to handle a wheelchair, finding new hobbies, and learning how to live with a new situation.One type of measures used in the field of QoL research, are preference measurements, or health state valuations, mostly used in the field of decision-making. Whereas descriptive QoL measurements only describe QoL, preference measurements measure both QoL and the valuation of QoL, relative to perfect health and death [17]. One of these preference measurements is the Visual Analog Scale (VAS). In the VAS, patients give valuations of their health by placing a mark on a 100 mm. horizontal or vertical line ranging from perfect health to death. Just like the Likert-scale method the VAS is a rating scale, but mostly without labeling intervals; (as an exception the VAS of the EQ-5D used to estimate health state tariffs does have labeling intervals [18]). When labeling intervals are missing, respondents have the tendency to divide the scale using their own 'intervals’ [19]. Probably these personal 'intervals’ can expand or contract just like the intervals of a likert scale [5]. Lacey et al. [7] found support for the effect of scale recalibration on VAS ratings. Its feasibility makes the VAS a popular method, but a limitation of the VAS is that it does not provide a trade-off valuation, which is required in e.g. the assessment of quality-adjusted life years for decision models or cost-effectiveness analysis [20]. Another instrument often used in health state valuations which does suit this goal is the Time Trade-Off (TTO) [21]. In the TTO, participants choose between a number of life years in their current state of health and a shorter lifespan in perfect health. The severity of the state of health estimated is based on the number of years a person is willing to trade for perfect health. TTO ratings are measured on a time scale, usually in life years. On a time scale, the intervals are stable. It is hard to imagine how 'time’ as a response scale can be the subject of recalibration, unless the survival time prognosis of the respondent changes dramatically. Theoretically, the TTO therefore seems less vulnerable to scale recalibration than the VAS, yet no empirical evidence has been described to ground this assumption.

The aim of our study is to attempt to disentangle scale recalibration and inability to anticipate one’s adaptation, in patients with SCI. We asked patients to predict future and recall former state of health including physical, psychological and social aspects using a VAS and a TTO. We will use a method similar to the then-test assessing improvement in QoL [22] to examine scale recalibration and anticipated adaptation. The then test gives information at the group level which can be used to guide treatment decisions at the group level but will have measurement error when decisions are made at the individual level [23]. Therefore, individual methods such as the Bland-Altman plot will be used as well. Since we expect valuations given on a TTO to be less sensitive to scale recalibration than valuations given on a VAS, as explained above, we compared results for these two measures. Previous research among patients with SCI revealed that for some patients the change over time in satisfaction with life -measured with a likert-scale- was due to scale recalibration [24]. Others found that patients with SCI changed their priorities over time [25]. In the current study, we expected patients to overpredict changes in their state of health over time on both the VAS and TTO, if they overestimated their ability to adapt. However, if they correctly estimate their ability to adapt they will make correct predictions of changes in their state of health on the TTO. On the VAS they will still make overpredictions of these changes, due to scale recalibration.


Participants and procedures

Patients between 18 and 75 years old with SCI who were able to understand and speak Dutch, with sub-acute SCI causing functional losses and problems with daily activities, were asked to participate in our study. Patients with minor functional losses (who had neither problems with walking ability nor problems with bladder or bowel functions) as well as patients with severe emotional or cognitive problems were excluded. Eligible patients were contacted by their attending physician or psychologist in the first few weeks of admission. We collaborated with six Rehabilitation Centers (RCs) in The Netherlands specializing in SCI. Patients who consented to be interviewed were contacted by one of the interviewers.

Patients were interviewed at three time points. These time points were based on clinical moments given by the rehabilitation schedule. This meant that the interviews were scheduled around major events in the rehabilitation trajectory similar for all patients. The first interview took place at the RC as soon as possible after admission, the second interview during active rehabilitation, also at the RC, and the third after discharge when patients were accustomed to their home situation. This interview took place at home or during an out-patient clinic visit to the RC. Along with these clinical moments, a time frame was used; the first interview took place within the first four weeks of admission, the second interview at least two weeks before discharge and a month after the first interview, and the third interview took place at least six months after discharge. Since the rehabilitation period of patients with tetraplegia is generally longer, the time between interviews differed for patients with paraplegia and patients with tetraplegia. For patients with paraplegia, the schedule was therefore an interval of three months between the first and second interview and an interval of one year between the first and third interview. For patients with tetraplegia, the interview schedule included an interval of six months between the first and second interview and an interval of 18 months between the first and third interview. A great deal of effort was put into interviewing all patients within these time frames; exceptions that had to be made are described in the results section. The medical ethics committee of the Leiden University Medical Center (LUMC) and the local ethics committees of the RCs approved the study protocol.

The interview

At all three time points, a face-to-face interview was conducted by one of two trained interviewers following a strict interview protocol. Overall the same questions were asked at the three time points, but small changes were made when necessary due to the different situation patients were in. In the introduction of the interview, patients were instructed not only to take their illness into account in answering questions about their health, but to incorporate the limitations caused by their injury as well. Patients gave a valuation of their own health of the previous week using a VAS and a TTO. In the first and second interviews this was followed by a (predicted) valuation of their state of health at least six months after discharge from the RC when the patient was expected to be home; approximately the time of the third interview. In the third interview, the valuation of their own state of health at that moment was followed by a valuation of their own health as they recalled it to be during the second interview.


The time trade-off (TTO)

Patients rated how many years (x) of their remaining life expectancy (y) they were willing to trade in order to obtain perfect health. Utility was calculated as y-x/y. Life expectancy was derived from Dutch life expectancy tables for their gender and age category [26]. The interviewer asks a participant to choose between a number of years, i.e. their remaining life expectancy, living in the health state to be valued or living a shorter period of time in perfect health. The time in perfect health is varied to obtain an indifference point, the number of years in perfect health equal to a higher number of years in the health state to be valued. For example if a respondent has a life expectancy of 20 years the interviewer asks the respondent to choose between 20 years living in the health state to be valued or living for instance 10 years in perfect health. If the respondent chooses for the health state to be valued the duration in perfect health will increase if the respondent chooses for perfect health the duration will decrease. By changing the duration of perfect health an indifference point can be sought. The indifference point was sought through the bisection method. TTO utilities were elicited using a time-line and board. The descriptions of the health states are placed on the board, and with the sliding time-line the duration of the health state was made visible. The health states were perfect health and the patients’ own health of the previous week, their future, or their previous state of health, respectively. Perfect health was described as full well-being in physical, psychological, and social functioning. Given the severity of SCI and the emotional status of the patients, the lowest trade-off was set at three months living in perfect health.

The visual analogue scale (VAS)

The VAS is a 100 mm horizontal line anchored by death and perfect health. Perfect health was again described as full well-being in physical, psychological and social functioning. The valuation of their own health of the previous week (actual health), of their expected future state of health, and previous state of health were elicited by placing a mark between death and perfect health, and calculated as #mm/100.

Data analyses

Prior to the main analyses, all variables were examined for normality by relating the level of skewness, and kurtosis to the standard deviation. All variables were normally distributed. We used Students’ t-tests to assess differences between the original scores, which we will call the 'actual change’ in health state valuations and the difference between the original scores and scores for predicted health state which we will call the “accuracy in predicting” and the difference between the original scores and the scores for the recalled health which we will call “accuracy in recalling”.

Individual accuracy was tentatively (because of the known moderate ICC reliability of the methods [27, 28]) examined by examining the percentage of respondents who made correct estimates. Given that valuations given on the VAS and TTO both can be calculated to more than two decimals specific, we allowed for an individual rating-error of 0.100. That is if the difference between two ratings was within 0.100 we report this as an equal rating. To further examine the individual accuracy Bland-Altman plots were prepared which visualize the accuracy by showing the 95% confidence interval.


Physicians or psychologists approached 74 patients who met our inclusion criteria. Of these, 14 patients refused due to personal reasons. Ten of these patients refused because they did not want to participate, two patients refused because of their religion, and two patients found it difficult to talk about their state of health. No differences in age and gender were found between patients refusing to participate and patients who participated in the first interview. We had no information on the medical condition of patients who refused.

In total 60 (74-14) patients (81%) agreed to participate and were interviewed at T1. At T2 four patients withdrew for personal reasons and one patient could not be contacted. Three patients had to be excluded:one patient was excluded due to an infection that made active rehabilitation impossible and two patients were not interviewed because the time between the first interview and discharge had been less than a month. Two patients had incomplete data because at the time of their interview the prediction of the future state of health had not been included as yet. At T3, four patients dropped out (two withdrew due to major pain, one had passed away, and one could not be interviewed due to logistic reasons). Finally two respondents were excluded from the analysis who did not want to make a prediction of their future health. In total 44 patients of the initially 74 approached (59%) were included in our analyses, 73% of the 60 patients who took part in the first interview. No statistically significant differences in gender, marital status, children, education level, age and cause of injury were found between patients excluded in the second or third interview and patients included. However, a difference was found in type of injury: more patients with incomplete tetraplegia had dropped out.

Tables 1 and 2 show the demographic characteristics and mean valuations of the included patients.

Table 1 Demographic characteristics of patients with spinal cord injury participating in this study
Table 2 Continuous characteristics of the patients with spinal cord injury participating in this study

Accuracy of predicted and recalled health state valuations compared to actual ratings

Table 3 shows the mean actual, predicted, and recalled valuations of health states. In Figure 1 and Figure 2 the actual, predicted and recalled mean changes are presented. The VAS shows that at the group level patients over-predicted their future and previous health compared to their actual health. Between the first and third interview patients expected significant improvement (difference d = 0.16 (0.14) (t(43) = -7.39, p < 0.001)) but showed only a minor actual improvement (d = 0.06 (0.21) (t(43) = -1.77, p = 0.084)). Between the second and third interview, patients again predicted significant improvement (d = 0.08 (0.11) (t(43) = 4.81, p < 0.001)) but here their valuations for their actual health even slightly decreased (d = -0.02 (0.19) (t(43) = 0.71, p = 0.48)). Finally, they also recalled a small improvement at the group level (d = 0.06 (0.21) (t(43) = 1.91, p = 0.06)).

Table 3 Accuracy of predicted and recalled valuations of patients with Spinal Cord Injury (SCI) on Time Trade-Off (TTO) and Visual Analogue Scale (VAS) at the group level
Figure 1
figure 1

At group level actual change and predicted change at time point 1 and 3 for valuations given for the own health on the Time Trade-Off (TTO) and Visual Analogue Scale (VAS) by patients with Spinal Cord injury.

Figure 2
figure 2

At group level actual change, predicted change and recalled changed between time point 2 and 3 for valuations given for the own health on the Time Trade-Off (TTO) and Visual Analogue Scale (VAS) by patients with Spinal Cord Injury (SCI).

For the TTO on the other hand, at a group level patients gave accurate predictions and had accurate recollections when compared to the actual change. Between the first and third interview patients predicted an improvement of d = 0.18(0.22) (t(43) = 5.36, p <0.001) and their state of health actually improved by d = 0.25 (0.30) (t(43) = -5.43, p < 0.001). Between the second and third interview patients predicted an improvement of d = 0.06 (0.09) (t(43) = 4.27, p < 0.001) and their state of health actually showed an improvement of d = 0.06 (0.30) (t(43) = 1.40, p = 0.17)) at the group level. Finally, they also recalled a small improvement at the group level (d = 0.08 (0.22)(t(43) = 2.33, p = 0.025)). Both the predicted and the recalled valuations did not differ significantly from the actual change (Table 3).

At the individual level 41% of the respondents gave an accurate prediction of the VAS at first interview for their health at the third interview. At the second interview the correct prediction was slightly lower, 37%. Finally, 41% recalled their health during the period of the second interview correctly. For the TTO the percentages were similar, at the first interview 43% gave accurate predictions, at the second interview this was 39% and at the third interview 45% recalled their health during the period of the second interview correctly. Comparing the valuations given on the VAS with valuations given on the TTO it seems that respondents on the VAS more often over predicted and under recalled their improvement (Table 4). This is in line with the findings at the group level.

Table 4 Accuracy of predicted and recalled valuations of patients with Spinal Cord Injury (SCI) on Time Trade-Off (TTO) and Visual Analogue Scale (VAS) at the individual level

The Bland-Altman plots (Figure 3) show the 95% confidence interval of the accuracy. The 95% confidence interval for the VAS of T1 prediction of T3 vs T3 actual is -0.32 -0.53, for T2 prediction of T3 vs T3 actual is -0.30-0.50, and for the T2 actual vs T3 recall of T2 is -0,40 -0.24. The 95% confidence interval of the TTO of T1 prediction of T3 vs T3 actual is -0,62 -0,74, for T2 prediction of T3 vs T3 acutal is -0,61 – 0,62, and for the T2 actual vs T3 recall of T2 is -0,45-0,48.

Figure 3
figure 3

Individual accuracy between predicted health at T1 for actual health at T3, predicted health at T2 for actual health at T3 and recalled health at T3 for actual health at T2 for VAS and TTO.


The aim of our study was to attempt to disentangle scale recalibration and ability to anticipate one’s adaptation, in patients with SCI. We explored whether patients correctly predict their future QoL and recall their previous QoL at the group level. In an effort to disentangle scale recalibration and poor anticipated adaptation we used a then-test approach [22] with valuations given on a TTO and VAS. As hypothesized, at the group level the predicted and recalled improvements on the TTO were similar to the actual improvements. On the VAS patients expected an improvement and recalled improvement, but according to their actual ratings they remained stable. Scale recalibrations were likely to have occurred on the VAS. Over time, patients thus used different standards on the VAS. This is in line with the study of Lacey et al [7] who found proof for the effect of scale recalibration on valuations on the VAS. In contrast, other research, reported that similar differences on the VAS should be explained by recall bias [24]. It seems preliminary to conclude that the change on the VAS is fully caused by scale recalibration or by recall bias. When a then-test approach is used, recall bias and response shift are almost impossible to disentangle in the field of QoL [2931]. In the field of education, the differences between recall bias and response shift can be disentangled since in this field subjective ratings can be compared by objective tests measuring the change in knowledge [32]. Another explanation for our findings is that patients in this study were simply over optimistic about their recovery, possibly based on their theories of stability and change [33]. However, if patients were overly optimistic about their recovery, or if the findings were led by recall bias, we would have expected to find similar results on the TTO.

Regarding to the TTO, patients at the onset of a disability were able to predict their own ability to adapt; they accurately predicted their valuations on the TTO for their future health states and recalled their previous health. Patients were able to estimate correctly their increase in QoL in the first part of their rehabilitation trajectory and the lack of increase in the second part of their rehabilitation trajectory. Previously it was found that during rehabilitation patients with SCI do adapt to their situation both physically as psychologically [34]. In this study we find that at the group level patients accurately estimate the effect of physical and psychological adaptation on their future QoL, where the concept 'adaptation’ refers to aspects such as learning how to handle a wheelchair, finding new hobbies and learn how to live with the new situation, reflecting reprioritization and reconceptualization in terms of quality of life measurement. Compared to a population without any experience of a disability [35], only even limited experience with a disability seems to be sufficient for accurate predictions on a TTO. Even shortly after injury, patients adapt to their disability [34, 36]. The insight into their ability to adapt may have enabled them to anticipate accurately. In contrast, people without any experience of a disability find it difficult to incorporate their ability to adapt [35] even after an exercise making them aware of this adaptation [37]. Obviously, these conclusions are restricted to our findings at the group level.

Utility assessment is only used at the group level, generally for cost-effectiveness analyses, but also for calculation of quality-adjusted life years for guideline development. The methods have insufficient reliability for individual level decision making [21]. Therefore, our main interest lies with the paired comparisons. Most response shift research is based on findings at a group level [3841] although some researchers have examined individual changes [42, 43] The findings at the individual level show that only between 37% and 45% of the respondents were accurate in their predictions. These findings are in line with previous research, McPhail and Haines found that 40% of their respondents made correct predictions [38]. In contrast to the group level data, at the individual patient level the accuracy of the predictions was similar for the TTO (39-45%) and the VAS (37-41%) when we made a restriction up to a difference of 0.100. However, the Bland-Altman plots show that the deviation of the accuracy is larger for the TTO then the VAS. That is, the accuracy of 95% of the respondents was lower in the TTO then in the VAS. The distributions of the errors were uneven for the VAS, though, leading to a difference at the group level. On the VAS, predictions of future health were more often higher than the original valuations and predictions of recalled health were more often lower than the original valuations. This is also in line with what can be expected when ratings are influenced by scale recalibration. After a stressful experience people tend to change their internal standards [6]. If patients with SCI have changed their standards at post-test their valuation of their QoL on the then-test will be lower than the original valuation at pre-test [41]. So despite the poor accuracy at the individual level, the differences in results between the TTO and the VAS seem to point to scale recalibration. We urge others, however, to perform more research on predictions and recall using TTO and VAS, for a proper interpretation of these findings.

The results at the group level in this study are retrieved by using the then-test and the results at the individual level were based on Bland-Altman plots and percentages accuracy. These methods are limited, besides the effect of bias [31] such as recall bias [22] a then test is also influenced by an order effect [44]. To further investigate anticipated adaptation and the effect of scale recalibration on the TTO and the VAS statistical methods should be used in which group and individual effects are analyzed simultaneously [8, 45]. Herefore, the guidelines to investigate response shift as described by Schwartz et al. [8] could be helpful.

Without objective valuations or a so-called 'gold standard’, our findings cannot implicate that valuations given on the TTO are more accurate than valuations given on the VAS. The only implication that may follow from our findings is, that for valuations over time and when subject samples are compared, the VAS seems vulnerable to scale recalibration at the group level [22]. This might lead to less consistent comparisons when using the VAS than when using the TTO. However, the TTO contains other challenges. For instance, people with low numeracy have difficulty answering the TTO [46]. Our data also show that the TTO has more variance between respondents compared to the VAS. This variance directly influences the power of our analyses. The power in both methods is low, due to the rather small sample size of this study. This small sample size is mostly due to practical reasons. First of all, the prevalence of acute SCI is low, but more importantly, we used a longitudinal design. Over time, 16 patients dropped out. It is important to replicate these findings in a larger sample.

In this study, patients with SCI were interviewed shortly after their injury took place. During this period patients are going through an intensive rehabilitation process. Given the rehabilitation process, patients were interviewed in a rehabilitation center. Naturally, this caused some limitation to their ratings. Although patients reported they were able to assess their physical, social and psychological functioning in this unusual situation, the data cannot be generalized to other patient samples. Besides the setting, the order of health state valuations might have influenced our findings as well [47]. Patients first rated their current health, followed by a valuation of their future health (or in the last interview of their previous health). Their current health might have had influence on how the patients have rated their future or previous health. Given that patients improved over time, their ratings of their actual health at post-test will have been higher compared to their then-ratings, which might have negatively influenced the latter. Investigating aspects of response shift among patients with a disability might seem questionable. One might assume that patients with a disability find a certain balance after going through rehabilitation [48]. However, we examined the aspects of response shift in the subinitial phase after injury. From a related study we know that these patients do go through physical as well as psychological adaptation [34]. Moreover it is suggested that response shift is an important aspect of rehabilitation [49]. In his paper van Rijn [49] explains the effect of response shift on the way people adequately adapt to their spinal cord injury.

In this study we assumed that differences in results between TTO and VAS could be attributed to scale recalibration in the VAS. However, the assumption that the TTO is less vulnerable to scale recalibration is only theoretical, we do not have empirical evidence. It might be possible that other underlying causes led these findings. Future research should further examine the effect of scale recalibration on the TTO. It might be possible that other underlying causes led these findings.


We tentatively conclude that the health state valuations given on the TTO reveal that patients with SCI in the subinitial phase of their rehabilitation accurately anticipate their adaptation. They accurately predict the effect of change in values, goals, activities and learning new skills, such as in using a wheelchair, on their QoL. Valuations given on the VAS seem biased by scale recalibration, although we need to remind the reader that the results are preliminary. Theoretically, this paper assumes that the three aspects of response shift can be disentangled, with reconceptualization en reprioritization as aspects of adaptation and scale recalibration as measurement bias. However this point of view has been discussed previously [7, 1114] and needs further empirical and theoretical investigation.



Time trade-off


Visual analogue scale


Quality of life


Rehabilitation center


Spinal cord injury.


  1. Peeters Y, Stiggelbout AM: Health state valuations of patients and the general public analytically compared: a meta-analytical comparison of patient and population health state utilities. Value Health 2010, 13: 306–309. 10.1111/j.1524-4733.2009.00610.x

    Article  PubMed  Google Scholar 

  2. Peeters Y, Vliet Vlieland TP, Stiggelbout AM: Focusing illusion, adaptation and EQ-5D health state descriptions: the difference between patients and public. Health Expect 2012,15(4):367–378. 10.1111/j.1369-7625.2011.00667.x

    Article  PubMed  Google Scholar 

  3. Ubel PA, Loewenstein G, Jepson C: Whose quality of life? a commentary exploring discrepancies between health state evaluations of patients and the general public. Qual Life Res 2003, 12: 599–607. 10.1023/A:1025119931010

    Article  PubMed  Google Scholar 

  4. Gilbert DT, Pinel EC, Wilson TD, Blumberg SJ, Wheatley TP: Immune neglect: a source of durability bias in affective forecasting. J Pers Soc Psychol 1998, 75: 617–638.

    Article  CAS  PubMed  Google Scholar 

  5. Golembiewski RT, Billingsley K, Yeager S: Measuring change and persistence in human affairs - types of change generated by Od designs. J App Behav Sci 1976, 12: 133–157. 10.1177/002188637601200201

    Article  Google Scholar 

  6. Sprangers MA, Schwartz CE: Integrating response shift into health-related quality of life research: a theoretical model. Soc Sci Med 1999, 48: 1507–1515. 10.1016/S0277-9536(99)00045-3

    Article  CAS  PubMed  Google Scholar 

  7. Lacey HP, Loewenstein G, Ubel PA: Compared to what? a joint evaluation method for assessing quality of life. Qual Life Res 2011,20(8):1169–1177. 10.1007/s11136-011-9856-0

    Article  PubMed  Google Scholar 

  8. Schwartz CE, Ahmed S, Sawatzky R, Sajobi T, Mayo N, Finkelstein J, Lix L, Verdam MG, Oort FJ, Sprangers MA: Guidelines for secondary analysis in search of response shift. Qual Life Res 2013. In press

    Google Scholar 

  9. Livneh H, Parker RM: Psychological adaptation to disability: perspectives from chaos and complexity theory. Rehabil Coun Bull 2005, 49: 17–28. 10.1177/00343552050490010301

    Article  Google Scholar 

  10. Rochette A, Tribble DS, Desrosiers J, Bravo G, Bourget A: Adaptation and coping following a first stroke: a qualitative analysis of a phenomenological orientation. Int J Rehabil Res 2006, 29: 247–249. 10.1097/01.mrr.0000230051.24228.c8

    Article  PubMed  Google Scholar 

  11. Westerman MJ, Hak T, Sprangers MA, Groen HJ, van der Wal G, The AM: Listen to their answers! response behaviour in the measurement of physical and role functioning. Qual Life Res 2008, 17: 549–558. 10.1007/s11136-008-9333-6

    Article  PubMed Central  PubMed  Google Scholar 

  12. Ubel PA, Peeters Y, Smith DM: Abandoning the language of “response shift”: a plea for conceptual clarity in distinguishing scale recalibration from true changes in quality of life. Qual Life Res 2010, 19: 465–471. 10.1007/s11136-010-9592-x

    Article  PubMed  Google Scholar 

  13. Sprangers MAG, Schwartz CE: Do not throw out the baby with the bath water: build on current approaches to realize conceptual clarity. Response to ubel, peeters, and smith. Qual Life Res 2010, 19: 477–479. 10.1007/s11136-010-9611-y

    Article  PubMed Central  PubMed  Google Scholar 

  14. Reeve BB: An opportunity to refine our understanding of “response shift” and to educate researchers on designing quality research studies: response to ubel, peeters, and smith. Qual Life Res 2010, 19: 473–475. 10.1007/s11136-010-9612-x

    Article  PubMed  Google Scholar 

  15. Ubel PA, Smith DM: Why should changing the bathwater have to harm the baby? Qual Life Res 2010, 19: 481–482. 10.1007/s11136-010-9613-9

    Article  PubMed  Google Scholar 

  16. Eton DT: Why we need response shift: an appeal to functionalism. Qual Life Res 2010, 19: 929–930. 10.1007/s11136-010-9684-7

    Article  PubMed  Google Scholar 

  17. Stiggelbout AM, De Haes JC: Patient preference for cancer therapy: an overview of measurement approaches. J Clin Oncol 2001, 19: 220–230.

    CAS  PubMed  Google Scholar 

  18. Dolan P, Gudex C, Kind P, Williams A: A social tariff for EuroQol: results from a UK general population survey. Discussion paper 138. York: The University of York, centre for health economics York health economics consorium NHS centre for reviews & dissemination; 1995.

    Google Scholar 

  19. van Osch SMC, Stiggelbout AM: Understanding VAS valuations: qualitative data on the cognitive process. Qual Life Res 2005, 14: 2171–2175. 10.1007/s11136-005-6808-6

    Article  PubMed  Google Scholar 

  20. von Neumann J, Morgenstern O: Theory of games and economic behavior. 1st edition. Princeton: Princeton University Press; 1947.

    Google Scholar 

  21. Torrance GW: Social preferences for health states - empirical evaluation of 3 measurement techniques. Socioecon Plann Sci 1976, 10: 129–136. 10.1016/0038-0121(76)90036-7

    Article  Google Scholar 

  22. Schwartz CE, Sprangers MA: Methodological approaches for assessing response shift in longitudinal health-related quality-of-life research. Soc Sci Med 1999, 48: 1531–1548. 10.1016/S0277-9536(99)00047-7

    Article  CAS  PubMed  Google Scholar 

  23. Wyrwich KW, Norquist JM, Lenderking WR, Acaster S: Methods for interpreting change over time in patient-reported outcome measures. Qual Life Res 2013, 22: 475–483. 10.1007/s11136-012-0175-x

    Article  CAS  PubMed  Google Scholar 

  24. van Leeuwen CMC, Post MWM, van der Woude LHV, de Groot S, Smit C, van Kuppevelt D, Lindeman E: Changes in life satisfaction in persons with spinal cord injury during and after inpatient rehabilitation: adaptation or measurement bias? Qual Life Res 2012, 21: 1499–1508. 10.1007/s11136-011-0073-7

    Article  PubMed Central  PubMed  Google Scholar 

  25. Ditunno PL, Patrick M, Stineman M, Ditunno JF: Who wants to walk? Preferences for recovery after SCI: a longitudinal and cross-sectional study. Spinal Cord 2008, 46: 500–506. 10.1038/

    Article  CAS  PubMed  Google Scholar 

  26. Centraal Bureau voor de statistiek: Overlevingstafels naar leeftijd en geslacht 2005. 2007.

    Google Scholar 

  27. Lenert LA, Sturley A, Watson ME: iMPACT3: Internet-based development and administration of utility elicitation protocols. Med Decis Making 2002, 22: 464–474. 10.1177/0272989X02238296

    Article  CAS  PubMed  Google Scholar 

  28. Tijhuis GJ, Jansen SJT, Stiggelbout AM, Zwinderman AH, Hazes JMW, Vlieland TPMV: Value of the time trade off method for measuring utilities in patients with rheumatoid arthritis. Ann Rheum Dis 2000, 59: 892–897. 10.1136/ard.59.11.892

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  29. Visser MRM, Smets EMA, Sprangers MAG, De Haes HJCJ: How response shift may affect the measurement of change in fatigue. J Pain Symptom Manage 2000, 20: 12–18. 10.1016/S0885-3924(00)00148-2

    Article  CAS  PubMed  Google Scholar 

  30. Schwartz CE, Bode R, Repucci N, Becker J, Sprangers MAG, Fayers PM: The clinical significance of adaptation to changing health: a meta-analysis of response shift. Qual Life Res 2006, 15: 1533–1550. 10.1007/s11136-006-0025-9

    Article  PubMed  Google Scholar 

  31. Nolte S, Elsworth GR, Sinclair AJ, Osborne RH: Tests of measurement invariance failed to support the application of the “then-test”. J Clin Epidemiol 2009, 62: 1173–1180. 10.1016/j.jclinepi.2009.01.021

    Article  PubMed  Google Scholar 

  32. Howard GS, Schmeck RR, Bray JH: Internal invalidity in studies employing self-report instruments - suggested remedy. J Educ Meas 1979, 16: 129–135. 10.1111/j.1745-3984.1979.tb00094.x

    Article  Google Scholar 

  33. Ross M: Relation of implicit theories to the construction of personal histories. Psychol Rev 1989, 96: 341–357.

    Article  Google Scholar 

  34. Edelaar-Peeters Y, Putter H, Snoek GJ, Sluis TAR, Smit CAJ, Post MWM, Stiggelbout AM: The influence of time and adaptation on health state valuations in patients with spinal cord injury. Med Decis Making 2012, 32: 805–814. 10.1177/0272989X12447238

    Article  PubMed  Google Scholar 

  35. Wilson TD, Gilbert DT: Affective forecasting - knowing what to want. Curr Dir Psychol Sci 2005, 14: 131–134. 10.1111/j.0963-7214.2005.00355.x

    Article  Google Scholar 

  36. van Koppenhagen CF, Post MW, van der Woude LH, de Groot S, de Witte LP, van Asbeck FW, HW V d, Lindeman E: Recovery of life satisfaction in persons with spinal cord injury during inpatient rehabilitation. Am J Phys Med Rehabil 2009, 88: 887–895. 10.1097/PHM.0b013e3181b71afe

    Article  PubMed  Google Scholar 

  37. Ubel PA, Loewenstein G, Jepson C: Disability and sunshine: can hedonic predictions be improved by drawing attention to focusing illusions or emotional adaptation? J Exp Psychol Appl 2005, 11: 111–123.

    Article  PubMed  Google Scholar 

  38. McPhail S, Haines T: Patients undergoing subacute rehabilitation have accurate expectations of their health-related quality of life at discharge. Health Qual Life Outcomes 2012, 10: 94. 10.1186/1477-7525-10-94

    Article  PubMed Central  PubMed  Google Scholar 

  39. Balain B, Ennis O, Kanes G, Singhal R, Roberts SN, Rees D, Kuiper JH: Response shift in self-reported functional scores after knee microfracture for full thickness cartilage lesions. Osteoarthritis Cartilage 2009, 17: 1009–1013. 10.1016/j.joca.2009.02.007

    Article  CAS  PubMed  Google Scholar 

  40. Korfage IJ, de Koning HJ, Habbema JDF, Schroder FH, Essink-Bot ML: Side-effects of treatment for localized prostate cancer: are they valued differently by patients and healthy controls? Bju International 2007, 99: 801–806. 10.1111/j.1464-410X.2006.06707.x

    Article  PubMed  Google Scholar 

  41. King-Kallimanis BL, Oort FJ, Visser MR, Sprangers MA: Structural equation modeling of health-related quality-of-life data illustrates the measurement and conceptual perspectives on response shift. J Clin Epidemiol 2009, 62: 1157–1164. 10.1016/j.jclinepi.2009.04.004

    Article  CAS  PubMed  Google Scholar 

  42. McPhail S, Haines T: Response shift, recall bias and their effect on measuring change in health-related quality of life amongst older hospital patients. Health Qual Life Outcomes 2010, 8: 65. 10.1186/1477-7525-8-65

    Article  PubMed Central  PubMed  Google Scholar 

  43. Hinz A, Finck BC, Zenger M, Singer S, Schwalenberg T, Stolzenburg JU: Response shift in the assessment of anxiety, depression and perceived health in urologic cancer patients: an individual perspective. Eur J Cancer Care (Engl ) 2011, 20: 601–609. 10.1111/j.1365-2354.2011.01256.x

    Article  CAS  Google Scholar 

  44. Nolte S, Elsworth GR, Sinclair AJ, Osborne RH: The inclusion of 'then-test’ questions in post-test questionnaires alters post-test responses: a randomized study of bias in health program evaluation. Qual Life Res 2012, 21: 487–494. 10.1007/s11136-011-9952-1

    Article  PubMed  Google Scholar 

  45. Oort FJ: Using structural equation modeling to detect response shifts and true change. Qual Life Res 2005, 14: 587–598. 10.1007/s11136-004-0830-y

    Article  PubMed  Google Scholar 

  46. Woloshin S, Schwartz LM, Moncur M, Gabriel S, Tosteson ANA: Assessing values for health: numeracy matters. Med Decis Making 2001, 21: 382–390. 10.1177/0272989X0102100505

    Article  CAS  PubMed  Google Scholar 

  47. Lacey HP, Fagerlin A, Loewenstein G, Smith D, Riis J, Ubel PA: It must be awful for them: perspective and task context affects ratings for health conditions. Judgement Decision Making 2007, 1: 146–152.

    Google Scholar 

  48. Schwartz CE, Andresen EM, Nosek MA, Krahn GL: Response shift theory: important implications for measuring quality of life in people with disability. Arch Phys Med Rehabil 2007, 88: 529–536. 10.1016/j.apmr.2006.12.032

    Article  PubMed  Google Scholar 

  49. Van Rijn T: A physiatrist’s view of response shift. J Clin Epidemiol 2009, 62: 1191–1195. 10.1016/j.jclinepi.2009.01.023

    Article  PubMed  Google Scholar 

Download references


The authors thank the patients participating in this study and Nanny van Duijn for her help in collecting the data. They especially thank Prof. dr. P.A. Ubel for his valuable comments on an earlier draft of this article.

Y. Edelaar-Peeters and A.M. Stiggelbout were entirely supported by a VIDI-award of the Netherlands Organization for Scientific Research NWO Innovational Research Incentives (grant number 917.56.356).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Yvette Edelaar-Peeters.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

YEP participated in the design of the study, collected the data, performed the statistical analyses, interpreted the findings and wrote the first draft of the manuscript. AMS participated in the design of the study, interpreted the findings and helped to draft the manuscript. Both authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Edelaar-Peeters, Y., Stiggelbout, A.M. Anticipated adaptation or scale recalibration?. Health Qual Life Outcomes 11, 171 (2013).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: