Assessment of retrospective collection of EQ-5D-5L in a US COVID-19 population
Health and Quality of Life Outcomes volume 21, Article number: 103 (2023)
It is imperative to evaluate health related quality of life (HRQoL) pre-COVID-19, but there is currently no evidence of the retrospective application of the EuroQol 5-Dimension, 5 level version (EQ-5D-5L) for COVID-19 studies.
Symptomatic patients with SARS-CoV-2 at CVS Health US test sites were recruited between 01/31/2022-04/30/2022. Consented participants completed the EQ-5D-5L questionnaire twice: a modified version where all the questions were past tense to retrospectively assess pre-COVID-19 baseline QoL, and the standard version in present tense to assess current HRQoL. Duncan’s new multiple range test was adopted for post analysis of variance pairwise comparisons of EQ visual analog scale (EQ VAS) means between problem levels for each of 5 domains. A linear mixed model was applied to check whether the relationship between EQ VAS and utility index (UI) was consistent pre-COVID-19 and during COVID-19. Matching-adjusted indirect comparison was used to compare pre-COVID-19 UI and EQ VAS scores with those of the US population. Lastly, Cohen’s d was used to quantify the magnitude of difference in means between two groups.
Of 676 participants, 10.2% were age 65 or more years old, 73.2% female and 71.9% white. Diabetes was reported by 4.7% participants and hypertension by 11.2%. The estimated coefficient for the interaction of UI-by-retrospective collection indicator (0 = standard prospective collection, 1 = retrospective for pre-COVID-19), -4.2 (SE: 3.2), P = 0.197, indicates that retrospective collection does not significantly alter the relationship between EQ VAS and UI. After adjusting for age, gender, diabetes, hypertension, and percent of mobility problems, the predicted means of pre-COVID-19 baseline EQ VAS and UI were 84.6 and 0.866, respectively. Both means were close to published US population norms (80.4 and 0.851) compared to those observed (87.4 and 0.924). After adjusting for age, gender, diabetes, and hypertension, the calculated ES between pre-COVID-19 and COVID-19 for UI and EQ VAS were 0.15 and 0.39, respectively. Without retrospectively collected EQ-5D-5L, using US population norms tended to underestimate the impact of COVID-19 on HRQoL.
At a group level the retrospectively collected pre-COVID-19 EQ-5D-5L is adequate and makes it possible to directly evaluate the impact of COVID-19 on HRQoL. (ClinicalTrials.gov NCT05160636)
The impact of COVID-19 extends beyond clinical outcomes. To holistically understand the burden of COVID-19, it is important to measure its impact on patients’ quality of life. The EuroQoL Group 5 dimension and 5 levels (EQ-5D-5L) instrument  is an internationally validated questionnaire that is widely used for measuring health-related quality of life (HRQoL) and deriving utilities for estimation of Quality Adjusted Life Years (QALYs). Retrospective utilization of the EQ-5D-5L to assess pre-event status is limited, but the feasibility and validity of its retrospective collection (EQ-5D-5L) to assess past HRQoL was investigated in prior studies [2, 3]. In Lawson et al. (2020) , EQ-5D-5L was collected prospectively in patients 2 weeks prior to their date of elective hip or knee arthroplasty surgery and then retrospectively collected following their operation to recall their pre-operative health status. At a group level the agreement was high between prospective and retrospective measurements indicating retrospective collection could be valid in orthopedic clinical context.
Another study by Rajan et al. (2021)  utilized the EQ-5D-5L prospectively among patients with a stroke at 3 months post-discharge and compared those results with the retrospective collection of their 3rd month EQ-5D-5L at 6th, 9th, or 12th month after hospital discharge. Considerable agreement was observed in mobility, self-care, and usual activities dimensions of EQ-5D-5L based on weighted kappa (range 0.72–0.95), and concordance was good to excellent for utility score based on intraclass correlation coefficients (range 0.79–0.81). The authors concluded that retrospective collection of EQ-5D-5L could be a valid alternative for assessing morbid health status.
Retrospective and prospective collection of EQ-5D-5L have been used in infectious disease PRO studies to establish the baseline pre-infection health status and the morbid health status over the course of the infection [4, 5]. However, the validity of retrospective collection of EQ-5D-5L was not assessed in those studies. A recent study utilizing retrospective and prospective collection of EQ-5D-5L was used in a PRO study of US symptomatic outpatients with positive reverse transcription–polymerase chain reaction for SARS-CoV-2. EQ-5D-5L was used to establish, respectively, the baseline pre-COVID-19 health status and the health status at week 1 and week 4 following infection. Subsequently, the pre-infection and COVID-19 health statuses were compared to evaluate the impact of COVID-19 on HRQoL . This subsequent methodology study utilizing the PRO data collected in the COVID-19 study tests the validity of the use of retrospective collection of EQ-5D-5L for establishing pre-infection status for the first time for COVID-19-related studies.
In the COVID-19 PRO study , participants were recruited between 01/31/2022 and 04/30/2022 among US adult outpatients with ≥ 1 self-reported symptom and positive reverse transcription–polymerase chain reaction test for SARS-CoV-2 at CVS Health test sites (ClinicalTrials.gov NCT05160636). The analytic population for the Di Fusco et al. study was limited to participants unvaccinated or receiving the BNT162b2 COVID-19 vaccine only. To assess the validity of the retrospective collection methodology, all patients that were enrolled were included .
EQ-5D-5L addresses quality of life across five dimensions (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression) and five levels (no problems, slight problems, moderate problems, severe problems, extreme problems/unable). These five domains were converted into the Utility Index (UI) using the US-based weights . Moreover, it addresses a general assessment of health, the EQ visual analog scale (EQ VAS), via a 101-point Visual Analog Scale (EQ VAS) ranging from 0 = “Worst imaginable health state” to 100 = “Best imaginable health state” . On the day of enrollment (~ day 3 after testing positive to SARS-CoV-2) consented participants completed the EQ-5D-5L questionnaire twice a modified version where all the questions were past tense to retrospectively assess pre-COVID-19 baseline HRQoL and, separately, the standard version in present tense to assess current HRQoL. These two versions were administered consecutively with no washout or break between, and in random order to balance the potential responder bias due to the order of administration.
To support the validity of retrospectively collected EQ-5D-5L, the following assumptions were utilized: firstly, lower EQ VAS is associated with higher problem levels within each dimension of EQ-5D-5L. Secondly, our pre-COVID-19 cohort can be considered as a sample from the general US population. After adjusting for major distributional differences, mean EQ VAS should be close to the population norm. Thirdly, retrospective collection does not modify the relationship between EQ VAS and UI. Last, retrospectively collected EQ VAS is more appropriate than the population norm when evaluating COVID-19’s impact on HRQoL.
Categorical variables were described by using frequency and percentages. Continuous variables were described by using means and standard deviations. The analysis of variance (ANOVA) was used to test differences in means between different problem levels within each domain of EQ-5D-5L. Duncan’s new multiple range test was adopted for pairwise comparisons post ANOVA [8, 9], which may evaluate the association between EQ VAS and problem levels.
A linear mixed model was used to characterize the relationship between EQ VAS and UI . The response variable was EQ VAS. The explanatory variables included UI, the indicator for retrospective assessment variable RETRO (1 = retrospective collection for pre-COVID-19, 0 = standard collection for day 3 after COVID-19 testing) and its interaction with UI. The coefficient of the interaction term reflects the magnitude of relationship altered by retrospective assessment between EQ VAS and UI. Random intercepts were incorporated to account for the clusters of assessments within participants.
Matching-adjusted indirect comparison (MAIC)  was used to compare pre-COVID-19 UI and EQ VAS scores with those of the US population. To do that, weights for individuals in our sample were estimated so that weighted percentage of selected patient characteristics matched those published. Then two-sample t-tests were used to test differences between weighted EQ VAS or UI with those in the US population, as reported in the Jiang et al. (2021) face-to-face sample .
The effect size (ES), Cohen’s d, was calculated to assess the magnitude of difference in means between two groups [13, 14]. Specifically, when comparing COVID-19 to pre-COVID-19 baseline, the ES was calculated as mean score differences divided by the standard deviation of score difference. When comparing the COVID-19 to US norm, the ES was calculated as the difference in mean COVID-19 score and US norm, divided by the pooled standard deviation of scores. Values of 0.2, 0.5, and 0.8 standard deviation (SD) units represent small, medium, and large ES, respectively. These cut-off estimates have been widely used to establish important differences in HRQoL studies .
Among 676 participants submitting responses to the EQ-5D-5L questionnaire, missingness was recorded in 7 instances only: 2 participants did not answer the EQ VAS component in the EQ-5D-5L retrospective version and 5 participants missed the EQ VAS component in the standard version. There were no missing data for utility index scores. The impact of missing data was considered minimum, so the analyses were all based on available data.
Of 676 participants, 10.2% were age 65 or older, 73.2% female and 71.9% white. Asthma or chronic lung disease, diabetes, and hypertension were reported by 8.6%, 4.7%, and 11.2% participants, respectively. (Table 1) Descriptive statistics are presented in Supplemental Table 1.
As shown in Fig. 1, worse EQ VAS was associated with a higher level of severity in each EQ-5D-5L domain, for both retrospectively collected pre-COVID-19 baseline and standard collection for day 3 after COVID-19 positive testing. A detailed summary of EQ VAS by EQ-5D-5L domain level for retrospective collection for pre-COVID-19 and standard collection for COVID-19, as well as the pairwise comparisons between levels of each domain, are presented in Supplemental Table 2. For pre-COVID-19 mobility, mean EQ VAS scores were 88.5, 75.9, 57.7 and 50.0 for those reporting no problem, slight problem, moderate problem, and severe problem, respectively. No participants reported ‘unable’ for mobility. According to Duncan’s tests, mean EQ VAS was found to be significantly different among no problem, slight problem, and moderate/severe problem categories. In addition, a summary of EQ VAS by health states with a sample size greater than 10 is presented in Supplemental Table 3.
Compared with the US general population , the current cohort was predominantly female, white, and reported fewer chronic conditions. The pre-COVID-19 baseline mean utility index (UI) of 0.924 and mean EQ VAS of 87.4 were higher than those in the US population (0.851 and 80.4, respectively). In addition, for all five domains, less problems were reported, which were prevalent in 7.1%, 6.7%, 7.9%, 27.3% and 43.5% of the current study cohort versus 28.4%, 6.5%, 24.7%, 51.0%, and 38.4% in Jiang et al. (2021)  for mobility, self-care, usual activity, pain/discomfort and anxiety/depression, respectively.
For the model predicting EQ VAS by using UI, RETRO and their interaction, the estimated coefficient of UI-by-RETRO interaction term was − 4.2 (SE: 3.2), P = 0.197, which indicated that the magnitude of relationship between EQ VAS and UI was not significantly altered by retrospective assessment.
To compare with the US population norms for the EQ-5D-5L , we matched percentages of age category, gender, diabetes, and hypertension in our sample to US population norms, age category (31.6% for 18–34, 48.7% for 35–64 and 19.7% for ≥ 65 years), gender (49.7% for male and 49.8 for female), diabetes (9.8%), and hypertension (23.8%), by using the MAIC approach . After weighting, the effective sample size reduced to 440. The weighted mean of UI and EQ VAS for pre-COVID-19 baseline decreased slightly to 87.0 and 0.922, respectively. Both were significantly higher than US population norms, P < 0.001. When percent of problems with mobility was added, the effective sample size reduced to 291. The weighted mean of UI for pre-COVID-19 baseline became 0.866 (SD: 0.176), which is not statistically different from the US population norm 0.851 (P = 0.253). The weighted mean of EQ VAS was 84.6 (SD:12.8) and higher than the US population norm of 80.4 (P < 0.001). (Table 2)
At approximately day 3 after COVID-19 positive testing, UI and EQ VAS were 0.808 and 73.3, respectively. When compared with baseline assessment, Cohen’s d for UI and EQ VAS were 0.68 and 1.01, a medium-to-large impact on UI and a large impact on EQ VAS, respectively. When compared with US population norms , Cohen’s d for UI and EQ VAS were 0.21 and 0.44, a small impact on UI and a small-to-medium impact on EQ VAS, respectively. However, the percentage of problems with mobility, 17.2%, of COVID-19 cohort was significantly lower than 25.2% of US population , P < 0.001. After matching on percentages of age category (18–29, 30–64 and ≥ 65 years), gender, diabetes, and hypertension, 19.0% with mobility problems was still significantly lower than 25.2%, P < 0.001. The calculated ES for UI and EQ VAS became 0.15 and 0.39, respectively. (Table 2)
In this current study implementing retrospective collection of EQ-5D-5L data, EQ VAS generally declined along with more severe levels in each of 5 domains as expected. Strong empirical evidence indicates that retrospective collection did not materially alter the relationship between EQ VAS and UI. Our current study population was predominantly female, white, and reported fewer impairments on the EQ VAS than the general US population. After adjusting for demographic variables, chronic conditions, and percentage of mobility problems, mean scores of EQ VAS and UI approached those of US population norms. As with related studies which demonstrate the validity of retrospective collection of EQ-5D-5L through direct comparisons with standard collection [2, 3], results from the current study provide evidence to support the validity of retrospective EQ-5D-5L collection when it is not feasible to collect this information prospectively. To our knowledge, however, this is the first study of its kind in an adult outpatient COVID-19 cohort.
The pre-COVID-19 adjusted mean EQ VAS of 84.6 is the same as that reported for the US general population in Cha et al. (2019), the 2017 sample of EQ-5D-3 L sub-study , but still higher than that reported by the face-to-face sample in Jiang et al. (2021) . This difference may be due to under-adjustment for participants’ characteristics and possible bias in retrospective assessment. Another possibility is due to the nature of data collection in our study (online). Jiang et al. (2021)  reported a mean 3.0-point lower in online respondents than the face-to-face respondents. However, differences in administration mode cannot explain the higher adjusted value in our study.
While useful to adjust for differences in participants’ characteristics between study cohort and the cohort deriving US population norms, there are limitations with MAIC. In the current study chronic conditions were reported less than US general population and Jiang et al. (2021) . Nevertheless, conditions that are either not commonly available in both studies or not consistently collected cannot be adjusted, which resulted in adjustment for diabetes and hypertension only. The addition of retrospectively collected problems with mobility for pre-COVID-19 in MAIC brought EQ VAS and UI much closer to US population norms.
When it is not feasible to collect prospective measurements on EQ-5D-5L, as well as EQ VAS, the current evidence suggests that it reasonable to consider retrospective assessment under the situation found in our study, including the short recall period (only a few days) for the retrospective assessment. Especially for mobility, there is high agreement between retrospective collection and standard collection [2, 3]. High percentage of no reported mobility problems is in alignment with an ambulatory cohort comprising of individuals able to walk into a store for COVID-19 testing. Even so, we did not attempt to use problem levels where there were either too few to be reliably adjusted or we were unable to adjust for the level not observed. (Supplemental Table 2)
The retrospective assessment of EQ VAS for pre-COVID-19 may either be recalled directly, or can be reconstructed based on the participant’s current state of COVID-19 and the assumptions of change from pre-COVID-19 to COVID-19. The recall bias occurs because of errors and distortions in the recollection of pre-COVID-19 state as well as in the inferential process from pre-COVID-19 to COVID-19 . As shown in Rajan et al. (2021) , stroke patients tended to underestimate EQ VAS retrospectively, which supports the ‘present state effect’ [17, 18], that is, a person who feels well might think his status improved and therefore tends to underestimate his previous state. Conversely, a patient with COVID-19 who feels bad might think that her status worsened and therefore tends to overestimate her pre-COVID-19 status. As presented in Supplemental Tables 1, the mean EQ VAS score was 85.3 if administered first for pre-COVID-19, and 89.7 if after standard version.
Had EQ-5D-5L not been collected retrospectively, US population norms must be compared to evaluate COVID-19’s impact on HRQoL. The assumption is that COVID-19 causes more problems in mobility, self-care, usual activity, pain/discomfort and anxiety/depression, as well as overall health. However, our current cohort of patients with COVID-19 had less problems with mobility than the US general population, whether matching on a selected patient’s characteristics or not. On the other hand, retrospectively collected baseline percentage of problems with anxiety and depression, 43.5%, was higher than US population norm, 38.4%, based on the cohort pre-pandemic , which is consistent with World Health Organization report about elevated prevalence of anxiety and depression during COVID-19 pandemic . Comparisons of UI and EQ VAS of patients with COVID-19 with population norms cannot account for such differences, which leads to unreliable estimates of COVID-19’s impact. Last, population norms may not be available in the general population. Even for EQ-5D-5L, the norms were developed based on a quota-sampling of the US population, which may not provide a true norm as larger random samples can provide.
Of 39,889 candidates with email invitations, 676 patients consented to participate and completed online surveys. Comparisons between the study cohort and those not included in the study were presented in Supplemental Table 4. Patients with COVID-19 vaccination, 30 years or older, female, white and living in less vulnerable communities were more likely to respond to the study. Study participants were more likely to have one or more comorbidities but had similar percentages of acute COVID-19 symptoms as those not included in the study. Though nonresponse bias is a potential limitation in this study, researchers have shown little relationship between survey response rate and nonresponse bias [20, 21]; nevertheless, that may not be generalizable to self-reported HRQoL in the current setting.
In summary, limitations in this study include: the lack of a “true” comparator for pre COVID-19 HRQoL because one does not exist as of yet, potential bias in retrospective assessment, and unknown sources of nonresponse bias. Further research is needed to directly compare standard prospective assessment of pre-COVID-19 EQ-5D-5L with retrospective assessment. Results in this analysis should therefore be interpreted with caution.
At a group level, the retrospectively collected pre-COVID-19 EQ-5D-5L is adequate when compared with the US population norm and reasonably aligned when compared with standard collection of EQ-5D-5L for COVID-19. Retrospective collection of pre-COVID-19 EQ-5D-5L makes it possible to directly evaluate the impact of COVID-19 on health-related quality of life.
Aggregated data that support the findings of this study are available upon reasonable request from the corresponding author XS, subject to review. These data are not publicly available due to research participant privacy/consent considerations.
Analysis of variance
Health related quality of life
Matching-adjusted indirect comparison
- EQ VAS:
EuroQol visual analogue scale
EuroQol Research Foundation. EQ-5D-5L User Guide, Version 3.0. 2019 [Available from: https://euroqol.org/publications/user-guides.
Lawson A, Tan AC, Naylor J, Harris IA. Is retrospective assessment of health-related quality of life valid? BMC Musculoskelet Disord. 2020;21(1):1–10.
Rajan SS, Wang M, Singh N, Jacob AP, Parker SA, Czap AL, et al. Retrospectively collected EQ-5D-5L data as valid proxies for imputing missing information in Longitudinal Studies. Value Health. 2021;24(12):1720–7.
Barbut F, Galperine T, Vanhems P, Le Monnier A, Durand-Gasselin B, Canis F, et al. Quality of life and utility decrement associated with Clostridium difficile infection in a french hospital setting. Health Qual Life Outcomes. 2019;17(1):1–7.
Glick HA, Miyazaki T, Hirano K, Gonzalez E, Jodar L, Gessner BD, et al. One-year quality of Life Post-Pneumonia diagnosis in japanese adults. Clin Infect Dis. 2021;73(2):283–90.
Di Fusco M, Sun X, Moran MM, Coetzer H, Zamparo JM, Puzniak L, et al. Impact of COVID-19 and effects of BNT162b2 on patient-reported outcomes: quality of life, symptoms, and work productivity among US adult outpatients. J Patient-Reported Outcomes. 2022;6:123.
Pickard AS, Law EH, Jiang R, Pullenayegum E, Shaw JW, Xie F, et al. United States valuation of EQ-5D-5L health states using an international protocol. Value in Health. 2019;22(8):931–41.
Rosner B. Fundamentals of biostatistics. Eighth ed. Boston, MA: Cengage learning; 2015.
Duncan DB. T tests and intervals for comparisons suggested by the data. Biometrics. 1975:339–59.
Brown H, Prescott R. Applied mixed models in medicine. John Wiley & Sons; 2015.
Signorovitch JE, Sikirica V, Erder MH, Xie J, Lu M, Hodgkins PS, et al. Matching-adjusted indirect comparisons: a new tool for timely comparative effectiveness research. Value in Health. 2012;15(6):940–7.
Jiang R, Janssen MB, Pickard AS. US population norms for the EQ-5D-5L and comparison of norms from face-to-face and online samples. Qual Life Res. 2021;30(3):803–16.
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ.: Lawrence Erlbaum Assoc; 1988.
McLeod LD, Cappelleri JC, Hays RD. Best (but oft-forgotten) practices: expressing and interpreting associations and effect sizes in clinical outcome assessments. Am J Clin Nutr. 2016;103(3):685–93. Erratum: 2017; 105:241.
Mouelhi Y, Jouve E, Castelli C, Gentile S. How is the minimal clinically important difference established in health-related quality of life instruments? Review of anchors and methods. Health Qual Life Outcomes. 2020;18(1):1–17.
Cha AS, Law EH, Shaw JW, Pickard AS. A comparison of self-rated health using EQ-5D VAS in the United States in 2002 and 2017. Qual Life Res. 2019;28(11):3065–9.
Blome C, Augustin M. Measuring change in quality of life: bias in prospective and retrospective evaluation. Value in Health. 2015;18(1):110–5.
Kamper SJ, Ostelo RW, Knol DL, Maher CG, de Vet HC, Hancock MJ. Global Perceived Effect scales provided reliable assessments of health transition in people with musculoskeletal disorders, but ratings are strongly influenced by current status. Journal of clinical epidemiology. 2010;63(7):760-6. e1.
World Health Organization. Mental Health and COVID-19: Early evidence of the pandemic’s impact 2022 [Available from: https://www.who.int/publications/i/item/WHO-2019-nCoV-Sci_Brief-Mental_health-2022.1.
Hendra R, Hill A. Rethinking response rates: new evidence of little relationship between survey response rates and nonresponse bias. Eval Rev. 2019;43(5):307–30.
Beehr A, Kim T, Armstrong MW. I. How much do survey response rates affect relationships among variables? Int J Soc Res Methodol. 2022:1–24.
The authors acknowledge Alejandro Cane, Deepa Malhotra and Nancy Gifford (Pfizer employees), Joseph Ferenchick, Shiyu Lin and Shawn Edmonds (CVS Health employees) for specific contributions to this research project. Editorial support was provided by Laura Anatale-Tardiff and Leena Samuel at CVS Health and was funded by Pfizer.
This study was sponsored by Pfizer Inc.
Ethics approval and consent to participate
This study was approved by the Sterling IRB, Protocol #C4591034. Participation in the study was voluntary and anonymous. Consent was obtained electronically via the CVS Health E-Consent platform. Participants were informed of their right to refuse or withdraw from the study at any time. Participants were compensated for their time.
Consent for publication
All authors have given their approval for this manuscript version to be published.
MDF, LP, JMZ and JCC are employees of Pfizer and may hold stock or stock options of Pfizer. XS and HC are employees of CVS Health and may hold stock of CVS health. YPT was employee of CVS Health when current study was conducted.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Sun, X., Di Fusco, M., Puzniak, L. et al. Assessment of retrospective collection of EQ-5D-5L in a US COVID-19 population. Health Qual Life Outcomes 21, 103 (2023). https://doi.org/10.1186/s12955-023-02187-x