- Open Access
Development of a rapid point-of-care patient reported outcome measure for cataract surgery in India
Health and Quality of Life Outcomesvolume 16, Article number: 25 (2018)
For patient undergoing cataract surgery in India, existing patient-reported outcome (PRO) measures are either not culturally relevant, have not been adequately validated, or are too long to be used in a busy clinical setting. We sought to develop and validate a brief and culturally relevant point-of-care PRO measure to address this need.
Twelve items from the Indian Visual Functioning Questionnaire (IND-VFQ) were selected based on preliminary data. Patients 18 years and older were prospectively recruited at Aravind Eye Care System in Madurai, India. Clinical and sociodemographic data were collected and the 12-item short-form IND-VFQ (SF-IND-VFQ) was administered pre- and post-operatively to 225 patients; Factor analysis and Rasch modeling was performed to assess its psychometric properties.
One item that did not fit a unidimensional scale and had poor fit with the Rasch model was eliminated from the questionnaire. The remaining 11 items represented a single construct (no residual correlations> 0.1) and were largely unaffected by differential item functioning. Five items had disordered thresholds resolved by collapsing the response scale from four to three categories. The survey had adequate reliability (0.80) and good construct (infit range, 0.77–1.29; outfit range, 0.56–1.30) and content (item separation index, 5.87 logits) validity. Measurement precision was fair (person separation index, 1.97). There was evidence that items were not optimally targeted to patients’ visual ability (preoperatively, − 1.92 logits; overall, − 3.41 logits), though the survey measured a very large effect (Cohen’s d 1.80). In a subset of patients, the average time to complete the questionnaire was 2 min 6.3 s.
The SF-IND-VFQ is a valid, reliable, sensitive, and rapidly administered point-of-care PRO measure to assess changes in visual functioning in patients undergoing cataract surgery in India.
In the United States  India  and elsewhere there is a growing emphasis on understanding the impact of medical interventions on the lives of patients to assess and improve quality of care. In the case of cataract surgery, success is often equated with an improvement in an individual’s best-corrected visual acuity (BCVA). However, it is also meaningful to judge the success of surgery in relation to vision-related quality of life (QOL) and vision-dependent functioning, as these factors likely drive patients to pursue surgery and affect their satisfaction.
Many patient-reported outcome (PRO) measures have been used to assess the impact of cataract on vision-related QOL and functioning [3,4,5,6,7,8,9] and a number of these have been employed among Indian patients undergoing cataract surgery [8,9,10,11]. However, existing instruments may not be culturally relevant to patients in India [3, 4]; have not been adequately validated ; or are too lengthy to be deployed as point-of-care tests in a high-volume clinical setting .
Among existing PRO measures, the Indian Visual Functioning Questionnaire (IND-VFQ) was developed in India using rigorous qualitative methodology ; was validated with Rasch analysis ; and has been shown to detect a change in vision-related QOL after cataract surgery . Nonetheless, with 28 items, the Rasch-modified version of the IND-VFQ was too long for use at the point-of-care in a busy clinical practice at Aravind Eye Care System (AECS) in Madurai, India. Therefore, we sought to develop a rapidly administered short-form version of the IND-VFQ (SF-IND-VFQ) and to test its psychometric properties for patients undergoing cataract surgery at a tertiary eye hospital in south India. Our purpose in validating a rapid point-of-care PRO for cataract surgery is to facilitate patient-centered measurement for quality improvement, monitoring of surgical results, and use in future research studies.
This study was approved by the AECS Institutional Review Board and adhered to all tenets of the Declaration of Helsinki.
The original IND-VFQ was developed around the qualitative insights of patients who participated in 46 focus groups across three different regions of India, including Tamil Nadu . Investigators then generated and tested 45 survey items among subjects with cataract, glaucoma, diabetic retinopathy and age-related macular degeneration . Based on an evaluation using classical test theory, they produced a final 33-item version of the IND-VFQ with four subscales (mobility, activity limitation, psychosocial impact, and visual symptoms). Subsequently, the instrument was refined to a 28-item version based on the results of a Rasch analysis in which the IND-VFQ was administered to patients in Tamil Nadu with cataract .
We used the 28-item IND-VFQ  as the basis for the development of the SF-IND-VFQ. In an exploratory first phase, we administered the 28-item survey to an independent sample of 223 patients at AECS in Madurai, Tamil Nadu, India prior to and one month after cataract surgery. For each item, we calculated the Cohen’s d effect size (d = [mean1 – mean2 / SD]) . We considered for inclusion in our pilot instrument any item with d ≥ 0.5, which is considered a moderate effect size , and for which a minimum of 40% of subjects indicated that they were at least minimally affected before surgery (any response other than “not at all”). Eleven items met these criteria. An expert panel of ophthalmologists (AH, JDS, JRE) then selected one additional item from the activity limitation domain to ensure that each domain was represented by at least 2 items. This resulted in the 12 item pilot questionnaire that we evaluated in this study.
We recruited patients in the free and paid wards at AECS Madurai from June–July 2016. All patients had previously undergone a full ophthalmologic examination at the AECS Cataract and Intraocular Lens Service and had chosen to pursue cataract surgery. Criteria for inclusion were: ≥ 18 years of age; undergoing routine cataract surgery (by phacoemulsification or manual small incision cataract surgery [MSICS]); and planning to return to AECS Madurai for a one-month post-operative examination. Subjects were excluded if they: had impairment that precluded communication with an interviewer; did not speak Tamil; or lived > 50 km from Madurai since these patients commonly seek post-operative care at Aravind satellite clinics. No patients who completed the parent 28-item IND-VFQ in the exploratory phase contributed data to the validation phase of the study.
To recruit patients, the surgery roster for the day was reviewed each morning and those meeting study criteria were noted. Patients were approached in the triage area, told about the study, and informed consent was obtained. A research assistant was present to assist subjects in completing the survey if needed. An observer timed a subset of consecutive patients while they completed the survey either with or without assistance. To collect postoperative data with minimal loss to follow-up, we tagged medical charts of study patients and the study team received an alert when patients returned for their post-operative visit. Responses were reverse coded so that lower values represented poorer visual functioning. We sought a minimum sample size of 150 based on empirical evidence that this would provide person measures stable within ± ½ logit with 99% confidence .
Demographics and clinical measures
We recorded best-corrected visual acuity (BCVA) at each exam using a Snellen chart at 6 m with standard overhead illumination; testing distance was varied if vision was too poor to test at 6 m. Visual acuities were converted to log minimum angle of resolution (logMAR) values. LogMAR values for counting fingers and hand motions were recorded as 2.0 and 3.0, respectively ; perception of light was also recorded as logMAR 3.0 since no standard value existed for this level of vision. The following sociodemographics were collected: age, sex, place of residence, marital status, education, employment status, monthly household income, and whether surgery was paid for or free.
Rash analysis, a form of modern test theory, has emerged as the gold standard methodology for validating visual-functioning questionnaires . In contrast to classical test theory, Rasch analysis accounts for both the difficulty of tasks and the abilities of subjects by modeling the relationship between a latent trait (i.e. a patient’s functional ability) and the items used to measure that trait . Therefore, a Rasch analysis produces estimates of item difficulties, which correspond to the amount of visual ability required by a given task, and person abilities, which are a measure of participants’ ability to perform tasks that require vision. These can be utilized to estimate interval scales rather than summing Likert scores that assume an equal difference between any two consecutive response categories. The results of a Rasch analysis are then compared to normative values to assess the reliability, validity and measurement precision of a PRO measure [18,19,20,21]. Massof has published an extensive discussion of Rasch analysis and its application to PRO measures of visual ability .
The Rasch model is based on the assumption of unidimensionality, which describes the measurement of a single latent construct. Therefore, we performed exploratory factor analysis (EFA) by weighted least squares of the polychoric item-test correlation of pre- and post-test scores . We then performed confirmatory factor analysis (CFA) for a one factor model and calculated residual correlations for each item and for model fit indices, including the root mean square error of approximation (RMSEA), comparative fit index (CFI) and Tucker-Lewis index (TLI). Good fit with a one-factor model is indicated by an RMSEA value < 0.08 and CFI and TLI values ≥0.95 . Factor analyses were performed using MPlus (version 7.0).
Next, after performing EFA and CFA, we fit SF-IND-VFQ responses to the Rasch model using Winsteps software (version 3.92.1). The likelihood-ratio test was significant, suggesting the partial credit model (PCM) was applicable to our data since it allows for response thresholds to vary from item to item . We fit our survey response data to the Rasch model to estimate item and overall model fit by comparing our results to normative values that indicate instrument reliability, precision and validity [18,19,20,21]. Polyserial item-test correlations were calculated, representing the inferred latent correlation between a continuous variable, the test score, and an ordered categorical variable, the item score. For each survey item, we calculated estimates of visual functioning for each response category; response category thresholds; and fit statistics. For all analyses, data from the pre- and post-operative periods were pooled.
In order for data to fit the Rasch model well, there should not be differential item functioning (DIF) . This occurs when item responses vary among subgroups of the population; for example, when men and women have similar visual ability but respond differently to the same item. We performed testing for DIF using logistic regression and the magnitude of DIF was measured by McFadden’s pseudo-R2 criterion. We examined the following variables for DIF: age, sex, location of home (urban versus suburban/rural), marital status, educational attainment, employment status, household income, pay status (whether surgery was paid for or free), and surgery type (phacoemulsification or MSICS). Finally, to test the responsiveness of our instrument to cataract surgery, we calculated the average change in survey score and Cohen’s d, a measure of effect size .
Patient characteristics are presented in Table 1. 225 patients were included with an average age of 60.4 ± 8.6 years and 66.2% were female. About half of patients paid for surgery (51.1%) and underwent MSICS (53.8%), while a majority lived in an urban setting (76.0%); did not complete high school (58.2%); were unemployed (54.2%); were married (71.1%); and had a monthly household income ≤5000 Rupees (about US$78). The mean pre-operative BCVA in the eye undergoing surgery was logMAR 0.46 ± 0.44 (approximately 6/18), which improved to 0.07 ± 0.13 (approximately 6/6) after surgery. The average time to complete the survey for a subset of 18 consecutive patients was 2 min 6.3 s (range, 1 min 9 s to 4 min 37 s).
EFA showed that the first eigenvalue was the only one that was substantially greater than one, which suggested that the instrument was dominated by a single factor; this is illustrated in the scree plot in Additional file 1. In order to test this, we performed CFA for the one factor model. Item 11, “Does bright light hurt your eyes?” had three residual correlations > 0.1 with other items, suggesting that this item may not fit with the single underlying factor. With this item removed, the RMSEA was 0.077, CFI was 0.98 and TLI was 0.98. Of note, the RMSEA, CFI and TLI statistics with item 11 included were 0.073, 0.98, and 0.98, respectively, indicating similar model fit with and without item 11. There were no additional items with residual correlations > 0.1.
Survey items are listed in Table 2. We examined response frequencies and ordering of response thresholds for each item. Items 1, 2, 3, 5 and 7 all had evidence of disordered thresholds. This meant that for these items successive response categories did not correspond to increasing values of visual ability. Therefore, we collapsed the response scale for these items from four to three categories, after which all items had ordered thresholds. We then fit a PCM model to the recoded items.
We examined data for the presence of DIF. We did not detect DIF related to age, sex, location of home, marital status, educational attainment, employment status or household income for any item. Item 2 showed DIF for surgery type, with patients undergoing MSICS being less likely to endorse higher response categories. Item 8 showed non-uniform DIF for pay status, with individuals who had reduced payment or free surgery showing less discrimination than individuals who fully paid for surgery. However, for both items the magnitude of the DIF as measured by McFadden’s pseudo R2 criterion was negligible (R2 < 0.02).
Table 2 provides a summary of the difficulty and fit of each item. The infit and outfit mean-square (MNSQ) statistics provide evidence of construct validity when expected values are close to 1.0, with values from 0.5 to 1.5 being useful for measurement . Values considerably > 1.0 denote randomness in the data (e.g. noise and outliers), while values < 1.0 imply that an item may not be productive for measurement because its values are too predictable. Item 11 was the only item that exhibited significant misfit, with an infit MNSQ of 1.53 and outfit MNSQ of 2.32.
The item-test correlation for item 11 was also low (0.48) compared to other items. These findings suggest that Item 11, which also did not fit the single factor model, could be removed without affecting measurement precision.
The average visual functioning estimates for each response category of each item are reported in Additional file 2 along with category thresholds and misfit statistics for each of the 12 items. The misfit statistics for each item category correspond to the overall data reported in Table 2. However, for item 6 in which there were observed responses for only two of the categories, we detected category misfit due to the small number of observed values.
Table 3 presents reliability values and separation indices, calculated after having omitted Item 11. We found a high item reliability (0.97), indicating that our sample was large enough to achieve statistically reproducible item difficulties. Person reliability was acceptable (0.80), confirming the overall reliability of the questionnaire. Measurement precision, indicated by the person separation index (1.97 logits), approached the commonly accepted threshold of 2.0 logits and a high item separation index (5.87 logits) specified good content validity.
We also calculated the mean item difficulty and person ability, which were 0.00 ± 0.77 and 3.41 ± 2.23 logits, respectively. If items are perfectly targeted to a sample, there should be minimal or no difference between person ability and item difficulty. We observed suboptimal targeting, as the visual ability of subjects was greater than that required by the survey items, which were too easy for this population. The person-item map in Fig. 1 illustrates this, showing a mismatch between item and person scores. Furthermore, the test information plot in Fig. 2 demonstrates that items provided the greatest information for the lowest functioning individuals. This plot shows that the SF-IND-VFQ achieved high information and good precision for all values less than zero (the mean of the person distribution), but that information degraded for higher values. Of note, when only pre-operative data were analyzed targeting remained suboptimal, though the mean difference between person and item measures improved considerably to 1.92 logits.
The average change in SF-IND-VFQ scores after cataract surgery was 2.98 ± 0.11 logits. Cohen’s d was 1.80, indicating a very large effect size .
We found that the 11-item SF-IND-VFQ is a unidimensional instrument capable of detecting changes in visual functioning among Tamil-speaking patients undergoing cataract surgery in India. The culturally-relevant questionnaire was administered rapidly at the point-of-care and had good reliability, content validity and was largely unaffected by DIF.
The purpose of this study was to design a short-form PRO measure that is appropriate as a point-of-care instrument for use in high volume clinical settings in order to monitor outcomes, promote quality improvement, and facilitate outcome measurement in future pragmatic clinical trials and clinical research studies. Traditionally, PRO measures have been employed as research tools and are often too time-consuming to be deployed in a busy clinical setting. However, in more recent years, with the growth of the National Institutes of Health’s Patient-Reported Outcomes Measurement Information System (PROMIS) and the National Health System’s quality improvement initiatives in the United Kingdom, the use of PRO measures in clinical practice has become more common .
The SF-IND-VFQ meets most criteria that Kroenke and colleagues proposed for using PRO measures in clinical practice . Briefly, the survey is setting-appropriate based on its cultural relevance and low respondent burden; can be self-administered and quickly completed; has items that focus on a single symptom; evaluates only one dimension (symptom severity), with no more than four response categories per item; and is widely accessible at no cost. Additional criteria such as actionability (translating scores into concrete actions) and universality (usefulness in other conditions) will be tested in future studies as the SF-IND-VFQ is used more broadly throughout AECS.
One item present in the initial SF-IND-VFQ, “Does bright light hurt your eyes?,” was removed since it did not fit the same unidimensional construct as other items and had poor fit with the Rasch model, as denoted by its MNSQ values. Notably, this was the only item that assessed pain symptoms and this may explain why it did not form part of a unidimensional scale. All other items had goodness-of-fit statistics within the acceptable range, which confirmed the instrument’s construct validity.
Two items retained in the SF-IND-VFQ were affected by DIF, though the magnitude of DIF was negligible for both of these. The first of these items (item 2) asked, “because of your vision how much trouble do you have making out the bumps and holes in the road when walking?” Patients undergoing MSICS, but not phacoemulsification, were less likely to endorse response categories associated with high visual functioning. This may be the case since MSICS is frequently the preferred surgical technique for very advanced cataracts that tend to cause more severe visual dysfunction. Item 8, “because of your eye problem are you ashamed that you can't see?” had non-uniform DIF in which patients who received free surgery showed less discrimination than those who paid for surgery. The reason for this finding is less clear but may be related to understanding or interpretation of the question.
The SF-IND-VFQ was able to discriminate effectively amongst individuals with high levels of visual dysfunction. However, overall the instrument demonstrated suboptimal targeting. The mean difference between person ability and item difficulty was 3.41 logits, suggesting that the amount of visual functioning required by SF-IND-VFQ items was not matched to the visual ability of our subjects. Other cataract surgery PRO measures have also suffered from poor targeting because of items being too easy for patients with cataract [26, 27]. One exception is Catquest-9SF, which showed excellent targeting in a Swedish population (preoperatively, − 0.34 logits) . In our current study, since patients had poorer vision before surgery, SF-IND-VFQ targeting was considerably better preoperatively, though it remained suboptimal (− 1.92 logits).
One reason for suboptimal targeting in this study was that the long-form IND-VFQ was also not well-targeted to patients with cataract . The IND-VFQ was developed through focus groups of patients with cataract, glaucoma, diabetic retinopathy, macular degeneration, and “mixed low vision.” Among the 308 participants from three regions of India, about half (51.9%) had cataracts. In a subsequent study, Finger and colleagues surveyed patients with cataract in south India with the initial 33-item IND-VFQ . Using principal components analysis, they found that the survey contained five subscales (general functioning, mobility, activity limitation, psychosocial impact, and visual symptoms). When Rasch analysis was performed, the visual symptoms subscale had the best targeting (− 0.93 logits), while general functioning (− 2.32), mobility (− 2.94), activity limitation (− 1.93) and psychosocial impact (− 1.39) were not as well targeted. We had similar results when we assessed preoperative targeting (− 1.92 logits), which is comparable to the cross-sectional analysis in Finger et al.’s study of patients with cataract . Since subjects in our study had relatively preserved preoperative visual function (mean BVCA of about 6/18) the instrument may be better targeted to cataract patient populations in India with more advanced disease and this could be explored in future studies.
Given the strong psychometric properties of Catquest-9SF, we had considered employing this instrument for our study rather than validating a new short-form questionnaire. However, although Catquest was well-targeted to patients in Sweden  and several other settings, targeting had not been optimal in all populations . Additionally, we did not believe that all Catquest items were culturally relevant to our patients at AECS. Specifically, items related to reading newspaper text, doing handicrafts, reading text on television, and carrying out a preferred hobby were not thought to be pertinent.
The SF-IND-VFQ was extremely sensitive to changes in visual function after cataract surgery, with the average score improving by 2.98 logits. Additionally, the effect size in this study, measured by Cohen’s d, was 1.80, which is considered very large based on accepted criteria . This was due in part to including only IND-VFQ items that were known to be sensitive to the impact of cataract surgery. The effect size for the SF-IND-VFQ was also high in comparison to other cataract PRO measures. McAlinden and colleagues conducted a study in which 16 PROs were administered to patients having cataract surgery in Australia . Out of this battery, they found that Catquest-9SF had the largest effect size with a Cohen’s d of 1.45.
Although our findings indicate that the SF-IND-VFQ is a valid, reliable and quickly administered PRO, there were several limitations to our study. First, our findings may not be generalizable beyond a tertiary eye center in south India; however, future work should be done to test the measurement properties of the SF-IND-VFQ in different geographic and eye care delivery settings, such as in a lower resource field-based setting in India. Second, we intentionally selected items from the parent IND-VFQ that were most sensitive to the effect of cataract surgery, though we acknowledge that targeting might have been better had we chosen items using different criteria.
All analyses used combined pre-operative and post-operative data. We acknowledge that this violates the Rasch model’s assumption of independence. However, in pooling data we are able to provide item calibrations that can be applied to patients both before and after undergoing cataract surgery, a feature that makes this approach clinically useful. To determine the impact of this violation of the assumption of independence, we analyzed pretest data alone and compared this to results from our pooled analyses. We found no differences in the results for the EFA or CFA, while Rasch person reliability was 0.78 for pretest data alone compared to 0.80 for the pooled data and the 11 retained items continued to show good fit with the model. Of note, we were unable to fit a separate model for the post-operative data since there were few responses indicating severe visual dysfunction, as one would expect due to patients’ improved vision after undergoing surgery.
Building on prior work in India [9,10,11], the current investigation describes the development of a brief and culturally relevant PRO measure that can be quickly administered at the point-of-care. Though further work could be done to improve the targeting of the SF-IND-VFQ, these features suggest that it may be a feasible tool to assess patient-reported visual function in a busy clinical practice. As the original IND-VFQ is available in multiple Indian languages, our short-form version can be easily tested in other parts of India. This questionnaire may prove useful in future work to monitor and improve surgical outcomes and to facilitate clinical research and pragmatic clinical trials at AECS and elsewhere. We have developed a raw-score to Rasch person measure conversion to allow others to use the SF-IND-VFQ without performing their own Rasch analysis (Additional file 3).
Aravind Eye Care System
Best-corrected visual acuity
Confirmatory factor analysis
Comparative fit index
Differential item functioning
Exploratory factor analysis
Indian Visual Functioning Questionnaire
Log minimum angle of resolution
Manual small incision cataract surgery
Patient-Reported Outcomes Measurement Information System
Root mean square error of approximation
Short-Form Indian Visual Functioning Questionnaire
United States Dollars
Basch E. Patient-reported outcomes — harnessing patients’ voices to improve clinical care. N Engl J Med. 2017;376:105–8.
Dang A, Mendon S. The role of patient reported outcomes (PROs) in healthcare policy making. Syst Rev Pharm. 2015;6:1.
Steinberg EP, Tielsch JM, Schein OD, Javitt JC, Sharkey P, Cassard SD, et al. The VF-14. An index of functional impairment in patients with cataract. Arch Ophthalmol. 1994;112:630–8.
Lundstrom M, Behndig A, Kugelberg M, Montan P, Stenevi U, Pesudovs K. The outcome of cataract surgery measured with the Catquest-9SF. Acta Ophthalmol. 2011;89:718–23.
Pesudovs K, Gothwal VK, Wright T, Lamoureux EL. Remediating serious flaws in the National eye Institute visual function questionnaire. J Cataract Refract Surg. 2010;36:718–32.
Mangione CM, Orav EJ, Lawrence MG, Phillips RS, Seddon JM, Goldman L. Prediction of visual function after cataract surgery. A prospectively validated model. Arch Ophthalmol Chic Ill 1960. 1995;113:1305–11.
Gupta SK, Viswanath K, Thulasiraj RD, Murthy GVS, Lamping DL, Smith SC, et al. The development of the Indian vision function questionnaire: field testing and psychometric evaluation. Br J Ophthalmol. 2005;89:621–7.
Nutheti R, Shamanna BR, Nirmalan PK, Keeffe JE, Krishnaiah S, Rao GN, et al. Impact of impaired vision and eye disease on quality of life in Andhra Pradesh. Invest Ophthalmol Vis Sci. 2006;47:4742–8.
Fletcher A, Vijaykumar V, Selvaraj S, Thulasiraj RD, Ellwein LB. The Madurai intraocular lens study. III: visual functioning and quality of life outcomes. Am J Ophthalmol. 1998;125:26–35.
Finger RP, Kupitz DG, Fenwick E, Balasubramaniam B, Ramani RV, Holz FG, et al. The impact of successful cataract surgery on quality of life, household income and social status in South India. PLoS One. 2012;7:e44268.
Khan A, Amitava AK, Rizvi SAR, Siddiqui Z, Kumari N, Grover S. Cost-effectiveness analysis should continually assess competing health care options especially in high volume environments like cataract surgery. Indian J Ophthalmol. 2015;63:496–500.
Murthy GVS, Gupta SK, Thulasiraj RD, Viswanath K, Donoghue EM, Fletcher AE. The development of the Indian vision function questionnaire: questionnaire content. Br J Ophthalmol. 2005;89:498–503.
Finger RP, Kupitz DG, Holz FG, Balasubramaniam B, Ramani RV, Lamoureux EL, et al. The impact of the severity of vision loss on vision-related quality of life in India: an evaluation of the IND-VFQ-33. Invest Ophthalmol Vis Sci. 2011;52:6081–8.
Cohen J. Statistical power analysis for the behavioral sciences. 2nd edition. Abingdon-on-Thames, U.K.: Routledge; 1988.
Linacre JM. Sample size and item calibration stability. Rasch Meas Trans. 1994;7:328.
Holladay JT. Visual acuity measurements. J Cataract Refract Surg. 2004;30:287–90.
Massof RW. Understanding Rasch and item response theory models: applications to the estimation and validation of interval latent trait measures from responses to rating scale questionnaires. Ophthalmic Epidemiol. 2011;18:1–19.
Cappelleri JC, Jason Lundy J, Hays RD. Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures. Clin Ther. 2014;36:648–62.
Petrillo J, Cano SJ, McLeod LD, Coon CD. Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples. Value Health J Int Soc Pharmacoeconomics Outcomes Res. 2015;18:25–34.
Wright B, Linacre J. Reasonable mean-square fit values. Rasch Meas Trans. 1994;8:370.
Wright BD, Stone M. Measurement essentials. Wilmington: Wide Range Inc.; 1999.
Thompson B. Exploratory and confirmatory factor analysis: understanding concepts and applications. American Psychological Association; 2004.
Masters GN. A Rasch model for partial credit scoring. Psychometrika. 1982;47:149–74.
Snyder CF, Aaronson NK, Choucair AK, Elliott TE, Greenhalgh J, Halyard MY, et al. Implementing patient-reported outcomes assessment in clinical practice: a review of the options and considerations. Qual Life Res Int J Qual Life Asp Treat Care Rehabil. 2012;21:1305–14.
Kroenke K, Monahan PO, Kean J. Pragmatic characteristics of patient-reported outcome measures are important for use in clinical practice. J Clin Epidemiol. 2015;68:1085–92.
Pesudovs K, Garamendi E, Keeves JP, Elliott DB. The activities of daily vision scale for cataract surgery outcomes: re-evaluating validity with Rasch analysis. Invest Ophthalmol Vis Sci. 2003;44:2892–9.
Skiadaresi E, Ravalico G, Polizzi S, Lundström M, González-Andrades M, McAlinden C. The Italian Catquest-9SF cataract questionnaire: translation, validation and application. Eye Vis. 2016;3. doi: https://doi.org/10.1186/s40662-016-0043-9.
McAlinden C, Gothwal VK, Khadka J, Wright TA, Lamoureux EL, Pesudovs K. A head-to-head comparison of 16 cataract surgery outcome questionnaires. Ophthalmology. 2011;118:2374–81.
This study was funded by the Heed Ophthalmic Foundation (JRE; no grant number), the National Eye Institute (JRE; K23EY027848) and the University of Michigan Summer Biomedical Research Program (CF and JS; no grant number). No funding agency had any role in the design of the study, collection, analysis or interpretation of data or in the writing of the manuscript.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
Approval for this study was granted by the Aravind Eye Care System Institutional Review Board. Informed consent was obtained from all participants. The study adhered to all tenets of the Declaration of Helsinki.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Scree Plot of Exploratory Factor Analysis. This plot illustrates that the first eigenvalue of the exploratory factor analysis was the only value that was greater than 1. This suggests a unidimensional model. (TIFF 929 kb)
Supplementary tables from Rasch Analysis. These tables report for each of the 12 survey items: the average visual functioning estimates for each response category; category thresholds; and misfit statistics. (DOCX 19 kb)
Raw score to Rasch person-measure conversion. This worksheet allows researchers and clinicians to covert raw SF-IND-VFQ scores to interval scores without performing their own Rasch analysis. Caution should be exercised in applying this Rasch analysis to patient populations that are very different from the one in this study. (XLSX 12 kb)