Evaluation of the Functional Assessment of Cancer Therapy-General (FACT-G) Spanish Version 4 in South America: Classic Psychometric and Item Response Theory Analyses
© Dapueto et al; licensee BioMed Central Ltd. 2003
Received: 16 May 2003
Accepted: 20 August 2003
Published: 20 August 2003
The FACT-G has gone through many validation studies. However, little research has been conducted in South American Spanish speaking patients. The present study aimed to evaluate the FACT-G Spanish Version 4 in Uruguayan cancer patients.
The data analyzed were collected from 309 patients, with various tumor sites, at different stages of disease and receiving different treatments.
Reliability was evaluated using Cronbach's coefficient alpha and showed high internal consistency for each of the subscales and its total scale (range = .78 – .91) of the FACT-G. The FACT-G total score also showed significant mean differences among known groups (performance status, in vs. outpatients) when tested by ANOVA and t-test. When the tumor stage (Local and Regional vs. Metastatic disease) was used as a clinical anchor, the FACT-G total score, the Physical Well-being (PWB), and Functional Well-being (FWB) subscale scores showed mean differences, ranging from 5 to 10 points in a scale from 0–108 (effect sizes = 0.30–0.60). Item response theory (IRT)-based evaluation using mean square fit statistics (.60–1.4) criteria showed that only two items misfit: "Estoy satisfecho(a) con mi vida sexual" (I am satisfied with my sex life) and "Estoy satisfecho(a) de cómo estoy enfrentando mi enfermedad" (I am satisfied with how I am coping with my illness).
The results indicated that, using both traditional and IRT approaches, the Spanish FACT-G has good reliability and validity to be used as a QOL instrument among Uruguayan cancer patients.
The Functional Assessment of Cancer Therapy-General (FACT-G) Questionnaire, designed to measure quality of life (QOL) in cancer patients , has gone through many validation studies, both in its English and translated versions [2–5]. However, little research has been conducted in South American Spanish speaking patients. A previous study performed among Uruguayan cancer patients using FACT-G Spanish Version 2  showed acceptable to good reliability and validity except for the emotional well-being subscale. Cella and his colleagues also recognized this potential problem and subsequently revised the items in a new Spanish Version 3 and its most recent Spanish Version 4 in order to improve its reliability. Major changes include adding one previously available but not scored item to the scoring algorithm ("I worry my condition will get worse" – "Me preocupa que mi enfermedad empeore"), rephrasing some other items to improve readability and removing the 2-item Relationship with Doctor subscale. These remarks the need for additional validation studies for the Spanish-speaking cancer patient population in South America.
Another reason for further psychometric studies on FACT-G is that, up to date, its Spanish version has been validated using classic test theory (CTT) approaches like other QOL questionnaires. Traditionally, classical psychometric based procedures have dominated the health status assessment. More recently, item response theory (IRT) measurement models have entered the field and researchers are increasingly enthusiastic for the prospect of deriving better definitions of underlying constructs and the opportunity to turn attention away from static tests and scales to items and the incremental information they provide . The present study aimed to evaluate the performance of the FACT-G Spanish Version 4 in Uruguayan cancer patients using both classic psychometric and item response theory based approaches.
The data were collected from cancer patients, between 18 and 75 years of age, with various tumor sites, at different stages of disease and under different forms of treatment. To be eligible patients must have been fluent in Spanish. Potential participants were identified from the daily record of office visits, treatment visits and inpatient hospitalizations. To ensure sufficient experience with treatment related side-effects, patients must have completed a minimum of two cycles of chemotherapy and/or 10 radiation therapy sessions or one month of hormone therapy. There must have been at least one month since last surgery. To ensure heterogeneity of the socioeconomic features, patients from one private (Centro de Asistencia del Sindicato Médico del Uruguay – CASMU) and three public (Hospital de Clínicas de la Universidad de la República, Instituto de Oncología del MSP, Servicio de Oncología Radioterápica del Hospital Pereira Rossell) hospitals of the city of Montevideo were recruited for the study. Patients with ostensible cognitive deficits or serious psychiatric dysfunctions were excluded. Ability to give informed consent was required. Approval from the corresponding ethics committees was obtained.
Patients were assessed using a battery of instruments. Two physician rated QOL questionnaires, the ECOG Performance Status Rating and the Spitzer's Quality of Life-Index doctor version (QLI-d) were completed either by the treating physician or oncologist. Patient self-reported questionnaires included the Functional Assessment of Cancer Therapy-General (FACT-G) Spanish Version 4, the Spitzer's Quality of Life-Index patient version (QLI-p), the Profile of Mood States, Short Form (POMS-SF) and the Marlowe-Crowne Social Desirability Scale (MCSDS).
ECOG Performance Status Rating (ECOG PSR) is a five-point scale  ranging from 0 (fully ambulatory) to 4 (not being able to leave bed).
The Quality of Life Index (QL-I)  is a five-item questionnaire where each one of them explores a dimension or domain of quality of life: health, activity, daily living, support and outlook. Every item has three response categories indicating different levels of functional impairment. Although it was originally developed as an observer rated scale (QLI-d), it can also be used as a patient rated scale (QLI-p). For the purpose of the study, the QL-I was translated into Spanish following a forward and backward translation procedure, carried out by an English native speaking linguist and a native Spanish-speaking English translator. The Spanish versions of the two QLI questionnaires are available from the authors upon request.
The Functional Assessment of Cancer Therapy-General Questionnaire (FACT-G) Spanish Version 4 is a widely used QOL instrument. It comprises 27 questions that assess four primary dimensions of QOL: physical (PWB; 7 items), social and family (SFWB; 7 items), emotional (EWB; 6 items), and functional well-being (FWB; 7 items). It uses 5-point Likert-type response categories ranging from 0 = 'not at all' to 4 = 'very much'. The total FACT-G score is the summation of the 4 subscale scores and ranges from 0 to 108. Data from a previous study on the FACT-G Spanish version 2 conducted among Uruguayan cancer patients suggested that its reliability was acceptable to good in all subscales except for the EWB scale. An important question raised by these results was whether these subscales showed sufficient internal consistency to justify their use across cultures and whether there was equivalence of the Spanish EWB to its English counterpart. Ever since, developers of the FACT-G revised the questionnaire into its most recent version 4 . Major changes were the inclusion of an additional item ("I worry my condition will get worse" – "Me preocupa que mi enfermedad empeore") in the EWB subscale, the removal of the two-item "Relationship with the Doctor" subscale and the rewording of 12 Spanish items to improve their readability.
The Profile of Mood States, Short Form (POMS-SF) is a widely used scale measuring subjective mood states, such as anxiety, tension, vigor, depression, fatigue and confusion. The POMS-SF is a valid measure of affective states and psychological adjustment in cancer patients and is available in Spanish. A Total Mood Disturbance score (POMS TMD) may be obtained by summing the five scores of Tension, Depression, Anxiety, Fatigue and Confusion subscales and substracting Vigor from these scores. Only patients with a 6th grade or higher level of reading abilities were included in the analysis, according to the instrument developers' instructions.
The Marlowe-Crowne Social Desirability Scale (MCSDS) . The 10 items short form of the MCSDS provides a measure of the degree to which participants endorse socially desirable characteristics. A validated Spanish version of the questionnaire  was completed by the study participants.
Demographic, disease and treatment information was collected from patients, the treating physician and verified by the research assistants with the participants' medical record.
Classical psychometric approach for analysis of the data consisted of an examination of the reliability and validity of the FACT-G Version 4. Reliability was examined by internal consistency (Cronbach's coefficient alpha) for each subscale and the overall scale. Alpha coefficients of 0.70 or higher were considered acceptable . Construct validity was assessed by comparing mean differences in FACT-G total and subscale scores according to known groups, i.e. patients performance status, in vs. out-patients and by studying the correlations between the instruments (convergent and discriminant validity).
Several methods have been described in order to establish clinical significance of QOL measures . Anchor-based methods examine the relationship between scores on the instrument whose interpretation is under question (target instrument) and some independent measure (an anchor). This approach requires that 1) the anchor is easily interpretable and 2) there must be appreciable association between the target and the anchor. Differences in scores in relation to the clinical anchors can be then used to set the minimum important difference or clinically meaningful change  in order to evaluate outcomes in clinical trials. Anchor-based clinical significance was studied in order to determine clinically significant differences in QOL assessments as measured by FACT-G total and subscale scores, using tumor stage as a definite clinical criterion commonly used in the Oncology field. Differences of 5 to 10 score points in a 100 point scale are considered relevant in determining the clinical significance of QOL measures . To examine the statistical magnitude of the observed differences, each mean difference score was standardized by relating this score to its standard deviation (effect size). An effect size of d = 0.2 was taken to indicate a small difference, d = 0.5 a moderate difference, and d = 0.8, a large one .
The item response data were analyzed using Andrich's [19–22] rating scale model (RSM). The RSM is an item response theory (IRT)-based measurement model and has been implemented in the WINSTEPS computer program . The RSM specifies two facets (person latent trait, Bn; item location, Di), and the step threshold (Fi). The probability of person n responding in response category j to item i can then be expressed by the formula:
ln [P nij / P ni(j-1) ] = Bn - Di - Fj,
in which P nij is the probability of person n endorsing or choosing in category j of item i, P ni(j-1) is the probability of person n endorsing or choosing in category j - 1 of item i, Bn is the latent trait measure (e.g., fatigue) of person n, and Di is the location of item i, and Fj is the step threshold between categories j - 1 and j. In the present study, for example, F1 is the transition from intensity category 1 ("not at all") to category 2 ("a little bit") and F4 is the transition from category 4 ("quite a bit") to category 5 ("very much"). That is the point on the latent trait scale (i.e., PWB) at which two consecutive category response curves intersect.
Item fit statistics
In order to examine the fit of each item to measure a unidimensional construct (e.g., PWB), the infit and outfit mean square (MNSQ) item fit statistics provided in the WINSTEPS program were evaluated. Fit implies meeting the measurement requirements of item homogeneity and unidimensionality. It also indicates the validity of the item calibrations and person measures. Item misfit indicates that an item is not measuring the same underlying construct as other items within the same scale.
The infit MNSQ is an information-weighted fit statistic, which is more sensitive to unexpected behavior affecting responses to items near the person's trait level. The weighting reduces the influence of less informative, low variance, off-target responses. The outfit MNSQ is an outlier-sensitive fit statistic, more sensitive to unexpected behavior by persons on items far from the person's trait level. These statistics have an expected value of 1.0, and range from 0 to infinity. Values substantially below 1 indicate local dependencies in the data; values substantially above 1 indicate noise. These values are on a ratio scale, so that 1.2 indicates 20% excess noise. We set .60–1.4 for infit MNSQ and outfit MNSQ as cut-off criteria for items with good fit.
Socio-demographic and clinical features of the sample (n = 309)
Lives with family
Lives with others
Lives in institution
Years of education
Primary School <3 years
Primary School 3 to 5 years
Primary School 6 years
High School 1 to 3 years
High School 4 to 6 years
Satisfaction with income
"I can save"
"I get enough, no money problems"
"I get just fair for my needs"
"I don't get enough for my needs"
"I am in need"
Family monthly income in US dollars
0 – 5000
192 – 571
Head & Neck
Uterus & ervix
Mean time since cancer diagnosis was 29 months (SD = 39.5 months; ranged 2 weeks to 24.5 years). As for treatment characteristics, 172 patients had undergone surgery (55.7%); 204 received only one treatment modality, 95 of them were treated with chemotherapy (ChT) (30.7%), 98 with radiotherapy (RT) (31.7%) and 11 with hormone therapy (HT) (3.6%), while 100 patients received a combination of ChT and RT; 4 (1.3%) were treated with RT and HT and 1 (0.3%) with ChT and HT. The timing of testing in relation to treatment varied along the sample. In 119 cases (38.5%), patients were interviewed during the week following their last treatment (ChT cycle or RT session); in 64 cases (20.7%) there had been between one week and one month since the last treatment; in 44 cases (14.2%) between 3 months and a year and 82 (26.5%) had been off treatment for more than one year.
Internal consistency of FACT-G and subscales
English Version 2 
Spanish Version 2 
Spanish Version 4
Mean ± SD
Mean ± SD
Mean ± SD
PWB (7 items)
20.5 ± 5.5
19.0 ± 5.9
15.7 ± 8.9
SFWB (6 items)
16.6 ± 5.1
SFWB (7 items)*
21.9 ± 4.8
18.0 ± 5.7
18.6 ± 5.9
EWB (5 items)
14.8 ± 5.0
14.2 ± 3.7
EWB (6 items)
13.5 ± 6.3
FWB (6 items)
18.0 ± 6.1
12.9 ± 5.6
16.0 ± 6.0
Relationship with Doctor (2 items)
6.9 ± 1.5
6.8 ± 1.2
82.1 ± 15.9
71.9 ± 15.7
61.9 ± 18.4
Total FACT-G (*)
63.7 ± 19.0
Differences in FACT-G scores according to criteria groups
2 – 4
Mean score differences by patient location
Differences in FACT-G and subscale scores with tumor stage and effect size
FACT – G Total
95% confidence interval
95% confidence interval
95% confidence interval
57.5 – 64.0
53.0 – 61.1
Local – Metastatic
Regional – Metastatic
Correlation between FACT-G, QLI doctor and patient version, POMS-SF, MCSDS-10
FACT – G
FACT – G
QL-Index doctor version
QLId Daily Living
QL-Index patient version
QLIp Daily Living
IRT based analysis of FACT-G Spanish Version 4.
I am forced to spend time in bed
I have a lack of energy
Because of my physical condition, I have trouble meeting the needs of my family
I have pain.
I am bothered by side effects of treatment
I feel sick
I have nausea
Social/ Family Well-Being
I am satisfied with my sex life
I feel close to my friends
I get support from my friends
My family has accepted my illness
I am satisfied with family communication about my illness
I feel close to my partner (or the person who is my main support)
I get emotional support from my family
I worry that my condition will get worse
I feel sad
I worry about dying
I feel nervous
I am losing hope in the fight against my illness
I am satisfied with how I'm coping with my illness
I am able to work (include work at home)
I am content with the quality of my life right now
I am sleeping well
My work (include work in home) is fulfilling
I am enjoying the things I usually do for fun
I am able to enjoy life
I have accepted my illness
Since most instruments designed to evaluate quality of life have been developed in the United States or in Western Europe, it is necessary to adapt them to be used in other cultural settings. Thus, it is important to produce a culturally equivalent measure that can be used to accurately evaluate different groups of people. The final step in the complex process of cross-cultural adaptation is to validate the instrument through the study of the psychometric properties of the measure.
A validation study of the FACT-G Spanish Version 4 was conducted in a sample of Uruguayan cancer patients. A large variability in the biological and sociodemographic features of the sample was ensured to study the general performance of the questionnaire. The frequency of tumor sites represents very closely the incidence of solid tumors in the Uruguayan general population .
Reliability analysis showed high internal consistency as indicated by Cronbach alpha coefficients ranging from (.78 – .91). The comparison of these data with the results obtained from the FACT-G English and Spanish Version 2 (Table 2) points out a remarkable improvement in FACT-G Spanish Version 4 total scale as well as subscale reliability coefficients. Based on these results it is safe to conclude that the FACT-G Spanish Version 4 shows sufficient internal consistency to justify its use across cultures.
As an evidence of construct validity, the FACT-G questionnaire appeared to be capable of discriminating among groups of patients according to their level of performance status, showing differences in the total scale and subscale scores with the criterion groups. The FACT-G total and the Functional Well-being subscale showed the best discriminative ability. Differences among known groups were also observed in the FACT-G total and subscale scores in relation to in-patients vs. out-patients. As expected, the FACT-G scores are higher (better QOL) among outpatients and differences were statistically significant on the Functional, Social and Emotional Well-being subscales.
Quality of life researchers have been concerned about the clinical significance of measures and have pursued the objective to find practical and comprehensive criteria that could be interpretable by clinicians when conveying research results. Clinical status and disease characteristics were considered as clinical anchors. In our study, differences ranging from 5 to10 points in the overall scale and of approximately 3 points in the physical and functional subscales can be considered clinically relevant since they can discriminate between patients with loco regional and metastatic diseases, a clear-cut criterion commonly used by oncologists. Accordingly, effect size calculations showed moderate values ranging from 0.30 to 0.60. These findings are consistent with those found in a longitudinal study using the FACT-G to assess treatment outcomes in a sample of advanced lung cancer patients .
In CTT, the most common form of construct validation of HRQL measures has been the study of convergent and discriminant validity . We included it in our study, along with the more recent IRT approach, because it provides relevant information on the relationship of the questionnaire with other measures of QOL and related constructs, i.e., performance status or psychological distress after a priori hypothesis were made about the magnitude and direction of the correlations. As an evidence of convergent validity, moderate but significant, correlations were found between the FACT-G and a set of instruments (ECOG PSR, QL-I and POMS-SF) that are expected a priori to be related to QOL assessments while no correlation was found between the FACT-G and the MCSDS-10, supporting divergent validity.
An important issue to be considered is the technical equivalence of the FACT-G when used in a sample of cancer patients of a South American country . As mentioned earlier, most patients in the Uruguayan sample preferred the questionnaire to be read out loud by an interviewer instead of filling it out by themselves. This is not a common finding in studies with patients from the United States. These may raise the issue as to whether this difference in the method of assessment is comparable in each culture with respect to the data that it yields. In our study, the internal consistency of the FACT-G did not vary when studied separately, by means of the Cronbach alpha coefficients for the FACT-G total and subscale scores for two groups of patients. In a study of the impact of socio-cultural and clinical factors on Hispanic and African American cancer patients' quality of life, Wan et al  also found no significant effect of the mode of administration of the FACT-G on the reporting of overall QOL. Recently, audio-visual computerized based assessments of QOL provide an innovative way for gathering and using self-report data and may be feasible for individuals with limited literacy skills (Hahn et al. unpublished data).
Based on the CTT approach, we may conclude that Spanish FACT-G Version 4 is a psychometrically sound instrument to assess QOL in the population being studied. However, in the present study, we moved on to introduce IRT based analyses of the data considering the significant advantages shown by this method when used to evaluate health outcomes measures. Despite the long history of CTT, there remain major limitations in some areas that have been summarized by Hambleton . First of all, the CTT-based statistics that describe test performance are sample dependent. IRT is more useful since it provides more robust item statistics that are independent and invariant over sample populations that vary in the trait measured by the test. Another limitation of CTT is that the scores that are commonly used as a measure of the examinees' ability are test dependent. Potential advantages of using IRT in health outcome assessments are: more comprehensive and accurate evaluation of item characteristics, assessment of group differences in item and scale functioning, evaluation of scales containing items with different response formats, improvement of existing measures, computer adaptive testing (CAT) applications, and evaluation of person fit . Thus, IRT can facilitate the development of new items and scales to improve exiting measures. It may raise attention on redundant items or the location along the trait continuum (in our case, quality of life) where the scale provides little information and needs to be improved.
An IRT analysis was included in the evaluation of FACT-G Spanish Version 4 because it provides additional information on the reliability of the scale than that provided by the CTT approach. A rating scale model (RSM)  was used which assumes that the logit-transformed measures of the item scores within each subscale vary along the latent trait level (quality of life) and are aligned according to the difficulty (or location) the patients had to endorse each item, with negative values representing those items that are easier to endorse and positive values those that are more difficult. In the present study, item fit statistics confirmed the unidimensionality of each subscale with two exceptions. In the case of GS7 (I am satisfied with my sex life), many patients were reluctant to give information about their sexual life and this may be a cause of inconsistency in their responses. Another reason for discrepancy may be related to some problem in translation or comprehension. In both items (GS7 and GE2) the word "satisfaction" was translated into Spanish as "satisfacción" which implies in Spanish a degree of fulfillment that patients may be not prone to express when answering such question. Another possible explanation is that these items are the only ones phrased positively in their respective subscales while the rest of the items refer to negative conditions for quality of life.
Other IRT models could have been used for the analysis of the data. For instance, the graded response model (GRM), an extension of the two parameter logistic model  is also appropriate to use when item responses can be characterized as ordered categorical responses. However, scores obtained using several models were highly correlated showing that these approaches yield comparable results .
IRT analyses demands large sample sizes in order to obtain stable and invariant item and latent trait estimates. However, several studies using this procedure to assess the psychometric properties of QOL measures addressed rather small sample of patients (range: 100 to 400 patients) [32–37].
Cella and Chang  warned of the possible limitations of using IRT methods in the evaluation of health measures since they were originally developed for and used with a fairly homogeneous educational assessment population. When we apply these methods to more heterogeneous clinical populations there may be limitations to obtain item-free estimates of sample latent traits. They remark that the context, selection and sequence of questions, considering both item diversity and clinical diversity, may produce sample-dependent item difficulty estimates and therefore unreliable item-dependant estimates of patient ability. The continuous monitoring of item calibrations involved in the process of item banking will help to solve these uncertainties.
The present study is the first one in South America reporting results on item functioning on a health related quality of life measure. Future studies with larger sample of patients could lead to a better understanding of differences in item functioning across different South American countries and cultures and move forward to item banking and CAT technology suitable for developing countries.
We conclude that the FACT-G Spanish Version 4 showed, using classic psychometric and IRT approaches, good reliability and validity and is a valid instrument to set clinical significant differences in longitudinal studies of cancer treatment. Thus, the FACT – G Spanish language version, as reported here, provides sufficient assurance of equivalence to its original English version to be used in future research on quality of life among South American Spanish speaking patients.
This study was partially funded by the Comisión Honoraria de Lucha Contra el Cancer, Uruguay. Dr. Dapueto's research at the Center on Outcomes, Research and Education, Evanston Northwestern Healthcare was supported by an International Union Against Cancer ICRETT Fellowship. We gratefully acknowledge Professors Enrique Barrios M.D., Ricardo Bernardi, MD., David Cella, Ph.D., and Ignacio Muse, M.D. for their cooperation and support.
- Cella D, Tulsky D, Gray G, Sarafian B, Linn E, Bonomi A, et al.: The Functional Assessment of Cancer Therapy Scale: development and validation of the general measure. J Clin Oncol 1993, 11: 570–579.PubMedGoogle Scholar
- Cella D, Hernandez L, Bonomi AE, Corona M, Vaquero M, Shiomoto G, Baez L: Spanish language translation and initial validation of the functional assessment of cancer therapy quality of life instrument. Med Care 1998, 36: 1407–18. 10.1097/00005650-199809000-00012PubMedView ArticleGoogle Scholar
- Bonomi AE, Cella DF, Hahn EA, Bjordal K, Sperner-Unterweger B, Gangeri L, Bergman B, Willems-Groot J, Hanquet P, Zittoun R: Multilingual translation of the Functional Assessment of Cancer Therapy (FACT) quality of life measurement system. Qual Life Res 1996, 5: 309–320.PubMedView ArticleGoogle Scholar
- Winstead-Fry P, Schultz A: Psychometric analysis of the Functional Assessment of Cancer Therapy-General (FACT-G) scale in a rural sample. Cancer 1997, 15: 2446–52. Publisher Full Text 10.1002/(SICI)1097-0142(19970615)79:12<2446::AID-CNCR23>3.3.CO;2-IView ArticleGoogle Scholar
- Yu CL, Fielding R, Chan CL, Tse VK, Choi PH, Lau WH, Choy DT, O SK, Lee AW, Sham JS: Measuring quality of life of Chinese cancer patients: A validation of the Chinese version of the Functional Assessment of Cancer Therapy-General (FACT-G) scale. Cancer 2000, 88: 1715–27. 10.1002/(SICI)1097-0142(20000401)88:7<1715::AID-CNCR28>3.3.CO;2-BPubMedView ArticleGoogle Scholar
- Dapueto JJ, Francolino C, Gotta I, Levin R, Barrios E, Alonso I, Afonzo Y, Cambiasso S: Evaluation of the Functional Assessment of Cancer Therapy – General Questionnaire (FACT-G) Version 2 in a Spanish Speaking Population. Psycho-Oncology 2001, 10: 88–92.PubMedView ArticleGoogle Scholar
- Cella D, Chang CH: A discussion of Item Response Theory and its applications in health status assessment. Med Care 2000, Suppl II: 66–72.Google Scholar
- Zubrod CG, Schneiderman M, Frei E, et al.: Appraisal of methods for the study of chemotherapy of cancer in man: comparative therapeutic trial of nitrogen mustard and triethylene thiophosphoramide. J Chron Dis 1960, 11: 7–33.View ArticleGoogle Scholar
- Spitzer W, Dobson A, Hall J, Chesterman E, Levi J, et al.: Measuring the quality of life of cancer patients. A concise QL-Index for use by physicians. J Chron Dis 1981, 34: 585–597.PubMedView ArticleGoogle Scholar
- McNair M, Lorr M, Droppleman L: Manual of the Profile of Mood States. San Diego: EdITS Educational and Industrial Testing Service 1992.Google Scholar
- Crowne D: A new scale of social desirability independent of psychopathology. Journal of Consulting Psychology 1960, 4: 349–354.View ArticleGoogle Scholar
- Lara-Cantú M, Suzan-Reed M: La escala de Deseabilidad Social de Marlowe y Crowne: un estudio psicométrico. Salud Mental 1988,11(3):25–29.Google Scholar
- DeVellis RF: Scale and Applications. Newbury Park: Sage Publications 1991.Google Scholar
- Guyatt GH, Osoba D, Wu AW, Wyrwich KW, Norman GR: Clinical significance consensus meeting group. Methods to explain the clinical significance of health status measures. Mayo Clin Proc 2002, 77: 371–83.PubMedView ArticleGoogle Scholar
- Cella D, Eton D, Fairclough DL, Bonomi P, Heyes AE, Silberman C, Wolf MK, Johnson DH: What is a clinically meaningful change on the Functional Assessment of Cancer Therapy – Lung (FACT-L) Questionnaire? Results form Eastern Cooperative Oncology Group (ECOG) Study 5592. J Clin Epidemiol 2002, 55: 285–295. 10.1016/S0895-4356(01)00477-2PubMedView ArticleGoogle Scholar
- Sobin LH, Wittekind Ch, Eds: TNM Classification of Malignant Tumours. New York: Wiley-Liss Inc 1997.Google Scholar
- Osoba D, Rodrigues G, Myles J, Zee B, Pater J: Interpreting the significance of changes in health-related quality-of-life scores. J Clin 1998, 16: 139–44.Google Scholar
- Cohen J: Statistical Power Analysis for the Behavioral Sciences Second Edition (Edited by: Hillsdale NJ). Lawrence Erlbaum and Associates 1988.Google Scholar
- Andrich D: A binomial latent trait model for the study of Likert-style attitude questionnaires. British Journal of Mathematical and Statistical Psychology 1978, 31: 84–98.View ArticleGoogle Scholar
- Andrich D: A rating formulation for ordered response categories. Psychometrika 1978, 43: 561–573.View ArticleGoogle Scholar
- Andrich D: Scaling attitude items constructed and scored in the Likert tradition. Educational and Psychological Measurement 1978, 38: 665–680.View ArticleGoogle Scholar
- Andrich D: Application of a psychometric rating model to ordered categories which are scored with successive integers. Applied Psychological Measurement 1978, 2: 581–594.View ArticleGoogle Scholar
- WINSTEP – MINISTEP Rasch Model Computer Programs Chicago: John M. Linacre 2002.Google Scholar
- Vasallo JA, Barrios E, De Stefani E, Ronco A: II Atlas de Incidencia del Cáncer en el Uruguay. Montevideo: Comisión Honoraria de Lucha contra el Cáncer 2001.Google Scholar
- Tulsky D: An Introduction to Test Theory. Oncology 1990, 4: 43–48.PubMedGoogle Scholar
- Flaherty JA, Gaviria FM, Pathak D, et al.: Developing instruments for cross-cultural psychiatry research. J Nerv Mental Dis 1988, 176: 257–263.Google Scholar
- Wan G, Counte M, Cella D: The Influence of Personal Expectations on Cancer Patients' Reports of Health Related Quality of Life. Psycho-Oncology 1997, 6: 1–11. Publisher Full Text 10.1002/(SICI)1099-1611(199703)6:1<1::AID-PON230>3.3.CO;2-3PubMedView ArticleGoogle Scholar
- Hambleton RK: Emergence of Item Response Modeling in instrument development and data analysis. Med Care 2000, Suppl II: 60–64.Google Scholar
- Hays RD, Morales LS, Reise SP: Item Response Theory and health outcomes measurement in the 21 st Century. Med Care 2000, Suppl II: 28–41.Google Scholar
- Samejima F: Graded Response Model. In Handbook of modern item response theory (Edited by: Van der Linden WJ, Hambleton RK). New Your: Springer 1997.Google Scholar
- Chang CH, Cella D: One- versus two-parameter item response theory (IRT) measurement models applied to MOS PF-10 scores: How much does it really matter? [abstract]. Qual Life Res 2001, 10: 302.Google Scholar
- Young MA, Blodgett C, Reardon A: Measuring seasonality: psychometric properties of the Seasonal Variation. Psychiatric Res 2003, 117: 75–83. 10.1016/S0165-1781(02)00299-8View ArticleGoogle Scholar
- Orlando M, Marshall GN: Differential item functioning in a Spanish translation of the PTSD checklist: detection and evaluation of impact. Psychol Assess 2002, 14: 50–9. 10.1037//1040-35220.127.116.11PubMedView ArticleGoogle Scholar
- Ryser L, Wright BD, Aeschlimann A, Mariacher-Gehler S, Stuchi G: A new look at the Western Ontario and Mc Master Universities Osteoarthritis Index using Rasch analysis. Arthritis Care Res 1999, 12: 331–5. Publisher Full Text 10.1002/1529-0131(199910)12:5<331::AID-ART4>3.0.CO;2-WPubMedView ArticleGoogle Scholar
- Leplege A, Ruede N, Ecosse E, Ceinos R, Dohin E, Pouchot J: Measuring Quality of life from view of HIV-positive subjects the HIV-QL31. Qual Life Res 1997, 6: 585–94. 10.1023/A:1018468301617PubMedView ArticleGoogle Scholar
- Gibbons RD, Clark DC, Kupfer DJ: Exactly what does the Hamilton Depression Rating Scale measure? J Psychiatr Res 1993, 27: 259–73. 10.1016/0022-3956(93)90037-3PubMedView ArticleGoogle Scholar
- Chang CH, Gehlert S: The Washington Psychosocial Seizure Inventory (WPSI): psychometric evaluation and future applications. Seizure 2003, 12: 261–267. 10.1016/S1059-1311(02)00275-3PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.