Comparison of numerical and verbal rating scales to measure pain exacerbations in patients with chronic cancer pain
© Brunelli et al; licensee BioMed Central Ltd. 2010
Received: 18 December 2009
Accepted: 22 April 2010
Published: 22 April 2010
Numerical rating scales (NRS), and verbal rating scales (VRS) showed to be reliable and valid tools for subjective cancer pain measurement, but no one of them consistently proved to be superior to the other. Aim of the present study is to compare NRS and VRS performance in assessing breakthrough or episodic pain (BP-EP) exacerbations.
In a cross sectional multicentre study carried out on a sample of 240 advanced cancer patients with pain, background pain and BP-EP intensity in the last 24 hours were measured using both a 6-point VRS and a 0-10 NRS. In order to evaluate the reproducibility of the two scales, a subsample of 60 patients was randomly selected and the questionnaire was administered for a second time three to four hours later. The proportion of "inconsistent" (background pain intensity higher than or equal to peak pain intensity) evaluations was calculated to compare the two scales capability in discriminating between background and peak pain intensity and Cohen's K was calculated to compare their reproducibility.
NRS revealed higher discriminatory capability than VRS in distinguishing between background and peak pain intensity with a lower proportion of patients giving inconsistent evaluations (14% vs. 25%). NRS also showed higher reproducibility when measuring pain exacerbations (Cohen's K of 0.86 for NRS vs. 0.53 for VRS) while the reproducibility of the two scales in evaluating background pain was similar (Cohen's K of 0.80 vs. 0.77).
Our results suggest that, in the measurement of cancer pain exacerbations, patients use NRS more appropriately than VRS and as such NRS should be preferred to VRS in this patient's population.
The importance of pain measurement in routine cancer patient assessment and in research is advocated by experts and scientific associations [1–5], and several efforts are being made to raise consensus on international recommendations in the choice of standardized measurement tools specific for cancer pain evaluation [3, 6–8] in both clinical practice and research.
Subjective pain intensity is the most often considered among the dimensions of pain that should be assessed , both in the clinic and in clinical trials. Among several subjective methods for pain intensity measurement, visual analogue scales (VAS), numerical rating scales (NRS), and verbal rating scales (VRS) proved to be reliable and valid, but no one of them consistently showed to be superior to the others [9–19]. The three scales are significantly different as to number of response categories, patient and clinician preference, likelihood of missing data and administration requirements . Research consistently shows that the use of VAS in elderly patients is associated with higher failure of completion rates than the use of NRS, and also that the elderly prefer to use NRS in respect to VAS [12, 20]. Similar difficulties were observed among patients on high doses of opioids . For these reasons VAS can be considered less suitable for pain evaluation in cancer patients, many of which are old and assume opioids. Yet VAS and NRS have shown a better sensitivity to change with respect to VRS  probably due to the usually smaller number of categories in VRS.
For these reasons VAS was not considered in our study, which instead focused on VRS and NRS; both scales are easy to use with most patients and have shown good psychometric properties  but no studies have been conducted to compare them for the evaluation of pain exacerbation.
In developing a new questionnaire for breakthrough or intense episodic pain (BP-EP) evaluation, both an 11-point NRS and a 6-level VRS were included in the questionnaire with the aim of comparing their performance in evaluating pain exacerbations in terms of reproducibility and of discriminatory capability to distinguish pain exacerbations over a background of less severe pain.
This analysis is based on data from 240 patients consecutively enrolled in a cross sectional Italian multicentre study aimed at estimating BP-EP prevalence in a population of advanced cancer patients with pain. The results on prevalence are going to be presented elsewhere. Patients were included if they had a diagnosis of cancer, had cancer-related chronic pain, were at least 18 years of age, and were able to provide written informed consent. Patients were excluded if their pain was exclusively due to a surgical procedure.
The questionnaire for BP-EP evaluation was administered as an interview to the patients by a nurse or a physician; patients were asked to assess their background pain intensity referring to the previous 24 hours and, if they reported to have also episodes of pain exacerbations (both spontaneous or due to volitional or non volitional actions such as movement or cough), they were asked to rate the intensity of their most severe episode during the previous 24. Only for the aims of the present study, the questionnaire for BP-EP evaluation contained a double evaluation both for background pain and for pain intensity exacerbations; one evaluation was performed using a 6-point VRS and patients were asked to rate their pain intensity choosing from the following descriptors: None, Very mild, Mild, Moderate, Severe, Very severe ; the second evaluation was performed by an 11 point NRS and patients were asked to rate their pain on a 0 to 10 scale where 0 indicates "No pain" and 10 "The worst possible pain" . This NRS version was chosen from the BPI  as the most diffused and validated in Italian language, while the 6-level VRS chosen is a widely used instrument validated across 15 languages  which fulfils the requirement of a sufficient number of levels to ensure scale sensitivity . In order to estimate the two scales reproducibility, a randomly selected subsample of 60 patients was administered the questionnaire a second time by a different nurse or physician, three to four hours after the first administration. For the second evaluation the patient was instructed to assess the same 24 hours period already evaluated in the first assessment, excluding the time period between the two administrations.
The sample size of 240 patients was calculated based on the main outcome of the study (prevalence of BP-EP, not reported here). 60 patients were enrolled in the retest phase to ensure a 0.18 precision for the estimates of the reproducibility indexes (where precision indicates the width of the 95% confidence interval). This last calculation was performed in the hypothesis that the reproducibility indexes to be estimated were 0.8 .
The capability of the two scales to discriminate between background pain and pain exacerbations intensities, was measured calculating the proportion of "consistent" and of "inconsistent" evaluations; the evaluation provided by a patient was defined as "consistent" if background pain intensity was lower than peak intensity, otherwise it was defined as "inconsistent" (background pain intensity higher than or equal to peak pain intensity). A higher percentage of inconsistent evaluations on one scale with respect to the other indicates that the former is less adequate for pain exacerbation measurement. The difference between the percentage of inconsistent evaluations obtained through NRS and through VRS, along with its 95% Confidence Interval (95% CI), was estimated to compare the two scales.
Scales reproducibility was evaluated through weighted Kappa (with quadratic weights) and its 95% CI, as a measure of agreement between the first and the second administration of the same scale in the subsample of 60 patients. The strength of the agreement was defined as poor (K < 0.40), moderate (0.41-0.60), substantial (0.61-0.80) and almost perfect (0.81-1.00) .
The study was approved by the ethics committees of each of the 8 participating centers. It was carried out in accordance with the Declaration of Helsinki, and with Italian laws regarding clinical research. All patients provided written informed consent.
Clinical characteristics of the study sample (N = 240)
Setting of visit
Pain therapy outpatients' department
Home palliative care
Oncology day hospital
Day hospital for pain therapy or palliative care
Oncology outpatients' department
Primary cancer site or type
Leukemia and lymphoma
Head and neck
Extent of disease
Background pain characteristics and analgesic therapy, on the whole sample (n = 240).
Pain duration (weeks)
Type of paina
Pain exacerbations in the previous 24 hours
Cause of pain
Other or unknown
Analgesic medication assumed in the previous 24 h
WHO grade 1 (NSAIDsc)
WHO grade 2
WHO grade 3
Comparison of the differences between background and peak pain intensities (Δ) for VRS and NRS on the 158 patients who reported to have had pain flares in the previous 24 hours.
Δ = 0
Δ = 0
TYPE OF PAIN EVALUATED
0.54 - 0.91
0.20 - 0.77
0.61 - 0.91
0.71 - 0.96
This study, comparing NRS and VRS psychometric properties in the assessment of pain exacerbations, reveals a significantly higher discriminatory capability of NRS in distinguishing between background and peak pain intensities referred to the pain experienced in the previous 24 hours; patients gave inconsistent evaluations in 23 cases with NRS (14%) versus 40 cases with VRS (25%). NRS also showed higher reproducibility when measuring pain exacerbations (Cohen's K of 0.86 for NRS vs. 0.53 for VRS) while the reproducibility of the two scales was similar in evaluating background pain (Cohen's K of 0.80 vs. 0.77).
In agreement with previous studies [11, 13, 26] NRS and VRS showed high positive correlation (Spearman's rho of 0.86 and 0.84 respectively for background and peak pain intensity measurements) although the comparison of the two scales revealed a rather high individual variability mainly for patients scoring "moderate" on the VRS (FIG 1 and FIG 2). This fact suggests that assuming a direct correspondence between VRS and NRS scores (as for example: 0 corresponding to "None", 1-4 to "Mild pain", 5-6 to "Moderate pain", and 7-10 to severe pain1 [14, 27–29]), should be interpreted cautiously in clinical practice due to relevant individual discrepancies.
Moreover the wider range of NRS scores at any value of VRS suggests that patients benefit from the greater sensitivity offered by the higher number of response levels possible with NRS. The possibility to increase the number of verbal descriptors in VRS scales has also limitations. A study by Rosier et al.  showed that among 15 adjectives offered to describe their pain, on average patients used only 6 of them, perhaps also because of a difficulty in distinguishing and ordering such high a number of verbal descriptors.
In this experience no data were missing for both scales. This is probably due to the fact that the pain evaluations were not self-completed by patients but administered by a trained nurse or physician, who could properly help patients in understanding questions. Some patients who did not give consent may have had physical or cognitive impairment and this could have contributed to increase the compliance with pain assessment. Although good compliance with the use of NRS is confirmed also in the clinical use, the two scales applicability should be verified in different conditions such as self-administration and repeated use in time.
One limit of the study could be that the two scales have different upper anchor descriptors: "The worst possible pain" for the NRS and "Very severe" for the VRS. The two scales formats have been chosen because they both have undergone specific validation studies in Italian and other languages [23, 31, 32] and fulfill the requirement of a sufficient number of levels to ensure scale sensitivity .
The ability of the patient to report his/her pain assessment over the same 24 hours period 3 to 4 hours after the first administration, could be questionable. This choice is aimed to avoid reproducibility overestimation due to memory effect of the first assessment. Furthermore the potential bias introduced by a 3 to 4 hours interval, should have resulted in an underestimation of reproducibility while the indexes obtained (Cohen's K of 0.80 and 0.77 respectively for baseline NRS and VRS) indicate substantial agreement.
In addition, these results should be considered within the limits of the study methods which required the assessment of previous 24 hours pain in a population of advanced cancer patients with no clinically evident cognitive impairment and in relatively good general conditions (38% of patients were out patients and only 14% were admitted to hospice or home care programs).
Previous studies have already compared various scales for pain measurement and gave different results [13–15, 18, 19, 22, 26, 33–35]. Various factors may have influenced the differences in the results of these studies such as patient's populations (chronic or acute pain, different ages, and different levels of cognitive impairment), types of pain (usual background pain, breakthrough pain), different settings of care (clinical or experimental) and administration methods (self-administration or interview). It s also possible that the lack of agreement on the core properties of the measurement scales and on the analysis methods used to evaluate them, lead to apparently different conclusions depending on the different priority given to various scales properties such as easiness of compilation, validity, sensitivity to change and reliability [11, 15, 26], appropriateness of linearity assumption  or stability of intra-individual assessment .
The data from the literature favoring the use of NRS for pain measurement are based on its intrinsic measurement properties , its cross-cultural validity [29, 37], and its good responsivity properties . Moreover, the high variability of VRS formulations both in the number of response categories and in the labels attached to these categories, support the use of NRS which is applied with more standardized formats (usually 11 levels from 0 to 10) across cultures and languages [3, 30, 39]. The 0-10 NRS has greater sensitivity than the VRS and achieves an adequate level of discrimination . The use of VRS is usually supported by its easy of administration, mainly in some patient's populations [1, 16].
Our results suggest that in the measurement of cancer pain exacerbations, patients use NRS more appropriately than VRS and as such NRS should be preferred to VRS in this patient's population.
Visual Analogue Scale
Numerical Rating Scale
Verbal Rating Scales
Breakthrough or intense Episodic Pain
We thank Emanuela Scarpi, Giovanni Zaninetta, Maria Grazia Rusconi, Patrizia Ferreri, Libero Ciuffreda, Franco Marinangeli and Cecilia Moro for their precious contribution to data collection. The study was sponsored by Dompé SpA, Milan, Italy. Additional analyses were supported by the European Palliative Research Collaborative (EPCRC) through the EU Sixth Framework Programme, contract no 037777 and by a research grant from Fondazione Floriani, Milano.
- Dworkin RH, Turk DC, Farrar JT, Haythornthwaite JA, Jensen MP, Katz NP, Kerns RD, Stucki G, Allen RR, Bellamy N, Carr DB, Chandler J, Cowan P, Dionne R, Galer BS, Hertz S, Jadad AR, Kramer LD, Manning DC, Martin S, McCormick CG, McDermott MP, McGrath P, Quessy S, Rappaport BA, Robbins W, Robinson JP, Rothman M, Royal MA, Simon L, Stauffer JW, Stein W, Tollett J, Wernicke J, Witter J, IMMPACT: Core outcome measures for chronic pain clinical trials: IMMPACT recommendations. Pain 2005,113(1–2):9–19. 10.1016/j.pain.2004.09.012PubMedView ArticleGoogle Scholar
- Turk DC, Dworkin RH, Burke LB, Gershon R, Rothman M, Scott J, Allen RR, Atkinson JH, Chandler J, Cleeland C, Cowan P, Dimitrova R, Dionne R, Farrar JT, Haythornthwaite JA, Hertz S, Jadad AR, Jensen MP, Kellstein D, Kerns RD, Manning DC, Martin S, Max MB, McDermott MP, McGrath P, Moulin DE, Nurmikko T, Quessy S, Raja S, Rappaport BA, Rauschkolb C, Robinson JP, Royal MA, Simon L, Stauffer JW, Stucki G, Tollett J, von Stein T, Wallace MS, Wernicke J, White RE, Williams AC, Witter J, Wyrwich KW, Initiative on Methods Measurement and Pain Assessment in Clinical Trials: Developing patient-reported outcome measures for pain clinical trials: IMMPACT recommendations. Pain 2006,125(3):208–215. 10.1016/j.pain.2006.09.028PubMedView ArticleGoogle Scholar
- Caraceni A, Cherny N, Fainsinger R, Kaasa S, Poulain P, Radbruch L, De Conno F: Pain measurement tools and methods in clinical research in palliative care: recommendations of an Expert Working Group of the European Association of Palliative Care. J Pain Symptom Manage 2002,23(3):239–255. 10.1016/S0885-3924(01)00409-2PubMedView ArticleGoogle Scholar
- Garcia SF, Cella D, Clauser SB, Flynn KE, Lad T, Lai JS, Reeve BB, Smith AW, Stone AA, Weinfurt K: Standardizing patient-reported outcomes assessment in cancer clinical trials: a patient-reported outcomes measurement information system initiative. J Clin Oncol 2007,25(32):5106–5112. 10.1200/JCO.2007.12.2341PubMedView ArticleGoogle Scholar
- Gordon DB, Dahl JL, Miaskowski C, McCarberg B, Todd KH, Paice JA, Lipman AG, Bookbinder M, Sanders SH, Turk DC, Carr DB: American pain society recommendations for improving the quality of acute and cancer pain management: American Pain Society Quality of Care Task Force. Arch Intern Med 2005,165(14):1574–1580. 10.1001/archinte.165.14.1574PubMedView ArticleGoogle Scholar
- Kaasa S, Loge JH, Fayers P, Caraceni A, Strasser F, Hjermstad MJ, Higginson I, Radbruch L, Haugen DF: Symptom assessment in palliative care: a need for international collaboration. J Clin Oncol 2008,26(23):3867–3873. 10.1200/JCO.2007.15.8881PubMedView ArticleGoogle Scholar
- Hjermstad MJ, Gibbins J, Haugen DF, Caraceni A, Loge JH, Kaasa S, EPCRC, European Palliative Care Research Collaborative: Pain assessment tools in palliative care: an urgent need for consensus. Palliat Med 2008,22(8):895–903. 10.1177/0269216308095701PubMedView ArticleGoogle Scholar
- Holen JC, Hjermstad MJ, Loge JH, Fayers PM, Caraceni A, De Conno F, Forbes K, Furst CJ, Radbruch L, Kaasa S: Pain assessment tools: is the content appropriate for use in palliative care? J Pain Symptom Manage 2006,32(6):567–580. 10.1016/j.jpainsymman.2006.05.025PubMedView ArticleGoogle Scholar
- Ohnhaus EE, Adler R: Methodological problems in the measurement of pain: a comparison between the verbal rating scale and the visual analogue scale. Pain 1975,1(4):379–384. 10.1016/0304-3959(75)90075-5PubMedView ArticleGoogle Scholar
- Kremer E, Atkinson JH, Ignelzi RJ: Measurement of pain: patient preference does not confound pain measurement. Pain 1981,10(2):241–248. 10.1016/0304-3959(81)90199-8PubMedView ArticleGoogle Scholar
- Jensen MP, Karoly P, Braver S: The measurement of clinical pain intensity: a comparison of six methods. Pain 1986,27(1):117–126. 10.1016/0304-3959(86)90228-9PubMedView ArticleGoogle Scholar
- Jensen MP, Karoly P: Self-report scales and procedures for assessing pain in adults. Handbook of pain assessment 2001, 2: 15–34.Google Scholar
- De Conno F, Caraceni A, Gamba A, Mariani L, Abbattista A, Brunelli C, La Mura A, Ventafridda V: Pain measurement in cancer patients: a comparison of six methods. Pain 1994,57(2):161–166. 10.1016/0304-3959(94)90219-4PubMedView ArticleGoogle Scholar
- Briggs M, Closs JS: A descriptive study of the use of visual analogue scales and verbal rating scales for the assessment of postoperative pain in orthopedic patients. J Pain Symptom Manage 1999,18(6):438–446. 10.1016/S0885-3924(99)00092-5PubMedView ArticleGoogle Scholar
- Breivik EK, Bjornsson GA, Skovlund E: A comparison of pain rating scales by sampling from clinical trial data. Clin J Pain 2000,16(1):22–28. 10.1097/00002508-200003000-00005PubMedView ArticleGoogle Scholar
- Radbruch L, Sabatowski R, Loick G, Jonen-Thielemann I, Kasper M, Gondek B, Lehmann KA, Thielemann I: Cognitive impairment and its influence on pain and symptom assessment in a palliative care unit: development of a Minimal Documentation System. Palliat Med 2000,14(4):266–276. 10.1191/026921600672986600PubMedView ArticleGoogle Scholar
- Lara-Munoz C, De Leon SP, Feinstein AR, Puente A, Wells CK: Comparison of three rating scales for measuring subjective phenomena in clinical research. I. Use of experimentally controlled auditory stimuli. Arch Med Res 2004,35(1):43–48. 10.1016/j.arcmed.2003.07.007PubMedView ArticleGoogle Scholar
- Hartrick CT, Kovan JP, Shapiro S: The numeric rating scale for clinical pain measurement: a ratio measure? Pain Pract 2003,3(4):310–316. 10.1111/j.1530-7085.2003.03034.xPubMedView ArticleGoogle Scholar
- Lund I, Lundeberg T, Sandberg L, Budh CN, Kowalski J, Svensson E: Lack of interchangeability between visual analogue and verbal rating pain scales: a cross sectional description of pain etiology groups. BMC Med Res Methodol 2005, 5: 31. 10.1186/1471-2288-5-31PubMed CentralPubMedView ArticleGoogle Scholar
- Gagliese L: Assessment of pain in elderly people. Handbook of pain assessment 2001, 7: 119–133.Google Scholar
- Walsh D: Practical problems in pain measurements. Pain 1984,19(1):96–98. 10.1016/0304-3959(84)90070-8View ArticleGoogle Scholar
- Jensen MP, Turner JA, Romano JM: What is the maximum number of levels needed in pain intensity measurement? Pain 1994,58(3):387–392. 10.1016/0304-3959(94)90133-3PubMedView ArticleGoogle Scholar
- Caraceni A, Mendoza TR, Mencaglia E, Baratella C, Edwards K, Forjaz MJ, Martini C, Serlin RC, de Conno F, Cleeland CS: A validation study of an Italian version of the Brief Pain Inventory (Breve Questionario per la Valutazione del Dolore). Pain 1996,65(1):87–92. 10.1016/0304-3959(95)00156-5PubMedView ArticleGoogle Scholar
- Bonett DG: Sample size requirements for estimating intraclass correlations with desired precision. Stat Med 2002,21(9):1331–1335. 10.1002/sim.1108PubMedView ArticleGoogle Scholar
- Landis JR, Koch GG: The measurement of observer agreement for categorical data. Biometrics 1977,33(1):159–174. 10.2307/2529310PubMedView ArticleGoogle Scholar
- Peters ML, Patijn J, Lame I: Pain assessment in younger and older pain patients: psychometric properties and patient preference of five commonly used measures of pain intensity. Pain Med 2007,8(7):601–610. 10.1111/j.1526-4637.2007.00311.xPubMedView ArticleGoogle Scholar
- Collins SL, Moore RA, McQuay HJ: The visual analogue pain intensity scale: what is moderate pain in millimetres? Pain 1997,72(1–2):95–97. 10.1016/S0304-3959(97)00005-5PubMedView ArticleGoogle Scholar
- Paul SM, Zelman DC, Smith M, Miaskowski C: Categorizing the severity of cancer pain: further exploration of the establishment of cutpoints. Pain 2005,113(1–2):37–44. 10.1016/j.pain.2004.09.014PubMedView ArticleGoogle Scholar
- Serlin RC, Mendoza TR, Nakamura Y, Edwards KR, Cleeland CS: When is cancer pain mild, moderate or severe? Grading pain severity by its interference with function. Pain 1995,61(2):277–284. 10.1016/0304-3959(94)00178-HPubMedView ArticleGoogle Scholar
- Rosier EM, Iadarola MJ, Coghill RC: Reproducibility of pain measurement and pain perception. Pain 2002,98(1–2):205–216. 10.1016/S0304-3959(02)00048-9PubMedView ArticleGoogle Scholar
- Cleeland CS, Ryan KM: Pain assessment: global use of the Brief Pain Inventory. Ann Acad Med Singapore 1994,23(2):129–138.PubMedGoogle Scholar
- Jensen MP: The validity and reliability of pain measures in adults with cancer. J Pain 2003,4(1):2–21. 10.1054/jpai.2003.1PubMedView ArticleGoogle Scholar
- Caraceni A, Galbiati A, Brunelli C, Gorni G, Martini C, Zecca E, De Conno F: Cancer patient compliance in the self-administration of a pain assessment tool. J Pain Symptom Manage 2004,27(5):417–424. 10.1016/j.jpainsymman.2004.01.002PubMedView ArticleGoogle Scholar
- Herr KA, Spratt K, Mobily PR, Richardson G: Pain intensity assessment in older adults: use of experimental pain to compare psychometric properties and usability of selected pain scales with younger adults. Clin J Pain 2004,20(4):207–219. 10.1097/00002508-200407000-00002PubMedView ArticleGoogle Scholar
- Ponce de Leon S, Lara-Munoz C, Feinstein AR, Wells CK: A comparison of three rating scales for measuring subjective phenomena in clinical research. II. Use of experimentally controlled visual stimuli. Arch Med Res 2004,35(2):157–162.PubMedView ArticleGoogle Scholar
- Price DD, Bush FM, Long S, Harkins SW: A comparison of pain measurement characteristics of mechanical visual analogue and simple numerical rating scales. Pain 1994,56(2):217–226. 10.1016/0304-3959(94)90097-3PubMedView ArticleGoogle Scholar
- Zelman DC, Dukes E, Brandenburg N, Bostrom A, Gore M: Identification of cut-points for mild, moderate and severe pain due to diabetic peripheral neuropathy. Pain 2005,115(1–2):29–36. 10.1016/j.pain.2005.01.028PubMedView ArticleGoogle Scholar
- Cepeda MS, Africano JM, Polo R, Alcala R, Carr DB: Agreement between percentage pain reductions calculated from numeric rating scores of pain intensity and those reported by patients with acute or cancer pain. Pain 2003,106(3):439–442. 10.1016/j.pain.2003.09.006PubMedView ArticleGoogle Scholar
- Gracely RH, McGrath F, Dubner R: Ratio scales of sensory and affective verbal pain descriptors. Pain 1978,5(1):5–18. 10.1016/0304-3959(78)90020-9PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.