- Open Access
Re-evaluation of the definition of remission on the 17-item Hamilton Depression Rating Scale based on recovery in health-related quality of life in an observational post-marketing study
Health and Quality of Life Outcomesvolume 16, Article number: 14 (2018)
Although a score of less than 7 for the 17-item Hamilton Depression Rating Scale (HAM-D17) has been widely adopted to define remission of depression, a full recovery from depression is closely related to the patient’s quality of life as well. Accordingly, we re-evaluated this definition of remission using HAM-D17 in comparison with the corresponding score for health-related quality of life (HRQOL) measured by the SF-36.
Using the data for depressive patients reported by GlaxoSmithKline K.K. (Study No. BRL29060A/863) in a post-marketing observational study of paroxetine, with a sample size of n = 722, multivariate logistic regression was performed with the HAM-D17 score as a dependent variable and with each of the eight domain scores of HRQOL (from the SF-36) transformed into a binominal form according to the national standard value for Japan. Then, area under curve of receiver operating characteristic analyses were conducted. Based on the obtained results, a multivariate analysis was performed using the HAM-D17 score in a binomial form with HAM-D17 as a dependent variable and with each of the eight HRQOL domain scores (SF-36) as binominalized independent variables.
A cutoff value for the HAM-D17 score of 5 provided the maximum ROC-AUC at “0.864.” The significantly associated scores of the eight HRQOL domains (SF-36) were identified for the HAM-D17 cutoff values of ≥5 and ≤4. The scores for physical functioning (odds ratio, 0.473), bodily pain (0.557), vitality (0.379), social functioning (0.540), role-emotion (0.265), and mental health (0.467) had a significant negative association with the HAM-D17 score (p < 0.05), and HRQOL domain scores for HAM-D17 ≥ 5 were significantly lower compared with those for HAM-D17 ≤ 4.
A cutoff value for HAM-D17 of less than or equal to 4 was the best candidate for indicating remission of depression when the recovery of HRQOL is considered. Restoration of social function and performance should be considered equally important in assessing the adequacy of treatment for patients with depression.
Guidelines for the treatment of major depressive disorders recommend that the primary therapeutic goal is to achieve remission in the acute phase of therapy [1, 2]. Traditionally, for the assessment of depression, the 17-item Hamilton Depression Rating Scale (HAM-D17) has been considered the gold standard [3, 4] with remission of depression indicated by a score of ≤7 [5,6,7]. However, depressive disorders also impair health-related quality of life (HRQOL) [8, 9]. Practically, it is widely recognized that both remission of depression symptoms and recovery of HRQOL are important in clinical treatment and research .
GlaxoSmithKline K.K. conducted a post-marketing clinical study to evaluate HRQOL using the Medical Outcome Study 36-Item Short-Form Health Survey scale (SF-36) in Japanese patients with depression treated with paroxetine. Overall, 12 weeks of treatment with paroxetine (20–40 mg/d) yield a significant improvement in both depression symptoms (HAM-D17) and HRQOL (SF-36). An investigation of the relationship between HAM-D17 scores and SF-36 scores was performed, and patients with higher degrees of improvement in depression symptoms tended to show higher degrees of improvement in HRQOL, as reported in an article related to the present study . In the two SF-36 mental health dimensions of social functioning (SF) and role-emotional (RE), obtained scores were close to the level of the Japanese national norm when patients achieved remission of depression symptoms , which suggests a strong association between the remission of depression symptoms and recovery in HRQOL.
A HAM-D17 score of ≤7 was used to define remission of depression symptoms in the above post-marketing clinical study. While a HAM-D17 score of ≤7 has been widely adopted in clinical trials and research, little data support this threshold score from the perspective of HRQOL. Therefore, it is important to confirm the validity of this threshold score for remission of depression symptoms with respect to HRQOL for alignment with the primary therapeutic goal in the acute phase of therapy.
In this study, we investigated which symptom severity (HAM-D17) score was equivalent to recovery in HRQOL (SF-36 score) as indicators of remission. Through the analysis of raw data obtained from a post-marketing clinical study, we re-evaluated the definition of remission of depression to examine the validity and consistency of the existing definition (i.e., HAM-D17 score of ≤7) with respect to HRQOL.
Our specific objectives were as follows: (1) to investigate this definition of remission by comparison with the recovery in HRQOL for validity and consistency, (2) to examine the association of HRQOL domain scores (SF-36) with the severity of depression symptoms (HAM-D17 scores), and (3) to compare the depression symptoms (HAM-D 17 scores) of remitters (responders) and non-remitters (non-responders) using the recalculated cutoff score obtained in the present trial.
Data source and data management
Data sources for analysis
Raw data were obtained from a post-marketing clinical study of paroxetine in patients with depression or those experiencing depressive episodes, which included an evaluation of improvement by paroxetine of HRQOL (Study No. BRL29060A/863) [11, 13].
Raw data were obtained as anonymized patient data from GlaxoSmithKline K.K. Data were used and managed in compliance with a “Datasharing Agreement” with and authorized by GlaxoSmithKline K.K.
This study is a reanalysis of the raw data from a post-marketing clinical study of paroxetine in patients with depression or those experiencing depressive episodes, which included an evaluation of improvement by paroxetine of HRQOL (Study No. BRL29060A/863).
Demographic and clinical characteristics of patients
Height, weight, age, gender, onset of first depression or depressive episode, duration of the current episode, number of depressive episodes, depression diagnosis , past medical history, concurrent illness, treatment history, concurrent use of drugs, concurrent use of non-drug treatment (i.e., electroconvulsive therapy and cognitive behavioral therapy), comorbid anxiety disorders assessed by the Mini International Neuropsychiatric Interview (panic disorder, social anxiety disorder, obsessive-compulsive disorder, generalized anxiety disorder, posttraumatic stress disorder), dosing, and duration of paroxetine were evaluated.
Total score on HAM-D17, score on each item of HAM-D17, state score and trait score on the State-Trait Anxiety Inventory (STAI) .
The physical component summary (PCS) score on the SF-36 physical health dimension is composed of physical functioning (PF), role physical (RP), bodily pain (BP), and general health perceptions (GH).
The mental component summary (MCS) score on the SF-36 mental health dimension is composed of vitality (VT), SF, RE, and mental health (MH).
1) The scatter plot of each individual patient’s pairwise co-ordinates between HAM-D17 score and score on each HRQOL (SF-36) dimension was drawn. Then, the ability of HAM-D17 to identify patients who are in remission according to the recovery in HRQOL across the range of HAM-D17 cutoff scores by conducting receiver operating characteristic (ROC) analyses was examined. The HAM-D17 score showing the maximal level of agreement with the recovery in HRQOL was determined as the cutoff score of HAM-D17 for remission.
2) The cutoff value of HAM-D17 providing the maximal ROC-AUC was converted to a binominal form of “0 or 1.” A logistic multivariate analysis with HAM-D17 as the dependent variable and each of the eight HRQOL domains (SF-36) as binominalized independent variables was conducted to identify the relationship between them being associated with remitters (responders) or non-remitters (non-responders) defined using the HAM-D17 scores.
3) Mann-Whitney U tests were conducted to compare the domain subscores of HRQOL (SF-36) (PF, RP, BP, GH, VT, SF, RE, MH, PCS, and MCS) and STAI (the State Anxiety Scale and the Trait Anxiety Scale) between remitters (responders) or non-remitters (non-responders) using the recalculated cutoff score obtained in the present trial.
Patients treated with paroxetine (maximum dose of 20–40 mg/d) were assessed over 12 weeks (0, 4, 8, and 12 weeks, where “0 week” means the subject dropped from the study before week 4 for any reason). The sample number (n = 722) for analysis included assessments at all time points for participating patients (n = 217). The scatter plots between HAM-D17 and PCS and MCS are shown in Figs. 1 and 2. For the sample of n = 722, a multivariate logistic regression was performed with the score of HAM-D17 as a dependent variable and each of the eight domain scores of HRQOL (SF-36) as independent variables (converted into binominal form as “0 or 1” according to the national standard value for Japan). By assigning the different integers of the HAM-D17 scores as cutoff values, ROC analyses were conducted and area under curve of ROC (ROC-AUC) was calculated (Table 1). A cutoff value of 5 for HAM-D17 provided the maximum ROC-AUC as “0.864.” After converting this cutoff score of HAM-D17 into the binominal form of “0 or 1” for HAM-D17 of ≥5 and ≤4, a multivariate logistic regression analysis with HAM-D17 as the dependent variable and each of the eight HRQOL domain scores (SF-36) as binominalized independent variables was performed. Then, the significantly associated scores of the eight HRQOL domains (SF-36) were identified (Table 1). Specifically, the six subscores of PF (odds ratio, 0.473), BP (0.557), VT (0.379), SF (0.540), RE (0.265), and MH (0.467) were negatively associated with HAM-D17. The Spearman’s ρ of the HRQOL subscores with HAM-D17 are presented in Table 2. All HRQOL subscores were negatively correlated with the HAM-D17 scores. On performing Mann-Whitney U tests, it was observed that all respective subscores of HRQOL at HAM-D17 ≤ 4 are significantly higher than those at HAM-D17 ≥ 5, and State/Trait-Anxiety of STAI for HAM-D17 ≤ 4 were significantly lower than those for HAM-D17 ≥ 5 (Tables 3 and 4).
As is well known, severe depression often causes severe impairment of quality of life in patients with depression. In the present study, a cutoff value of 4 or lower (≤4) on HAM-D17 provided the maximal ROC-AUC or maximal level of agreement with HRQOL recovery. At this cutoff value, six subscores of HRQOL (SF-36), namely PF and BP (in the physical health dimension) and VT, SF, RE, and MH (in the mental health dimension), were identified with significantly negative association with the HAM-D17 scores around 4–5 points. This result was consistent with an existing report, which states that SF and RE scores return to close to the level of the Japanese national norm in Japanese patients with depression who are remitters . The significant relation of BP with HAM-D17 was understandable because depression with pain often harms a patient’s QOL, and alleviation of pain is one of primary symptoms to be ameliorated in patients with depression.
Since 1991, a score of ≤7 has been a consensus in previous reports on the HAM-D17 cutoffs for remission . The American College of Neuropsychopharmacology (ACNP) recommended a score of ≤7 or ≤5 [7, 20]. On the contrary, Zimmerman et al. reported that the cutoff score of 7 on HAM-D17 could be high [21,22,23], and others supported similar results [24, 25]. Zimmerman also reported that remitters (HAM-D17 ≤ 7) had heterogeneity in their symptoms and that the lower cutoff could be more suitable for remission if patients’ social functionality was considered [21,22,23]. Similar results were observed in the Montgomery Åsberg Depression Rating Scale (MADRS) , where patients with depression having lower scores (≤4 vs 5–9) appeared to have better global functioning . Romera et al. proposed a similar suggestion based on the Social and Occupational Functioning Assessment Scale [28, 29]. A similar result was reported for functionality . Furukawa et al. reported that this gap was due to the fact that functional recovery occurs after clinical recovery . In addition, for the two scores of STAI, the differences based on the cutoff scores of HAM-D17 (≥5 and ≤4) indicated recovery from anxiety at this level as well. The present study investigated the relationship between depression and HRQOL which may have contributed as a reference for psychotherapeutic interventions. However, it revealed the need of further studies in this field.
Given this literature, the identification of a cutoff score of ≤4 in our study is consistent. A 3-point gap between the re-estimated cutoff score of ≤4 and the traditional cutoff score of ≤7 was observed. This might imply that the recovery of HRQOL lags behind that of depressive symptoms (HAM-D17), and that additional time is required for patients’ SF and QOL to reflect their HAM-D17 score. Moreover, the assessment of patients’ functional status is vital for precise estimation of QOL. In this regard, a “cutoff value of ≤4 for HAM-D17” is not inconsistent with this suggestion and also supports the propositions of Zimmerman and others.
Limitations in our approach should be noted. First, the analysis of the present study was performed retrospectively based on a specific population with clear limitations; however, the intent-to-treat method was used. Data might be biased because of our post hoc approach. In addition, the interpretation of the results of this study could not be applied to other conditions, countries, species, prescription, dietary, climate, or culture either simply or directly. These factors should be considered in the context of the respective treatment. We expect that these limitations will be remedied in a future study.
Second, the clinical course of patients with depression could have been influenced by the prescription of paroxetine. The data for the clinical course for assessment were not obtained in a series of patients undergoing a natural course of depression without drug (drug-naïve patients with depression). At the very least, a prospective observational study under prescription is needed.
Third, the sample (n = 722) in this study did not comprise completely independent data, because there are several observations for most patients (n = 217). The lack of independence of the sample data set might affect the value of the cutoff score for HAM-D17. However, randomly obtained data are considered less meaningful in determining the turning point in the clinical course of depression. Thus, this trade-off might be necessary.
Fourth, for the results of the logistic regression of HRQOL, the level of reproducibility for the selection of the six subscores of PF, BP, VT, SF, RE, and MH that were negatively correlated with HAM-D17 is unclear. In addition, to identify specifically associated symptoms with the HAM-D17 scores around 4–5 points (HAM-D17 cutoff values of ≥5 or ≤4), six symptoms cover almost the entire score of the HRQOL. Focusing on specific domain scores of QOL might not always provide consistent information.
Fifth, the entire data set is dependent on Japanese characteristics, which could be heterogeneous and could also introduce unexpected biases or tendencies in the results. Specific geographic locality and population are necessary tradeoffs in research. Therefore, any generalization from these results should be used with caution.
Additional investigations using more rigorous methodology in future studies to confirm these findings will be necessary.
The cutoff value for HAM-D17 score of less than or equal to 4 (≤4) was one of the candidates for the remission of depression if HRQOL recovery is considered. The recovery of social function and performance are equally important for patients with depression for the remission of depressive symptoms.
American College of Neuropsychopharmacology
General health perceptions
Hamilton Depression Rating Scale
17-item Hamilton Depression Rating Scale
Health-related quality of life
Montgomery Åsberg Depression Rating Scale
Mental component summary
Physical component summary
Receiver operating characteristic
Area under curve of receiver operating characteristic
36-Item Short-Form Health Survey scale
the Social and Occupational Functioning Assessment Scale
the State Anxiety Scale
the State-Trait Anxiety Inventory
the Trait Anxiety Scale
Work group on major depressive disorder. Practice guideline for the treatment of patients with major depressive disorder. 3rd ed. Washington D.C.: American Psychiatric Association; 2010. p. 15–22.
Bauer M, Bschor T, Pfennig A, Whybrow PC, Angst J, Versiani M, et al. World Federation of Societies of biological psychiatry (WFSBP) guidelines for biological treatment of unipolar depressive disorders in primary care. World J Biol Psychiatry. 2007;8:67–104.
Hamilton MA. Rating scale for depression. J Neurol Neurosurg Psychiatry. 1960;23:56–62.
Hamilton M. Development of a rating scale for primary depressive illness. Br J Soc Clin Psychol. 1967;6:278–96.
Bech P, Engelhardt N, Evans KR, Gibertini M, Kalali AH, Kobak KA, et al. Why the Hamilton depression rating scale endures. Am J Psychiatry. 2005;162:2396.
Carroll BJ. Why the Hamilton depression rating scale endures. Am J Psychiatry. 2005;162:2395–6.
Moller HJ. Outcomes in major depressive disorder: the evolving concept of remission and its implications for treatment. World J Biol Psychiatry. 2008;9:102–14.
Wells KH, Stewart A, Hays RD, Burnam MA, Rogers W, Daniels M, et al. The functioning and well-being of depressed patients. Results from the medical outcomes study. JAMA. 1989;262:914–9.
Hays RD, Wells KB, Sherbourne CD, Rogers W, Spritzer K. Functioning and well-being outcomes of patients with depression compared with chronic general medical illnesses. Arch Gen Psychiatry. 1995;52:11–9.
Weissman MM, Olfson M, Gameroff MJ, Feder A, Fuentes MA. Comparison of three scales for assessing social functioning in primary care. Am J Psychiatry. 2001;158:460–6.
Aoba A, Motoyama H, Inoshiri S, Uechi Y, Nagata T. Evaluation of health related quality of life (HRQOL) in patients with depression or depressive episodes treated with paroxetine hydrochloride hydrate: second report – what factors contributed to improvement of HRQOL? Jpn J Clin Psychopharmacol. 2005;8:1063–75. (in Japanese)
Ishigooka J. Current status and issues of pharmacotherapy with SSRIs for depression. Jpn. J Clin Psychopharmacol. 2006;9:1407–23.
Aoba A, Motoyama H, Kageyama S, Inoshiri S, Uechi Y, Nagata T. Evaluation of health related quality of life in patients with depression or depressive episodes treated with paroxetine hydrochloride hydrate – an open-label multi-center clinical study of paroxetine hydrochloride hydrate. Jpn J Clin Psychopharmacol. 2004;7:1169–92. (in Japanese)
American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 4th ed., text revision. New York: American Psychiatric Association; 2000.
Spielberger CD, Gorsuch RL, Lushene R, Vagg PR, Jacobs GA. Manual for the state-trait anxiety inventory. Palo Alto, CA: Consulting Psychologists Press; 1983.
Inc SPSS. SPSS for windows version 20. Armonk: SPSS Inc.; 2011.
Stata Corporation: Stata User’s Guide, Release 13.0. Stata Corporation., College Station, Texas, 2013.
Corporation M. Microsoft Excel 2003. Redmond, Washington: Microsoft Corp; 2003.
Frank E, Prien RF, Jarret RB, Keller MB, Kupfer DJ, Lavori PW, et al. Conceptualization and rationale for consensus definitions of terms in major depressive disorder. Remission, recovery, relapse, and recurrence. Arch Gen Psychiatry. 1991;48:851–5.
Rush AJ, Kraemer HC, Sackeim HA, Fava M, Trivedi MH, Frank E, et al. ACNP task force. Report by the ACNP task force on response and remission in major depressive disorder. Neuropsychopharmacology. 2006;31:1841–53.
Zimmerman M, Posternak MA, Chelminski II. The cutoff to define remission on the Hamilton rating scale for depression too high? J Nerv Ment Dis. 2005;193:170–5.
Zimmerman M, McGlinchey JB, Posternak MA, Friedman MD, Boerescu D, Attitullah N. Discordance between self-reported symptom severity and psychosocial functioning ratings in depressed outpatients: implications for how remission from depression should be defined. Psychiatry Res. 2006;141:185–91.
Zimmerman M, McGlinchey JB, Posternak MA, Friedman MD, Boerescu D, Attitullah N. Remission in depressed outpatients: more than just symptom resolution? J Psychiatr Res. 2008;42:797–801.
Riso LP, Thase ME, Howland RH, Friedman ES, Simons AD, Tu XM. A prospective test of criteria for response, remission, relapse, recovery, and recurrence in depressed patients treated with cognitive behavior therapy. J Affect Disord. 1997;43:131–42.
Jud LL, Paulus MJ, Schettler PJ, Akiskal HS, Endicott J, Leon AC, et al. Dose incomplete recovery from first lifetime major depressive episode herald a chronic course of illness? Am J Psychiatry. 1997;157:1501–4.
Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Brit J Psy. 1979;134:382–9.
Zimmerman M, Posternak MA, Chelminski I. Defining remission on the Montgomery Åsberg depression rating scale. J Clin Psychiatry. 2004;65:163–8.
Goldman HH, Skodol AE, Lave TR, Revision Axis V. For DSM-IV: a review of measures of social functioning. Am J Psychiatry. 1992;149:1148–56.
Romera I, Pérez V, Menchón JM, Polavieja P, Gilaberte I. Optimal cutoff point of the Hamilton rating scale for depression according to normal levels of social and occupational functioning. Psychiatry Res. 2011;186:133–7.
Mintz J, Mintz LI, Arruda MJ, Hwang SS. Treatments of depression and the functional capacity to work. Arch Gen Psychiatry. 1992;49:761–768 (Erratum published in Arch Gen Psychiatry. 1993;50:241.).
Furukawa TA, Takeuchi H, Hiroe T, Mashiko K, Kamei K, Kitamura T, et al. Symptomatic recovery and social functioning in major depression. Acta Psychiatr Scand. 2001;103:257–61.
The authors thank GlaxoSmithKline K.K. for providing information on the paroxetine study (BRL29060A/863). We also thank all principal investigators that participated in this original study (BRL29060A/863) for permission to use information from it.
No funding to declare.
Availability of data and materials
The data supporting the findings of this study are available upon request with GlaxoSmithKline K.K. (GSK), but some restrictions apply regarding the availability of these data. However, the data are available from the authors upon reasonable request and with permission from GSK.
Ethics approval and consent to participate
The original data were collected from a clinical study of paroxetine in patients with depression or those experiencing depressive episodes, which included an evaluation of improvement by paroxetine of HRQOL (Study No. BRL29060A/863). In that study, written informed consent was obtained from all participants upon trial enrollment. Raw data were obtained as anonymized patient data from GlaxoSmithKline K.K. (GSK), which was used and managed in compliance with a “Datasharing Agreement” with GSK. The present study was approved by the ethics committee of Tokyo Women’s Medical University prior to the start of this study.
Consent for publication
JI has received grants and/or speaker’s honoraria from Otsuka Pharmaceutical; Sumitomo Dainippon Pharma Co., Ltd.; Shionogi & Co., Ltd.; Takeda Pharmaceutical Company Limited; Eli Lilly Japan K.K.; Novartis Pharma K.K.; Pfizer Japan Inc.; MSD K.K. a subsidiary of Merck & Co., Inc.; Kenilworth, N.J., U.S.A.; and Eisai Co., Ltd. within the past 3 years.
KN has received grants and/or speaker’s honoraria from Eli Lilly Japan K.K.; Astellas Pharma Inc.; Meiji Seika Pharma Co., Ltd.; Janssen Pharmaceutical K.K.; MSD K.K. a subsidiary of Merck & Co., Inc.; Kenilworth, N.J., U.S.A.; Mochida Pharmaceutical Co., Ltd.; Yoshitomiyakuhin Corporation; Otsuka Pharmaceutical; Eisai Co., Ltd.; Pfizer Japan Inc.; Novartis Pharma K.K.; Takeda Pharmaceutical Company Limited.; Mitsubishi Tanabe Pharma; Shionogi & Co., Ltd.; Dainippon Sumitomo Pharma Co., Ltd.; Torii Pharmaceutical Co., Ltd.; Daiichi Sankyo Company, Ltd., Tsumura & Co.; and Mebix, Inc. within the past 3 years.
JS has none to declare. There was no specific funding for this study.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.