Sensitivity as outcome measure of androgen replacement: the AMS scale
Health and Quality of Life Outcomes volume 4, Article number: 23 (2006)
The capacity of the AMS scale as clinical utility and as outcome measure still needs validation.
An open post-marketing study was performed by office-based physicians in Germany in 2004. We analysed data of 1670 androgen-deficient males who were treated with testosterone gel. The AMS scale was applied prior to and after 3 months treatment.
The improvement of complaints under treatment relative to the baseline score was 30.7% (total score), 27.3% (psychological domain), 30.5% (somatic domain), and 30.7% (sexual domain), respectively. Patients with little or no symptoms before therapy improved by 9%, those with mild complaints at entry by 24%, with moderate by 32%, and with severe symptoms by 39% – compared with the baseline score. We showed that the distribution of complaints of testosterone deficient men before therapy almost returned to norm values after 12 weeks of testosterone treatment. Age, BMI, and total testosterone level at baseline did not modify the positive effect of androgen therapy. We also demonstrated that the AMS results can predict the independent (physician's) opinion about the individual treatment effect. Both, sensitivity (correct prediction of a positive assessment by the physician) and specificity (correct prediction of a negative assessment by the physician) were over 70%, if about 22% improvement of the AMS total score was used as cut-off point.
The AMS scale showed a convincing ability to measure treatment effects on quality of life across the full range of severity of complaints. Effect modification by other variables at baseline was not observed. In addition, results of the scale can predict the subjective clinical expert opinion on the treatment efficiency.
The Aging Males' Symptoms (AMS) scale was originally developed in Germany as a health-related quality of life scale (HRQoL) . The scale was designed as self-administered scale (a) to assess symptoms of aging (independent from those which are disease-related) between groups of males under different conditions, (b) to evaluate the severity of symptoms over time, and (c) to measure changes pre- and post androgen therapy . It was developed in response to the lack of fully standardized scales to measure the severity of aging symptoms and their impact on HRQoL in males, specifically [2, 3]. It was recently demonstrated by a French research group that the AMS scale measures HRQoL similarly in younger (even 20–30 years old) and older persons . Right from the beginning a possible screening potential of the AMS scale was controversially discussed. Therefore we compared the AMS scale with internationally well-known screening instruments for androgen deficiency in adult males (ADAM scale of Morley et al  and the Screener of Smith' et al . We found that the AMS has obviously similar test characteristics as both screening instruments . Later, the similarity of AMS and ADAM was confirmed in another study . In addition, Kratzik et al  observed in a population-based cross-sectional study in Vienna an impressive association between subscales of the AMS and free testosterone level when age and body mass index was multivariately taken into account. Recently, a Japanese research group under Itoh et al  and Soh et al  observed a correlation between the AMS scores and the testosterone level. A Polish research group found a similar but less clear result . Other studies, however, could not find associations of the AMS scores with testosterone level [13, 14].
The evaluation of the AMS scale is simple; the scheme has been published . Norm values to compare with were determined [1, 3]. Conventional psychometric requirements of test reliability and validity have been successfully achieved and published [17, 18]. A point was reached to demonstrate the capacity of the scale to reliably measure the effect of androgen treatment or to predict the magnitude of the therapeutic effect subjectively perceived by the treating physician. To this end, many clinicians use the term "validity" and mean high utility for clinical work or research.
It was shown in a previous publication that the AMS scale meets the requirements of a clinical utility and outcomes sensitivity . The focus of this paper however is to analyze if variables such as age, body mass index (BMI), severity of complaints, and testosterone level before treatment effect the outcome measured with the AMS scale.
An open post-marketing study was conducted together with office-based physicians in Germany in 2004. The study monitored the effect of androgen substitution on complaints as well as adverse reactions of a licensed testosterone gel product (Testogel JENAPHARM®) under routine conditions. The eligibility of male patients for androgen therapy was determined by the prescribing physician, i.e. following the recommendations of the International Society for The Study of the Aging Male (ISSAM)  for testosterone treatment in patients with androgen deficiency. No other inclusion or exclusion criteria were set up except the consent between the patient and his treating doctor. The needed sample size was not determined for this observational three months follow-up study.
The observation encompassed three visits which were documented in a short form: before treatment, after 4–6 weeks, and at the end of the observation period of 12 weeks. A short questionnaire was completed by the treating physician to characterize the patient at baseline, after 4–6 weeks androgen treatment, and at the end of the study (3 months). The physician subjectively assessed the treatment effect at each of the three visits, and listed also adverse reactions on a specific form.
We got for this methodological paper a database with the AMS scale completed before therapy and after 12 weeks, age, BMI and total testosterone (TT). However, only a sub-sample had TT values available.
The computerized data of this post-marketing study were analyzed with conventional statistics using the statistical package SAS 9.1®.
Results and discussion
Altogether, 1670 patients were available for analysis. A general description of the group analyzed in this paper is given in Table 1. The mean age was of 56.4 years, the mean BMI was 26.8, and the TT at baseline was 2.5 ng/ml. A great proportion of the participants had a medical history of one or more chronic diseases. In addition, alcohol consumption and smoking were quite frequent (see table 1).
The HRQoL improved after 12 weeks of testosterone-gel application as measured with the total score of the AMS scale. Relative to the scores at baseline, the total score, the scores for the psychological, somato-vegetative, and sexual scores improved 30.7%, 27.3%, 30.5%, and 30.7% compared with the baseline score, respectively. This is an almost identical relative improvement of the HRQoL as shown in an earlier study associated with injectable testosterone .
The higher severity of complaints at baseline the greater is the improvement as demonstrated for the AMS total score in Figure 1. This applies also for the three sub-scales (data not shown). This was expected but an important observation with impact on the methodological assessment (validity) of the AMS scale.
Compared with the distribution of complaints in the normal population (=norm values [1, 3]), the markedly altered HRQoL in this androgen deficient males shifted towards the "norm" of the male population over 40 years after androgen treatment (Figure 2). Patients under 40 were excluded for this comparison (n = 137). It seems to be important to underscore: There was a positive effect (taken numbers at face value) detectable even in males with mild or even little symptoms. Almost identical observations concerning improvement of complaints after androgen therapy were made for injectable testosterone in a previous study . Thus, two independent observational studies (with the same methodology) found the same pattern, but there is still a need for confirmation in a randomised clinical trial.
To answer the question if the treatment-related improvement depends on age, BMI, and testosterone level at baseline, we run a stratified analysis (one-way analysis of variance). Table 2 shows the relative improvement after therapy for the AMS total score, and also for the three sub-scores. There is neither much difference in relative improvement among age, BMI or testosterone groups nor among subscales. All significant changes of HRQoL range around 30% improvement compared with baseline (before therapy). There is no clinically consistent and relevant impact on the magnitude of improvement of the three variables at baseline (age, BMI, TT) considering the size of the standard deviation (in brackets), although the Tuckey-Test showed a few significant effects of age and BMI but with contradictory direction of the trend of the effect, i.e. random findings due to multiple testing cannot be excluded. It cannot be excluded either that the study group is too homogeneous to find small effects.
Another question was to what extend the evaluation with the AMS scale could "predict" the opinion of the treating physician regarding "success" of the androgen therapy. The treating physician subjectively assessed the "success" without knowing the results of the AMS he had no access to. Sensitivity (correct prediction of a positive assessment by the physician) and specificity (correct prediction of a negative assessment by the physician) are important characteristics for a test that intends to "diagnose" successful therapy – like AMS in this case (predictive validity).
As can be seen in Table 3, we plotted the sensitivity and specificity in a kind of ROC analysis against the degree of improvement of complaints found under androgen treatment (difference between the pre- and post-treatment score on AMS as percent of pre- treatment total score). More than 22% relative improvement of the total AMS score seems a suitable cut-off point for "diagnosing treatment success": both sensitivity and specificity were about 70% and thereby acceptably high. In other words, the AMS scale is able to assess the treatment success with sufficient good test characteristics, validated against the expert opinion of the treating physician. It could be an advantage of a "success diagnosis" with the AMS scale because it is directly based on patients' view, whereas the physician's assessment could be rather a varying mixture of theoretical expectation/experience and patient's report. Anyway, since the methodological characteristics of the AMS scale are pretty good it is worthwhile to get experience with this tool in clinical practice. It might be recommendable to apply a standardized "objective" scale like the AMS scale in clinical studies, and in addition – if necessary – the subjective, not standardizable judgment of a physician.
It seems important to underline that this paper has a sufficient basis to describe the validity of the scale, but there is no intention to discuss the efficacy of testosterone substitution. This would be the task of a double-blinded, placebo controlled trial. We could just demonstrate that the AMS as measure of health-related quality of life (HRQoL) is able to detect changes in quality of life following androgen substitution in androgen-deficient males – irrespective of the specific drug used as treatment or the specific underlying diagnosis in this observational follow up study. This confirms results of an earlier post-marketing study  that showed acceptably good validity.
The AMS scale can be applied in clinical trials both in young hypogonadel men as well as in late-onset hypogonadism according to French experience : The scale measures the same phenomenon in different age groups, and no obvious differences exist in reference values. We recommend the use of the total score as outcome measure, but domain scores can be used as well. From our current experience we would conclude that the same cut-off point should be used as "treatment success" when planning a trial. However, we had no access to data from randomized clinical trials to really investigate this issue.
The AMS scale showed a convincing ability to measure treatment effects on quality of life across the full range of severity of complaints. Effect modification by other variables at baseline (age, BMI, and testosterone level) on the "treatment associated effect on AMS score" was not observed. In addition, results of the scale can predict the subjective clinical expert opinion on the treatment efficiency.
Heinemann LA, Zimmermann T, Vermeulen A, Thiel C: A New 'Aging Male's Symptoms' (AMS) Rating Scale. Aging Male 1999, 2: 105–114.
Heinemann LA, Saad F, Thiele K, Wood-Dauphinee S: The Aging Males' Symptoms (AMS) rating scale. Cultural and linguistic validation into English. Aging Male 2001, 3: 14–22.
Heinemann LA, Saad F, Pöllänen P: Measurement of Quality of LifeSpecific for Aging Males. In Hormone Replacement Therapy and Quality of Life. Edited by: Schneider HPG. Parthenon Publishing Group. London, New York, Washington; 2002:63–83.
Myon E, Martin N, Taieb C: The French Aging Males' Symptoms (AMS) scale: Methodological review. Health Qual Life Outcomes 2005, 3: 20. 10.1186/1477-7525-3-20
Morley JE, Charlton E, Patrick P, Kaiser FE, Cadeau P, McCready D, Perry HM III: Validation of a screening questionnaire for androgen deficiency in aging males. Metabolism 2000, 49: 1239–1242. 10.1053/meta.2000.8625
Smith KW, Feldman HA, McKinlay JB: Construction and field validation of a self-administered screener for testosterone deficiency (hypogonadism) in ageing men. Clin Endocrinol 2000, 53: 703–711. 10.1046/j.1365-2265.2000.01152.x
Heinemann LA, Saad F, Heinemann K, DoMinh Thai: Can results of the AMS scale predict those of screening scales for androgen deficiency? Aging Male 2004, 7: 211–218. 10.1080/13685530400004223
Morley JE, Perry HM, Kevorkian RT, Patrick P: Comparison of screening questionnaires for the diagnosis of hypogonadism. Maturitas 2006, 53: 424–429. 10.1016/j.maturitas.2005.07.004
Kratzik CW, Reiter WJ, Riedl AM, Lunglmayr G, Brandstätter N, Rücklinger E, Metka M, Huber J: Hormone profiles, Body Maß Index and Aging Male Symptoms – results of the Androx Vienna Municipality study. Aging Male 2004, 7: 188–196. 10.1080/13685530412331284650
Itoh N, Hisasue S, Kato R, Tanaka T, Takahashi A, Masomori N, Tsukamoto T, abstract: Comaparison of Morley's ADAM questionnaire and Heinemann's Aging Male Symptoms (AMS) rating scale to screen andropause symptoms in Japanese males. Aging Male 2004, 7: 49.
Soh J, Ishida Y, Naito Y, Ochiai A, Naya Y, Mizutani Y, Fujuito A, Kawauchi A, Fujiwara T, Tschida H, Fukui K, Miki T, abstract: Correlations of AMS score, depression score and hormonal levels with the manifestation of partial androgen decline in the aging male (PADAM). Aging Male 2004, 7: 83.
Jankowska EJ, Szklarska A, Lopuszanska M, Medras M, abstract: Hormonal determinants of andropausal symptoms in Polish men. Aging Male 2004, 7: 21.
T'Sjoen G, Feyen E, Kuyper P, Comhaire F, Kaufman JM: Self -referred patients in an aging male clinic – much more than androgen deficiency alone. Aging Male 2003, 6: 157–165.
Dunbar N, Gruman C, Reisine S, Kenny A: Comparison of two health status measures and their associations with testosterone levels in older men. Aging Male 2001, 4: 1–7.
Heinemann LA, Saad F, Zimmermann T, Novak A, Myon E, Badia X, Potthoff P, T'Sjoen G, Pöllänen P, Goncharow NP, Kim S, Giroudet C: The Aging Males' Symptoms (AMS) scale: update and compilation of international versions. Health Qual Life Outcomes 2003, 1: 15. 10.1186/1477-7525-1-15
Convay K, Heinemann LA, Giroudet C, Johannes EJ, Myon E, Taieb C, Raynaud JP: Harmonized French version of the Aging Males' Symptoms Scale. Aging Male 2003, 6: 106–109.
Moore C, Huebler D, Zimmermann T, Heinemann LA, Saad F, DoMinh T: The Aging Males' Symptoms Scale (AMS) as outcome measure for treatment of androgen deficiency. Eur Urol 2004, 46: 80–87. 10.1016/j.eururo.2004.01.009
Daig I, Heinemann LA, Kim S, Leungwattanakij S, Badia X, Myon E, Moore C, Saad F, Potthoff P, Do Minh T: The Aging Males' Symptoms (AMS) scale: Review of its methodological characteristics. Health Qual Life Outcomes 2003, 1: 77. 10.1186/1477-7525-1-77
Morales A, Lunenfeld B: Investigation, treatment and monitoring of late-onset hypogonadism in males – official recommendations of ISSAM. Aging Male 2002, 5: 74–86.
The authors thank Horst Dietrich, Jenapharm for creating the initial database of the post-marketing study and for providing the limited dataset for this validation study.
CM is employee of the company that produces testosterone products. The authors, however, see no conflict of interest as far as the methodological aspects of validation of the AMS scale are concerned.
LAH: developer of the AMS scale, responsible for the design of this validation analysis, and involved in writing the manuscript; CM: responsible for design and execution of the PMS, contributed to writing of the paper; JD: contributed to the design of the study, and to writing the manuscript; DS: responsible for setting up and managing the database for different analyses, running all analyses, and contributes to revising the paper.
About this article
Cite this article
Heinemann, L.A., Moore, C., Dinger, J.C. et al. Sensitivity as outcome measure of androgen replacement: the AMS scale. Health Qual Life Outcomes 4, 23 (2006). https://doi.org/10.1186/1477-7525-4-23