Development and validation of the living with pulmonary hypertension questionnaire in pulmonary arterial hypertension patients
Health and Quality of Life Outcomes volume 11, Article number: 161 (2013)
The Living with Pulmonary Hypertension questionnaire (LPH) was adapted from the Minnesota Living with Heart Failure Questionnaire for use in patients with pulmonary arterial hypertension (PAH). Study objectives were to confirm the face and content validity, to assess the structure and psychometric properties, and provide guidance for the interpretation of the LPH.
A qualitative interview study was conducted with PAH patients in the US (n=12), Germany (n=14) and France (n=12) to evaluate the face and content validity of the LPH. Psychometric validation was performed using blinded data from a double blind, Phase III, clinical trial (n=196). Validation analyses were performed on baseline and week 12 (visit 6/last visit) data and included evaluation of: item response distributions, quality of completion, construct validity, reliability, clinical validity and responsiveness. Analyses to provide an estimation of the Minimal Important Difference (MID) for the LPH scores were performed.
Cognitive debriefing interviews with 38 PAH patients indicated that the most commonly reported PAH symptoms and impacts are covered by LPH items. Patients found the LPH questionnaire relevant and comprehensive to their experience. Some suggestions were made to enhance the face validity of the LPH. The content validity of the questionnaire was supported. Results of the psychometric validation analyses (n=190) indicated that the LPH Emotional and Physical scores met the criteria for convergent and discriminant validity; for the total score all but two items met the test for item convergent validity. Internal consistency reliability was demonstrated by Cronbach’s alpha values of >0.70 for all LPH scores. The LPH Physical and Total scores discriminated between World Health Organisation (WHO) Functional classes and 6 Minute walk test distances, indicating clinical validity and were also responsive to change in clinical severity, as measured by change in WHO functional class and Borg CR 10 Scale. Further investigation is required to confirm the responsiveness of the Emotional score. Estimation of MID using distribution-based methods indicated a change of 3 points for the sub-scales and 7 for the total score to be clinically meaningful.
The LPH is a valid and reliable instrument that meets FDA criteria.
Pulmonary arterial hypertension
Pulmonary arterial hypertension (PAH) is a rare lung disorder with current estimates suggesting the prevalence of PAH in the US is 109 per million individuals . In PAH, pulmonary vascular injury leads to vessel remodeling and subsequent narrowing of the pulmonary arterioles that in turn, increases afterload on the right ventricle. As this disease progresses and afterload increases, right ventricular failure ensues that ultimately leads to death . While the initial symptoms of PAH are non-specific, including shortness of breath (dyspnea)  and fatigue following physical exertion, these often progress to occur with minimal exertion or, in extreme cases, at rest . Additionally, patients often experience swelling in the ankles or legs (edema); bluish lips and skin (cyanosis); chest pain; and palpitations .
Such debilitating symptoms result in substantial impairments in patients’ Health-Related Quality of Life (HRQoL) . In particular, PAH results in significant impairment in physical functioning, ability to perform activities of daily living and social functioning, and many patients experience feelings of depression and anxiety and difficulties sleeping [7–9]. To evaluate treatment benefit, primary and key secondary endpoints in PAH clinical trials are typically clinical measures focused on sub-maximal exercise capacity (e.g. the six minute walked distance [6MWD]). However, to complement such trial endpoints, rigorous measurement of HRQoL related to PAH is also recommended to ensure treatments and interventions improve not just objective functional capacity, but also the day-to-day well-being of patients. To date, HRQoL measurement in trials has typically relied on generic measures which may not fully evaluate the specific impacts experienced by PAH patients [10, 11]. Few disease-specific measures of quality of life (QoL) in PAH exist and are limited by the lack of explicitly defined responsiveness and clinical utility . Thus, we sought to develop and assess a PAH-specific tool to measure HRQoL in this patient population by modifying an existing disease-specific metric of QoL in left heart failure, the Minnesota Living with Heart Failure questionnaire (MLHF). Further, we sought to define a minimal important difference for this new tool, the Living with Pulmonary Hypertension Questionnaire (LPH).
The objectives of this work were to select and adapt an existing disease- specific measure of HRQoL for use in PAH populations and then to confirm the face and content validity and scoring and psychometric properties of the selected instrument. Specific objectives of the face and content validity testing of the LPH questionnaire were: to explore the item coverage of key symptoms and impacts of PAH; to explore whether the items, instructions, recall period and response options are relevant, well-understood and interpreted in a consistent manner by patients with PAH; and to identify any changes to the wording of the LPH that are recommended as a result. Specific objectives of the assessment of the psychometric properties of the LPH were: to confirm the item-scale structure of the LPH; to assess the reliability, validity and responsiveness of the LPH; and to provide guidance on the interpretation of LPH scores and changes in scores.
Development of a conceptual model
Qualitative and quantitative PAH literature was reviewed to support the development of a conceptual model of PAH (Figure 1). The model includes a summary of clinical characteristics of PAH; symptoms experienced by patients and the resulting functional impairments, and available treatment options. The conceptual model was used as basis for a review of existing HRQoL instruments and to explore appropriate outcome measures to assess PAH. The instrument review identified only one PAH-specific instrument the Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR) . Although the CAMPHOR had been developed for specific use in PAH and there was evidence of the reliability and validity of the instrument  there were concerns about the concept coverage; there is no evidence that saturation analysis was conducted to confirm that all concepts important to patients were captured, mapping of the CAMPHOR concepts to the PAH conceptual model identified key concepts such as dizziness, chest pain and palpitations were missing from the CAMPHOR. There were also concerns about the length of the instrument (65 items) and the dichotomous format of some of the response options. Such factors would be a concern for regulatory agencies such as the FDA and may impact on the content validity and responsiveness of the instrument .
Given the concerns about the CAMPHOR, the MLHFQ  was identified as a stronger instrument, that provided better measurement and more comprehensive coverage of PAH symptoms and impacts. Moreover, there is evidence of previous use in PAH clinical trials [14, 15, 28, 29], and that it is responsive to changes following treatment in PAH [15, 29–31]. The MLHFQ however, is specific to heart failure. In order to make the instrument appropriate for use in patients with PAH, minor modifications were made to the MLHFQ. Modifications included: changes to the wording of some of the questions and instructions to be specific to PAH rather than heart failure, and a revision to the recall period from four weeks to one week. The reduction in the recall period to one week was considered an important modification in order to ensure the instrument met the FDA preference for “short recall periods” . This work was conducted in order to provide a suitable instrument for the assessment of PAH symptoms in a clinical trial to benefit PAH patients across the globe.
Qualitative patient interviews
After obtaining the relevant institutional review board approval (approval codes: 2009-P-001852/2, MAPI-10-242, B-F-2010-033, 09/2295), qualitative patient interviews were conducted to evaluate the content validity of the LPH. The qualitative interviews were conducted in the US, France and Germany and included patients with PAH as defined by current consensus guidelines  who were at least 18 years of age and had provided written informed consent. The MLHFQ had previously been translated and linguistically validated for use in France and Germany. Patients were either treatment naïve or had previously received treatment with an endothelin receptor antagonist (ERA), a phosphodiesterase type 5 inhibitor (PDE5I), or a prostacyclin analogue. Patients were required to have cognitive and linguistic capacities sufficient to allow them to actively participate in an interview, as determined by the recruiting physician. Patients with significant psychiatric disease were excluded from the study. Patients with a diagnosis of other relevant pulmonary diseases including pulmonary hypertension other than PAH, moderate to severe obstructive lung disease, or severe restrictive lung disease were excluded from the study .
Interviewers trained in qualitative research, native to each country, followed a semi-structured interview guide. The guide included both open-ended questions about patient experiences of PAH and a cognitive debriefing exercise to assess the patient’s understanding of the instructions, items, response options and recall period of the LPH. Examples of open-ended questions exploring the patient experience of PAH included:
“What is it like to have PAH?”
“Please tell me about any symptoms or problems that you experience, if any? When do you usually experience these symptoms? How long do these symptoms normally last for? In what ways do these symptoms affect you, if at all?”
“Does your PAH ever stop you from doing things? (e.g. housework, going out, hobbies) If yes please explain”
The cognitive debriefing part of the interview asked the patients general questions about the LPH questionnaire such as:
“What is your overall opinion of this questionnaire? Where there any questions that were unclear or difficult to answer? If yes, which? What made this/these questions difficult to answer?”
“What did you think about having to think about your symptoms over the past week?”
“Do you think it was relevant to think about your symptoms over the past week?”
Patients were also asked specific questions about each item or instruction, for example:
“What did you think of this question? What did this question mean to you, in your own words?”
“Was there anything that was unclear or hard to understand in this question? How would you reword it to make it clearer”
“How far back were you thinking when you answered this question? How long do you thinking this question was asking you to think back over?”
“Do you think the response options for this question are appropriate? Would you change them in any way?”
Procedures were implemented to capture any adverse events reported during the interviews; no adverse events were reported. Qualitative analysis of verbatim transcripts was performed using Atlas Ti software and Microsoft Excel and methods derived from Grounded Theory . French and German transcripts were translated into English prior to analysis. To determine whether all of the symptoms and impacts of importance to patients with PAH had been elicited during the interviews, the patient interview sampling strategy and analysis followed the principle of 'saturation’. Saturation is defined as the point where no 'new’ information on a particular item or topic is mentioned by patients . Interviews were analyzed in a stepwise manner to determine the point at which saturation was reached.
Psychometric validation study
Psychometric validation of the LPH was performed using blinded data from a double blind, Phase III clinical trial. As part of this trial patients were randomized at Visit 1 (V1), and were then followed for 12 weeks. The LPH was self-administered at V0 (baseline) and V6 (12 weeks or last visit), the V6 time point was used for cross-sectional analyses. Patients with PAH, defined as per consensus guidelines, aged between 18 and 75 years of age, whose six-minute walk distance (6MWD) was between 150 and 450m were enrolled . Patients were either treatment naïve or had previously received treatment with an ERA or a prostacyclin analogue. Patients were excluded from the trial if they were unable to perform the six-minute walk test (6MWT), had taken intravenous (IV) prostacyclin analogues, or PDE5I within the 90 days prior to visit 1.
The LPH derived from the MLHF questionnaire comprises 21 items, responded to on a 6-point Likert scale ranging from 0 'No’ to 5 ’Very much’. A total score ranging from 0 to 105 is calculated by summing the responses to all 21 questions. A physical dimension score (range 0–40, 8 items) and an emotional dimension score (range 0–25, 5 items) can also be calculated. For all LPH scores, a higher score indicates that patients are more affected by their medical condition. The MLHFQ from which the LPH was derived has shown to be highly reliable as demonstrated by the correlation between repeated baseline assessments (r=0.93) . The MLHFQ has high internal consistency validity, and an analysis of MLHFQ scores according to New York Health Association (NYHA) functional class supported the clinical validity of the MLHFQ . It should be noted that these properties are relevant to patients with heart failure, not PAH. The MLHFQ has been used previously in PAH clinical trials. Cenedese et al. used the MLHFQ in a study of patients with PAH and chronic thromboembolic pulmonary hypertension (CTEPH). This study supported the internal consistency and test-retest reliability of the instrument in PAH and CTEPH patients as well as the clinical and concurrent validity and responsiveness . In addition to the study by Cenedese et al  the MLHFQ has previously been used in other PAH clinical trials, where it has shown evidence of responsiveness to change following treatment [15, 29, 31].
The 6MWT was conducted according to American Thoracic Society guidelines . Specifically, the test was performed indoors, along a long, flat, straight, enclosed corridor of at least 30 meters in length. The 6MWT was conducted unencouraged by a person not involved in the titration of the study drug, who was unaware of the immediate reaction of the patient’s blood pressure and heart rate after dosing.
The Borg CR10 scale
The Borg Category Ratio 10 (CR10) Scale was measured in conjunction with the 6MWD Test during the clinical trial. Patients are asked to rank their exertion at the end of the 6MWD test on a scale with the lowest rating being '0 Nothing at all’ up to the patients being able to rate their exertion as a ’12 or still higher’ which represents “Absolute maximum”. The Borg CR 10 Scale has been shown to be a valid and reliable measure for the estimation of perceived intensity [36, 37].
World Health Organisation (WHO) functional class
Patient’s functional class was determined by the study investigator using WHO classification:  Class I: Patients with PH but without resulting limitation of physical activity; Class II: Patients with PH resulting in slight limitation of physical activity; Class III: Patients with PH resulting in marked limitation in physical activity; Class IV: Patients with PH with inability to carry out any physical activity.
The EQ-5D is a standardized, self-report measure of health status. Patients describe their health state within the domains of “Mobility”, “Self-Care”, “Pain/Discomfort” and “Anxiety/Depression” on a 3 level scales, with 1 reflecting the better health state, and rate their overall health status on a visual analogue scale (VAS) of 0 'Worst imaginable health state’ to 100 'Best imaginable health state’ . The test-retest reliability of the EQ-5D has been shown to be acceptable (ICC>0.7) across a range of disease areas , the clinical validity of the EQ-5D has been confirmed in patients in a variety of disorders.
Demographic, clinical, and functional data were summarized using means and standard deviations, medians and ranges, or proportions where appropriate. Table 1 presents a summary of the analyses performed as part of the validation of the LPH. Test-retest reliability could not be analysed as part of this study given the clinical trial context and the fact that most patients experienced a change in their condition due to receiving treatment. Thus there were limited numbers of stable patient data with which to perform test-retest reliability analysis. All data processing and analyses were performed with SAS software for Windows version 9.2 (SAS Institute, Cary, NC, USA).
Qualitative patient interviews
Interviews were conducted with 38 PAH patients (US n=12, Germany n=14 and France n=12). Demographic and clinical characteristics were broadly comparable across country samples, although patients in France had been diagnosed for longer than patients in the US and Germany (Table 2). Although, this may mean their symptoms were more under control, it provided a greater depth of symptom experience.
The symptoms and domains of impact reported by patients during the open-ended part of the interviews were mapped onto the LPH items to assess content validity of the LPH. Table 3 presents the symptoms reported by five or more patients during the interviews and links them to items in the LPH. The results indicate that the key symptoms of PAH are captured by the LPH, supporting the face and content validity of the LPH with respect to the measurement of PAH symptoms. A number of other symptoms were reported by fewer than five patients each that are not assessed by the LPH. The majority of these symptoms are considered 'signs’ rather than symptoms of PAH and therefore would not be appropriate to assess.
Table 4 presents the impacts reported by patients during the interviews and links them to items on the LPH. Patient interviews confirmed that the key impacts of PAH (those reported by five or more patients) are assessed by the LPH. For most impact concepts, there was sufficient coverage in the LPH; however, within the cognitive and emotional impact concepts there were some impacts reported that are not directly assessed. Within the cognitive concept, memory (n=26), concentration (n=24), focus (n=6) and motivation (n=5) were reported. Concentration and memory are assessed by the LPH. Focus is not directly assessed, although during the interviews patients described focus to be similar to concentration and therefore thus may be covered by the LPH in this patient population. Motivation, which was only reported by five of the 38 patients, is not assessed. A large number of emotional impacts were reported, the key impacts of worry (n=32), depression (n=28) and worry about the future (n=18) are assessed. Given these findings it can be concluded that the key impacts to patients with PAH are captured by the LPH thus supporting its content validity.
The second part of the patient interviews involved cognitive debriefing of the LPH to confirm the level of understanding and relevance of the questionnaire to PAH patients. The majority of patients found the questionnaire resonated with their experience of PAH. In terms of the response options, the majority of the patients felt that the range of response options was appropriate. Some patients suggested adapting the response options or making them more specific to certain questions. While these suggestions could be taken into consideration for future versions of the LPH, the issues raised did not detract from patients’ ability to actually complete the questionnaire. Some areas to improve understanding were raised; these included highlighting the instructions to the patient to ensure they do not miss them, adapting the response options to ease completion and splitting items assessing multiple concepts into separate items. Not all patients appeared to use the recall period of one week when completing the items. The interview setting may have been one cause for this, because patients did not have a clear timeframe in which to consider their symptoms. A number of patients specifically commented that they liked the questionnaire and thought it captured appropriate symptoms. Sixteen of the patients specifically reported that they found the questionnaire 'easy’ or 'simple’ to complete.
Psychometric validation study
The total psychometric validation population (patients who returned an LPH questionnaire at V0 or V6) included 196 patients, 190 of whom were included in the baseline population (patients who completed at least 90% of LPH items at V0), and were included in all analyses conducted on baseline data. The week 12 population (patients who completed at least 90% LPH items at V6) included 176 patients who were included in all analyses conducted on week 12 data. The responsiveness population (patients who completed at least 90% of LPH items at V0 and V6) included 171 patients who were included in responsiveness analyses. Table 5 presents patient demographic and clinical characteristics for the baseline LPH population.
Quality of completion of the LPH
Over 85% patients had no missing items on the LPH at both baseline and week 12. All LPH items had less than 3% missing data at baseline and week 12. Responses were well spread across the response scale. The results indicate a good level of completion for the LPH items and questionnaires consistent with the benefits of a single page instrument and reports from the qualitative work that the questionnaire was easy to complete.
Scaling properties and confirmation of the structure of the LPH
Multitrait analysis performed on the LPH scale scores and total score at baseline and visit 6/last visit indicated that for both the Emotional and Physical scores all items met the criteria for item convergent and item discriminant validity. Item-scale correlations ranged from 0.59-0.76 for the 'Emotional’ score and 0.43-0.78 for the 'Physical’ score. For the Total score all but two items met the test for item convergent validity (correlation range: 0.38-0.72). Two items correlated with the Total score at a level of r=0.38 which is just below the threshold of acceptability (r=0.40). There was a moderate correlation between the LPH Emotional and Physical scores (r=0.58) indicating that the scales are related but not redundant and high correlations between the LPH Total Score and both the LPH Emotional score (r=0.85) and the LPH Physical score (r=0.87), which indicates that the Total Score adequately covers both physical and emotional dimensions. At both baseline and week 12 the percentages of patients with the lowest or highest possible score was low (<3%) for all scales, indicating no floor or ceiling effect for the LPH scores. These results indicate a good ability for patients to both improve and worsen on the scales (Table 6). Finally, strong internal consistency reliability of the LPH scores was demonstrated by Cronbach’s alpha values of >0.70 for all LPH scores at baseline and week 12 (Table 6).
Confirmatory factor analysis (CFA) of the LPH Emotional and Physical scores at baseline indicated an average overall fit for the items on the Emotional and Physical scores. CFA of the LPH Total score at baseline indicated a poor overall fit of the model with poor factor loadings for most LPH items with only five items meeting the criteria of 0.70.
Results indicated that the LPH Physical and Total scores were able to discriminate among groups of patients of differing severity levels, as defined by World Health Organisation (WHO) functional class (Figure 2) and the six minute walking test (Figure 3). The LPH Emotional score did not discriminate between severity groups at a statistically significant level. Small sample sizes in WHO class I and IV should be considered when interpreting these results. However, the LPH Emotional, Physical and Total scores were broadly worse for those subjects with more severe disease across clinical criteria.
Correlations were examined between LPH scores with the Borg score at baseline and V6 (week 12). The highest Borg correlations were with the LPH Physical score, as expected (r=0.36 and r=0.34 respectively). Correlations with the LPH Emotional Score (r=0.11 and r=0.15), and the LPH Total Score (r=0.21 and r=0.23) were low.
The pattern of correlations between the LPH scores and the concurrent measures was consistent with the content of the different scales, and so supportive of the validity of the measure. Scales measuring similar concepts correlated more highly than scales measuring dissimilar concepts (Table 7). For example, the LPH Emotional score correlated moderately with the EQ-5D anxiety/depression item (0.59), but at a low level with EQ-5D self-care (0.24).
Table 8 presents a summary of the responsiveness analyses for the LPH. The results provide some support for the responsiveness of the Physical score and the Total score, but not the Emotional score. For the Physical score and the Total score, across all three methods of defining change groups, effect sizes suggest there were small to moderate improvements for the 'improved’ group, small improvements in the 'stable’ group and negligible change in the 'worsened’ group. However, for the Emotional score there were only small improvements in both the 'improved’ and 'stable’ groups, and negligible change in the worsened group. Moreover, the differences between change groups was only significant for the LPH Physical and Total Score according to change in Borg score (p=0.0073, p=0.0415 respectively).
Interpretation of the LPH Scores
Results to estimate the MID for the LPH scores using anchor-based methods indicate an MID with a range of 1.48-3.69 for the LPH Emotional Score, 1.88 to 4.71 for the LPH Physical Score and 4.41 to 11.02 for the LPH Total score (Table 9).
The results of this study provide evidence that the LPH has strong content validity and psychometric validity as a measure of symptoms and HRQoL in PAH. Mapping the symptoms and domains of impact elicited from the patient interviews on to the LPH items indicated that key symptom concepts relevant to PAH are assessed by the LPH. A number of other symptoms were reported by patients that are not directly assessed by the LPH, but none of them were felt to be primary symptoms. Moreover, although a wide range of additional impacts were also mentioned, they were very personal and reflected the individual’s living situation rather than their PAH experience and so would be unlikely to resonate with the wider PAH population. While patients discussed a number of emotional impacts that are not specifically addressed by the LPH, including items to assess all of the emotional impacts would add significant respondent burden. Furthermore, some of the concepts such as 'upset’ and 'feeling low’ are vague concepts, and ones which are to some extent covered by the items that are included in the LPH (e.g. feeling depressed). Considering patient burden and relevance, the instrument captures all core symptom and impact domains.
In addition to the concept elicitation results, when the LPH was debriefed with patients it was well understood and the questions were considered relevant, with no concerns raised that key concepts were missing, thus providing further evidence of content validity. The LPH response options were understood well as evidenced by the high proportion of patients who were able to rate the symptoms and impact of PAH on different aspects of their life as defined in the questionnaire. However, some patients suggested further response descriptors in addition to the existing 'No’, 'Very little’ and 'Very much’ descriptors. While additional descriptors might be useful in future versions of the LPH, their absence was unlikely to have affected the ability of patients to complete the current version of the LPH. Moreover, there is evidence that increasing the number of categories does not always provide a larger coverage of the target trait, and they concluded that rating scales with fewer response categories were more functional . Although many patients found the 'past week’ recall appropriate, some patients felt this was too short a time period over which to consider the impact of PAH on their lives. The FDA suggests the use of shorter recall periods over longer periods;  patients themselves note it would be harder for them to accurately recall their symptoms over a longer period of time. Thus, extending the recall period would not be appropriate.
The results of scaling tests supported the a priori structure of the LPH. However, as scale development is an iterative process, some modifications to the LPH may be warranted to improve the structure further for future studies. Nevertheless, a trade-off between the advantages and disadvantages of changing a well-validated and now-well used scale with interpretation aids should be considered before making any modifications. The results of a confirmatory factor analysis on the LPH Emotional and Physical scales were positive, with goodness of fit values indicating an average overall fit and moderate factor loadings on the factor analysis model. Further testing using exploratory factor analysis could be considered to explore whether an alternative structure of the LPH would produce a better fitting model. However, as noted above there is a trade-off to be considered when making modifications to a well-validated and well-used instrument. Internal consistency reliability results were very good for the LPH sub-scale scores and the Total score indicating that the LPH items included on each scale are measuring a single underlying concept, without being redundant. These results are consistent with those demonstrated by Rector and colleagues in the validation of the original MLHFQ in heart failure patients . It was not possible to evaluate test-retest reliability as part of this study – such testing is recommended as a priority in future evaluations of the LPH.
Clinical validity was evaluated by examining the ability of the instrument to discriminate between patients who differed on key clinical indicators. The results provided good evidence of clinical or known groups validity for the Physical score and the Total score (with statistically significant differences among groups), but limited support for the clinical validity of the Emotional score. While the pattern of mean scores for the emotional score was in line with expectations, the differences between groups were not statistically significant. However, low sample sizes for these analyses, particularly in the case of WHO functional class categories I and IV should be considered when interpreting these results. In addition, the emotional subscale contains fewer items than the physical subscale (5 vs. 8) and therefore differences in scores between clinical groups may not be as apparent. Concurrent validity tests demonstrated that in most cases there were moderate correlations between the LPH scores and EQ-5D items that would be expected to correlate. These results indicate that the two questionnaires are measuring similar concepts but are not redundant with each other and are thus supportive of the validity of the LPH. The clinical and concurrent validity properties were comparable to those for similar scales on the CAMPHOR, a PAH specific instrument .
The responsiveness results provided mixed support regarding the ability of the LPH to reflect changes over time, and suggest that the Physical score and the Total score are more responsive to change over time than the Emotional score. However, this is to be expected within the context of a short term trial, as emotional functioning is typically more distal than physical functioning, and is often less responsive to treatment. These findings are consistent with those presented by Gilbert et al. in reporting the MID for the 6MWD and Short Form-36 (SF-36); they similarly found that significant improvements were only demonstrated for the 6MWD and SF-36 physical functioning scale . Overall, the responsiveness of the LPH can be concluded to be acceptable. However, further investigations of responsiveness are arguably warranted, particularly to assess responsiveness to worsening and the responsiveness characteristics of the Emotional scale within the context of a larger scale study.
Analyses to provide guidance for interpretation of LPH scores using distribution-based methods indicated that for all LPH scores there was a small range of MID definition. Given this, based on an average of the results, a recommendation for a change of three points on the sub-scales, and a change of seven points on the Total score would be considered as meeting the MID. This would translate as a patient who experienced improvement in only a few areas of their condition perceiving that they had experienced a beneficial improvement in their quality of life. It is likely that the 'important’ or clinically important difference is higher –around 4 points for the sub-scales and 11 points for the Total score, as a 0.5 ES is easier to argue, given the literature available to back it up [49, 50]. Estimation of MID using anchor-based methods could not be established in this study due to correlations in LPH scores and change in clinical parameters indicating no clear linear relationship. Therefore, further investigation of MID using anchor-based methods is recommended.
This study provide evidence supporting the use of the LPH over other PAH specific instruments such as the CAMPHOR. The original MLHFQ and now the LPH have a strong history of published qualitative research supporting content validity and use in clinical trials. The evidence of these factors for the CAMPHOR is limited. Although the CAMPHOR was developed specifically for a PAH population, the development and validation samples were very homogenous, including only white English speaking patients, thus limiting the generalizability of the instrument . The LPH however, has been validated using a large sample of PAH patients across a range of cultures and languages. The CAMPHOR authors also acknowledge that a limitation of their instrument is lack of evidence of responsiveness to treatment within a clinical trial . This paper provides evidence for the responsiveness of the LPH in an interventional study with PAH patients. Finally, the LPH can be considered a more suitable instrument for use with PAH patients given its short length (21 items), compared to the longer CAMPHOR (65 items) and the fact it has clear, PAH-specific instructions, something the CAMPHOR lacks.
Although this study provides strong support for the face and content validity, and measurement properties of the LPH there are some limitations that should be acknowledged. The cut-points for the 6MWD and Borg Scale  as used for the responsiveness and MID analyses were selected based on evidence from published literature. For the 6MWD at the time of SAP design there was only one published study which presented a suggested MID for the 6MWD in PAH. This study recommended an MID of 41m and a range of 18.70-74.15  which the cut-point used in this study was within. However, the authors are aware that since the analyses were completed there have been cut-points published for the two instruments that could be considered more clinically relevant to distinguish groups for clinical validity testing. For future studies and further validation of the instruments such cut-points will be considered.
The authors also note that it would have been beneficial to collect haemodynamic characteristics in order to characterise the sample, but unfortunately this was not collected. However, it is still felt that the sample is well characterised and clinically relevant.
Overall, this study provides evidence that the LPH has good face and content validity in patients with PAH, has strong psychometric properties and is appropriate for use in clinical trials as a measure of core symptom and impact concepts.
Six-minute walk distance
Six-minute walk test
Cambridge pulmonary hypertension outcome review
Borg category ratio 10
Endothelin receptor antagonist
Health related quality of life
Living with pulmonary hypertension
Minimal important difference
Minnesota living with heart failure
Pulmonary arterial hypertension
Phosphodiesterase type 5 inhibitor
Quality of life
Visual analogue scale
World health organisation.
Kirson NY, Birnbaum HG, Ivanova JI, Waldman T, Joish V, Williamson T: Prevalence of pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension in the United States. Curr Med Res Opin 2011,27(9):1763–1768. 10.1185/03007995.2011.604310
Rich S: Right ventricular adaptation and maladaptation in chronic pulmonary arterial hypertension. Cardiol Clin 2012,30(2):257–269. 10.1016/j.ccl.2012.03.004
Rich S, Dantzker DR, Ayres SM, et al.: Primary pulmonary hypertension. A national prospective study. Ann Intern Med 1987, 107: 216–223. 10.7326/0003-4819-107-2-216
Humbert M, Gerry CJ, Khanna D: Early detection and management of pulmonary arterial hypertension. Eur Respir Rev 2012,21(126):306–312. 10.1183/09059180.00005112
Facts About Primary Pulmonary Hypertension from the National Heart, Lung, and Blood Institute (NHLBI). 2008. Available at: URL: http://www.nhlbi.nih.gov/health/dci/Diseases/pah/pah_what.html
Taichman DB, Shin J, Hud L, et al.: Health-related quality of life in patients with pulmonary arterial hypertension. Respiratory Research 2005, 6: 92–101. 10.1186/1465-9921-6-92
Doughty N, McKenna SP, Doward LC, Meads DM, Pepke-Zaba J: The Impact of Pulmonary Arterial Hypertension (PAH) on Patients: Results from Qualitative Patient Interviews. Poster Presented at 10th Annual Conference of the International Society for Quality of Life Research (ISOQOL); 2003.
Shafazand S, Goldstein MK, Doyle RM, Hlatky MA, Gould MK: Health-related quality of life in patients with pulmonary arterial hypertension. Chest 2004, 126: 1452–1459. 10.1378/chest.126.5.1452
Gilbert C, Brown MC, Cappelleri JC, Carlsson M, McKenna SP: Estimating a minimally important difference in pulmonary arterial hypertension following treatment with sildenafil. Chest 2009,135(1):137–142. 10.1378/chest.07-0275
Chen H, Taichman DB, Doyle RL: Health-related quality of life and patient-reported outcomes in pulmonary arterial hypertension. Proc Am Thorac Soc 2008,5(5):623–630. 10.1513/pats.200802-020SK
Chen H, De MT, Kobashigawa EA, Katz PP, Chang VW, Blanc PD: Comparison of cardiac and pulmonary-specific quality-of-life measures in pulmonary arterial hypertension. Eur Respir J 2011,38(3):608–616. 10.1183/09031936.00161410
McKenna SP, Doughty N, Meads DM, Doward LC, Pepke-Zaba J: The Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR): a measure of health-related quality of life and quality of life for patients with pulmonary hypertension. Quality of Life Research 2006, 15: 103–115. 10.1007/s11136-005-3513-4
U.S.Department of Health and Human Services FDA Center for Drug Evaluation and Research. Guidance For Industry: Patient Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims. 2009. http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/ucm071975.pdf
Barst RJ, McGoon M, McLaughlin V, et al.: Berapost therapy for pulmonary arterial hypertension. J Am Coll Cardiol 2003,41(12):2119–2125. 10.1016/S0735-1097(03)00463-7
Simmonneau G, Barst RJ, Galie N, et al.: Continuous subcutaneous infusion of treprostinil, a prostacyclin analogue, in patients with pulmonary arterial hypertension: a double-blind, randomized, placebo-controlled trial. Am J Respir Crit Care Med 2002,165(6):800–804. 10.1164/ajrccm.165.6.2106079
Snow JL, Kawut SM: Surrogate end points in pulmonary arterial hypertension: assessing the response to therapy. Clin Chest Med 2007, 28: 75-viii. 10.1016/j.ccm.2006.11.005
Barst RJ, Rubin LJ, Long WA, McGoon MD, Rich S, Badesch DB, et al.: A comparison of continuous intravenous epoprostenol (prostacyclin) with conventional therapy for primary pulmonary hypertension. The Primary Pulmonary Hypertension Study Group. N Engl J Med 1996, 334: 296–302. 10.1056/NEJM199602013340504
Wilkins MR, Paul GA, Strange JW, Tunariu N, Gin-Sing W, Banya WA, et al.: Sildenafil versus Endothelin Receptor Antagonist for Pulmonary Hypertension (SERAPH) study. Am J Respir Crit Care Med 2005, 171: 1292–1297. 10.1164/rccm.200410-1411OC
Galie N, Torbicki A, Barst R, Dartevelle P, Haworth S, Higenbottam T, et al.: Guidelines on diagnosis and treatment of pulmonary arterial hypertension. The Task Force on Diagnosis and Treatment of Pulmonary Arterial Hypertension of the European Society of Cardiology. Eur Heart J 2004, 25: 2243–2278.
Sastry BK, Narasimhan C, Reddy NK, Raju BS: Clinical efficacy of sildenafil in primary pulmonary hypertension: a randomized, placebo-controlled, double-blind, crossover study. J Am Coll Cardiol 2004, 43: 1149–1153. 10.1016/j.jacc.2003.10.056
Mereles D, Ehlken N, Kreuscher S, Ghofrani S, Hoeper MM, Halank M, et al.: Exercise and respiratory training improve exercise capacity and quality of life in patients with severe chronic pulmonary hypertension. Circulation 2006, 114: 1482–1489. 10.1161/CIRCULATIONAHA.106.618397
Galie N, Badesch D, Oudiz R, Simonneau G, McGoon MD, Keogh AM, et al.: Ambrisentan therapy for pulmonary arterial hypertension. J Am Coll Cardiol 2005, 46: 529–535. 10.1016/j.jacc.2005.04.050
Lowe B, Grafe K, Ufer C, Kroenke K, Grunig E, Herzog W, et al.: Anxiety and depression in patients with pulmonary hypertension. Psychosom Med 2004, 66: 831–836. 10.1097/01.psy.0000145593.37594.39
Cases of PAH in the U.S. 2008. http://en.wikipedia.org/wiki/Pulmonary_hypertension
Mucke HA: Pulmonary arterial hypertension: on the way to a manageable disease. Curr Opin Investig Drugs 2008, 9: 957–962.
Galie N, Manes A, Negro L, Palazzini M, Bacchi-Reggiani ML, Branzi A: A meta-analysis of randomized controlled trials in pulmonary arterial hypertension. Eur Heart J 2009, 30: 394–403.
Rector TS: Overview of the Minnesota Living with Heart Failure® Questionnaire. 2005. https://docs.google.com/a/umn.edu/viewer?a=v&pid=sites&srcid=dW1uLmVkdXx0bXN8Z3g6MjQxYWMzOTBkMWYwMjAwZA
Keogh AM, McNeil KD, Wlodarczyk J, Gabbay E, Williams TJ: Quality of life in pulmonary arterial hypertension: improvement and maintenance with bosentan. J Heart Lung Transplant 2007,26(2):181–187. 10.1016/j.healun.2006.11.009
Lewis GD, Shar R, Shahzad K: Sildenafil improves excercise capacity and quality of life in patients with systolic heart failure and secondary pulmonary hypertension. Circulation 2007,116(14):1555–1562. 10.1161/CIRCULATIONAHA.107.716373
Cenedese E, Speich R, Dorschner L, et al.: Measurement of quality of life in pulmonary hypertension and its significance. Eur Respir J 2006,28(4):808–815. 10.1183/09031936.06.00130405
Ulrich S, Speich R, Domenighetti G, et al.: Bosentan therapy for chronic thromboembolic pulmonary hypertension. A national open lable study assessing the effect of Bosentan on haemodynamocs, excercise capacity, quality of life, safety and tolerability in patients with chronic thromboembolic pulmonary hypertension (BOCTEPH-Study). Swiss Med Wkly 2007,137(41–42):573–580.
McLaughlin VV, Archer SL, Badesch DB, et al.: ACCF/AHA 2009 expert consensus document on pulmonary hypertension a report of the American College of Cardiology Foundation Task Force on Expert Consensus Documents and the American Heart Association developed in collaboration with the American College of Chest Physicians; American Thoracic Society, Inc.; and the Pulmonary Hypertension Association. J Am Coll Cardiol 2009,53(17):1573–1619. 10.1016/j.jacc.2009.01.004
Glaser B, Strauss AL: The Discovery of Grounded Theory: Strategies for Qualitative Research. New Jersey: Aldine de Gruyter; 1967.
Guest G, Bunce A, Johnson L: How many interviews are enough? An experiment with data saturation and variability. Field Methods 2006,18(1):59–82. Sage Publications 10.1177/1525822X05279903
American Thoracic Society: ATS statement: guidelines for the six-minute walk test. American Journal of respiratory and Critical Care Medicine 2002, 166: 111–117. http://www.atsjournals.org
Neely G, Ljunggren G, Sylven C, Borg G: Comparison between the Visual Analogue Scale (VAS) and the category ratio scale (CR-10) for the evaluation of leg exertion. Int J Sports Med 1992,13(2):133–136. 10.1055/s-2007-1021244
Borg E: On Percieved Exertion and its Measurement. Sweden: Doctoral Dissertation; 2007.
Rubin LJ: Diagnosis and management of pulmonary arterial hypertension: ACCP evidence-based clinical practice guidelines. Chest 2004,126(1 Suppl):7S-10S.
Kind P, Brooks R, Rabin R: EQ-5D concepts and methods: a developmental history. The Netherlands: Springer; 2005.
Wilke C, Pickard A: Test-Retest Reliability of the EQ-5D Visual Analog Scale across Populations and Conditions. Value In Health 2009,12(3):A30.
Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika 1951, 16: 297–334. 10.1007/BF02310555
Nunnally JC, Bernstein IH: Psychometric Theory. 3rd ed. ed. New York: McGraw-Hill; 1994.
Guyatt GH, Deyo RA, Charlson M, Levine MN, Mitchell A: Responsiveness and validity in health status measurements : a clarification. J Clin Epidemiol 1989,42(5):403–408. 10.1016/0895-4356(89)90128-5
Ries AL: Minimally clinically important difference for the UCSD shortness of breath questionnaire, Borg scale, and visual analog scale. COPD 2005,2(1):105–110. 10.1081/COPD-200050655
Jaeschke R, Singer J, Guyatt GH: Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials 1989,10(4):407–415. 10.1016/0197-2456(89)90005-6
Stockler MR, Osoba D, Goodwin P, Corey P, Tannock IF: Responsiveness to change in health-related quality of life in a randomized clinical trial: a comparison of the Prostate Cancer Specific Quality Of Life Instrument (PROSQOLI) with analogous scales from the EORTC QLQ-C30 and a trial specific module. European organization for research and treatment of cancer. J Clin Epidemiol 1998,51(2):137–145. 10.1016/S0895-4356(97)00269-2
Khadka J, Gothwal VK, McAlinden C, Lamoureux EL, Pesudovs K: The importance of rating scales in measuring patient-reported outcomes. Health Qual Life Outcomes 2012, 10: 80. 10.1186/1477-7525-10-80
Rector TS, Kubo SH, Cohn JN: Patients' self-assessment of the congestive heart failure. Part 2: content, reliability and validity of a new measure, the Minnesota living with heart failure questionnaire. Heart Failure 1987,114(1 pt 1):198–209.
Sloan J, Symonds T, Vargas-Chanes D, Fridley B: Practical guidelines for assessing the clinical significance of health-related quality of life changes within clinical trials. Drug Inf J 2003, 37: 23–31.
Cohen J: Statistical power analysis for the behavioral sciences. New York: Academic Press; 1977.
The Authors would like to thank Rob Arbuckle for reviewing the manuscript at different stages of development and providing expert input and comments on the content of the manuscript.
Bayer Pharma AG funded this study; Ms Bonner, Ms Abetz and Ms Meunier were contracted by Bayer Pharma AG as consultants to the study and develop the manuscript. Dr. Mathai was employed by Bayer Pharma AG as a scientific advisor. The authors declare there are no other competing interests.
NB and LA were involved in the conception, design and conduct of the qualitative and quantitative components of the study and the interpretation of all study findings. NB drafted the manuscript, LA reviewed the manuscript critically for scientific content. JM was involved in the design, conduct and analysis of the quantitative component of the study and reviewed the manuscript critically for content. MS was involved in the interpretation of study findings and drafting of the manuscript. SM critically reviewed the manuscript for scientific content and was involved in the revision of the manuscript. All authors read and approved the final manuscript.
About this article
Cite this article
Bonner, N., Abetz, L., Meunier, J. et al. Development and validation of the living with pulmonary hypertension questionnaire in pulmonary arterial hypertension patients. Health Qual Life Outcomes 11, 161 (2013). https://doi.org/10.1186/1477-7525-11-161