When patients and surgeons disagree about surgical outcome: investigating patient factors and chart note communication
© Schwartz et al. 2015
Received: 16 April 2015
Accepted: 10 September 2015
Published: 29 September 2015
Effective physician-patient communication is a critical component of a clinical practice and in order to achieve optimal patient outcomes. We aimed to investigate indirect effects of physician-patient communication by examining the relationship between a physician-patient mismatch in perceived outcomes and content in the medical record’s clinical note. We compared patient records whose perceived subjective assessment of surgery outcomes agreed or disagreed with the surgeon's perception of that outcome (Subjective Disagreement).
This study included 172 spine surgery patients at a teaching hospital. Patient-reported outcomes included the Oswestry Disability Index; the Short-Form 36; and a Visual Analogue Scale items for leg and back pain. We content-analyzed the clinical note in the medical record, and used logistic regression to evaluate predictors of Subjective Disagreement (n = 41 disagreed vs. 131 agreed).
Patient and surgeon agreed in 76 % of cases and disagreed in 24 % of cases. Patients who assessed their outcome worse than their surgeons tended to be less educated and involved in litigation. They also tended to report worsened mental health and leg pain. Content analysis revealed group differences in surgeon communication patterns in the chart notes related to how symptom change was emphasized, how follow-up was described, and a specific word reference. Specifically, disagreement was predicted by using “much” to emphasize the findings and noting long-term prognosis. Agreement was predicted by use of positive emphasis terms, having an “as-needed” follow-up plan, and using “happy” in the chart note.
The nature of measuring outcomes of surgery is based on patient perception. In surgeon-patient perspective mismatches, patient factors may serve as barriers to improvement. Worsened change on patient-reported mental health may be an independent factor which colors the patient’s general perceptions. This aspect of treatment may be missed by the spine surgeon. Chart note communication styles reflect the subjective disagreement. Investigating and/ or treating mental health deterioration may be valuable in resolving this mismatch and for overall outcome.
KeywordsCommunication Patient-reported outcomes Clinical outcomes Mismatch Response shift Medical record Disability
Central to effective communication is being able to convey orally to the patient realistic expectations about surgical outcome, as well as effective written information about a medical encounter . The purpose of this study was to examine predictors of physician-patient mismatch in perceived outcomes of spine surgery. We compared patient records whose perceived subjective assessment of surgery outcomes agreed or disagreed with the surgeon's perception of that outcome, and investigated demographic factors, patient-reported outcomes, and the clinical chart note content in the medical record.
Materials and methods
Sample and design
This prospective study included patients who had undergone one or two level decompression surgery; (discectomy or laminectomy) with leg-dominant pain. Multilevel decompressions and fusion procedures that require longer rehabilitation until maximum recovery were excluded. Excluding patients with more complicated operations was done to create a homogenous group of patients Study participants were consecutively recruited from the practices of three spine surgeons at a major teaching hospital in Canada. As standard protocol, surgeons engaged all patients in a transparent discussion about what to expect from the surgery. The surgeons explained what the surgery will improve and will not improve, i.e. leg pain will be better, back pain may or may not. Patients provided baseline data pre-surgically, and follow-up data post-surgery. The study protocol was approved by the Sunnybrook Health Sciences Centre Research Ethics Board, and all participants provided written informed consent. Participants completed self-report questionnaires pre-surgery and at first follow-up.
Three categories of measures were used in this study. Demographic characteristics were collected from the patients, including age, gender, duration of symptoms, employment status (working at present, retired, student, homemaker), smoking status (i.e., current smoker or not), and associated co-morbid health conditions and other musculoskeletal conditions. We also tracked having an incentive not to work. This variable was characterized as involvement in compensation or litigation that would serve as an incentive not to experience symptom improvement over time (e.g., currently on disability or worker’s compensation, or involved in litigation related to their illness or injury.
Standardized spine outcome measures were collected in this study: (1) the generic Short-Form-36 v1 (SF-36v1)  comprising eight domains assessing evaluative functional health, with higher scores reflecting better functional health; (2) two Likert-scaled visual analogue scale (VAS) items measuring back and leg pain on a 100-point scale, with higher scores indicating worse pain ; (3) The 10-item disease specific Oswestry Disability Index (ODI)  measuring perceived pain during activities of daily living.
Clinical chart notes were also utilized in this study. These notes included data from the neurological examination (e.g., straight leg raising, numbness, strength, walking distance); as well as the recorded subjective assessment of surgical outcome from patient and surgeon. Patients were asked how they would characterize their surgical outcome (poor, moderate, excellent). Surgeons were asked to note in the chart how they would characterize the patient’s outcome based on the objective examination, as well as their understanding of the patient’s change in function. Surgeons categorized this understanding as not improved, not fully improved, or fully improved. The chart note also provided information on whether there were complications from the surgery; and reported symptoms of leg or back pain. Additionally, written summaries of the patient’s follow-up appointment were captured verbatim. These included documentation for the patient’s medical record as well as communications to other health care providers.
Text from the clinical chart notes were content analyzed using QSR NVIVO 10 . Two independent raters read and coded all chart notes for terms or concepts that were identified after coding an initial 100 patients (see Additional file 1 for complete listing and explanation of nodes). After all records were coded by both raters, inter-rater reliability was computed using the kappa coefficient . It was greater than 90 % on most nodes. Adjudication then took place such that all differences in codes were discussed to determine the most appropriate coding for the record. This process resulted in 100 % inter-rater reliability.
Patients were characterized on the basis of whether their subjective assessment of surgery outcome was similar or disagreed with the surgeon’s assessment. We then used this grouping variable (Subjective Agreement vs Subjective Disagreement) as the dependent variable in a series of hierarchical logistic regression analyses. We began with univariate regressions within a class of variables (i.e., demographic, patient-reported outcome, clinical-chart nodes). We then computed multivariate models within a class of variables. We did not combine the three classes of variables into a single multivariate model, due to sample size constraints and the resulting limited power. We thus present the results of the three sets of models in terms of triangulating on the prediction of the grouping variable. The Type I error rate for the univariate models was p < 0.10, and p < 0.05 for the multivariable model, as per standard hierarchical modeling approaches. Stata 13  was used for logistic regression modeling.
Characterization of Patient-Surgeon Agreement on Surgery Outcomea
Not fully improved
N = 171 ( % or SD)
Mean Weeks of Follow-up (SD)
Gender: N (%)
Surgical Diagnosisa: N (%)
Co-morbidities: N (%)
Education: N (%)
Less than High School
Graduated From High School or GED
Some College or Technical School
Graduated from College
Postgraduate School or Degree
Employment Status at Pre-surgical Baselineb: N (%)
On leave of absence
Pain Medication Use at Pre-surgical Baseline: N (%)
Age: Mean Years (SD)
Pre-surgical Baseline Patient-Reported Outcome Scores: Mean (SD)
Demographic predictors of subjective disagreement
Logistic regression model of significant demographic factors predicting disagreement between doctor and patient
95 % Conf.
Demographic Factors Model
Quit over a year ago
Patient-reported outcome predictors of subjective disagreement
Logistic regression model of significant patient-reported outcomes predicting disagreement between doctor and patient
95 % Conf.
Patient-reported Outcomes Model
VAS Back Change
VAS Leg Change
Clinical-chart node predictors of subjective disagreement
Logistic regression model of significant clinical chart notes predicting disagreement between doctor and patient
95 % Conf.
Clinical Chart Notes Model
Positive Emphasis terms
Much Emphasis terms
All Emphasis terms
All Juxtaposition terms
Follow-up only if symptoms
Our results suggest that patient-surgeon perspective mismatches may relate to layers of factors, beginning with patient characteristics, continuing through patient-reported outcomes, and ending up as meta-messages (i.e., reading between the lines) transmitted in the medical record. The subtlety of these findings underscores the value of a qualitative analysis. Such patterns were detected only after careful coding and pattern analysis using a method that combined qualitative and quantitative techniques. This mixed-methods approach results in findings that would not be apparent from a simple reading of the text. They are only revealed by dint of careful data analysis.
Lower levels of education, which can reflect low health literacy, were predictive of disagreement with their surgeon. Being involved in any litigation related to their spinal disorder (i.e., worker’s compensation, disability, or other litigation) also served as a risk factor for disagreement. Our findings underscore the potential bias in the self-report of patients in secondary-gain situations [6, 7], such as worker’s compensation and litigation. Our data suggest that when people are in a situation where they benefit from not getting better, their answers to patient-reported outcome questionnaires may not be valid. No measure, no matter how well it has demonstrated reliability and validity, can counteract the influence of secondary gain. Our findings are reminiscent of early work by Hayes and colleagues documenting that psychometric test results are unreliable among patients with nonorganic signs .
An unexpected finding was that the patient’s reporting worsened mental health or worsened leg pain, (as opposed to no change in leg pain) after surgery were significant factors in subjective disagreement. It should be noted that the basis of the surgeon’s assessment is on both objective and subjective grounds; including findings on the examination, notably the presence or absence of pain on straight leg-raising.
Our content analysis revealed subtle differences in how emphasis-language was used that differentiated patients whose subjective assessment differed from their surgeons. In addition to these differing ways of emphasizing their clinical findings, the chart text differed in how long-term follow-up was described. Whereas patients whose subjective assessment agreed with their surgeons were more likely to have non-specific follow-up planned (i.e., on an as-needed basis), those who disagreed with their surgeons were more likely to have chart notes that suggested that symptoms would improve with time without mention of a specific plan for a medical encounter. Finally, those whose subjective assessment agreed with their surgeons were more likely to have the term “happy” in their medical record.
Since the clinical chart note is a largely codified document, there are many terms and content domains that must be mentioned. It is thus not surprising that many relevant content areas did not differentiate the patient groups. For example, re-engagement in activities of daily living, having an action plan, and utilizing physical therapy are all expected aspects of surgical follow-up. These were not terms or content that differentiated our groups.
The clinical record may be saying more than the written word expresses, and it is possible the surgeon may be ill-prepared to appreciate the causes of the mismatch. The deterioration in mental health may be a harbinger of other personal or social factors in the patient’s life. The worsening of mental health would not be an expected outcome from the surgery itself, but may be related to external influences (e.g., increased interpersonal conflict due to financial or marital strain related to being unwell). It may also reflect poor adaptation to the patient’s new status quo. Mental health deterioration can lead to a negative coloring on all the subjective parameters of outcome, including leg pain, which was a predictor of subjective mismatch. Our findings suggest that measuring mental health status over time is not only important for understanding the patient’s well-being, but may also help to elucidate subjective-mismatch situations. On the basis of change in patient-reported measures of mental health, an appropriate referral can be made.
The implications of our findings for improving clinical outcomes of spine surgery might focus on interventions that focus on improving health literacy and insight. Such an intervention might focus on adjusting patient expectations of surgical outcomes to be more consistent with the likely outcome, and on increasing their insight into the negative impact of litigation/compensation on their health and well-being. The impact of such interventions on the eventual chart-note and surgeon-patient communication would be a useful path for future research.
The limitations of the present work should be noted. Whereas we chose a short follow-up time period that would be reflective of the pathology and treatment studied, a longer follow-up would be of more value except that the generic nature of patient-reported outcomes can introduce other biases due to changing life events that are unrelated to the surgery. There is also a possible bias introduced by missing data. Indeed, missing data issues restricted our ability to consider all classes of factors in one multivariate model because the sample size was substantially reduced when we did so. Further, our study included patients from a small number of surgeons (3) and from a country with notable socialized healthcare (Canada), both factors that may limit the generalizability of our findings.
Future work might continue our study’s line of research by replicating its delineation of patterns associated with surgeon-patient mismatch using PROs, content analysis of clinical chart notes, and demographic factors along with collecting a information on nonorganic signs (e.g., the Waddell Nonorganic Signs Test ).
In summary, our findings underscore the multiple dimensions involved in surgeon-patient disagreement about subjective outcomes of spine surgery. In our study, this disagreement was apparent in about one quarter of the patients. Deterioration in the patient’s mental health score was a predictor of subjective disagreement, a context which may color the overall perception of the patient. The surgeon should be mindful of this, and may be in a position to facilitate other forms of support for their patients. This problem is worthy of further research to further characterize risk factors, and to investigate approaches to intervene at multiple levels to prevent disagreement and improve overall satisfaction with and outcomes of spine surgery.
We gratefully acknowledge Albert Yee, M.D., and Michael Ford, M.D., for providing access to patients; Rebecca MacDonald for her assistance with chart abstraction; Kimberly Mulvehill for her assistance with coding chart notes; and Victoria Powell, M.P.H., for assistance with data management.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Chou S, Cole G, McLaughlin K, Lockye J. CanMEDS evaluation in Canadian postgraduate training programmes: tools used and programme director satisfaction. Med Educ. 2008;42(9):879–86.View ArticlePubMedGoogle Scholar
- Fluit C, Bolhuis S, Grol R, Ham M, Feskens R, Laan R, et al. Evaluation and feedback for effective clinical teaching in postgraduate medical education: Validation of an assessment instrument incorporating the CanMEDS roles. Medical Teacher. 2012;34(11):893–901.View ArticlePubMedGoogle Scholar
- Yee A, Adjei N, Ford M, Finkelstein J. Do patient expectations of spinal surgery relate to functional outcome? Clinical Orthopaedics and Related Research. 2008;466(5):1154–61.PubMed CentralView ArticlePubMedGoogle Scholar
- Ware Jr JE, The SCD, MOS. 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30(6):473–83.View ArticlePubMedGoogle Scholar
- QSR NVIVO 10 [computer program]. Doncaster, Victoria, Australia: QSR International Pty Ltd.; 2013Google Scholar
- Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74.View ArticlePubMedGoogle Scholar
- Stata [computer program]. Version 13. 4905 Lakeway Drive, College Station, TX 778452013.Google Scholar
- Hayes B, Solyom CAE, Wing PC, Berkowitz J. Use of psychometric measures and nonorganic signs testing in detecting nomogenic disorders in low back pain patients. Spine. 1993;18(10):1254–62.View ArticlePubMedGoogle Scholar
- Waddell G, McCulloch JA, Kummel E, Venner RM. Nonorganic physical signs in low-back pain. SPINE. 1980;5(2):117–25.View ArticlePubMedGoogle Scholar