Elsevier

Spine Deformity

Volume 6, Issue 6, November–December 2018, Pages 762-770
Spine Deformity

Case Series
Predicting Surgical Complications in Patients Undergoing Elective Adult Spinal Deformity Procedures Using Machine Learning

https://doi.org/10.1016/j.jspd.2018.03.003Get rights and content

Abstract

Study Design

Cross-sectional database study.

Objective

To train and validate machine learning models to identify risk factors for complications following surgery for adult spinal deformity (ASD).

Summary of Background Data

Machine learning models such as logistic regression (LR) and artificial neural networks (ANNs) are valuable tools for analyzing and interpreting large and complex data sets. ANNs have yet to be used for risk factor analysis in orthopedic surgery.

Methods

The American College of Surgeons National Surgical Quality Improvement Program (ACS-NSQIP) database was queried for patients who underwent surgery for ASD. This query returned 4,073 patients, which data were used to train and evaluate our models. The predictive variables used included sex, age, ethnicity, diabetes, smoking, steroid use, coagulopathy, functional status, American Society of Anesthesiologists (ASA) class >3, body mass index (BMI), pulmonary comorbidities, and cardiac comorbidities. The models were used to predict cardiac complications, wound complications, venous thromboembolism (VTE), and mortality. Using ASA class as a benchmark for prediction, area under receiver operating characteristic curves (AUC) was used to determine the accuracy of our machine learning models.

Results

The mean age of patients was 59.5 years. Forty-one percent of patients were male whereas 59.0% of patients were female. ANN and LR outperformed ASA scoring in predicting every complication (p<.05). The ANN outperformed LR in predicting cardiac complication, wound complication, and mortality (p<.05).

Conclusions

Machine learning algorithms outperform ASA scoring for predicting individual risk prognosis. These algorithms also outperform LR in predicting individual risk for all complications except VTE. With the growing size of medical data, the training of machine learning on these large data sets promises to improve risk prognostication, with the ability of continuously learning making them excellent tools in complex clinical scenarios.

Level of Evidence

Level III.

Introduction

The advent of digital technology, machine learning and deep learning in particular, is increasingly making it possible to utilize big data to more precisely risk stratify and prognosticate how an individual patient will behave given a disease or intervention. Machine learning has already been used in other realms such as retail and search engines. However, healthcare has lagged in the uptake of newer techniques to leverage the rich information contained in electronic health records (EHRs).

The practice of evidence-based medicine has sustained the progress seen in modern care and diagnosis. Traditional statistical approaches have gleaned much about what is known regarding risk factors used for prognostication. Machine learning (ML) combines these fundamental statistical insights with modern high-performance computing to learn patterns that can be used for recognition and prediction. Importantly, machine learning often identifies patterns that are not readily apparent to human intuition, thus identifying otherwise unknown connections [1]. Multivariate logistic regression and artificial neural networks are the two most commonly used machine learning models employed in medicine [2]. Artificial neural networks were first developed to model the neural architecture of the brain. Harnessing the structure of biology, artificial neural networks (ANNs) are particularly well suited for modeling complex, nonlinear data when little is known regarding the underlying distribution of the data or colinearity among the variables [3]. Importantly, ANNs can perform these functions without prior assumptions, leading to a highly adaptable system less susceptible to anchoring biases [3]. However, similar to any machine learning algorithm, neural networks are susceptible to intrinsic limitations and biases of the underlying data set. Additionally, limitations of model design such as neural network architecture, feature selection, and optimization functions can lead to model biases and overfitting that decrease generalizability and prognostication value of neural networks on external data [4]. Advancements in neural network science and proper implementation with recognition of these limitations are important for future integration of machine learning in surgical practice, and the utility of machine learning in adult deformity surgery has not yet been explored.

Adult spinal deformity (ASD) is a spinal disorder defined as a complex spectrum of spinal diseases that present in adulthood including adult scoliosis (progression of childhood scoliosis), degenerative scoliosis, sagittal and coronal imbalance, and iatrogenic deformity (with or without spinal stenosis) [5]. Adult degenerative scoliosis is the most common cause of ASD and is commonly seen in elderly adults, particularly those older than 60 years, as degeneration of intervertebral discs and facet joints exacerbate scoliotic curvature [6]. With the aging baby boomer generation and overall population structure of the United States, it is not surprising that the demand and prevalence for ASD surgery continues to increase [7]. In the burgeoning era of rising healthcare costs and greater scrutiny over surgical outcomes, there has been increasing emphasis on understanding the risk factors and possible predictors to optimize perioperative planning and management. Data-driven clinical decision support tools have the potential to lead to cost savings by leveraging the information contained in large medical databases. Uptake of machine learning approaches in the realm of spinal surgery have lagged. However, the patient population and associated increased rate of postoperative complications renders ASD a prime target for quality improvement through the utilization of machine learning.

ML algorithms have the capability of “learning” using newly generated information to improve their predictive capability. Briefly explained, these algorithms work by utilizing a subset of the overall study data (70% in this case) to “train” and create an accurate predictive model. This established model is then validated using the remainder of the data to determine the accuracy of the post-training model. This study seeks to develop and validate ML algorithms to precisely predict complications following ASD using a national database, in order to compare ML algorithms with logistic regression (LR) or American Society of Anesthesiologists (ASA) classification.

Section snippets

Patient selection and preprocessing

The National Surgical Quality Improvement Program (NSQIP) database was used for the purpose of training and validating ANN and LR models. Adult patients (≥18 years) undergoing adult deformity surgery were identified based on Current Procedural Terminology (CPT) codes 22800, 22802, 22804, 22808, 22810, 22812, 22818, 22819. CPT codes 22843, 22844, 22846, or 22847 were also included to capture long, multilevel fusion constructs. Patients with CPT code 22842 and 22845 were included if they had an

Data and analysis pipeline

A total of 5,818 patients were identified as having undergone ASD surgery between 2010 and 2014. Among this cohort, 4,073 patients (70%) were included into the training set and 1,746 patients (30%) were used as a holdout training set for evaluating the trained machine learning models (Fig. 1). Following our exclusion criteria, 2,376 (41.0%) of patients were male, whereas 3,418 (59.0%) were female. The mean age was 59.5 years old and the cohort exhibited low rates of complications across all

Discussion

With the advent of large, prospective, multi-institutional clinical registries, physicians have access to large amounts of diverse, high-quality clinical data. This has given birth to ideas such as “precision medicine” with the goal of developing quantitative models that can be used to predict health status, prognosticate disease processes, prevent disease, and reduce complications. Previous groups have employed the use of ANNs and other ML models to these data sets [13], [14], [15], [16].

References (24)

  • J.A. Youssef et al.

    Current status of adult spinal deformity

    Global Spine J

    (2013)
  • J.K. Birknes et al.

    Adult degenerative scoliosis: a review

    Neurosurgery

    (2008)
  • Cited by (53)

    • Use of computer navigation and robotics in adult spinal deformity

      2022, Seminars in Spine Surgery
      Citation Excerpt :

      Between 2012 and 2015, the robotic-assisted technique was found to perform worse or similar to freehand technique.14,17,25-27 From 2016 – 2019, there were mixed results, with five studies finding the techniques to be equivalent28-32 and four separate studies showing superiority of the robotic-assisted techniques to the free hand technique.33-36 Some of these discrepancies may be the result of the evolution of these technologies over time or the steep learning curve that individual surgeons face when they adopt these systems into their clinical practice.

    • Current understanding on artificial intelligence and machine learning in orthopaedics – A scoping review

      2022, Journal of Orthopaedics
      Citation Excerpt :

      One of the major challenges faced during clinical practice is risk assessment and predicting outcomes, AI provides a different approach that helps in overcoming the challenges during clinical practice. ML can be used to guide the management of patients by providing a patient-specific post-operative complication following lumbar surgery.16 AI technique assists the treating physician in making a decision or diagnosis for the patients.

    View all citing articles on Scopus

    Author disclosures: JSK (none), VA (none), EKO (none), DK (none), WR (none), CU (none), AKH (none), JC (none), SKC (grants from Zimmer, Orthopaedic Research and Education Foundation, and Stryker, outside the submitted work).

    This study was approved by the Institutional Review Board of the Icahn School of Medicine at Mount Sinai, New York, NY.

    View full text