Predicting Surgical Complications in Patients Undergoing Elective Adult Spinal Deformity Procedures Using Machine Learning

doi:10.1016/j.jspd.2018.03.003

Spine Deformity

Volume 6, Issue 6, November–December 2018, Pages 762-770

https://doi.org/10.1016/j.jspd.2018.03.003 Get rights and content

Abstract

Study Design

Cross-sectional database study.

Objective

To train and validate machine learning models to identify risk factors for complications following surgery for adult spinal deformity (ASD).

Summary of Background Data

Machine learning models such as logistic regression (LR) and artificial neural networks (ANNs) are valuable tools for analyzing and interpreting large and complex data sets. ANNs have yet to be used for risk factor analysis in orthopedic surgery.

Methods

The American College of Surgeons National Surgical Quality Improvement Program (ACS-NSQIP) database was queried for patients who underwent surgery for ASD. This query returned 4,073 patients, which data were used to train and evaluate our models. The predictive variables used included sex, age, ethnicity, diabetes, smoking, steroid use, coagulopathy, functional status, American Society of Anesthesiologists (ASA) class >3, body mass index (BMI), pulmonary comorbidities, and cardiac comorbidities. The models were used to predict cardiac complications, wound complications, venous thromboembolism (VTE), and mortality. Using ASA class as a benchmark for prediction, area under receiver operating characteristic curves (AUC) was used to determine the accuracy of our machine learning models.

Results

The mean age of patients was 59.5 years. Forty-one percent of patients were male whereas 59.0% of patients were female. ANN and LR outperformed ASA scoring in predicting every complication (p<.05). The ANN outperformed LR in predicting cardiac complication, wound complication, and mortality (p<.05).

Conclusions

Machine learning algorithms outperform ASA scoring for predicting individual risk prognosis. These algorithms also outperform LR in predicting individual risk for all complications except VTE. With the growing size of medical data, the training of machine learning on these large data sets promises to improve risk prognostication, with the ability of continuously learning making them excellent tools in complex clinical scenarios.

Level of Evidence

Level III.

Introduction

The advent of digital technology, machine learning and deep learning in particular, is increasingly making it possible to utilize big data to more precisely risk stratify and prognosticate how an individual patient will behave given a disease or intervention. Machine learning has already been used in other realms such as retail and search engines. However, healthcare has lagged in the uptake of newer techniques to leverage the rich information contained in electronic health records (EHRs).

The practice of evidence-based medicine has sustained the progress seen in modern care and diagnosis. Traditional statistical approaches have gleaned much about what is known regarding risk factors used for prognostication. Machine learning (ML) combines these fundamental statistical insights with modern high-performance computing to learn patterns that can be used for recognition and prediction. Importantly, machine learning often identifies patterns that are not readily apparent to human intuition, thus identifying otherwise unknown connections [1]. Multivariate logistic regression and artificial neural networks are the two most commonly used machine learning models employed in medicine [2]. Artificial neural networks were first developed to model the neural architecture of the brain. Harnessing the structure of biology, artificial neural networks (ANNs) are particularly well suited for modeling complex, nonlinear data when little is known regarding the underlying distribution of the data or colinearity among the variables [3]. Importantly, ANNs can perform these functions without prior assumptions, leading to a highly adaptable system less susceptible to anchoring biases [3]. However, similar to any machine learning algorithm, neural networks are susceptible to intrinsic limitations and biases of the underlying data set. Additionally, limitations of model design such as neural network architecture, feature selection, and optimization functions can lead to model biases and overfitting that decrease generalizability and prognostication value of neural networks on external data [4]. Advancements in neural network science and proper implementation with recognition of these limitations are important for future integration of machine learning in surgical practice, and the utility of machine learning in adult deformity surgery has not yet been explored.

Adult spinal deformity (ASD) is a spinal disorder defined as a complex spectrum of spinal diseases that present in adulthood including adult scoliosis (progression of childhood scoliosis), degenerative scoliosis, sagittal and coronal imbalance, and iatrogenic deformity (with or without spinal stenosis) [5]. Adult degenerative scoliosis is the most common cause of ASD and is commonly seen in elderly adults, particularly those older than 60 years, as degeneration of intervertebral discs and facet joints exacerbate scoliotic curvature [6]. With the aging baby boomer generation and overall population structure of the United States, it is not surprising that the demand and prevalence for ASD surgery continues to increase [7]. In the burgeoning era of rising healthcare costs and greater scrutiny over surgical outcomes, there has been increasing emphasis on understanding the risk factors and possible predictors to optimize perioperative planning and management. Data-driven clinical decision support tools have the potential to lead to cost savings by leveraging the information contained in large medical databases. Uptake of machine learning approaches in the realm of spinal surgery have lagged. However, the patient population and associated increased rate of postoperative complications renders ASD a prime target for quality improvement through the utilization of machine learning.

ML algorithms have the capability of “learning” using newly generated information to improve their predictive capability. Briefly explained, these algorithms work by utilizing a subset of the overall study data (70% in this case) to “train” and create an accurate predictive model. This established model is then validated using the remainder of the data to determine the accuracy of the post-training model. This study seeks to develop and validate ML algorithms to precisely predict complications following ASD using a national database, in order to compare ML algorithms with logistic regression (LR) or American Society of Anesthesiologists (ASA) classification.

Section snippets

Patient selection and preprocessing

The National Surgical Quality Improvement Program (NSQIP) database was used for the purpose of training and validating ANN and LR models. Adult patients (≥18 years) undergoing adult deformity surgery were identified based on Current Procedural Terminology (CPT) codes 22800, 22802, 22804, 22808, 22810, 22812, 22818, 22819. CPT codes 22843, 22844, 22846, or 22847 were also included to capture long, multilevel fusion constructs. Patients with CPT code 22842 and 22845 were included if they had an

Data and analysis pipeline

A total of 5,818 patients were identified as having undergone ASD surgery between 2010 and 2014. Among this cohort, 4,073 patients (70%) were included into the training set and 1,746 patients (30%) were used as a holdout training set for evaluating the trained machine learning models (Fig. 1). Following our exclusion criteria, 2,376 (41.0%) of patients were male, whereas 3,418 (59.0%) were female. The mean age was 59.5 years old and the cohort exhibited low rates of complications across all

Discussion

With the advent of large, prospective, multi-institutional clinical registries, physicians have access to large amounts of diverse, high-quality clinical data. This has given birth to ideas such as “precision medicine” with the goal of developing quantitative models that can be used to predict health status, prognosticate disease processes, prevent disease, and reduce complications. Previous groups have employed the use of ANNs and other ML models to these data sets [13], [14], [15], [16].

References (24)

S. Dreiseitl et al.
Logistic regression and artificial neural network classification models: a methodology review
J Biomed Inform
(2002)
W.H. Crown
Potential application of machine learning in health outcomes research and some statistical cautions
Value Health
(2015)
A.L. Blum et al.
Selection of relevant features and examples in machine learning
Artif Intell
(1997)
R. Iorio et al.
Diabetes mellitus, hemoglobin A_1C, and the incidence of total joint arthroplasty infection
J Arthroplasty
(2012)
A. Van Esbroeck et al.
Quantifying surgical complexity with machine learning: looking beyond patient factors to improve surgical models
Surgery
(2014)
S. Sohn et al.
Detection of clinically important colorectal surgical site infection using Bayesian network
J Surg Res
(2017)
K.J. Cios et al.
Uniqueness of medical data mining
Artif Intell Med
(2002)
A.K. Jain et al.
39 Dimensionality and sample size considerations in pattern recognition practice
Handbook Stat
(1982)
J.A. Cruz et al.
Applications of machine learning in cancer prediction and prognosis
Cancer Inform
(2007)
R.C. Deo
Machine learning in medicine
Circulation
(2015)

J.A. Youssef et al.

Current status of adult spinal deformity

Global Spine J

(2013)

J.K. Birknes et al.

Adult degenerative scoliosis: a review

Neurosurgery

(2008)

Cited by (53)

Artificial intelligence and the orthopaedic surgeon: A review of the literature and potential applications for future practice: Current concepts
2024, Journal of ISAKOS
Recommended Requirements and Essential Elements for Proper Reporting of the Use of Artificial Intelligence Machine Learning Tools in Biomedical Research and Scientific Publications
2024, Arthroscopy - Journal of Arthroscopic and Related Surgery
Essential elements required for proper use of artificial intelligence machine learning tools in biomedical research and scientific publications include (1) explanation justifying why a machine learning approach contributes to the purpose of the study; (2) description of the adequacy of the data (input) to produce the desired results (output); (3) details of the algorithmic (i.e., computational) approach including methods for organizing the data (preprocessing); the machine learning computational algorithm(s) assessed; on what data the models were trained; the presence of bias and efforts to mitigate these effects; and the methods for quantifying the variables (features) most influential in determining the results (e.g., Shapley values); (4) description of methods, and reporting of results, quantitating performance in terms of both model accuracy and model calibration (level of confidence in the model’s predictions); (5) availability of the programming code (including a link to the code when available—ideally, the code should be available); (6) discussion of model internal validation (results applicable and sensitive to the population investigated and data on which the model was trained) and external validation (were the results investigated as to whether they are generalizable to different populations? If not, consideration of this limitation and discussion of plans for external validation, i.e., next steps). As biomedical research submissions using artificial intelligence technology increase, these requirements could facilitate purposeful use and comprehensive methodological reporting.
A bibliometric analysis of patient-reported outcome measures in adult spinal deformity, and the future of patient-centric outcome assessments in the era of predictive analytics
2023, Seminars in Spine Surgery
Surgical treatment of adult spinal deformity (ASD) is associated with high resource utilization, high costs, and potential complications. In order to generate consensus for treatment paradigms and to demonstrate value, it is crucial to accurately assess clinical outcomes. Historically, objective assessments in ASD were performed by providers, and in recent decades the use of patient-reported outcome measures (PROMs) have become widely incorporated. Here, we report results of a bibliometric analysis of PROMs in ASD, synthesizing a global view of the topic and mapping trends in the field. In the modern era, enabling advancements in predictive modeling and machine learning, along with technology within smartphones and wearables, may supplement traditional patient-centric outcomes assessments and overcome some of their limitations.
Use of computer navigation and robotics in adult spinal deformity
2022, Seminars in Spine Surgery
Citation Excerpt :
Between 2012 and 2015, the robotic-assisted technique was found to perform worse or similar to freehand technique.14,17,25-27 From 2016 – 2019, there were mixed results, with five studies finding the techniques to be equivalent28-32 and four separate studies showing superiority of the robotic-assisted techniques to the free hand technique.33-36 Some of these discrepancies may be the result of the evolution of these technologies over time or the steep learning curve that individual surgeons face when they adopt these systems into their clinical practice.
Recent years have seen significant advancements in the implementation of computer-assisted surgery in spine. Enabling technologies like robots and navigation have been refined to compliment the field's shift towards minimally invasive techniques and to fit more seamlessly into the existing workflow. Robotic-surgery and navigation in deformity can be particularly helpful in cases where the severe curves of the spinal column or the abnormal pedicle anatomy make pedicle screw placement challenging with the use of traditional anatomic landmarks. Furthermore, the ability to pre-plan patient specific rods has opened the door for greater precision in rod contouring. Drawbacks of robotic-assisted surgery include the steep upfront cost, the need for additional staff and training and the lack of tactile feedback. This review will discuss the current state of navigation and robotics, with a specific focus on their applications to deformity surgery.
Current understanding on artificial intelligence and machine learning in orthopaedics – A scoping review
2022, Journal of Orthopaedics
Citation Excerpt :
One of the major challenges faced during clinical practice is risk assessment and predicting outcomes, AI provides a different approach that helps in overcoming the challenges during clinical practice. ML can be used to guide the management of patients by providing a patient-specific post-operative complication following lumbar surgery.16 AI technique assists the treating physician in making a decision or diagnosis for the patients.
Artificial Intelligence (AI) has improved the way of looking at technological challenges. Today, we can afford to see many of the problems as just an input-output system rather than solving from the first principles. The field of Orthopaedics is not spared from this rapidly expanding technology. The recent surge in the use of AI can be attributed mainly to advancements in deep learning methodologies and computing resources. This review was conducted to draw an outline on the role of AI in orthopaedics.
We developed a search strategy and looked for articles on PubMed, Scopus, and EMBASE. A total of 40 articles were selected for this study, from tools for medical aid like imaging solutions, implant management, and robotic surgery to understanding scientific questions.
A total of 40 studies have been included in this review. The role of AI in the various subspecialties such as arthroplasty, trauma, orthopaedic oncology, foot and ankle etc. have been discussed in detail.
AI has touched most of the aspects of Orthopaedics. The increase in technological literacy, data management plans, and hardware systems, amalgamated with the access to hand-held devices like mobiles, and electronic pads, augur well for the exciting times ahead in this field. We have discussed various technological breakthroughs in AI that have been able to perform in Orthopaedics, and also the limitations and the problem with the black-box approach of modern AI algorithms. We advocate for better interpretable algorithms which can help both the patients and surgeons alike.
A scoping review of complication prediction models in spinal surgery: An analysis of model development, validation and impact
2022, North American Spine Society Journal
Predictive analytics are being used increasingly in the field of spinal surgery with the development of models to predict post-surgical complications. Predictive models should be valid, generalizable, and clinically useful. The purpose of this review was to identify existing post-surgical complication prediction models for spinal surgery and to determine if these models are being adequately investigated with internal/external validation, model updating and model impact studies.
This was a scoping review of studies pertaining to models for the prediction of post-surgical complication after spinal surgery published over 10 years (2010-2020). Qualitative data was extracted from the studies to include study classification, adherence to Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) guidelines and risk of bias (ROB) assessment using the Prediction model study Risk Of Bias Assessment Tool (PROBAST). Model evaluation was determined using area under the curve (AUC) when available. The Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement was used as a basis for the search methodology in four different databases.
Thirty studies were included in the scoping review and 80% (24/30) included model development with or without internal validation. Twenty percent (6/30) were exclusively external validation studies and only one study included an impact analysis in addition to model development and internal validation. Two studies referenced the TRIPOD guidelines and there was a high ROB in 100% of the studies using the PROBAST tool.
The majority of post-surgical complication prediction models in spinal surgery have not undergone standardized model development and internal validation or adequate external validation and impact evaluation. As such there is uncertainty as to their validity, generalizability, and clinical utility. Future efforts should be made to use existing tools to ensure standardization in development and rigorous evaluation of prediction models in spinal surgery.

View all citing articles on Scopus

: Author disclosures: JSK (none), VA (none), EKO (none), DK (none), WR (none), CU (none), AKH (none), JC (none), SKC (grants from Zimmer, Orthopaedic Research and Education Foundation, and Stryker, outside the submitted work).

: This study was approved by the Institutional Review Board of the Icahn School of Medicine at Mount Sinai, New York, NY.

View full text

Case SeriesPredicting Surgical Complications in Patients Undergoing Elective Adult Spinal Deformity Procedures Using Machine Learning

Abstract

Study Design

Objective

Summary of Background Data

Methods

Results

Conclusions

Level of Evidence

Introduction

Section snippets

Patient selection and preprocessing

Data and analysis pipeline

Discussion

J Biomed Inform

Value Health

Artif Intell

J Arthroplasty

Surgery

J Surg Res

Artif Intell Med

Handbook Stat

Applications of machine learning in cancer prediction and prognosis

Cancer Inform

Machine learning in medicine

Circulation

Current status of adult spinal deformity

Global Spine J

Adult degenerative scoliosis: a review

Neurosurgery

Case Series
Predicting Surgical Complications in Patients Undergoing Elective Adult Spinal Deformity Procedures Using Machine Learning