Introduction

Surgical management of symptomatic cervical degenerative disc disease (DDD) includes anterior cervical decompression and fusion (ACDF) or cervical total disc replacement (CTDR), and posterior foraminotomy [1]. These treatment modalities have been investigated extensively in clinical trials and are effective in reducing radicular pain, with CTDR having a lower revision and secondary surgery rate [2,3,4,5,6,7,8,9,10,11]. Data generated from multiple prospective randomised controlled studies comparing CTDR to ACDF have led to Food and Drug Administration (FDA) approval for eight devices [12]. These studies were industry sponsored, raising concerns of bias. A Cochrane systematic review comparing research sponsored by industry found that the treatment benefits were more likely to favour the sponsor’s products, and the authors’ conclusions were more favourable. Noting that the data collected were closely audited by the FDA, this potential issue should be mitigated. Meta-analyses have been carried out on these clinical trials to synthesise the results for comparison [13]. According to these meta-analyses, CTDR seems to be superior to ACDF with regard to most clinical parameters [14, 15]. Given the short-term clinical success of CTDR around the world, criticisms in the literature regarding long-term durability of these prostheses resulted in publications of midterm results [16,17,18].

The successful surgical treatment of DDD with CTDR requires collaboration between surgeons, engineers, and scientists to provide an evidence-based approach to optimise long-term outcomes. Each depends on the other for success, and the patient depends on all for success. Biomechanical simulations indicated high durability across the tested implants [19]. Multi-axis testing machines have combined in vivo kinematics and loading with in vitro testing in six degrees of freedom to offer more accurate predictions of the performance of new spinal instrumentation [20,21,22]. Initially there were no reported device failures among the identified studies. Implants were tested from 10 to 20 million cycles in axial loading, flexion and extension, and lateral bending. Optimal testing should be performed for up to 40 million cycles. During an average life, the spine may undergo 100 million cycles. No device failures were reported during these testing cycles and it has been suggested that this is equivalent to 50–100 years of in vivo wear [23, 24].

There are several biomechanical requirements for CTDR, perhaps the most important is maintenance of physiological motion. Studies have shown negative patient-reported outcome measures (PROMs) regarding hypomobility and hypermobility [25, 26]. Depending on the implant type, prostheses need to be able to tolerate load and minimise friction and wear. The device endplates must provide osteointegration qualities with bone endplates. These prostheses must be durable over a patient’s lifetime. Reeks and Liang state that total disc replacement (TDR) introduces unique problems that are separate from interbody fusion, including hypermobility, hypomobility, material wear, and particulate debris [27, 28].

The M6-C™ Artificial Cervical Disc (Orthofix, Lewisville, Texas) (M6-C) was designed as an innovative option for patients needing CTDR as an alternative to ACDF. The M6-C device incorporated a compressible polycarbonate urethane polymer (PCU) core (artificial nucleus). The annulus is mimicked by a woven fibre construct from ultra-high-molecular-weight-polyethylene (UHMWPE) fibres and is responsible for facilitating the semi-constrained six-degrees of freedom [29]. An additional design feature is the polymer sheath that surrounds the core and woven fibre construct to limit tissue ingrowth and contain possible wear debris. The M6-C artificial nucleus and annulus were designed to provide the same physiological motion characteristics of a natural disc, including compressive deformation to axial load [30]. The M6-C has two titanium outer plates coated with a titanium plasma spray to promote bony ingrowth. The initial stability was provided by two keels on each endplate. The device was implanted outside the USA before the completion of the FDA Investigational Device Exemption (IDE) study [31].

Methods

This study was a retrospective cohort analysis from a prospective study of 382 patients who received a standalone CTDR or as part of a hybrid procedure (ACDF/CTDR). This study included 95 PCM® (Nuvasive, San Diego, California), 34 Discover® (Depuy Synthes, Raynham, Massachusetts), 189 Prodisc® C Vivo (Centinel Spine, West Chester, Pennsylvania), 9 CP-ESP® (FH Orthopaedics, Heimsbrunn, Alsace), 2 Mobi-C® (Zimmer Biomet, Austin, Texas), and 53 M6-C. This analysis was prompted by the clinical and radiologic failure of patients receiving the M6-C CTDR, after an Australian Therapeutic Goods Administration (TGA) Implant Hazard Alert was received. The TGA Implant Hazard Alert is available as an Online Resource (Online Resource 1). A search of the Database of adverse event notifications (DAEN) revealed 20 notifications to the TGA up to 4 August 2021, with the majority of these notifications referencing osteolysis and/ or implant failure.

All patients were contacted, and further radiologic and clinical reviews were performed. Plain films, bone scans, and magnetic resonance imaging (MRI) are unreliable in detecting osteolysis. A computed topography (CT) scan is the investigation of choice with 3-dimensional reformats (Fig. 1). The retrieval analysis revealed macroscopic disc failure and histologic evidence of wear induced osteolysis around the revised M6-C prosthesis raising significant concerns regarding durability of this device. Bacteriological contamination was not detected [32].

Fig. 1
figure 1

Mean differences in pain from baseline and 95% confidence interval (CI) at 3-, 6- and 12-months post-surgery for prosthesis groups M6-C™ (Orthofix, Lewisville, Texas) and other cervical total disc replacements (CTDR). a Visual Analogue Score neck (VAS-N) improvement was statistically and clinically significant within both groups. b Visual Analogue Score arm (VAS-A) improvement was statistically significant within both groups, and clinically significant for ‘Other CTDR’ group. Differences between groups were neither statistically nor clinically significant for the neck and arm pain outcomes

PROMs, including Visual Analogue Score neck (VAS-N), and arm (VAS-A), and the Neck Disability Index (NDI), were collected preoperatively, then postoperatively at 3, 6, and 12 months, and annually thereafter. Descriptive statistics were reported as mean (SD) for normally distributed continuous variables or median (IQR) for skewed variables. Categorical variables were summarised as counts and percentages. Differences in VAS-N, VAS-A, and NDI scores from baseline to follow-up timepoint within groups were compared using the paired t-test. Differences in continuous variables between the groups M6-C and other CTDR were compared using the independent t-test or the Mann–Whitney U test, depending on data distribution. Bonferroni correction was applied to multiple comparisons for each outcome so that statistical significance was assessed at the 0.006 level. Graphical representations of the mean changes from baseline and 95% confidence interval (CI) were plotted for each prosthesis group, along with the minimum clinically important difference (MCID) of 25 points for VAS-N and VAS-A, and 15 points for the NDI. All statistical analyses were carried out using R version 4.0.2.

Indications for the index surgery included cervical radiculopathy, myelopathy, and radiculomyelopathy secondary soft tissue herniations. Contraindications to CTDR included osteoporosis, advanced spondylosis, retro vertebral compression, kyphotic deformity, and facet arthropathy. A right sided Smith-Robinson approach was utilized with the neck secured in place in a neutral position, this being verified by image intensifier. Distraction pins were placed into the midline of vertebral body above and below the disc to be replaced. A complete discectomy and removal of the complete cartilaginous endplate to the posterior annulus and uncovertebral joints was performed. Following this uncovertebral decompression was performed followed by resection of the posterior annulus and longitudinal ligament. Removal of the sequestered fragment and osteophytes completed cord and neuroforaminal decompression. Sizing trials were then performed to achieve the best fit in the height, width, and depth. The same surgical technique was performed on all CTDRs.

Prospectively collected PROMs of the M6-C and the other CTDR cohort (Tables 1 and 2) are tabulated and displayed in Figs. 2 and 3. Characteristics of all CTDR surgery patients (Table 3) were tabulated. Revision rates were compared with other prostheses (Table 4), considering that the same indications and surgical techniques were utilized, and the follow-up was longer than 5 years for most of the prostheses. Subsequent surgical interventions were classified as revisions, removals, reoperations, or supplemental fixations (Table 5).

Table 1 VAS raw and change score outcomes for neck and arm pain over time
Table 2 NDI raw and change score outcomes over time
Fig. 2
figure 2

Mean differences in Neck Disability Index (NDI) score from baseline and 95% confidence interval (CI) at 3-, 6- and 12-months post-surgery for the M6-C™ (Orthofix, Lewisville, Texas) and other cervical total disc replacement (CTDR) groups. NDI was statistically and clinically significant within both groups. Differences between groups were neither statistically nor clinically significant

Fig. 3
figure 3

Patient 13 radiological images pre- and post-revision surgery of the M6-C™ (Orthofix, Lewisville, Texas), a. Pre-operative plain X-ray film with anterior migration secondary to osteolysis. b. Pre-operative SPECT scan with no osteoblastic activity. c. Pre-operative MRI scan with an interference signal making interpretation difficult. d. Pre-operative CT scan showing extensive osteolysis and potential mechanical and neurological instability. e. Post-operative plain X-ray film with conversion to anterior cervical fusion

Table 3 Characteristics of 382 surgery patients
Table 4 Revision rates for patients following CTDR
Table 5 Classification of subsequent surgical interventions

Figures 2 and 3 show that all differences from baseline were clinically significant, except for VAS-A in the M6-C group. None of the differences between prosthesis groups were clinically significant.

Tables 2 and 3 show that the mean change-from-baseline scores were all statistically significant. However, there were no statistically significant differences in improvement between the M6-C and other CTDR groups. The largest difference was at 3 months with VAS-A improvement being lower in the M6-C group by a mean of 11.0 points (p = 0.049; not significant at the 0.006 level).

Results

Fifty-three patients received M6-C CTDR. Eighteen patients (34%) were revised at an average of 67 months postoperatively. The PROMs of the two groups were similar, indicating that the failure mode (wear-induced osteolysis) is often asymptomatic. The demographics of the two groups were also similar, with more women subjected to revision surgery than men. All surgeries (except for one case) were revised using a right anterior approach. Dissection superior to the omohyoid gained access to the cervical spine. The level was verified by an image intensifier, and the longus colli were mobilised to gain exposure to the device. Caspar distraction pins were inserted, and the anterior osteophytes and scar tissue were removed to visualise the entire implant. The disrupted polyethylene weaves and PCU core was removed. A small osteotome dislodged the titanium endplates. The osteolytic defects were then curetted, and a decision to replace, fuse, or perform vertebrectomy was made depending on the integrity of the bony endplates and facets, and bone stock. There were no intra-or post-operative complications.

The 18 cases of index-level revision of the M6-C prosthesis were all performed for polyethylene wear-induced osteolysis. There were three one-level CTDR, four two-level hybrids, seven three-level hybrids, and three four-level hybrids revised anteriorly. All patients underwent removal of the prosthesis and were treated according to the extent of osteolysis. There were four vertebrectomies, six revisions to ACDF, and seven revisions to another CTDR. One patient underwent supplemental fixation using a posterior approach. There was only one other mechanical failure in the other CTDR revision cohort. This was a CP-ESP that had mechanical failure of the internal coupling at 3 months postoperatively and presented as an asymptomatic subluxation. This has been revised to include another CTDR. The other 10 revisions occurred for combinations of subsidence, uncovertebral, and facet arthropathy causing recurrent neural compression and radiculopathy. These were all converted to the ACDF.

Discussion

There is limited data on the long-term effects of spinal loading on CTDR prostheses and the immune response that is initiated secondary to wear particles [33]. This presents significant challenges for developers of CTDR prostheses [34]. Understanding the properties of implantable biomaterials is important, considering the movement within the implanted biomaterials and between the implanted biomaterial and the natural tissues under normal in vivo physiological loads. Additionally, different surgeons, techniques, and pathologies [35] may result in an array of unique forces that create wear particles. These may be of different sizes and shapes and, consequently, a specific biological response may develop unique to that recipient, which can influence the immune response to a prosthesis debris and thereby the clinical course [36]. The mechanisms of osteolysis-induced aseptic loosening following joint arthroplasty and subsequent revision strategies are well understood [37, 38]. Revision surgery has been shown to be more technically demanding, associated with a higher complication rate, poorer outcomes, and considerable costs, both in direct and indirect terms.

Aseptic loosening, secondary to periprosthetic osteolysis, can have catastrophic secondary effects [39, 40]. Mechanical and neurological injuries have already been reported as case studies with the M6-C CTDR [41,42,43,44]. Blumenthal (2021) presented five cases of late osteolytic reaction after CTDR who required reoperation. Four of those cases involved the M6-C disc, while one involved the Prestige™ LP Cervical Disc System (Medtronic, Memphis, Tennessee) [45]. All M6-C disc retrievals showed evidence of wear and there was Propionibacterium acnes detected. A subsequent study by Mobbs suggest this may have been a contaminant [32].

There have been more than 55,000 M6-C prostheses inserted around the world as of 2019 [46]. The 2020 Demographics of Spinal Disc Arthroplasty Report is based on the analysis of 7,325 spinal disc procedures recorded by the Australian Orthopaedic Association National Joint Replacement Registry (AOANJR), with a procedure date up to and including 31 December 2019. In July 2020, the TGA issued an Implant Hazzard Alert about the M6-Cstating, “Routine long term clinical and radiographic monitoring of patients implanted with the M6 is suggested…. Changes in disc position, loss of height and periprosthetic bone loss may be indicative of onset of osteolysis”. The TGA alert initiated this study. Since 2016, 1,460 M6-C have been implanted in Australia. [6] At the current revision rate, approximately 496 patients may require revision for mechanical failure over the next 5–10 years [47].

The long-term success of a CTDR requires an optimised compromise among implant material, design, and biological performance [39]. Implants were initially designed as ball-and-socket CTDR devices and were mobile, fixed, or constrained. The mid- and long-term clinical success of most of these devices has been published [16,17,18]. A more recent design is the one-piece non-articulating disc implant (Monoblock), which replicates the physiologic movement of a normal disc and reduces the number of wears contributing articulations [48]. They were developed to address concerns about the ball-and-socket designs, which provided no axial cushioning and extra-physiological movement in rotation and lateral bending, leading to excessive stresses being placed on the facet joints [27]. The Monoblock’s nucleus/annulus couple in the new designs must be able to undergo repetitive elastic deformation without failure under physiological loads, raising new concerns regarding the choice of materials for the constitution of the viscoelastic cushion. An early example was the AcroFlex lumbar TDR (DePuy Acromed, Raynham, Massachusetts). This design used a polyolefin rubber to mimic the mechanical behaviour of a natural disc. Although the device was tested for biological and biomechanical compatibility prior to clinical trials, patients suffered core material tears and failure mechanisms associated with fatigue. Recently, PCU as a core material has been utilized which has a longer fatigue life [49,50,51]. The M6-C prosthesis is constructed of titanium plates with a PCU core with UHMWPE fibre encapsulation [30, 52]. This device has been classified as a (sandwiched endplate) mobile bumper design. The core is not bonded to the endplate; therefore, there are no peak stresses at the interface. The stresses are distributed between the PCU core and UHMWPE fibre [49, 50, 53].

The polymer sheath that envelops the PCU core and polyethylene- fibre construct was designed to limit soft tissue ingrowth and contain wear debris. This has clearly proven to be ineffective. It has been thought that the device, having a variable centre of rotation, was less susceptible to malpositioning [29, 64]. On the contrary, any malpositioning of the device in the coronal plane would result in lateral tilt if positioned eccentric from the midline and, in the sagittal plane, would result in kyphosis or excessive lordosis the implant. These both create altered mechanics and most probably a precursor to accelerate wear and subsequent osteolysis.

Presently, there are two different testing protocols for studying implant wear, the ISO/FDIS 18192 [54] and ASTM 2423 [55]. Wear debris analysis were performed per ASTM1877 [56], and kinematic analysis per ATSM 2423 [55]. Spinal kinetics Inc. tested 6 prostheses through 20 million cycles combined motion modes (2 Hertz). All samples were functional and met the acceptance criteria. The average height loss was 1.6 ± − 0,3 mm. The average mass loss was 16.9 ± − 12,0.0 mg plus an additional 25.9 mg trapped inside the device [57].

The ASTM applied kinematic pattern of movement testing is unidirectional with a curvilinear shape, whereas the ISO is multidirectional. Nechtow et al. reported that cross-shear loading significantly increased the wear rate compared to curvilinear motion [58]. Grupp et al., when analysing wear rates in the activeL® (B Braun, Tuttlingen, Germany), found that the alternative wear simulations result in a difference in the gravimetric wear amount of approximately 20-fold between the ISO and ASTM methods. The main explanation for the divergence between ISO- and ASTM-driven wear simulations is the multidirectional pattern of movement described in the ISO document, resulting in cross-shear stress on the polyethylene material. It would be reasonable to assume that the testing according to ASTM F2423-05 with pure unidirectional motion does not reflect the kinematics of CTDR patients “daily activities”[59]. Unidirectional testing can lead to strain hardening of the polyethylene [60]. In vivo, large asymmetric loads and multidirectional forces are continuously applied to the polyethylene weave and the cross-shear loading significantly increases the wear rate by an order of magnitude [61]. The generation of wear particles then activate a biological cascade in the periprosthetic tissues with phagocytosis of the particles releasing inflammatory proteins resulting in osteolysis and probable mechanical and neurological instability [62].

The standard testing methods for wear defined by ASTM and ISO are only appropriate for determining wear because of friction at the interface between hard layers, as in metal-on-metal or metal-on-polymer implants [30]. There must be a discrepancy when using these standardised testing methods and application to viscoelastic TDR prostheses. These methods may be unsuitable for soft, deformable, viscoelastic implants because motion is not provided based on friction but rather on deformation. Assessments of the deformation characteristics (physical relaxation, creep, and hysteresis) and interfacial micro-motion with Rheometric Solids Analyser like techniques may be more appropriate [63].

These viscoelastic devices were developed to better mimic the kinematics of a natural intervertebral disc by incorporating a “soft” viscoelastic component [29, 64]. As a result, they appear to be more prone to having a lower physical endurance compared with ball-and-socket designs.

Retrieval of the data was from an ongoing prospective case series of PROMs in relation to patients receiving CTDR. This prospective study allows comparison between different CTDR devices regarding PROMs, failure scenarios, revision surgeries, and their outcomes. Therefore, this is a retrospective cohort study that compared CTDR revision aetiologies and how the revisions were managed. The cohort of patients with a similar diagnosis and treatment, selected based on exposure to a CTDR at the present time, and outcome data, which were measured in the past, were reconstructed for analysis. The primary disadvantage of this type of study design is usually the limited control the investigator has over data collection and follow-up. Only one of the 53 patients was lost to follow-up.

Conclusions

There is a concerning failure rate midterm that is related to UHMWPE wear-induced osteolysis with the M6-C. Patients implanted with the M6-C prosthesis were contacted, informed, and clinically and radiologically assessed. New standardised testing protocols which are more applicable for the new generation of viscoelastic CTDRs are required to ensure clinical safety based on these tests. These testing protocols, which assess the efficacy of medical devices, need to keep pace with the innovation in the medical device industry so that a particular revised characteristic of a device can be properly evaluated.