INTRODUCTION
With the growing number of experienced minimally invasive surgery (MIS) surgeons, the influx of evidence in favor of MIS is rapidly increasing [
1,
2]. Due to the limited line-of-sight in MIS procedures, surgeons need to rely on imaging, navigation, and guidance technologies to operate in a safe and efficient manner [
3,
4]. Also, the use of instrumented spinal fusion has been increasing vastly in the past decade, and can lead to favorable outcomes for a range of degenerative, congenital, infectious, traumatic, and neoplastic pathologies [
5,
6]. Multiple hundred million dollars are spent on spinal fusion procedures every year in the United States alone, with an increasing trend [
6].
To cope with the increasing demand created by the rise of MIS, a plethora of new and ever-improving navigational systems have been developed for pedicle screw insertion. These systems allow a consistent level of safety and accuracy, on par with results achieved by very experienced freehand surgeons, with a reasonably short learning curve [
4,
7-
10]. Robotic assistance can guide the surgeon to a predefined point in the anatomy, at a specific trajectory, while providing stability of drilling [
4]. Through advanced planning, robotic navigation can be very useful in instrumenting pedicles, especially in minimally invasive procedures and in anatomically challenging cases such as dysplastic, twisted, and scoliotic pedicles, or when previous implants limit trajectories [
4,
10].
There is some evidence that robotic guidance can decrease the incidence of both radiological [
11,
12] as well as clinically relevant [
10] pedicle screw malposition. If these benefits alone, particularly in light of the high acquisition and maintenance costs, warrant the use of these computer assistance systems, is currently unclear [
13,
14]. More importantly, data on the effect of using robotic guidance on clinical outcomes other than screw malposition, such as patient-reported outcome measures (PROMs) are lacking [
15]. Similarly, there are no published comparisons among different robotic surgical approaches and their respective clinical outcome profiles. Therefore, we aimed to compare minimally invasive, robot-guided transforaminal lumbar interbody fusion (RG-TLIF) and mini-open robot-guided posterior lumbar interbody fusion (RG-PLIF) in a cohort of patients with either discogenic chronic low back pain (CLBP) or lumbar stenosis with spondylolisthesis [
16].
DISCUSSION
In an analysis of 76 patients from a prospective registry, we compared the RG-TLIF and RG-PLIF surgical approaches to identify any relevant differences in the outcome parameters and PROMs. The 2 surgical approaches compared in our study did not show a statistically significant difference either in the static 12-month postoperative PROM, nor in the ODI nor the dynamic PROM change scores. However, our study did show differences between the 2 groups in regards to the achievement of the MCID. The VAS leg pain seemed to be improved to a greater extent using the RG-PLIF approach, whereas the RG-TLIF approach achieved a higher rate of clinical success in terms of functional impairment. Concerning the MCID achievement of back pain severity, there was no difference between the 2 groups. The secondary endpoints surgical time, length of hospital stay, intraoperative radiation as measured by the DAP, and occurrence of complications showed no differences between the RGPLIF and RG-TLIF group. These results confirm that both techniques should be regarded as equally effective but suited for slightly different purposes. Thus, technique choice in robotic spine surgery should be tailored to the surgical goal (i.e., primarily fusion or primarily decompression of the neuroforamen) and patient.
A multitude of studies has shown a certain superiority of MIS for PLIF and TLIF approaches [
1,
2,
24,
25]. Perioperative outcomes like blood loss [
1,
2,
24], length of stay [
1,
2,
25], and operative time have shown to be improved in patients undergoing MIS compared to open procedures [
1]. Additionally in an analysis conducted by Mummaneni et al. [
2] the patients’ leg pain seemed to have significantly improved using a MIS approach compared to open surgery. But despite functional outcome scores being notably improved for the MIS cohort, a statistical significance was not shown for all of them [
2]. Zhang et al. [
26] report that complication rates as well as back and leg pain severity seem to be significantly lower for TLIF compared to PLIF [
26]. In other studies, the complication rates and the patient-reported outcomes showed no superiority of the MIS approach when comparing with an open approach [
1,
24]. Some studies also show a shorter length of stay and a longer operative time compared with the open procedure [
2,
25]. Noticeably many outcome measures such as 90-day return to work, patient-reported outcomes, and length of stay have been equal among MIS and open surgery groups [
2].
Robotic assistance for MIS potentially provides high precision during critical surgical procedures like PLIF and TLIF. The potential of robot-guided procedures to reduce revisions for screw malposition is supported by the current data found in the literature [
4,
10]. A short learning curve for robot-guided procedures, shorter operative time, lower complication rates, and shorter length of hospital stay are convincing benefits provided by robotic surgery [
9,
27]. Other systematic reviews state that while there are successful incorporations for robot-guided MIS and a convincing potential of the RG-MIS to improve the accuracy of pedicle screw insertion, the evidence is still insufficient to rank the robot-guided approach for pedicle screw placement superior to a non-robotic approach [
12,
27]. Furthermore it needs to be mentioned that there is still a limited number of high level of evidence studies evaluating robot-guided procedures, especially regarding the high associated acquisition and maintenance costs [
9].
Further evaluations and evidence are needed to state whether robot-guided surgical approaches are superior to freehand approaches. The promising trials MIS ReFRESH and EUROSPIN are researching this very issue to fill these remaining voids of evidence. The MIS ReFRESH trial is currently investigating the differences between robot-guided and freehand techniques and the EUROSPIN trial aims to produce evidence on the potential comparative clinical benefits of robot-guided, navigated, and freehand surgical approaches [
13,
14].
In our study, there were no statistically significant differences between the 2 surgical approaches about the static 12-month postoperative PROM, the ODI, and the dynamic PROM change scores. Our results are supported by the literature review and meta-analysis performed by Zhang et al. [
26]. They found that TLIF and PLIF were equal concerning radiographic fusion rate and clinical satisfaction [
26]. The findings of Liu et al. [
28] support this by showing an equal benefit in improving short-term functional outcomes and in decreasing VAS scores for patients with degenerative lumbar spondylolisthesis undergoing PLIF or TLIF [
28]. Asil et al. [
29], Fujimori et al. [
30], and de Kunder et al. [
31] did not find a significant difference between the 2 surgical methods regarding clinical improvement measured with the ODI [
29-
31]. In regard to our results, the RG-TLIF approach achieved a higher rate of MCID in terms of ODI. De Kunder et al. [
31] in a meta-analysis find a slightly lower ODI scores for TLIF, hence supporting our findings.
Yet while in Asil and Yaldiz [
29], patients undergoing the PLIF procedures reported a higher postoperative VAS value, and in Fujimori et al. [
30], VAS of leg and lower back pain was significantly improved in their TLIF group, our postoperative PROM findings including VAS leg, and back pain did not show a significant difference between our 2 cohorts. However the MCID in turn showed a better improvement of the VAS leg pain for the RG-PLIF group, therefore standing in contrast with Asil and Yaldiz [
29] and Fujimori et al. [
30] Concerning complication rates Fujimori et al. [
30] does not find a significant difference between the complication rates of PLIF and TLIF groups. However, in other studies, a higher rate of complications and longer time of surgery associated with the PLIF procedures are reported [
26,
28-
30]. This stands in contrast to our results demonstrating equal complications and surgical time among the 2 groups.
Hence, it needs to be mentioned that while those studies compared TLIF and PLIF approach, they did not include the robot-guided TLIF and PLIF, as there has not yet been a comparison between the 2 robot-guided approaches. As mentioned above there is a multitude of benefits already evidenced for MIS and RG approaches [
4,
9,
10,
27]. The different approaches could therefore explain the slightly different outcomes in our study compared to literature found on comparisons between PLIF and TLIF. Hence the RG approach may improve the complication rate and surgery duration in the PLIF group, reported in non-robot-guided PLIF procedures [
26,
28,
29,
31].
In our study, an experienced neurosurgeon decided on the procedures based on his extensive knowledge and experience with the pathologies of discogenic CLBP and lumbar stenosis with spondylolisthesis and the TLIF and PLIF procedures. The neurosurgeon hypothesized that robotic PLIF may offer a slight advantage in decompressing the neuroforamen bilaterally compared to robotic TLIF in patients with bilateral symptoms. In this way, the neurosurgeon regarded both techniques as equally effective but suited for slightly different purposes, and suggests that technique choice in robotic spine surgery should be tailored to the surgical goal (i.e., primarily fusion or primarily decompression of the neuroforamen) and patient. This very selection process and the matter of this study is a single-center study, therefore, presents a possibility of bias. This might be demonstrated in our cohorts, by showing that the RG-TLIF group included more patients with CLBP as surgical indication. The indication for RG-PLIF included more patients with preoperative leg pain due to bilateral compression of the neuroforamen. Therefore, the RG-TLIF cohort displayed higher preoperative back pain and lower preoperative leg pain scores than the RG-PLIF cohort. Consequently, the improved leg pain in the RG-PLIF group can to the greatest extent be explained by the different indications for RG-PLIF and RG-TLIF.
In consideration of our cohort assignment not being randomized, we conducted a propensity score matching procedure matching for age-, sex-, and baseline PROM.
It is nonetheless important to consider the nonrandomized groups while studying the PROM change scores, which did not show a significant difference between the 2 cohorts. The matter of the 2 cohorts showing a difference in preoperative back and leg pain scores likewise needs to be considered. Looking at the MCID, the RG-PLIF cohort displayed a significantly superior improvement of VAS leg pain while there seemed to be no significant difference in the VAS back pain between the 2 groups. Nonetheless, the baseline differences among groups that persisted due to selection bias likely almost fully explain this difference in improvement.
Although all data were collected in a prospective registry and events captured systematically, and all patients with sufficient data and meeting the inclusion criteria were included, the presence of selection bias cannot be ruled out. In addition, all data stems from a single center and all procedures were carried out by one senior surgeon, possibly creating center/surgeon bias. Therefore, our findings may not generalize well to other centers with differing patient demographics. The analysis was not predefined and must thus be considered as a retrospective statistical analysis. Therefore, the sample size was also rather low. Most of our patients were relatively low-surgical-risk patients. This means that our data and conclusions should not readily be expanded to high-risk and elderly patients. Group allocation was not randomly assigned. Although we applied propensity score matching and although the aim of this study was never to present an unbiased comparison of the 2 techniques for the same patients, some baseline differences persisted, which likely influenced the outcomes and which can partially be explained by selection bias according to clinical criteria, as discussed above. Although we did carry out matching as our strategy to reduce intergroup bias, due to the relatively low sample size, a multivariable analysis was not deemed biostatistically sensible. Lastly, we did not have reliably radiographic follow-up data, in most cases because no follow-up imaging was carried out in clinically stable patients.