Disuse Bone Loss in Fusion Constructs After Multilevel Lumbar Fusion: A Computed Tomography Hounsfield Unit Analysis
Article information
Abstract
Objective
To evaluate long-term bone quality changes within the fusion construct (FC) after 2- to 3-level lumbar fusion using computed tomography (CT)-derived Hounsfield units (HUs).
Methods
Among 520 screened patients, 222 who underwent 2- to 3-level posterior lumbar interbody fusion met the inclusion criteria. HU values were measured on CT scans preoperatively, at 1-year postoperative, and at final follow-up. The percentage change in HU (HU [final–pre]%) was calculated for each vertebral level.
Results
At the final follow-up, the FC demonstrated a significant decline in HU compared to preoperative values (median [10th–90th percentile], 132.0 [86.5–220.4]; 95% confidence interval [CI], 116.0–142.5 vs. 124.5 [71.0– 210.0]; 109.8–135.1; HU (final–pre)%: -11.0 [-62.0 to 48.5]; -19.9 to -6.1; p<0.001). In contrast, HU increased significantly at the uppermost instrumented vertebra (HU (final–pre)%: median [10th–90th percentile], 28.3 [-19.9 to 102.9]; 95% CI, 21.1–36.4; p<0.001), likely reflecting increased mechanical demands. Subgroup analysis revealed a more pronounced decline in HU in patients with longer follow-up durations, particularly in the FC group (p=0.003).
Conclusion
CT-derived HU revealed progressive trabecular bone loss within FC over time after lumbar fusion. In patients with longer postoperative intervals, clinicians should remain aware of the potential weakening of the FC, which has important implications when considering implant removal or planning revision surgery.
INTRODUCTION
Bone remodeling, which is essential for maintaining bone quality, requires mechanical stress [1]. Particularly in multilevel fusion surgeries, the segments between the uppermost and lowermost instrumented vertebrae (UIV and LIV, respectively) become immobilized, reducing mechanical stress and potentially degrading bone quality postsurgery [2,3].
Bone quality is conventionally measured using dual-energy x-ray absorptiometry (DEXA), the prevailing method in current practice [4]. However, in spines with degeneration, scoliosis, or osteophytes, bone mineral density (BMD) and T scores measured using DEXA can be inaccurately high, making it difficult to assess bone quality accurately [5,6]. Because these inaccuracies limit the reliability of DEXA in postoperative and degenerative spines, a more robust imaging-based method is required.
Computed tomography (CT) measurements of BMD within specified regions of interest (ROIs) using Hounsfield units (HUs) were developed as an alternative capable of overcoming metal artifacts and providing more direct assessment of trabecular bone density [7]. CT measurements of HU in the spinal ROI correlate with BMD measurements obtained via DEXA. Other studies have demonstrated the interobserver, intraobserver, and interexamination reliabilities of these HU measurements [7,8]. Wanderman et al. [9] introduced a method for measuring instrumented segments using HU on CT, thereby facilitating accurate assessments. Several studies have reported that HU in the spine can be used as a predictor of pathologic and osteoporotic fractures. Zhang et al. [10-12] indicated that HU has a higher predictive efficacy for frailty fractures than vertebral bone quality measured by DEXA or magnetic resonance imaging. Although the bone quality of adjacent segments increases, changes in bone quality within the fused segment are not well understood [13].
This study investigated the decline in bone quality within immobilized fusion segments over time, a crucial consideration for surgeons performing complex surgeries. Therefore, this research aims to explore trends in bone quality over time in each segment following fusion surgery during long-term follow-up, with a level-specific analysis that considered potential confounding factors, contributing significantly to the field.
MATERIALS AND METHODS
1. Study Population
This single-center retrospective study was approved by the Institutional Review Board of Gangnam Severance Hospital (3-2024-0192), which waived the requirement for informed consent. Conducted by 3 surgeons at a single institution between January 2012 and January 2022, this study included patients who underwent posterior lumbar interbody fusion using PEEK (polyetheretherketone) cages and pedicle screw fixation for degenerative lumbosacral diseases between L2 and S1.
The inclusion criteria were patients aged 20 years or older who underwent fusion surgery involving 2–3 segments and had preoperative CT scans as well as follow-up CT scans at least 2 years postsurgery. Exclusion criteria were patients with tumors, congenital disorders, deformities, or infections; those who had undergone revision surgery; patients who developed vertebral fractures during the follow-up period; and those diagnosed with osteoporosis (DEXA Tscore at the femoral neck ≤-2.5) [14]. Patients who used medications affecting bone formation, such as bisphosphonates, denosumab, romosozumab, or teriparatide, before surgery and during the follow-up period were excluded.
2. HU Assessment of the Instrumented and Adjacent Spine
HU assessment was performed using the technique described by Schreiber et al. [7] for noninstrumented vertebrae, where an ROI was drawn on axial cuts at the midpedicle, encapsulating the maximum cancellous bone volume while avoiding the cortical bone. All patients underwent DEXA and CT scans within 2 months prior to surgery. For instrumented vertebrae, the method described by Wanderman et al. [9] was used to measure the ROI in the first axial cut caudal to the instrumentation-induced halo, ensuring that measurements were taken at the same location on preoperative imaging. All measurements were calculated as the average HU values using a picture archiving and communication system (Centricity PACS, version 3.0, GE Healthcare, Chicago, IL, USA). The widest possible elliptical region was drawn while excluding the cortical margin in axial cuts.
The level between the UIV and LIV was referred to as the fusion construct (FC) in this study. In cases of 3- to 4-level surgeries, 2 to 3 vertebrae were considered the FC; therefore, the average HU values of these FCs were used to represent the HU of the FC. For example, in an L3-4-5-S1 fusion surgery, L4 and L5 were considered FCs, and their HU values were averaged (Fig. 1).
Example of Hounsfield unit (HU) assessment at instrumented and noninstrumented levels. This patient underwent L3–S1 fusion surgery. (A) Example of HU measurement at an instrumented vertebra. The region of interest (ROI) was drawn to encapsulate the maximum cancellous bone volume while avoiding cortical bone. The average HU measured was 80.16. (B) Sagittal computed tomography (CT) image corresponding to the axial slice shown in panel A. HU measurements at instrumented levels were obtained on the first axial cut caudal to the instrumentation-induced halo. (C) Example of HU measurement at a noninstrumented vertebra, using the same ROI method as in panel A. The average HU measured was 56.69. (D) Sagittal CT image corresponding to the axial slice shown in panel C. HU was measured on axial cuts taken at the midpedicle level for noninstrumented vertebrae.
The time interval between the date of surgery and the last CT scan was calculated. HU values were measured preoperatively, 1-year postoperative, and at the final follow-up. The percentage change in HU (HU [final–pre]%) was calculated as ([final follow-up HU–preoperative HU]/preoperative HU)×100. Additionally, patients were divided into subgroups based on the follow-up period to analyze changes in bone quality over longer follow-up durations. In addition, the reason for postoperative CT imaging was classified into 3 categories based on clinical context: (1) check-up, performed in patients without specific symptoms; (2) pain, obtained in cases with persistent low back or leg pain despite no abnormal findings on radiographs; and (3) adjacent segment disease (ASD), performed to evaluate adjacent segment degeneration and determine the need for revision surgery. HU measurements were independently performed by 2 board-certified spine neurosurgeons, each with more than 5 years of clinical experience, who were blinded to patient clinical information. Each observer repeated the measurements twice at a 1-month interval, and the average of the 4 measurements was used for analysis. The intra- and interobserver reliability was excellent across all levels, with intraclass correlation coefficients ranging from 0.823 to 0.973 (Supplementary Table 1).
3. Statistical Analysis
All statistical analyses were performed using IBM SPSS Statistics ver. 27.0 (IBM Co., USA). The normality of data distribution was assessed using the Shapiro-Wilk test. Normally distributed values were expressed as mean±standard deviation, and comparisons were made using the t-test or 1-way analysis of variance, as appropriate. Nonnormally distributed values were expressed as the median [10th percentile–90th percentile], and group comparisons were performed using nonparametric tests: the Mann-Whitney U-test for 2-group comparisons, the Kruskal-Wallis test for multiple-group comparisons, and the Wilcoxon rank-sum test (paired) with Bonferroni correction for Post hoc analysis. To identify independent factors affecting HU (final–pre)%, we performed multivariable linear regression analyses using robust (HC3) standard errors. The dependent variable was HU (final–pre)% for each vertebral level (UIV+2, UIV+1, UIV, FC, and LIV), and the following covariates were included: age, sex, baseline BMD, BMI, smoking, steroid use, number of fusion levels, mean cage height, presence of nonunion, and follow-up period. Statistical significance was defined as p<0.05.
RESULTS
We initially screened 520 patients who underwent 2- to 3-level lumbar fusion surgery. After applying the inclusion and exclusion criteria, our final study cohort consisted of 222 individuals (Fig. 2), of whom 64.4% were female (n=143). The median age at the time of surgery was 65 (61.0–70.0) years. The fusion surgery levels included 2 segments in 85 patients (38.3%) and 3 segments in 137 patients (61.7%), with a median follow-up period of 45 (range, 24-177) months. Preoperative femoral neck BMD was 0.7 (0.6–0.9) g/cm², and the T score was -0.8±1.1, while the preoperative total spine BMD was 1.0 (0.9–1.1) g/cm² with a T score of -0.2 (-1.1 to 1.0). The median HU values from L2 to S1 were 124.0, 113.8, 116.0, 132.0, and 154.5, respectively, with S1 being the highest. The correlation coefficient (r) between femoral neck BMD T score and HU at each level ranged from 0.42 to 0.53 (p<0.05), while for total spine BMD, it ranged from 0.52 to 0.59 (p<0.05). This indicates that comparing preoperative HU values with those obtained after instrumentation is reasonable (Supplement Table 2).
Fifty patients in the cohort had immediate postoperative CT scans (within 1 week postoperatively). We compared the preoperative and immediate postoperative HU from L1 to S1 to evaluate the validity of HU measurements obtained from postoperative CTs without instrumentation. Preoperative median HU values for L1–S1 were 131.0, 126.0, 122.5, 129.0, 153.0, and 179.5, respectively. Immediate postoperative values were 135.5, 113.5, 122.5, 128.5, 151.0, and 175.5. Wilcoxon signed-rank test p-values for preoperative versus immediate postoperative HUs were 0.348, 0.105, 0.185, 0.214, 0.260, and 0.348, respectively, indicating no statistically significant differences in HU values before and immediately after surgery at any vertebral level (Table 1).
Comparison of preoperative and immediate postoperative Hounsfield unit (HU) values at each level (n=50)
At final follow-up (124.5 [71.0–210.0]), FC showed a significant decrease in HU compared with the preoperative value (132.0 [86.5–220.4]; HU (final–pre)%=-11.0 [-62.0 to 48.5]; 95% confidence interval (CI), -19.9 to -6.1; p<0.001). Post hoc analysis revealed a significant decrease between 1-year postoperative and final follow-up (p=0.001). In contrast, the UIV level demonstrated a significant HU increase at final follow-up (143.0 [91.0– 262.0]) compared with the preoperative value (116.0 [71.1–196.0]; HU (final–pre)%=28.3 [-19.9 to 102.9]; 95% CI, 21.1–36.4; p<0.001). Post hoc comparisons showed a significant increase between 1-year postoperative and final follow-up (p=0.006) and between preoperative and final follow-up (p<0.001). No significant HU changes were observed at the UIV+1, UIV+2, or LIV levels (p=0.075, p=0.723, and p=0.549, respectively) (Table 2, Fig. 3).
Comparison of Hounsfield unit (HU) values and percentage change in HU (HU [final–pre]%) across vertebral levels
in Hounsfield unit (HU) over time at different vertebral levels. This graph illustrates changes in HU values at different vertebral levels (UIV+2, UIV+1, UIV, FC, and LIV) at 3 time points: preoperatively, 1 year postoperatively, and at the final follow-up. Significant differences in HU (final– pre)% were observed at UIV, and FC. In the post hoc analysis, HU at the UIV was significantly higher at the final follow-up compared with both the preoperative and 1-year postoperative measurements, while HU at the FC was significantly lower at the final follow-up compared with the 1-year postoperative value (*p<0.05). These values correspond to the median HU at each time point, as presented in Table 3. UIV, uppermost instrumented vertebra; FC, fusion construct; LIV, lowermost instrumented vertebra.
To assess differences in HU values at each level according to follow-up duration, we stratified patients into subgroups based on follow-up periods (2, 3, 4, 5, 6, 7, and ≥8 years). The 3-year follow-up group comprised the largest number of patients (n=51), whereas the 7-year follow-up group included the fewest (n=17). Significant differences in demographic characteristics were observed across these subgroups. Patients with longer follow-up durations were younger (p=0.005), with differences observed in fusion surgery levels (p=0.045) and baseline femoral neck BMD (p=0.008). The indication for CT acquisition also varied significantly (p<0.001), shifting from routine check-ups in early follow-up to ASD evaluation in later years (Table 3). Regarding HU (final–pre)%, only FC demonstrated a significant difference across follow-up durations (2 years: -3% [-13% to 22%] vs. ≥8 years: -19% [-38% to -2%]; 95% CI for between-group difference, -22.5 to -7.3; p=0.003). No significant changes were observed at the UIV+2, UIV+1, UIV, or LIV levels (Table 4, Fig. 4).
HU (final–pre)% across vertebral levels by follow-up duration. This figure illustrates the percentage change in Hounsfield unit (HU [final–pre]%) across different vertebral levels, categorized by follow-up duration (2, 3, 4, 5, 6, 7, and 8 years). The data represent the median HU (final–pre)% values for each level, demonstrating how bone quality changes between the preoperative and final follow-up periods. UIV, uppermost instrumented vertebra; FC, fusion construct; LIV, lowermost instrumented vertebra.
To account for potential confounding variables, multivariate linear regression analyses were conducted including age, BMD, body mass index, reason for CT acquisition, smoking, steroid use, fusion levels, average cage height, and nonunion. Results indicated that a longer follow-up period was significantly associated with HU reduction, especially at the FC (β=-0.12, p=0.004) and UIV (β=-0.08, p=0.017). At the FC level, nonunion was associated with less HU decline (β=0.09, p=0.031), while at the LIV, a greater number of fused levels showed a significant association with HU reduction (β=-0.11, p=0.022). No significant predictors were identified at the UIV+1 or UIV+2 levels. (Table 5).
DISCUSSION
This study primarily examined long-term changes in bone quality within the FC following multilevel lumbar fusion surgery. Our results indicated that at the final follow-up, the FC demonstrated significant bone loss compared with preoperative measurements (HU [final–pre]%: -11.0 [-62.0 to 48.5], p<0.001). Notably, while HU values at UIV increased—possibly reflecting increased mechanical loading and localized bone sclerosis due to adjacent segment degeneration—the FC exhibited a distinct decline [15,16]. In multivariable analyses including follow-up duration and other covariates, longer follow-up periods were independently associated with a greater HU reduction at the FC and UIV, but not at the LIV, UIV+1, and UIV+2. These results suggest that time-dependent remodeling and mechanical unloading within the fusion segment contribute to progressive bone density loss, particularly at the central and upper portions of the construct.
Although HU values at the UIV consistently increased compared with preoperative measurements, the magnitude of this increase diminished over time [17]. Prior studies have reported that lower HU at the UIV may predispose patients to adjacentlevel fractures; however, individuals who developed such fractures during follow-up were excluded from our cohort, which may partly explain the relatively elevated HU observed at the UIV [18]. In contrast, the FC showed a true decline in HU relative to baseline, suggesting a more clinically meaningful deterioration in bone quality over time. Interestingly, the presence of nonunion at the FC was associated with less HU decline (β=0.09, p=0.031). This may reflect localized sclerosis induced by micromotion or reduced stress-shielding effects in segments that had not fully fused [19]. At the LIV, a greater number of fused levels was significantly related to HU reduction, suggesting that longer constructs may impose greater mechanical unloading on distal vertebrae, although supporting biomechanical data remain limited.
The reduction of mechanical stress on bone leads to decreased osteoclast-mediated bone resorption and osteoblast-mediated bone formation [20,21]. Several previous studies have highlighted the influence of mechanical unloading on bone remodeling, and spinal instrumentation is known to contribute to such stress redistribution [22-24]. Consequently, it has been suggested that after implant removal, the FC may be at risk for fractures and that during fusion extension procedures, bridging between the original and new implants may be necessary [23]. For example, Lipscomb et al. [25] reported that bone quality initially declined due to postoperative bracing but gradually increased by the first postoperative year. In another study by Swart et al. [26], which analyzed HU values after an average of 35 years following Harrington rod placement, the mid-construct fused segment demonstrated the lowest HU compared to other spinal regions. However, studies directly measuring long-term BMD within the FC itself remain limited. Our findings expand on previous literature by providing level-specific, multivariable-adjusted analyses of HU changes over time, encompassing not only the FC but also the UIV, LIV, and adjacent levels (UIV+1 and UIV+2).
The subgroup analyses stratified by follow-up duration further highlighted the complexity of these changes. While caution is warranted due to demographic differences, such as variations in age, baseline BMD, and fusion surgery level, the data consistently show that patients with longer follow-up periods experience a more marked decline in FC HU [13,25]. In our study, based on median values, HU in the FC began to show negative changes compared to preoperative values starting from year 4; in UIV+1 and LIV, this decline began from year 5, and in UIV+2, from year 6. In the subgroup with the longest follow-up duration (≥8 years), the HU (final–pre)% at the FC had a median of -19%, indicating a substantially greater decline than in other segments (UIV+2, -6%; UIV+1, 0%; UIV, 14%; LIV, -11%). Over time, this resulted in a significant decline in bone quality, as reflected by our HU measurements. Conversely, increased mechanical demands at the UIV, may stimulate bone formation, resulting in transient increases in HU.
These findings have significant clinical implications. Traditional imaging modalities like DEXA often fail to capture localized changes in bone quality, particularly within the FC [7,15]. CT-derived HU, while not a replacement for DEXA, may serve as a valuable adjunct for evaluating bone quality at specific vertebral levels [9]. The similarity of HU measurements between preoperative and immediate postoperative CT scans suggests that postoperative CT-based HU assessment remains reliable even in the presence of instrumentation. Given the progressive bone loss observed within the FC, particularly in patients with extended follow-up durations, patients with long-standing FCs may benefit from closer surveillance and potentially earlier interventions to mitigate further bone loss [27]. Notably, from 4 years postoperatively onward, the decline in HU at the FC indicates that surgeons should exercise greater caution regarding bone quality when removing screws from the FC.
Our study has some limitations. The retrospective design and extended data acquisition period of 10 years introduced inherent biases, including potential variations in surgical techniques, patients activity level and instrumentation over time. In addition, our patient cohort was heterogeneous, with a wide age range and varying degrees of preoperative bone quality. The exclusion of patients with osteoporosis or those receiving bone-modifying agents, as well as the restriction of fusion levels to the lower lumbar segments, may further limit the generalizability of our findings. HU measurement provides a useful surrogate for bone quality, some variability is inevitable due to slice selection, region-of-interest placement, and metallic artifacts. Ideally, postoperative HU changes should be evaluated in noninstrumented (posterolateral fusion only) cases; however, such procedures are rarely performed in current clinical practice, making this approach impractical. In our study, the inter- and intraobserver reliability for HU measurement showed excellent agreement, consistent with previous reports, indicating minimal measurement bias [9]. Although the indications for CT acquisition differed among follow-up year subgroups, raising the possibility of selection bias, the multivariate linear regression analysis demonstrated that CT indication was not significantly associated with HU changes, suggesting that its effect on the main outcomes was minimal. While we focused on trabecular bone quality as assessed by HU measurements, cortical bone—which also plays a critical role in spinal stability—was not evaluated [28-30]. Longitudinal observation of HU changes within the same patient using multiple CT scans over time would be ideal. In clinical practice, however, follow-up periods are heterogeneous across patients, and it is possible that symptomatic individuals were more likely to undergo long-term follow-up. Nevertheless, due to the risks associated with radiation exposure from frequent CT scans, conducting such longitudinal observational studies is challenging [31,32]. Despite these limitations, a strength of our study is that it provides an indirect means of predicting changes in bone quality over time at different surgical sites, using HU on CT as a complementary assessment tool.
In summary, our study demonstrated that significant bone loss occurred in the FC at the final follow-up, with the degree of bone loss becoming more pronounced as the follow-up period increased. While changes at other vertebral levels provide additional context, the FC emerges as the primary site of concern. These findings not only reinforce the utility of CT-derived HU measurements in postoperative monitoring but also highlight the need for tailored long-term management strategies to preserve bone quality within the FC. When performing procedures such as screw removal or revision in patients with long-standing FCs, surgeons should be aware of the potential for substantial bone quality deterioration within the FC (Fig. 5). Preoperative efforts to evaluate and optimize bone quality may therefore be necessary to minimize intraoperative risks and improve surgical outcomes. Future prospective studies with more frequent imaging intervals and larger, homogeneous cohorts are necessary to further delineate these relationships and develop effective interventions to counteract stress shielding in patients with lumbar fusion.
Illustrative case of Hounsfield unit (HU) assessment over time in multilevel lumbar fusion surgery. (A) Immediate postoperative computed tomography (CT) scan showing HU measurements at different vertebral levels, including uppermost instrumented vertebra (UIV)+1, UIV, and the fusion construct (FC). (B) CT scan at the 1-year postoperative, demonstrating changes in HU values. (C) CT scan at the 6-year postoperative follow-up, showing further HU changes over a longer period. Each column displays axial cuts with HU measurements at various vertebral levels. Arrows indicate the percentage change in HU values compared to the immediate postoperative period. Upward arrows represent an increase in HU values, while downward arrows indicate a decrease. Notably, HU values increased at the 1-year follow-up but decreased significantly by the 6-year follow-up, particularly at the FC, suggesting bone quality degradation due to disuse osteoporosis.
CONCLUSION
Multilevel lumbar fusion is associated with progressive trabecular bone loss within the FC, as demonstrated by declining CT-derived HU values at long-term follow-up. This deterioration becomes more pronounced in patients with longer postoperative intervals, highlighting the need for careful assessment of fusion-site bone quality when considering implant removal or revision surgery.
Supplementary Materials
Supplementary Tables 1-2 are available at https://doi.org/10.14245/ns.2551272.636.
Intra- and interobserver reliability for Hounsfield unit (HU) measurements at each lumbar level
Correlation between Hounsfield units (HU) measured by computed tomography and bone mineral density (BMD) measured by dual-energy x-ray absorptiometry in each lumbar vertebra
Notes
Conflict of Interest
The authors have nothing to disclose.
Funding/Support
This study received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Author Contribution
Conceptualization: JYP, DKC; Data curation: HJJ, DK, KHK; Formal analysis: HJJ; Methodology: DK, SUK, KHK; Project administration: BJM, KHK; Visualization: BJM, SUK; Writing – original draft: HJJ; Writing – review & editing: JYP, KHK, DKC.
