Patient-Reported Outcomes Measurement Information System Physical Function Validation for Use in Anterior Cervical Discectomy and Fusion: A 2-Year Follow-up Study
Article information
Abstract
Objective
Our study aims to evaluate the correlation of Patient-Reported Outcomes Measurement Information System physical function (PROMIS PF) with legacy patient-reported outcome measures (PROMs) among patients undergoing anterior cervical discectomy and fusion (ACDF).
Methods
A prospectively maintained database was retrospectively reviewed for ACDF surgeries performed between May 2015 and September 2017. Inclusion criteria were primary elective, single- or multilevel ACDFs for degenerative spinal pathology. Patients lacking preoperative or 2-year PROMIS PF surveys were excluded. Mean scores were calculated for visual analogue scale (VAS) neck, VAS arm, Neck Disability Index (NDI), 12-Item Short Form Physical Component Score (SF-12 PCS), and PROMIS PF at preoperative and 6-week, 12-week, 6-month, 1-year, and 2-year postoperative timepoints. A t-test and Pearson correlation coefficient were utilized to evaluate score improvement and PROM relationships respectively.
Results
The 50 subject cohort was 60.0% male, 50% obese (body mass index ≥ 30 kg/m2) and had an average age of 50.9 years. Significant improvements were demonstrated for VAS neck and NDI at all postoperative timepoints (p < 0.001) and for SF-12 and PROMIS PF at all timepoints except 6 weeks (p ≤ 0.025). VAS arm improvement was seen up to 1 year (p ≤ 0.016). PROMIS PF demonstrated strong correlations with NDI and SF-12 PCS at all evaluated timepoints and with VAS neck at all postoperative timepoints except 6 weeks (all p < 0.01).
Conclusion
PROMIS PF was strongly correlated with pain, disability, and physical function up to 2 years for patients undergoing ACDF. Our results support the long-term validity of PROMIS PF for measurement of patient-reported physical function among ACDF cohorts.
INTRODUCTION
Anterior cervical discectomy and fusion (ACDF) is a common cervical spine procedure with an average of 137,000 ACDF performed per year from 2006–2013 [1,2]. ACDF is a successful treatment for cervical spine pathologies such as herniated intervertebral discs, cervical radiculopathy, spondylosis, and myelopathy with low complication rates and minimal risk [3,4]. In recent years, the number of cervical spine surgeries has increased and are likely to rise as our population becomes older [5,6].
Patient-reported outcome measures (PROMs) are an essential piece of the increased emphasis on cost effectiveness, quality of care, and in prioritizing the patient perspective as a part of the treatment plan [7-9]. Traditional or “legacy PROMs” of spinal surgery often fall into the following categories: general health questionnaires (e.g., Medical Outcomes Study Short Form [SF] or EuroQol Five Dimension), pain rating scales (e.g., Numeric Rating Scale or visual analogue scale [VAS]), and disease-specific questionnaires (e.g., Oswestry Disability Index [ODI], Roland Morris Disability Questionnaire, Neck Disability Index [NDI], Cervical Spine Outcomes Questionnaire, the Japanese Orthopaedic Association [JOA] myelopathy questionnaire, and the Myelopathy Disability Index) [10]. Despite their widespread use, these legacy PROMs have known limitations. With additional surveys, they can increase patient burden, decrease measurement precision, and can deliver results that are challenging to compare. Many of these attributes can hinder interpretability [9,11].
In order to address these shortcomings, a 2004 multicenter cooperative group initiative by the National Institute of Health developed the Patient-Reported Outcomes Measurement Information System (PROMIS). PROMIS is standardized across a broad range of medical conditions and patient populations [12]. The PROMIS system assesses numerous outcomes including physical function (PF), depression, anxiety, and pain interference [13]. Its use has demonstrated decreased patient burden, lessened ceiling/floor effects, increased precision and efficiency, and smaller sample size requirements [11,14-17]. As a response to these findings, there has been a substantial increase in correlational between PROMIS and legacy spine PROMs [7].
With respect to cervical spine procedures, studies have since confirmed moderate-to-strong correlations of PROMIS with legacy PROMs such as NDI, modified JOA, ODI, SF-36/SF-12, and VAS ranging from 6 to 12 months of follow-up [18-24]. Among ACDF patients, PROMIS PF exhibits strong correlations with SF-12 and NDI at both preoperative and short-term postoperative time points [20,25]. However, among cervical spine patients the literature is lacking in studies which assess longitudinal correlations between PROMIS and legacy PROMs, which extend past one year. The purpose of this study is to evaluate the longitudinal correlations between PROMIS PF and legacy PROMs over a 2-year period among ACDF patients. Through this investigation, we intend to further evaluate the concurrent validity of PROMIS PF with legacy measures at follow-up time periods of up to 2 years.
MATERIALS AND METHODS
1. Study Population
Following Institutional Review Board approval of Rush University Medical Center Research Compliance Department (ORA-#14051301), a prospectively recorded single surgeon, single-institution surgical registry was retrospectively reviewed for eligible patients between May 2015 and September 2017 for elective ACDF surgeries. Inclusion criteria were primary, single or multilevel ACDFs for degenerative spinal pathology. Patients were excluded from our analysis if they had not completed PROMIS PF surveys at preoperative and 2-year timepoints. Patients were also excluded if surgery was indicated due to malignancy, trauma, or infection.
2. Data Collection
We collected cohort characteristics such as demographic, clinical, and operative variables. Demographic data included age, sex, body mass index (BMI), tobacco use, comorbidity burden as evaluated by Charlson Comorbidity Index, and insurance status. Following clinical and radiographic evaluation preoperative spinal pathology diagnosis was determined and recorded. Operative variables were recorded including number of levels fused, operative duration, estimated blood loss (EBL), and inpatient stay following surgery. Patients were administered SF-12 Physical Component Score (PCS), VAS neck/arm, PROMIS PF, and NDI surveys at both preoperative and postoperative time points (e.g., 6 weeks, 12 weeks, 6 months, 1 year, and 2 years).
3. Statistical Analyses
All computations were completed using Stata 16.0 (StataCorp LP, College Station, TX, USA). Descriptive statistics were performed for subject demographics, baseline pathologies, and perioperative characteristics. Average PROM scores were evaluated for VAS neck, VAS arm, NDI, SF-12 PCS, and PROMIS PF at both preoperative and postoperative timepoints (e.g., 6 weeks, 12 weeks, 6 months, 1 year, and 2 years). Postoperative improvement in PROM scores from preoperative baseline was evaluated using a paired Student t-test. Pearson correlation coefficient was used to evaluate the relationship of PROMIS PF with VAS neck, VAS arm, SF-12 PCS, and NDI. Correlation strength was assessed according to Cohen standard with the following categories: 0.1 ≤|r|<0.3=low; 0.3≤|r|<0.5=moderate; |r|≥0.5=strong [26]. Scatter plots visually represented PROMIS PF’s relationship with NDI and SF-12 PCS at all evaluated timepoints. Statistical significance was determined using an alpha level of 0.05.
RESULTS
1. Patient Cohort
A total of 50 eligible patients underwent ACDF and were included in our analysis. The majority were male (60.0%). The cohort average age was 50.9 years, and 50% had a BMI ≥ 30 kg/m2 (Table 1). The most common comorbid medical condition was hypertension (28.0%). The most common preoperative spinal pathology was herniated nucleus pulposus (78.0%). The majority of patients underwent 1 level (60.0%), followed by 2 levels (30.0%), 3 levels (8.0%), and 4 levels (2.0%) (Table 2). The mean operative time was 57.4 minutes, with an EBL of 30.5 mL, and inpatient stay of 12.5 hours.
2. Outcome Measures
Average (mean ± standard deviation) baseline preoperative VAS neck was 5.78 ± 2.39, VAS arm was 5.61 ± 2.33, NDI was 36.21 ± 16.86, SF-12 PCS was 35.56 ± 9.21, and PROMIS PF was 40.20 ± 6.38. VAS neck and NDI demonstrated significant postoperative improvement at all evaluated time points following ACDF when compared to baseline scores (all p < 0.001) (Table 3). PF instruments (SF-12 PCS, PROMIS PF) demonstrated significant postoperative improvement at 12 weeks, 6 months, 1 year, and 2 years (all p ≤ 0.025) but failed to have a significant improvement at 6 weeks postoperatively. VAS arm had a significant postoperative improvement at 6 weeks to 1 year postoperatively (all p ≤ 0.016) but had no difference at 2 years. All evaluated timepoints for NDI and SF-12 PCS revealed strong correlation with PROMIS PF as assessed with Pearson correlation coefficients (Table 4) (Fig. 1). VAS neck was found to have a strong correlation with PROMIS PF at all time points except for 6 weeks (moderate) and preoperatively (low). VAS arm only demonstrated statistically significant moderate correlation with PROMIS PF at 6 months and 2 years (p ≤ 0.028).
DISCUSSION
PROMs often carry inherent bias, which makes it challenging to adequately gauge patient health [10]. To combat this, PROMIS was created to gather information about the state of a patient’s health and their self-perception of PF and pain perception. By individualizing each patient’s health, these metrics offer a way for physicians to prescribe more specific treatments [10]. PROMIS can be administered as a short form or as a computerized adaptive testing (CAT) that changes the course of the survey after the selection of each answer. This provides more detailed information to not only assist patients during recovery, but also decrease clinical burden and strengthen patient compliance [12]. PROMIS PF is most commonly used in spine surgery and is beneficial for understanding range of motion, mobility, and postoperative coordination [20].
PROMIS PF has been increasingly used to psychometrically evaluate pre and postoperative outcomes of spinal surgery [11,27]. Multiple investigations of multilevel ACDFs have shown that PROMIS PF correlates with NDI and SF-12 during preoperative and short-term postoperative time points [20]. One study further highlighted that PROMIS PF can accurately compare patient improvement from preoperative scores out to 1 year postoperatively [28]. Vaishnav et al. [29] found significant associations between NDI and PROMIS PF, as well as between SF-12 PCS and PROMIS PF. Moreover, positive correlations between VAS neck and arm and PROMIS PF scores were reported preoperatively as well as postoperatively in patients undergoing surgery for cervical disc herniations [18]. It is important to note that neither study assessed these scores past the 1-year postoperative timepoint.
In our study, PROMIS PF was shown to be strongly correlated with NDI and SF-12 PCS in accordance with previous investigations in cervical spine surgery [27,30]. Both PF instruments (SF-12 PCS and PROMIS PF) demonstrated significant postoperative improvement from 12 weeks out to 2 years. However, we did not find significant improvements in the shortest time point of 6 weeks, which may be explained by the large portion of our cohort associated with a diagnosis of cervical myeloradiculopathy, which may take a longer timeframe to resolve. Moreover, previous studies validating the use of PROMIS PF in ACDF patients demonstrated a similar lack of postoperative improvement at 6 weeks for both PROMIS PF and SF-12 [20]. Similarly, Khalifeh et al. [31] reported that PROMIS may be less attuned to measuring outcomes at short-term follow-up points than those at long-term follow-ups.
Our results demonstrated variable correlation strength between VAS arm and PROMIS throughout both the preoperative and postoperative time period. This result may be attributed to our patients being associated with a preoperative diagnosis of both herniated nucleus pulposus and foraminal stenosis with arm pain resolving at different rates than PF. Specifically, arm pain may resolve more immediately as peripheral radicular symptoms are relieved, compared to neck pain, disability, and overall PF which may take longer to resolve due to central compressive pathology, as well as recovery from the procedure itself. However, it should be noted that by the 2-year timepoint, VAS arm demonstrated significant correlations with PROMIS that were similar to those of other PROMs in our study and were in line with findings of previous research [32].
Our findings support the convergent validity of PROMIS PF as a tool to evaluate patient health status. Unlike NDI, PROMIS is a general health measure that can be utilized among various patient populations and conditions [29].
There are several limitations to our study. Because we conducted a retrospective review, selection biases are most likely present. Additionally, our study’s exclusion criteria left outpatients who did not complete preoperative or 2-year follow-up PROMIS PF surveys. These patients are lost to postoperative follow-up and introduce an evident limitation of conducting analyses on PROM surveys, as well as decreasing the size of our cohort. Further encouragement and possible incentives of patients to fill out these surveys will be beneficial to future investigations. Furthermore, the ACDF procedures included in this study varied in terms of the number of vertebral levels fused and which levels were fused, which may introduce an element of bias to our results. However, restricting our analysis to only one level of the spine would have reduced the size of our cohort in a way that substantially limited our ability to observe meaningful trends in PRO improvement. Our study also included procedures performed by 1 surgeon at one institution; therefore, to produce more generalizable results, a multicenter and multisurgeon study design must be included in future investigations.
Another important aspect to consider is that the CAT questionnaire utilized by PROMIS includes a wide variety of items that are nearly universally applicable to patients across the globe (e.g., “Are you able to stand for an hour?”). However, some questionnaire items include references to activities or practices that may be less applicable to patients of some cultural backgrounds. For example, questions such as “Are you able to cut your food using eating utensils?” or “Are you able to open a new milk carton?” may not directly translate to the experiences of some patients living in Eastern nations. Fortunately, PROMIS includes a large “bank” of questions and the limited number of questions with such questionable cultural relevance could potentially be edited or excluded when the questionnaire is translated for use in other geographical regions. However, this highlights the importance of follow-up studies to validate the use of PROMIS for spine patients in other cultures and regions.
CONCLUSION
We observed strong correlations of PROMIS PF scores with pain (VAS neck), disability (NDI), and PF (SF-12 PCS) through both short-term and long-term (2 years) follow-ups for patients undergoing ACDF. The long-term validity of PROMIS PF provides evidence for its use as an instrument to evaluate PF among ACDF patients. Moreover, PROMIS PF could be used as a complimentary survey, or even as a primary instrument in place of legacy measures at time periods of up to at least 2 years.
Notes
The authors have nothing to disclose.