这是indexloc提供的服务,不要输入任何密码
Skip to main content
BMC is moving to Springer Nature Link. Visit this journal in its new home.

Predicting pathological complete response following neoadjuvant chemoradiotherapy (nCRT) in patients with locally advanced rectal cancer using merged model integrating MRI-based radiomics and deep learning data

Abstract

Background

To construct and compare merged models integrating clinical factors, MRI-based radiomics features and deep learning (DL) models for predicting pathological complete response (pCR) to neoadjuvant chemoradiotherapy (nCRT) in patients with locally advanced rectal cancer (LARC).

Methods

Totally 197 patients with LARC administered surgical resection after nCRT were assigned to cohort 1 (training and test sets); meanwhile, 52 cases were assigned to cohort 2 as a validation set. Radscore and DL models were established for predicting pCR applying pre- and post-nCRT MRI data, respectively. Different merged models integrating clinical factors, Radscore and DL model were constituted. Their predictive performances were validated and compared by receiver operating characteristic (ROC) and decision curve analyses (DCA).

Results

Merged models were established integrating selected clinical factors, Radscore and DL model for pCR prediction. The areas under the ROC curves (AUCs) of the pre-nCRT merged model were 0.834 (95% CI: 0.737–0.931) and 0.742 (95% CI: 0.650–0.834) in test and validation sets, respectively. The AUCs of the post-nCRT merged model were 0.746 (95% CI: 0.636–0.856) and 0.737 (95% CI: 0.646–0.828) in test and validation sets, respectively. DCA showed that the pretreatment algorithm could yield enhanced clinically benefit than the post-nCRT approach.

Conclusions

The pre-nCRT merged model including clinical factors, Radscore and DL model constitutes an effective non-invasive tool for pCR prediction in LARC.

Peer Review reports

Background

Patients with locally advanced rectal cancer (LARC) are routinely administered neoadjuvant chemoradiotherapy (nCRT) combined with total mesorectal excision (TME) [1, 2]. Approximately 15–27% cases achieve pathological complete response (pCR) and are expected to undergo “Watch & Wait” or nonoperative management (NOM) [3]. The goal of an organ-preserving approach is to maintain quality of life while ensuring safety, without early recurrence or impact on overall survival. In comparison with surgery, total survival time shows no significant difference, while effectively reducing surgical complications and mortality [4, 5]. However, variable response to nCRT has been reported, with some LARC cases showing no obvious tumor regression but suffering from side effects, which hinders the administration of timely surgery or other systemic treatments. Therefore, accurate prediction of pCR may enhance treatment personalization and efficacy. However, the final confirmation of pCR status can solely be made by post-surgical histopathologic examination [6]. Accordingly, developing a noninvasive pretreatment tool to determine candidates for an organ-preserving approach is highly critical.

Nowadays, rectal high-resolution magnetic resonance imaging (mainly including T2WI) is routinely performed for clinical imaging to identify individuals showing a good response and suitability for NOM [6,7,8]. The Magnetic Resonance Imaging and Rectal Cancer European Equivalence (MERCURY) study group developed an MRI-based Tumor Regression Grading (mrTRG) tool according to the five-category pathological tumor regression grading (pTRG) scale [6]. However, currently reported findings demonstrated this approach is defective because of distinct categories and variable subjective judgements. Therefore, a robust noninvasive tool for accurate identification of patients with pCR before or after nCRT needs to be developed. MRI-based radiomics has many advantages in the assessment of treatment response versus traditional imaging evaluation [7], representing a high-value noninvasive tool for predicting pCR status in LARC [8,9,10,11,12]. Meanwhile, substantial evidence reveals a potential benefit for MRI-based deep learning (DL) models for assessing therapeutic response to nCRT, with higher significance than subjective evaluation [13,14,15,16]. Among them, previous findings demonstrated the clinical benefits of clinical factors, MRI radiomics or DL models based on pre- and/or post-nCRT T2WI sequences in assessing therapeutic response in LARC [17, 18].

To our knowledge, however, the radiomics and DL models based on pre- or post-nCRT MRI that can yield enhanced clinically benefits remain unknown. We hypothesized that a combined model applying MRI radiomics features, DL model and clinical parameters might constitute a more reliable tool. Therefore, the current work aimed to validate and compare pre-nCRT and post-nCRT merged models in the prediction of pCR following nCRT for LARC.

Methods

Patients

This study followed the Declaration of Helsinki and had approval from the ethics committee of Changhai Hospital (B2023-022), who waived the requirement for informed consent due to a retrospective design.

First, patients with clinical diagnosis of LARC and administered TME upon nCRT in Changhai Hospital from January 2018 to December 2023 were retrospectively analyzed for model building (cohort 1). Next, LARC cases with the same inclusion criteria treated from June 2019 to December 2023 in Ruijin Hospital Luwan branch were examined for validation (cohort 2).

Inclusion criteria were: (1) initial endoscopic biopsy revealing histologically confirmed rectal adenocarcinoma; (2) primary rectal cancer confirmed as stage II (T3-4N0M0) or stage III (T1-4N1-2 M0) by clinical assessment; (3) pre-nCRT MRI within 14 days before nCRT; (4) post-nCRT MRI were routinely performed within 7 presurgical days; (5) single focus tumor.

Exclusion criteria were: (1) previously diagnosed malignancy or pelvic surgery (n = 7); (2) chronic inflammatory bowel disease (n = 5); (3) poor image quality or no high-resolution T2WI (n = 11); (4) nCRT and rectal surgery separated by more than 12 weeks (n = 19); (5) preoperative concomitant intestinal obstruction or perforation (n = 7). Altogether, the current study eventually included 197 and 52 cases in cohorts 1 and 2, respectively. Cohort 1 cases with the corresponding pCR statuses were randomized into the training and test sets at 8:2.

Baseline clinical features, including age, gender, BMI, pre-nCRT serum carcinoembryonic antigen (CEA, ng/mL, within 1 month of MRI), interval between nCRT to surgery, and interval between post-nCRT MRI to surgery were retrieved from medical records.

Image acquisition and subjective MRI evaluation

Rectal MR images were obtained pre- and post-nCRT, respectively, on a 3.0T or 1.5T MR scanner with an abdominal phase array coil. The patients underwent a 4-h fasting prior to MRI. Before MRI, enema was administered with glycerin (20 ml) to clean the intestines. Rectal high-resolution axial oblique T2W sequences were applied to obtain images perpendicular to the tumor’s long axis and/or post-nCRT lesion area, and data were used for both radiomics analysis and DL modeling. Supplementary Table S1 summarizes the parameters used for high-resolution T2W sequence.

Pre-nCRT MRI factors [19], including tumor height to anal margin, MR-based T and N stages, mesorectal fascia (MRF) and extramural vascular invasion (EMVI), were assessed by two radiologists (Y.Y. and HD.L. with 12 and 10 years of experience in MRI evaluation, respectively) on a GE PACS RA1000 image archiving and communication system workstation. Each radiologist was blinded to histopathologic data and treatment outcomes, except for the diagnosis of rectal cancer. Any discrepancy between the above two radiologists was resolved by consensual discussion.

Neoadjuvant treatment

The cases were administered neoadjuvant treatment with long-course pelvic radiotherapy (50.4 Gy in 25 to 28 fractions) and oral capecitabine (825 mg/m2 b.i.d.). After 8–12 weeks, radical surgery was carried out.

Definition of tumor regression

Referring to the National Comprehensive Cancer Network and American Joint Committee on cancer staging system [20], tumor regression grades were recorded as proposed previously [21], and response description utilized the pTRG system. Pathological response grading employed a scale ranging from 0 for complete response (no viable cancer cells) to 3 for poor response (negligible or no cancer cell killing). Pathological complete response (pCR) was reflected by the absence of viable cancer cells in primary tumors or lymph nodes (ypT0N0M0).

Clinical factor analysis

Univariable logistic regression analysis was carried out in the training set for various clinical factors such as gender, age, BMI, tumor height, MR T and N stages, MRF, EMVI, and CEA to select indexes independently predicting pCR.

Radiomics analysis

High-resolution T2W DICOM images originally obtained before and after nCRT, respectively, were imported into the uAI Research Portal (United Imaging Intelligence, Shanghai, China).

One radiologist (HD.L., 10 years of experience in MRI evaluation) performed a manual segmentation of the region of interest (ROI) on each pre- and post-CRT T2WI scan (Fig. 1). For segmentation in baseline MRI, a ROI was delineated along the tumor margin not including the outer non-rectal tissue and normal rectum. For segmentation in post-CRT MRI, baseline MRI was employed to identify the initial tumor location, and a ROI was drawn covering the entire post-nCRT lesion including signal intensity demonstrating fibrosis or mucin.

Fig. 1
figure 1

Study flowchart, including model building, comparison, representative images for lesion contouring and the validation process. Center 1: Changhai Hospital; Center 2, Ruijin Hospital Luwan Branch. nCRT: neoadjuvant chemoradiotherapy; LARC: locally advanced rectal cancer; TME: Total mesorectal excision; DL: deep learning

Subsequently, 2264 radiomics features were automatically retrieved from the uAI Research Portal. The various features defined below comply with the Imaging Biomarker Standardization Initiative (IBSI), subdivided into the following classes: (1) first-order statistics; (2) 2D and 3D shape-based features; (3) texture properties (gray-level cooccurrence matrix, gray-level run length matrix, gray-level size zone and neighborhood gray-tone difference matrices, and gray-level dependence matrix features); (4) higher-order statistics (first-order statistics and texture features post-transformation) [22,23,24]. Before training the model, data preprocessing was performed to normalize the original radiomics data by Z–score scaling. In addition, both radiologists (Y.Y. and HD.L.) performed segmentations a second time for 30 cases randomly chosen from the pre- and post-nCRT datasets a month later. Inter- and intraclass correlation coefficients (ICCs) were determined to evaluate inter-observer reliability and intra-observer reproducibility for each feature. Features with inter- and intra-observer ICCs > 0.8 (good robustness) were further examined. Next, the variance threshold algorithm (variance threshold = 0.1) was applied for further reduction. At last, the least absolute shrinkage and selection operator (LASSO) algorithm was utilized to determine optimal parameters related to pCR. Leave-one-out cross-validation was employed for selecting the optimal regularization index alpha, as average mean square errors for various patients were small. Features with non-zero coefficients were employed to build Radscore-1 based on the training set of pre-nCRT MRI data and Radscore-2 trained using the post-nCRT dataset.

DL model construction

To facilitate pCR prediction, the simple and easily manageable ResdualNet was introduced to classify 3D ROIs [25,26,27]. The residual block was considered the basic unit for the construction of a neural network for classification in the training set. The details of DL model are provided in Supplementary Table S2. Segmentation data from radiomics analysis were trained as the input for classification. Two DL models were constructed, with DL model-1 trained on pre-nCRT MRI data and DL model-2 trained exclusively on the post-nCRT dataset.

Merged models

A multivariate logistic regression analysis combining the selected clinical features, Radscore and DL-model predicted probability was carried out to establish a prediction model for pCR. Merged model 1 encompassed clinical factors, Radscore-1 and DL model-1 in the pre-nCRT dataset. Merged model 2 included clinical factors, Radscore-2 and DL model-2 in the post-nCRT dataset. Figure 1 shows the study flowchart.

Statistical analysis

Continuous variates, expressed as mean ± standard deviation, were assessed for normality by the Kolmogorov-Smirnov test. Those with normal and skewed distributions were compared by the Student’s t-test and the Kruskal-Wallis H test, respectively. Categorical variates were compared by the Pearson Chi-square test or the Fisher’s exact test. Receiver operator characteristic (ROC) curves were generated to examine the performances of the obtained models by determining areas under the ROC curves (AUCs) in both datasets. The models were compared by the DeLong’s test. The diagnostic sensitivity, specificity, accuracy, precision and F1 score were assessed for each model. The confusion matrix and decision curve analysis (DCA) were performed to assess the models’ clinical potentials. Calibration curve analysis and the Brier score test were utilized for goodness of fit estimation. Two-sided P < 0.05 suggested statistical significance. MedCalc 19.8 (MedCalc Software, Mariakerke, Belgium) and R 4.1.3 (Vienna, Austria) were employed for data analysis.

Results

Patient characteristics

Totally 197 and 52 patients were assessed in cohorts 1 and 2, respectively. Both cohorts were comparable in clinicopathological indexes that are summarized in Table 1. According to postoperative pathological examination, 43 patients (21.8%) showed pCR in cohort 1, and 12 cases (23.1%) were classified as pCR in cohort 2. Univariable logistic regression analysis in the training set revealed CEA, MR T stage and N stage as independent predictive factors of pCR in LARC cases (Supplementary Table S3).

Table 1 Patient demographics

Radiomics and DL models for pCR prediction

Totally 2264 radiomics features were derived from rectal MRI data acquired pre- and post-nCRT, respectively. Then, 2037 (90.0%) and 2009 (88.7%) features had good robustness (inter- and intra-observer ICCs above 0.8), respectively, and were further examined. Finally, 5 and 5 vital parameters associated with pCR were selected using the LASSO algorithm (Supplementary Fig. 1) based on rectal MRI data acquired pre- and post- nCRT, respectively, to build Radscore-1 and Radscore-2 (Fig. 2and Supplementary Fig. 2). Significant differences were found between the pCR and non-pCR groups in Radscore-1 (P < 0.0001) and Radscore-2 (P = 0.021).

Fig. 2
figure 2

Selected radiomics features and Radscore differences. (a) Totally 5 features from Pre-nCRT MRI for pCR status were obtained. (b) The difference in Radscore-1 between the pCR and non-pCR groups was significant (P < 0.0001). (c) Totally 5 features from Post-nCRT MRI for pCR status were obtained. (d) The difference in Radscore-2 between the pCR and non-pCR groups was significant (P = 0.021)

ROC analysis of Radscore and DL model in the training set are shown in Supplementary Table S4. In the test set, pre-nCRT Radscore-1 and post-nCRT Radscore-2 had AUCs of 0.713 and 0.653, respectively. DL model-1 from baseline MRI had an AUC of 0.753, and DL model-2 driven from post-nCRT MRI showed an AUC of 0.663. Meanwhile, in the validation set, Radscore-1 and Radscore-2 had AUCs of 0.643 and 0.635, respectively. The AUCs of DL models 1 and 2 were 0.669 and 0.642, respectively. The DeLong’s test showed these models had no significant difference (P > 0.05). The models are described in detail in Table 2 and Supplementary Fig. 3. The confusion matrixes are shown in Supplementary Fig. 4.

Table 2 ROC curve analysis in the test and validation sets for Radscore and DL model

Merged models for pCR prediction

In the training set, merged models were established using the multivariate logistic regression analysis (Fig. 3). Merged model 1 including clinical factors, Radscore-1 and DL model-1 showed an AUC of 0.909 (95%CI: 0.853–0.949). Merged model 2 including clinical factors, Radscore-2 and DL model-2 showed an AUC of 0.884 (95%CI: 0.823–0.929). Both models were comparable in the DeLong’s test (P = 0.219).

Fig. 3
figure 3

Forest plot depicting multivariate logistic regression analyses for predicting pCR by Radscore, DL model and clinical factors in merged models 1 and 2, respectively. OR: odds ratio; CEA: carcinoembryonic antigen. AUC of Merged model 1: 0.909 (95%CI 0.853–0.949). AUC of Merged model 2: 0.884 (95%CI 0.823–0.929)

The AUC of merged model 1 was 0.834 (95%CI: 0.737–0.931), with high sensitivity of 97.8%, and an accuracy of 88.2% in the test set; these values were 0.742 (95%CI: 0.650–0.834), 96.7% and 80.9% in the validation set, respectively. Meanwhile, merged model 2 had an AUC of 0.746 (95%CI: 0.636–0.856), with a sensitivity of 97.8% and an accuracy of 81.5% in the test set; these values were 0.737 (95%CI: 0.646–0.828), 85.0% and 75.8% in the validation set, respectively. Compared to merged model 2, the net reclassification improvement index (NRI) of merged model 1 were 0.052 and 0.041 in the test and validation sets, respectively. These models are described in detail in Table 3; Fig. 4. Calibration curves were satisfactory in both sets (Supplementary Fig. 5). DCA demonstrated that merged model 1 had enhanced net benefit compared with merged model 2 at a large threshold probability in both test and validation sets (Fig. 5). In the test and validation sets, DCA showed enhanced net benefit for merged model 1 compared with merged model 2, the treat-all scheme and the treat-none scheme at a wide range of threshold probabilities.

Table 3 ROC curve analysis of merged models in the test and validation sets
Fig. 4
figure 4

Receiver operator characteristic (ROC) curve analyses of merged models for pCR status prediction. (a) In the test set. (b) In the validation set

Fig. 5
figure 5

Decision curve analysis (DCA) of the developed merged models. (a) In the test set. (b) In the validation set

Discussion

Radiomics features and DL models based on rectal high-resolution T2WI data pre- and post-nCRT were obtained in this study. Our results demonstrated the merged models combining selected clinical factors, Radscore and DL model predicted the odds of pCR, constituting an effective noninvasive diagnostic strategy for LARC. Furthermore, merged model 1 from pre-nCRT T2WI had AUCs of 0.834 (test set) and 0.746 (validation set), suggesting good predictive efficiency. Clinical decision-making curves demonstrated the preoperative approach conferred added clinical benefit in both test and validation sets.

After nCRT, 15–27% of LARC cases show no live cancer cells, which indicates pCR, and long-term prognosis in such individuals is remarkably improved compared with cases showing residual tumor cells [4]. However, pCR is solely confirmed by pathologic assessment of surgical specimens. Magnetic resonance imaging (MRI)-assessed tumor regression grade (mrTRG) derived from high-resolution T2WI data has been previously introduced prior to surgery, which is a 5-point clinical tumor regression grading system derived from the MERCURY group [4,5,6]. However, subjective assessment of mrTRG is challenging because of uncertain correlation with pathologic tumor response and the scarcity of consensual treatment response assessment based on MRI [28]. mrTRG 1 shows sensitivity and specificity of only 32.3% and 93.5% for pCR, respectively, while mrTRG 1/2 had values of 69.9% and 62.2%, respectively, as reported in a meta-analysis [28]. Such sensitivity is obviously unsatisfactory.

Multiple reports have demonstrated radiomics represents a promising noninvasive tool with high potential in the prediction of pathological response to nCRT in LARC cases [8,9,10,11,12]. Indeed, a radiomics model showed a higher pooled AUC compared with the clinical radiological model in identifying pCR [9, 16]. Antunes and coworkers [10] developed a pre-nCRT MRI-based radiomics model with an AUC of 0.712 and an accuracy of 70.5%. Peng and collaborators [11] reported AUCs for radiomics models built with pre-treatment, post-treatment, and delta features of 0.771, 0.681, and 0.871, respectively. Wen et al. [12] established a radiomics nomogram with pre-nCRT cN stage, pre-nCRT Radscore and post-nCRT Radscore that achieved an AUC of 0.852, a higher value than obtained with single radiomics models.

Meanwhile, in the field of deep learning (DL), studies have reported DL models show elevated predictive accuracy compared with the radiomics approach, and combined models integrating clinical features have enhanced diagnostic potential compared with radiomics models alone [13,14,15,16]. The residual network (ResdualNet) attracts increasing attention from computer vision experts because of high performance. Recently, the residual module is considered the basic module for multiple deep neural networks in the field of biomedical imaging [25,26,27]. A systematic review [15] reported the algorithm with highest performance had an AUC of 0.845 (range of 0.71–0.99) and an accuracy of 0.85 (range of 0.83–0.98) in imaging prediction of pCR post-nCRT for LARC. Although the DL approach plays an important role in the prediction task, there is still a lack of a more multidimensional and robust preoperative prediction model based on MRI images from pre- or post-nCRT to compare the predictive performance.

It is worth noting that we here compared different radiomics features, DL models and the merged models before or after nCRT. The results indicated the merged model combining baseline clinical factors, pretreatment MRI Radscore and DL model could ameliorate pCR prediction in LARC following nCRT in the test and validation sets. The AUC of merged model 1 was 0.742–0.834, with relatively high sensitivity (0.967–0.978) and accuracy (0.809–0.882). This is consistent with previous findings, and predicting pCR with elevated sensitivity could help noninvasively identify individuals in whom surgical treatment is avoidable, since long-term survival rates are similar in cases with pCR post-rectum resection and the watch & wait group. Consequently, a pretreatment tool was designed effectively to provide an enhanced and objective understanding of predictive models before clinical decision-making.

By comparison, merged model 2 had a relatively low AUC of 0.737–0.746, as well as a relatively low accuracy (0.758–0.815). DCA also revealed enhanced clinical benefits for the pretreatment approach versus the posttreatment model in pCR prediction. These findings indicated that a tumor may exhibit varying degrees of necrosis, edema, scars and fibrosis, which may hamper the accurate segmentation of lesions [29]. Another factor affecting the stability and standardization of results is the variable interval between nCRT and surgery, leading to an inaccurate assessment of tumor heterogeneity, so posttreatment radiomics features and DL model are not sufficient to effectively predict pCR.

Inclusion of an external validation cohort was also notable in this study. When only a few samples are available, especially in the medical field, the model is prone to overfitting. However, because of uniform sample distribution between the training and validation sets, the external validation cohort showed good discrimination, calibration, and performance in the evaluation of pCR. These findings suggest including an external dataset may help mitigate the potential overfitting in a newly developed model. In addition, we adopted data augmentation for guiding and teaching the network to be general and robust. Therefore, the merged model is promising in enhancing diagnostic confidence for radiologists and clinicians in management decisions considering preoperative MRI data.

The current study had limitations. First, manual VOI delineation was employed. Despite its unsuitability for large data, manual segmentation is a more accurate method, which avoids the effect of intestinal wall deformation and potential automatic recognition errors [30,31,32]. Secondly, although we enrolled an external validation cohort, a retrospective design and a small sample size were used, indicating potential selection bias. Consequently, large multicenter studies are warranted to decrease the effects of data bias on the generalization of the model [33, 34]. Finally, only HR-T2WI for RC analysis was used since other relevant MR sequences, such as CE-MRI, were not available for all patients in our institution. Therefore, we did not apply multiple sequences for analysis to diminish selection bias. Another important aspect was avoiding a large amount of computational burden. Therefore, incorporation of transformer-based model with better suitability for handling large datasets effectively deserves further investigation.

Conclusions

In conclusion, this study developed pCR prediction models including radiomics and deep learning based on pre- and post- nCRT MRI. Additionally, the pre-nCRT merged model yielded enhanced clinical benefit compared with the post-nCRT model, indicating the model might boost confidence for pCR prediction in clinical management decisions in LARC cases before nCRT.

Data availability

The datasets used and/or analyzed in the current study are available from the corresponding author on reasonable request.

Abbreviations

LARC:

Locally advanced rectal cancer

TME:

Total mesorectal excision

nCRT:

Neoadjuvant chemoradiotherapy

TRG:

Tumor regression grade

pCR:

Pathological complete response

MRI:

Magnetic resonance imaging

T2WI:

T2 weighted imaging

VOI:

Volume of interest

LASSO:

Least absolute shrinkage and selection operator

DL:

Deep learning

DCA:

Decision curve analysis

References

  1. van Gijn W, Marijnen CA, Nagtegaal ID, et al. Preoperative radiotherapy combined with total mesorectal excision for resectable rectal cancer: 12-year follow-up of the multicentre, randomised controlled TME trial. Lancet Oncol. 2011;12:575–82.

    Article  PubMed  Google Scholar 

  2. Oronsky B, Reid T, Larson C, Knox SJ. Locally advanced rectal cancer: the past, present, and future. Semin Oncol. 2020;47:85–92.

    Article  PubMed  Google Scholar 

  3. Benson AB, Venook AP, Al-Hawary MM, et al. Rectal cancer, version 2.2018, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw. 2018;16:874–901.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Maas M, Nelemans PJ, Valentini V, et al. Long-term outcome in patients with a pathological complete response after chemoradiation for rectal cancer: a pooled analysis of individual patient data. Lancet Oncol. 2010;11:835–44.

    Article  PubMed  Google Scholar 

  5. van der Valk MJM, Hilling DE, Bastiaannet E, et al. Long-term outcomes of clinical complete responders after neoadjuvant treatment for rectal cancer in the International Watch & Wait Database (IWWD): an international multicentre registry study. Lancet. 2018;391(10139):2537–45.

    Article  PubMed  Google Scholar 

  6. Patel UB, Taylor F, Blomqvist L, et al. Magnetic resonance imaging-detected tumor response for locally advanced rectal cancer predicts survival outcomes: MERCURY experience. J Clin Oncol. 2011;29:3753–60.

    Article  PubMed  Google Scholar 

  7. Lambin P, Leijenaar RTH, Deist TM, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;14(12):749–62.

    Article  PubMed  Google Scholar 

  8. Shin J, Seo N, Baek SE, et al. MRI radiomics model predicts pathologic complete response of rectal cancer following chemoradiotherapy. Radiology. 2022;303(2):351–8.

    Article  PubMed  Google Scholar 

  9. Wan L, Sun Z, Peng W, et al. Selecting candidates for organ-preserving strategies after neoadjuvant chemoradiotherapy for rectal cancer: development and validation of a model integrating MRI radiomics and pathomics. J Magn Reson Imaging. 2022;56(4):1130–42.

    Article  PubMed  Google Scholar 

  10. Antunes JT, Ofshteyn A, Bera K, et al. Radiomic features of primary rectal cancers on baseline T2 -weighted MRI are associated with pathologic complete response to neoadjuvant chemoradiation: a multisite study. J Magn Reson Imaging. 2020;52(5):1531–41.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Peng J, Wang W, Jin H, et al. Develop and validate a radiomics space-time model to predict the pathological complete response in patients undergoing neoadjuvant treatment of rectal cancer: an artificial intelligence model study based on machine learning. BMC Cancer. 2023;23(1):365.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Wen L, Liu J, Hu P, et al. MRI-based radiomic models outperform radiologists in predicting pathological complete response to neoadjuvant chemoradiotherapy in locally advanced rectal cancer. Acad Radiol. 2023;30(Suppl 1):S176–84.

    Article  PubMed  Google Scholar 

  13. Zhang XY, Wang L, Zhu HT, et al. Predicting rectal cancer response to neoadjuvant chemoradiotherapy using deep learning of diffusion kurtosis MRI. Radiology. 2020;296(1):56–64.

    Article  PubMed  Google Scholar 

  14. Ke J, Jin C, Tang J, et al. A longitudinal MRI-based artificial intelligence system to predict pathological complete response after neoadjuvant therapy in rectal cancer: a multicenter validation study. Dis Colon Rectum. 2023;66(12):e1195–206.

    PubMed  Google Scholar 

  15. Prabhakaran S, Choong KWK, Prabhakaran S, Choy KT, Kong JC. Accuracy of deep neural learning models in the imaging prediction of pathological complete response after neoadjuvant chemoradiotherapy for locally advanced rectal cancer: a systematic review. Langenbecks Arch Surg. 2023;408(1):321.

    Article  PubMed  Google Scholar 

  16. Jia LL, Zheng QY, Tian JH, et al. Artificial intelligence with magnetic resonance imaging for prediction of pathological complete response to neoadjuvant chemoradiotherapy in rectal cancer: a systematic review and meta-analysis. Front Oncol. 2022;12:1026216.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Miranda J, Tan GXV, Fernandes MC, et al. Rectal MRI radiomics for predicting pathological complete response: where we are. Clin Imaging. 2022;82:141–9.

    Article  PubMed  Google Scholar 

  18. Di Re AM, Sun Y, Sundaresan P, Hau E, et al. MRI radiomics in the prediction of therapeutic response to neoadjuvant therapy for locoregionally advanced rectal cancer: a systematic review. Expert Rev Anticancer Ther. 2021;21:425–49.

    Article  PubMed  Google Scholar 

  19. Nougaret S, Reinhold C, Mikhael HW, Rouanet P, Bibeau F, Brown G. The use of MR imaging in treatment planning for patients with rectal carcinoma: have you checked the DISTANCE? Radiology. 2013;268:330–44.

    Article  PubMed  Google Scholar 

  20. Amin MB, Greene FL, Edge SB, et al. The eighth edition AJCC cancer staging manual: continuing to build a bridge from a population-based to a more personalized approach to cancer staging. CA Cancer J Clin. 2017;67(2):93–9.

    Article  PubMed  Google Scholar 

  21. Quirke P, Durdey P, Dixon MF, Williams NS. Local recurrence of rectal adenocarcinoma due to inadequate surgical resection. Histopathological study of lateral tumour spread and surgical excision. Lancet. 1986;2(8514):996–9.

    Article  CAS  PubMed  Google Scholar 

  22. Chen F, Ma X, Li S, et al. MRI-based radiomics of rectal cancer: assessment of the local recurrence at the site of anastomosis. Acad Radiol. 2021;28(Suppl 1):S87–94.

    Article  PubMed  Google Scholar 

  23. Ma X, Shen F, Jia Y, Xia Y, Li Q, Lu J. MRI-based radiomics of rectal cancer: preoperative assessment of the pathological features. BMC Med Imaging. 2019;19(1):86.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Zwanenburg A, Vallières M, Abdalah MA, et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. 2020;295(2):328–38.

    Article  PubMed  Google Scholar 

  25. Cui J, Liu S, Tian Z, Zhong Z, Jia J. ResLT: residual learning for long-tailed recognition. IEEE Trans Pattern Anal Mach Intell. 2023;45(3):3695–706.

    PubMed  Google Scholar 

  26. Wu J, Xuan S. Scale-aware dense residual retinal vessel segmentation network with multi-output weighted loss. BMC Med Imaging. 2023;23(1):100.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Ibtehaz N, Rahman MS. MultiResUNet: rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 2020;121:74–87.

    Article  PubMed  Google Scholar 

  28. Jang JK, Choi SH, Park SH, et al. MR tumor regression grade for pathological complete response in rectal cancer post neoadjuvant chemoradiotherapy: a systematic review and meta-analysis for accuracy. Eur Radiol. 2020;30:2312–23.

    Article  PubMed  Google Scholar 

  29. Yuan Y, Zheng K, Zhou L, et al. Predictive value of modified MRI-based split scar sign (mrSSS) score for pathological complete response after neoadjuvant chemoradiotherapy for patients with rectal cancer. Int J Colorectal Dis. 2023;38(1):40.

    Article  PubMed  Google Scholar 

  30. Yip SS, Aerts HJ. Applications and limitations of radiomics. Phys Med Biol. 2016;61(13):R150–66.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Trebeschi S, van Griethuysen JJM, Lambregts DMJ, et al. Deep learning for fully-automated localization and segmentation of rectal cancer on multiparametric MR. Sci Rep. 2017;7(1):5301.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Tian C, Ma X, Lu H, et al. Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling. Front Med (Lausanne). 2023;10:1326324.

    Article  PubMed  Google Scholar 

  33. Price WN 2nd, Cohen IG. Privacy in the age of medical big data. Nat Med. 2019;25:37–43.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat Rev Cancer. 2018;18:500–10.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We are especially grateful to all the patients and their families involved in this study. We thank all the team members at the Department of Colorectal Surgery, the Department of Radiology, and the Department of Pathology in all the participant institutions. We also thank Dr. Qi Chen in The Navy Medical University, Shanghai, China, for providing statistical consultation.

Funding

The study was supported by The National Natural Science Foundation of China (82371955, 82171891); Foundation of Shanghai Municipal Health Commission (202240204); “Yi Yuan Xin Xing” young medical talents funding project of Shanghai and the Guhai Project of Changhai hospital (GH145-09). The funder of the study had no role in design, data collection, data analysis, data interpretation, or writing of the report.

Author information

Authors and Affiliations

Contributions

All authors read and approved the final manuscript. Guarantors of integrity of entire study: Y.L., JP.L., F.S.; study concepts/interpretation: Y.L., JP.L., F.S.; agrees to ensure any questions related to the work are appropriately resolved: F.S.; data acquisition: XL.M.; deep learning algorithm: YW.X. and Feng Shi; clinical studies: ZH.L.,Y.Y., HD.L, and ML.L.; literature research, Y.Y., HD.L.; statistical analysis: F.S., Y.Y., YW.X. and Feng Shi; manuscript editing: Y.Y., HD.L., and ML.L.; JP.L., Y.L., and F.S. were responsible for manuscript revision.

Corresponding authors

Correspondence to Yong Lu, Jianping Lu or Fu Shen.

Ethics declarations

Ethics approval and consent to participate

All methods of the present research were carried out in accordance with the Declaration of Helsinki and were approved by the local Institutional Review Board (Committee on Ethics of Biomedicine, Changhai Hospital, Shanghai, China). Informed consent was waived for this retrospective study.

Consent for publication

NA.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lu, H., Yuan, Y., Liu, M. et al. Predicting pathological complete response following neoadjuvant chemoradiotherapy (nCRT) in patients with locally advanced rectal cancer using merged model integrating MRI-based radiomics and deep learning data. BMC Med Imaging 24, 289 (2024). https://doi.org/10.1186/s12880-024-01474-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1186/s12880-024-01474-3

Keywords