Abstract
Aim: Anastomotic leakage (AL) in left-sided colorectal cancer is a serious complication, with an incidence rate of 6-18%. We developed a novel predictive model for AL in colorectal surgery with double-stapling technique (DST) anastomosis using auto-artificial intelligence (AI). Patients and Methods: A total of 256 patients who underwent curative surgery for left-sided colorectal cancer between 2017 and 2021 were included. In addition to conventional clinicopathological factors, we included the type of circular stapler using DST, conventional double-row circular stapler (DCS) or EEA™ circular stapler with Tri-Staple™ technology, 28 mm Medium/Thick (Covidien, New Haven, CT, USA) which had triple-row circular stapler (TCS) as a covariate. Auto-AI software Prediction One (Sony Network Communications Inc.) was used to predict AL with 5-fold cross validation. Predictive accuracy was assessed using the area under the receiver operating characteristic curve. Prediction One also evaluated the ‘importance of variables’ (IOV) using a method based on permutation feature importance. Results: The area under the curve of the AI model was 0.766. The type of circular stapler used was the most influential factor contributing to AL (IOV=0.551). Conclusion: This auto-AI predictive model demonstrated an improvement in accuracy compared to the conventional model. It was suggested that use of a TCS may contribute to a reduction in the AL rate.
Colorectal cancer is the most common cancer globally and the second leading cause of cancer-related deaths, especially in Japan. Although for colorectal cancer anastomosis with the double-stapling technique (DST) using circular staplers is one of the most frequently employed standard methods (1–5), anastomotic leakage (AL) is a serious complication with an incidence rate of 6-18% (6–10). AL affects not only short-term outcomes, but also long-term oncological prognosis and functional outcomes (11). Successful anastomosis depends on several factors, including the tension between the two connected portions of the gastrointestinal tract, a healthy blood supply to the surrounding tissues, and the mechanical strength of the anastomosis (12–14).
Although many risk factors for AL have been reported, it remains difficult to accurately predict. Previous reports were analyzed using conventional statistical methods, such as multivariate analysis or nomograms (15–18). The accuracy of these retrospective reports was somewhat low at 0.75. Recently, artificial intelligence (AI) has rapidly developed worldwide. With the development of sequencing technologies and computational methods to facilitate big data analysis, AI has been developed as a prognostic tool to refine precision and accuracy (19). Although it is difficult for clinical surgeons to develop machine-learning algorithms, the new auto machine-learning software Prediction One (Sony Network Communications Inc.) enables predictions to be evaluated quickly and easily.
The choice of surgical instruments for anastomosis greatly affects the safety of the operation. Conventionally, DST has been performed using a double-row circular stapler (DCS) but a new circular stapler, EEA™ circular stapler with Tri-Staple™ technology, 28 mm Medium/Thick (Covidien, New Haven, CT, USA) introduced recently has a triple row of staples and is expected to improve pressure resistance. However, the effectiveness of this new triple-row circular stapler (TCS) has not been fully verified or compared with conventional DST.
In this study, we aimed to develop a novel predictive model for AL of DST for colorectal cancer using auto-AI software Prediction One easily and quickly.
Patients and Methods
Patients. Data from 256 patients who underwent curative surgery for left-sided colorectal surgery with DST anastomosis between January 2017 and October 2021 at Tokyo Medical University Hospital were acquired. The following 18 variables were extracted from this database: Age (≤75/>75 years), sex, body mass index (≤25/>25 kg/m2), American Society of Anesthesiologists score (0/≥1), preoperative hemoglobin (Hb) (<10/≥10 g/dl), surgical procedure (open/minimum invasive surgery), diverting stoma (yes/no), normal/emergency surgery, operative time (<400/≥400 min), hemorrhage (<70/≥70 ml), anastomotic level from anal verge (middle rectum, 5≤height ≤10cm/upper rectum, 10<height ≤15 cm), circular stapler used (DCS/TCS), postoperative complications (Clavien–Dindo classification >2), histological type (tub1/mucinous or signet/other), pathological T-status (pT), pathological N-status (pN) (negative/positive), lymphatic invasion (negative/positive), and venous invasion (negative/positive). We excluded patients who had anastomosis <5 cm from the anal verge and who underwent preoperative therapy. Generally, patients were admitted to our hospital a few days before surgery, and we obtained laboratory values on the day of admission. This study was conducted in accordance with the guidelines of the Declaration of Helsinki. The Review Board of the Tokyo Medical University Hospital approved the study (T2019-0060), and informed consent was obtained from all patients.
Surgical treatment. We performed curative surgeries for all the patients. The same group of dedicated colorectal surgeons performed all surgeries. The anal side of the colorectum was incised using a linear stapler. The anvil of the circular stapler was secured in place, and end-to-end anastomosis was performed using the DST. The rod of the circular stapler was inserted from the opposite side of the linear staple line, piercing the rectal stump near the linear staple line.
Postoperative care for AL. The 30-day AL rate was recorded and cases were diagnosed according to the International Study Group of Rectal Cancer AL definition (20) as a defect of the intestinal wall at the anastomotic site, which leads to a communication between the intra- and extraluminal compartments, or as an abscess adjacent to the anastomosis. A computed tomographic (CT) scan with rectal contrast was performed in patients with suspected AL in the absence of unquestionable clinical signs of peritonitis, which would indicate urgent surgery. On postoperative days 3, 5, and 7, the C-reactive protein level was determined in all cases. For patients with a C-reactive protein level >200 mg/l, a CT scan was performed. The following CT findings were considered suggestive of anastomotic failure: Contrast leakage from inside the bowel to the pelvis or abdominal cavity, abscess, or perianastomotic collection associated with or without localized pneumoperitoneum.
Statistical analysis. Descriptive statistics are provided for all variables. Regarding anastomotic leakage, we set the cut-off values of each covariate using receiver operating curves. The relationship between the qualitative variables was established using the chi-square test. All statistical analyses were performed using EZR (Saitama Medical Center, Jichi Medical University, Saitama, Japan), which is a graphical user interface for R (The R Foundation for Statistical Computing, Vienna, Austria). The level of statistical significance was set at p<0.05.
Artificial intelligence. We used Prediction One (https://predictionone.sony.biz; Sony Network Communications Inc.) machine-learning software to predict AL. The software generates feature vectors from the dataset using standard preprocessing methods, such as one-hot encoding for categorical variables and normalization for numerical variables. A gradient-boosting tree and a neural network are used as supervised machine-learning models, each trained with hyperparameter tuning. An ensemble model of both trained models was constructed. Missing values are automatically handled with common machine-learning techniques, such as using a gradient-boosting tree, which can handle missing values. The area under the receiver operating characteristics curve (AUC) was calculated using internal validation to evaluate the accuracy of the AI model. Prediction One made the best predictive model using an artificial neural network with 5-fold cross-validation. Prediction One also evaluated the ‘importance of variables’ (IOV) using a method based on permutation feature importance. This method was used to calculate the difference in the model output when a single variable was removed. The value of the difference in the model output indicates how much the model depends on the variable. The value of the difference was computed for each covariate and then averaged over those in the dataset.
Results
Patient and tumor characteristics. Baseline patient and tumor characteristics of the entire cohort, divided by the presence or absence of AL are shown in Table I. The rate of AL for the entire cohort was 6%. The rates of AL in the DCS and TCS subgroups were 7.7% and 2.7%, respectively (p=0.226).
AI analysis. We used Prediction One to analyze the model for predicting AL. Data from 156 and 100 patients were used for the learning and validation models, respectively. The receiver operating characteristics curve of the AI model is shown in Figure 1. The AUC was 0.766. Prediction One was also used to calculate the IOV of each factor for anastomotic leakage. Factors with IOV ≥0.040, in decreasing order, were the type of circular stapler used (0.055), pT status (0.053), pN status (0.051), and venous invasion (0.043). The total IOV and IOV for AL are shown in Table II, with the distribution according to the presence and absence of AL. The contributions of DCS and pT status to AL were the highest among all covariates.
Discussion
The AL rate is on a downward trend compared to before, but it has not yet reached zero (21, 22). The strength of this study is that a clinical surgeon not familiar with the use of AI was able to construct a highly accurate predictive model using simple auto-AI. Prediction One can automatically produce the predictive model within a few minutes, as long as the dataset is provided. We developed a novel auto-AI-generated model for predicting AL in colorectal cancer surgery with high accuracy. We also included a new type of circular stapler as covariate to be considered as an important factor in AL.
Regarding the accuracy of previously reported models using conventional statistical analyses, reported values of the AUC were all under 0.75 (15–18). In the present study, the AUC of the AI model was 0.766, which is an acceptable value. Moreover, this model, which consists of simple clinicopathological factors that are easy to obtain, can be constructed at a low cost. The use of machine learning, which includes supervised or unsupervised approaches, is becoming more and more necessary in the field of clinical research. Conventional statistical analysis uses a (generalized) linear model that does not combine features. However, Prediction One uses a non-linear model (gradient-boosting tree and a neural network) that can handle combinations of features. This may be the reason for the improved accuracy.
Previously, we used a DCS for the DST in colorectal surgery. Since the EEA™ circular stapler, a new circular stapler with TCS was introduced in Japan in January 2021, this device has been used to perform DST at our Institute. TCS with a triple row of staples is expected to improve pressure resistance, as compared to conventional DCS. However, the effectiveness of TCS in reducing the risk of AL has not been fully examined. In the present study, we included the type of circular stapler used in the DST in AI analysis. Among the many factors, in addition to the oncological factors such as pT and pN status predicted before the analysis, we found that the type of circular stapler was the superior factor related to AL. This suggests that changing the circular stapler from DCS to TCS may result in a reduction in AL.
There are some limitations to the present study. Firstly, we used a single-center, retrospective design. Secondly, it is difficult to understand the internal mechanism of AI analysis (‘black-box problem’) (23), which is problematic because entrusting important decisions to a system that is difficult to explain itself presents obvious dangers. Although Prediction One partly resolved the ‘black-box problem’ by calculating IOVs, we were not able to completely exclude this problem in the present analyses. Thirdly, external validation was not performed. Although external validation was not enforced, a 5-fold cross-validation was enforced. Finally, the number of patients was small for the deep learning analysis. A prospective study with a large number of patients and external validation will be necessary to further improve the AI model.
In conclusion, we developed a novel AI-generated predictive model with improved accuracy compared to the conventional statistical analysis model for predicting AL in patients with colorectal cancer. TCS may contribute to a reduction in the AL rate.
Footnotes
Authors’ Contributions
J.M. wrote the main article and R.U., T.T., K.K., H.K., M.E. and T.I. acquired data for the work. KK, YN and AT drafted the work or revised it critically for important intellectual content. All Authors reviewed the final article.
This article is freely accessible online.
Conflicts of Interest
The Authors declare that there are no conflicts of interest.
- Received August 7, 2021.
- Revision received October 7, 2021.
- Accepted October 12, 2021.
- Copyright © 2021 International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.