Abstract
Background/Aim: This study aimed to determine appropriate deformable image registration (DIR) algorithms for pancreatic cancer patients undergoing carbon ion radiotherapy (CIRT). Patients and Methods: The performance of three types of DIR algorithms, including intensity-based DIR (iDIR), contour-based DIR (cDIR), and hybrid DIR (hDIR) were evaluated using seventy-one CT images from eight pancreatic cancer patients. Both the geometry of the CTV and subsequent dose warping discrepancies were evaluated using the dice similarity coefficient (DSC) and the difference in V95. Results: cDIR and hDIR had superior performance than iDIR in both DSC and V95 (p<0.0001). iDIR caused accumulated dose to be underestimated by 5% in the median CTV V95 compared to the other methods. Conclusion: hDIR and cDIR can be utilized to assess the accumulated dose in CIRT for pancreatic cancer. iDIR can be considered when the obtained DSC is greater than 0.89.
- Deformable image registration (DIR)
- accuracy evaluation
- intensity-based DIR
- contour-based DIR
- hybrid DIR
- pancreatic cancer
- carbon ion radiotherapy
- dose accumulation
Carbon ion radiotherapy (CIRT) is an effective technology for use in patients with pancreatic cancer due to its unique physical and biological characteristics. The Bragg peak and sharp lateral dose penumbra allows for an excellent dose distribution in the target, while sparing healthy tissues (1). However, this property also makes it highly sensitive to the position and anatomical uncertainties of the tumor and normal tissues over the course of treatment. A three-dimensional imaging verification system is currently available in some carbon ion facilities (2); this system makes it possible to conduct dosimetric assessment based on daily CT images, and provides a possibility to perform adaptive radiotherapy with carbon ion beams. To trigger re-planning during adaptive RT, estimation of the cumulated dose is necessary to obtain the actual delivered dose over fractions. Therefore, it is important to select a tool with a sufficiently high accuracy to accumulate the dose.
A satisfactory image registration between moving and reference CT images is essential to evaluate the accumulated dose. In recent years, many image registration software and tools have been developed and implemented in clinical practice, in particular, deformable image registration (DIR). In contrast to rigid image registration (RIR), which is effective in cases where no anatomical change is expected, DIR can manage local distortion between two image sets (i.e., organ deformation and tumor movement) (3). Therefore, RIR is not considered to be appropriate for pancreatic cancer, where a large organ deformation in the gastrointestinal tract is common (4). Furthermore, the accuracy of image registration in dose warping is much lower in RIR than in DIR methods (5). Intensity-based DIR (iDIR) has been widely utilized for contour propagation, dose warping, and accumulation for various cancers (6). iDIR uses a variety of image intensity metrics, such as gray scale, mutual information, and cross correlation, to match the images. Therefore, iDIR can achieve high accuracy for organs with ample distinctive image features or high internal contrast (7, 8). Indeed, iDIR has been used to examine the accumulated doses for lung and liver cancer in CIRT (9, 10). However, for pancreatic cancer, it is often difficult to identify the boundary between the tumor and surrounding soft tissues on CT images due to the low contrast, which may reduce the accuracy of iDIR. Additionally, the iDIR algorithm has limitations in the extent to which it can accurately match large heterogeneous deformations in regions without the aid of features between images (11-13). To reduce these uncertainties, contour-based DIR (cDIR) and hybrid DIR (hDIR) have been recently developed, which use only contours in images (cDIR), or both the contour and image intensity metrics (hDIR) to guide the registration. cDIR and hDIR appear to have the potential to improve the accuracy of DIR for pancreatic cancer. However, few studies have focused on testing the accuracy of DIR algorithms for pancreatic cancer, especially those based on CIRT (5, 14, 15). Considering the steeper dose gradients of carbon ion beams, the uncertainties of DIR in anatomical structure registration may have greater effects on dose warping than conventional RT. Therefore, evaluation of the accuracy of DIR in CIRT is necessary. In our previous study, we demonstrated that cDIR and hDIR can obtain a better accuracy in dose warping than iDIR for CT image registration between different patient positions (supine and prone) (5), while a similar performance was observed between the same patient position. However, the most appropriate DIR method for dose accumulation was unclear due to the limited number of CT images studied.
In the current study, we evaluated the performance of iDIR, cDIR, and hDIR methods in eight patients with pancreatic cancer. To this end, we examined adaptive CIRT based on daily CT sets, and proposed a cut-off value for clinical DIR selection for dose accumulation by quantifying the discrepancy between the DIR algorithms.
Patients and Methods
Patient material. A total of eight pancreatic cancer patients treated with CIRT with a passive irradiation method were studied; the data were generated from the same patient group used in our previous study (5). In contrast to our previous study, where few CT sets were used, in the current study, all daily CT scans acquired on the treatment day were analyzed. However, to avoid uncertainties from the different positions, only the cases with supine position were selected; the details are shown in Table I. This study was approved by the Institutional Review Board (1564) of our facility, and the study was registered at the University Hospital Medical Information Network Clinical Trials Registry (UMIN-CTR trial number: 000029495).
Patient characteristics.
Image acquisition. The planning CT images were acquired in the simulation room in the 2 weeks before treatment. Daily CT data sets were acquired by a separate CT (CT-on-rails) system on the treatment day under the same settings and conditions as the planning CT in the treatment room (Aquilion LB, Self-Propelled, Canon Medical Systems, Otawara, Japan). Only the CT images with supine position (the first nine fractions) were selected in this study. Thus, a total of eight planning CT sets was obtained as fixed (reference) CT, and 71 daily CT sets were obtained as moving CT (one CT set was missing in case 7 due to equipment fault). The median (range) interval time between treatment days was 1 day (range=1-5 days).
Treatment planning. The contours of the clinical target volume (CTV) and gross tumor volume (GTV) on the planning CT images were used as the reference. To reduce uncertainties from the contour delineation, one radiation oncologist manually delineated the target volumes on each of the daily CT sets, which was subsequently checked by another experienced radiation oncologist. The GTV was determined via contrast-enhanced CT before treatment. To protect the gastrointestinal (GI) organs, additional margins were defined as planning organ at risk volume (PRV)-GI (GI boundary + 2 mm - GTV) at our facility. The CTV was defined as the GTV plus a 5 mm margin, including locoregional lymph nodes and neural plexus regions, minus the PRV-GI margin. The planning target volume (PTV) was determined by adding a 3 mm margin to the CTV, excluding the PRV-GI margin. The treatment plan of four beam fields with a passive scattering technique was created using the XiO-N system, which employs a pencil-beam algorithm (Elekta, Stockholm, Sweden and Mitsubishi Electric, Tokyo, Japan), and only one beam delivered on each treatment day. Our facility uses Gy as the clinical dose unit, which was calculated based on the physical dose and the relative biological effectiveness (RBE) (16). The prescribed dose for each fraction was 4.6 Gy (RBE), and a total of 12 fractions were applied to each patient.
New dose distributions with the same angle as the irradiated beam in the first 9 fractions (angles: 0°, 90°, and 270°) were generated independently with the same criteria as the treatment planning and used for the analysis. The last 3 fractions with prone position were excluded. The doses on daily CT images were used as the ground-truth doses.
Deformable image registration algorithm and evaluation parameters. The DIR algorithms were implemented using the VoxAlign Deformation Engine in MIM maestro (MIM Software Inc., Cleveland, OH, USA). Three types of DIR algorithms, including iDIR, cDIR, and hDIR, were applied to deform the daily CT images to the planning CT images. The iDIR algorithm is a free-form based deformable registration that assumes that voxel values (image intensity) of the same anatomical area are similar between CT images. The cDIR algorithm focuses solely on the contour information without depending on the image intensity, assuming that the target-volume changes are small. However, the hDIR algorithm, considers both the target structure and image intensity information (outside the contour). The deformation vector fields (DVFs) obtained after DIR between the planning CT and daily CT images were applied to the target contours and doses on the daily CT to validate the accuracy of the DIR. Delineated CTV and GTV were deformed separately for cDIR and hDIR, in order to avoid potential uncertainties caused by the deformation of multiple contours. In addition, before performing DIR, all daily CT images were rigidly registered in the corresponding planning CT image with manual tumor matching to improve the accuracy of DIR (17).
Structure correspondence was quantified with metrics of the dice similarity coefficient (DSC), which was calculated to evaluate the amount of volume overlap between each pair of contours using the following formula: DSC=2 (V1 ∩ V2)/(V1 + V2), where V1 and V2 are the volumes of the contour on the planning CT, and the contour deformed from the daily CT to the planning CT, respectively. In this study, we only evaluated the CTV and GTV, while other contours, such as the stomach and duodenum, were not considered. Dose correspondence is another important index to validate the accuracy of DIR, and the dose distribution on each daily CT set was warped and transferred to the corresponding planning CT images using the DVF obtained from each registration. Then, dose differences in the daily and accumulated doses among different DIR algorithms were evaluated by dose-volume parameters such as CTV V95 and GTV V95 (the volume receiving greater than 95% of the prescription dose). In principle, the dose parameters cannot be preserved when the contour volumes are different between the CT images. However, they will be close when the volume changes are very small, and in this situation, a large parameter difference should indicate large errors. Because the CTV and GTV did not change significantly in this study, we considered that smaller dose parameter differences indicated a higher accuracy of DIR.
We used Wilcoxon signed rank test to evaluate the difference in the DSC and dose distributions between the DIR methods. A simple linear regression was used to examine the relationship between the DSC and the dose distributions for the three DIR methods, the dependence between the percentage of cases with a dose difference greater than 2% (5%) after iDIR, and the difference in the accumulated dose in respect to hDIR and cDIR. Receiver operating characteristic (ROC) curve analysis was performed for the DSC with a dose difference greater than 2%, and 5% for iDIR. The cut-off value was determined when the sensitivity was 90%. p<0.05 was considered statistically significant, and p<0.0001 was considered as highly statistically significant.
Results
Figure 1 shows an example of CTV dose distribution deformed via iDIR. The average±standard deviation of the DSC of the CTV for all cases with iDIR, hDIR, and cDIR were 0.88±0.06, 0.94±0.01, and 0.96±0.01, respectively (Table II). hDIR and cDIR had a superior CTV overlap than iDIR (p<0.0001); although hDIR presented satisfactory DSCs, better DSCs were obtained in cDIR (p<0.0001). However, decreased DSCs were observed for the GTV compared to the CTV, with iDIR, hDIR, and cDIR values of 0.82±0.07, 0.89±0.03, and 0.92±0.02, respectively (Table III). iDIR had the worst DSC performance among the DIR methods. The dose differences in the CTV and GTV for all cases are shown in Figure 2. For the CTV V95, the dose changes were much smaller in cases with hDIR and cDIR than in those with iDIR (Figure 2A) (p<0.0001). Most cases had a dose change of less than 2% when using cDIR and hDIR, especially for cDIR, and there was only one case in which the dose change was greater than 1%. However, using iDIR, approximately 62% of cases had a change greater than 2%, while 27% of cases had a change greater than 5% (Table II). It is important to note that an underestimated dose may be obtained following the application of iDIR (Figure 2B). For the GTV, the dose differences were small, and similar in all cases between DIR methods (Figure 2D). However, there were more extreme values with larger dose difference in cases using iDIR (Figure 2C), the details of which are shown in Tables II and III.
Example of dose distributions deformed via iDIR in four cases. The left panel in each column shows the dose distributions on one daily set of CT images (A, C, E, G), and the right panel shows the deformed doses via iDIR on planning CT images (B, D, F, H). The filled white, red, purple, and yellow show the clinical target volume (CTV), gross tumor volume (GTV), stomach, and duodenum, respectively. Change: Dose reduction in CTV receiving greater than 95% of the prescription dose (V95).
Absolute dose difference in clinical target volume (CTV) receiving greater than 95% of the prescription dose (V95) and dice similarity coefficient (DSC) of the CTV with different deformable image registration (DIR) algorithms.
Violin plots for dosimetric difference in clinical target volume (CTV) and gross tumor volume (GTV) receiving greater than 95% of the prescription dose (V95). (A) and (C) show the absolute dose difference in CTV and GTV. The actual dose differences for CTV and GTV are shown in (B) and (D), respectively. The dotted red lines indicate the median dose changes, and the dotted black lines show the data at the 25th and 75th percentile. *p<0.0001; †p<0.05.
Absolute dose difference in gross tumor volume (GTV) receiving greater than 95% of the prescription dose (V95) and dice similarity coefficient (DSC) of the GTV with different deformable image registration (DIR) algorithms.
A negative dependence between the DSC and dose difference was observed in the CTV with iDIR (Pearson correlation coefficient (R): −0.64) (Figure 3A). However, the other two DIR methods had no such dependence. For the GTV, there was no dependence between the DSC and the dose difference in any of the DIR methods (Figure 3B). ROC was performed for DSC with greater than 2% and 5% dose differences in the CTV V95 after iDIR (Figure 4). The area under the curve was 0.82 and 0.84, respectively, and the cut-off value was 0.91 and 0.89 (the sensitivity was 90%), respectively; these results indicated that the dose changes would be greater than 2% if the DSC value is smaller than 0.91, and the dose changes would be greater than 5% if the DSC is smaller than 0.89.
Dependence of dosimetric difference in clinical target volume (CTV) (A) and gross tumor volume (GTV) (B) on dice similarity coefficient (DSC).
Receiver operating characteristic curves for dice similarity coefficients (DSCs) as predictors of dose differences for iDIR. The receiver operating characteristic curve for DSC with 2% (black line) and 5% (gray line) dose difference yielded an area under the curve of 0.817 and 0.843, respectively. These were statistically significant (p<0.0001).
Using iDIR, the median accumulated dose was reduced by approximately 5% in the CTV V95 compared to the other methods in this study (Table II). The dose difference was very small between hDIR and cDIR, and there was no significant difference in GTV between the DIR methods (Table III). Furthermore, a strong dependence was obtained between the percentage of cases with a dose difference greater than 5%, and the accumulated dose difference in respect to the hDIR and cDIR (R=0.90) (Figure 5).
Dependence of the difference in accumulated dose in respect to the hDIR and cDIR on the percentage of cases with dosimetric changes greater than 2% and 5%, respectively.
Discussion
Limited studies have evaluated the performance of DIR algorithms based on CIRT. To the best of our knowledge, this is the first study to determine a cut-off DSC value for iDIR for an accumulating dose in radiotherapy for pancreatic cancer.
Previous studies have reported that a poor DSC may be obtained using iDIR in the abdominal and pelvic region because of the large volume changes in the organs (5, 6, 11, 18). Indeed, the DSC is only about 0.7 for the rectum and bladder in prostate cancer (11, 18). In this study, using iDIR, a median DSC value of 0.89 and 0.82 was obtained for the CTV and GTV, respectively, which were higher than the abovementioned studies. This is mainly because the CTV and GTV changes are relatively small in pancreatic cancer (Table I). Furthermore, satisfactory DSCs were obtained using hDIR and cDIR, especially for the CTV, in which the median values were 0.94 and 0.96, respectively (Table II). In the current study, possible causes for a lower DSC for iDIR may be because the CTV margin was not defined based on an anatomical boundary, meaning that there was no clear intensity-based boundary. In addition, unlike the almost isotropic CTV boundary available in lung and liver cancer, the CTV margin of pancreatic cancer may vary with the anatomical changes surrounding the tumor. However, iDIR has limited capacity to estimate these changes in a low-contrast area (6), which may result in a poor deformed contour in some cases. hDIR and cDIR deform each point based on structures of the target and have no such limitation. For the GTV, the DSCs decreased in all DIR methods tested (Table III), suggesting that DIR accuracy is subject to the target volume. In general, DIR had a better performance in organs with larger volumes in the contour propagation process (19). Many improved algorithms and tools have been developed to reduce these inherent uncertainties and improve the accuracy of contour propagation for adaptive RT (20-22). In addition, the errors from inter- and intra-observer variation in target delineation should be considered (23). In this study, the contours on each CT set were examined by two radiation oncologists to reduce errors, and the average change in CTV was only 3.9 ml, and 2.4 ml in GTV (Table I). However, case 4 has a relatively large change in CTV (25 ml on average), which may have influenced the DIR accuracy.
We also investigated the discrepancy of warped dose among the DIR methods for CTV and GTV. Corroborating a prior study, a minimal dosimetry difference was observed in the CTV using cDIR and hDIR (5); this is mainly attributed to the satisfactory volume overlap of targets. However, a significant difference was observed between hDIR and cDIR (p<0.05) (Table II). Therefore, cDIR appears to be the best DIR method for image registration in pancreatic cancer. In contrast, approximately 62% of cases had a difference greater than 2% in CTV V95 using iDIR (Figure 1). In other words, CT values should not be used for DIR of the CTV in pancreatic cancer, as they can cause a decrease in accuracy. Although we obtained a moderate negative dependence between the DSC and dose variation, it is important to note that a poor DSC does not always indicate a great change in dose parameter when the deformed volume is within the beam field. Thus, this dose deformation cannot represent the actual dose distribution and may cause overestimation of the dose to the surrounding organs. Therefore, we proposed a cut-off value to predict the dosimetry difference for iDIR, in which the dose changes would be greater than 2% (5%) in CTV V95 if the DSC was smaller than 0.91 (0.89). This finding might be helpful for physicians to estimate the dose prior to the warping dose, as well as to help them select an adequate DIR method. For example, for case 5, cDIR and hDIR are recommended because the ratio over 2% (5%) of dose difference was 77.8% (66.7%) when using iDIR; however, for case 8, iDIR seems a better choice since it has a similar DSC as the other DIR methods (the ratio over 2% and 5% of the dose difference was only 11.1% and 0 with iDIR, respectively), but the registration process is much simpler. However, DSC can only be obtained after DIR, and other parameters that can provide guidance before applying DIR seem to be more effective, which will be examined in the future. For the GTV, decreased DSCs seem to have limited effects on the dose variations compared to the CTV because of the relatively large dose coverage. However, considering the steeper lateral dose penumbra applied in CIRT, large variations may be observed when the GTV margin is close to the CTV margin, so it merits close attention. In addition, to quantify the dosimetric difference based on the entire course of treatment, we calculated the accumulated dose using three types of DIR methods. From our results, using iDIR, the accumulated dose in the CTV V95 was underestimated by approximately 5% compared to hDIR and cDIR. Reducing the percentage of cases with a dose difference greater than 5% appears to be crucial for the accumulated dose; therefore, ensuring a DSC of more than 0.89 is important. The results above can be used as a reference for dose accumulation for pancreatic cancer. However, considering that the impact of DIR methods on dose warping is patient and organ-specific, and the fact that only a few cases were analyzed in this study, a validation study is necessary in the future.
The present study has several limitations. First, we evaluated the DIR accuracy for the CTV and GTV only, and organs at risk, such as the duodenum and stomach, were not considered, which are also areas of concern in adaptive radiotherapy. However, DIR has limited capacity to achieve satisfactory accuracy for dose warping for organs with a high degree of deformation, especially for those that have a sharp dose gradient inside (12, 24); while biomechanical model-based DIR may improve the accuracy, but currently only for bladder, rectum and parotid (24, 25). Second, we used DSC to evaluate the accuracy of volume registration, which has limitations in ensuring spatial accuracy. Unfortunately, it appears difficult to define anatomical landmarks for CTV and GTV in pancreatic cancer to validate the anatomical accuracy. Another limitation is that our results are based on single software. There may be variations in accuracy when employing different software (7, 26). However, similar results would be obtained when using a method based on the pixel values to evaluate the dose for pancreatic cancer.
Considering the inherent uncertainty of DIR, DIR algorithms should be selected carefully. In this study, we evaluated and quantified the DIR accuracy for three DIR methods based on adaptive CIRT. Compared to iDIR, both hDIR and cDIR have a superior accuracy in target structure and dose warping, especially in CTV. The proposed cut-off value may provide a reference for the selection of DIR method. iDIR may present a similar performance when the DSCs are greater than 0.89. For the accumulated dose, all DIR methods presented a good performance for GTV; however, it may be underestimated by 5% for the CTV V95 using iDIR. Therefore, hDIR and cDIR (especially cDIR), are recommended for assessing the accumulated dose for pancreatic cancer with CIRT. Further study with a large sample and more evaluation metrics is necessary.
Acknowledgements
The Authors thank the radiology technologists, medical doctors, nurses, and medical physicists at GHMC for their valuable insights.
Footnotes
Authors’ Contributions
Conceptualization, YK.; methodology, YK, YL, and SO; software, YK, YL.; validation, YK. and YL.; formal analysis, YL.; investigation, YK, YL, MO, SS, SO, TM, and SK; resources, YK, YL; data curation, YK and YL; writing-original draft preparation, YL; writing-review and editing, YL, YK, SS, SO, TM, and MO; visualization, YK; supervision, YK, MO, and TO; project administration, YK, MO, and TO.
This article is freely accessible online.
Conflicts of Interest
The Authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
- Received December 11, 2020.
- Revision received December 28, 2020.
- Accepted December 29, 2020.
- Copyright © 2021 International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.