Abstract
Aim: To perform a quantitative, volumetric analysis of therapeutic effects of trans-arterial chemoembolization (TACE) in hepatocellular carcinoma (HCC) patients. Patients and Methods: Entire tumor volume and a subset of hypervascular tumor portions were analyzed pre- and post-TACE in magnetic resonance imaging datasets of 22 HCC patients using a semi-automated segmentation and evaluation tool from the Medical Imaging Interaction Toolkit. Results were compared to mRECIST measurements and inter-reader variability was assessed. Results: Mean total tumor volume increased statistical significantly after TACE (84.6 ml pre- vs. 97.1 ml post-TACE, p=0.03) while hypervascular tumor volume decreased from 9.1 ml pre- to 3.7 ml post-TACE (p=0.0001). Likewise, mRECIST diameters decreased significantly after therapy (44.2 vs. 15.4 mm). In the inter-reader assessment, overlap errors were 12.3-17.7% for entire and 36.3-64.2% for the enhancing tumor volume. Conclusion: Quantification of therapeutic changes after TACE therapy is feasible using a semi-automated segmentation and evaluation tool. Following TACE, hypervascular tumor volume decreases significantly.
Hepatocellular carcinoma (HCC) is the third most common cause of death from cancer worldwide and the sixth most common cancer overall (1). Less than 30% of HCC patients are eligible for definitive resection and/or liver transplantation (2). As such, minimally-invasive techniques such as percutaneous ablation and transarterial chemoembolization (TACE) have become an important option in the management of HCC (3). TACE is now considered the treatment of choice in patients with Barcelona Clinic Liver Cancer (BCLC) intermediate-stage HCC (4) and has shown promising results in the treatment of advanced HCC (5).
With the development of minimal-invasive techniques for the treatment of HCC, monitoring of therapeutic response has become increasingly important. The Response Evaluation Criteria of Solid Tumors (RECIST) criteria for tumor response evaluation was established in 2000 and proposed response assessment through measurement of the largest tumor diameter (6). However, a 2009 study by Forner et al. showed that the RECIST criteria were ineffective for monitoring response to loco-regional HCC therapy, as RECIST does not account for the extent of tumor necrosis (7). Modified RECIST (mRECIST) criteria, specific to HCC, were proposed in 2010 focusing on assessment of the extent of arterial phase contrast enhancement (8). The mRECIST and European Association for the Study of the Liver (EASL) criteria have subsequently been shown to be more effective in predicting survival than assessments incorporating only measurements of tumor size (9).
Although the mRECIST criteria represent an improvement over prior standards, mRECIST evaluations still rely upon a one-dimensional measurement performed on three-dimensional MR or CT datasets. It is reasonable to assume, and has been shown in other applications, that volumetric measurements are more accurate than one-dimensional size assessments in determination of therapeutic response (10-12). In this context, 3D segmentation is the most accurate approach to obtain volumetric data. With respect to the above discussion, this consists of distinguishing a liver mass from the hepatic parenchyma on a CT or MR image or an arterially-enhancing portion of tumor from surrounding areas of non-enhancement. While segmentation can be readily performed by radiologists, technologists, or other observers, manual segmentation of a mass or organ on numerous axial 2D slices, as required for volumetric measurements, is cumbersome and time-consuming (13). Semi-automated techniques have shown to be less time-intensive and generally more reliable than manual or fully-automated segmentations (13-16). The Medical Imaging Interaction Toolkit (MITK) is free, open-source imaging software (17, 18). A voxel-based segmentation tool with support of manual delineation such as region growing and slice interpolation is provided as part of this software package; the implementation of which is described in detail elsewhere in the literature (17).
The aim of this study was to evaluate therapy response after TACE therapy in HCC patients using the segmentation MITK workflow for the analysis of MR examinations. The results were compared to mRECIST measurements.
Patients and Methods
The requirement of informed consent for this retrospective analysis was waived by the local institutional research board (Medizinische Ethikkommission II, 2008-338N-MA). Twenty-two consecutive patients (17 men and 5 women) with known hepatocellular carcinoma who were set to undergo TACE therapy and received a pre- and post-interventional magnetic resonance imaging (MRI) examination in-house were included from November 28th 2005 to March 3rd 2011. Mean patient age was 68 years (range=51 to 80 years). Mean time of MRI examination prior to TACE therapy was 2.5 weeks (range=0-13 weeks). Mean follow-up interval after TACE therapy was 5.5 weeks (range=2-12 weeks).
MR Imaging. Prior and after TACE therapy, all patients underwent an MRI examination with a standardized protocol on a single 1.5T MR-system (MAGNETOM AVANTO 32×76, Siemens Healthcare Sector, Erlangen, Germany). The protocol included coronal and axial T2 weighted half Fourier acquisition single-shot turbo spin-echo sequences as well as diffusion weighted imaging. Pre- and post-contrast T1-weighted sequences following gadolinium administration (0.1 mmol/kg body weight) were obtained. A 6-channel body-array coil was utilized in combination with 6 elements of the spine matrix coil for signal reception. 3D Volume-Interpolated Breathhold Examination (VIBE) sequences (TR/TE 5.5/1.93 ms, FA 30°, acquisition time 21.1 sec, matrix 384×188, FoV 370×265, slice thickness 3 mm, voxel size 1.0×1.4×3 mm3, parallel imaging factor 2) were used to acquire pre-contrast and multi-phase dynamic images. For multi-phase dynamic imaging, VIBE sequences were obtained at 25 sec (arterial phase), 59 sec (venous phase), and 80 sec (portal-venous phase) after injection.
Transarterial chemoembolization (TACE). All TACE procedures were performed by one interventional radiologist (SJD) with 15 years of experience in trans-arterial therapy of liver lesions. The details of the technique utilized for these procedures have been previously described (19). In all cases, TACE was performed with drug-eluting beads (DC Bead, Biocompatibles, United Kingdom).
Image analysis. For each of the 22 patients, Digital Imaging and Communications in Medicine (DICOM) files from the pre- and post-treatment MR examinations were exported to an offline workstation where they were analyzed with the Medical Imaging Interaction Toolkit (MITK; German Cancer Research Center Division of Medical and Biological Informatics; www.mitk.org). The primary index lesion was selected in this assessment. The primary index lesion was defined as the lesion first targeted during the TACE therapy, often corresponding to the largest lesion (20). Lesion assessment was performed by a MD candidate who was directly supervised by an experienced user of this software package. The semi-automated segmentation tool was used to outline the tumor margins on each available slice of the arterial phase T1 weighted images as described previously (17). In an analogous manner, the enhancing portions of the dominant tumor on all available slices of the arterial phase images were selected (Figure 1). Care was taken to exclude hemorrhagic areas without arterial enhancement by comparing native and arterial phase T1 images visually. Selection of hyperintense areas was supported by an interactive threshold tool. Additionally, the primary index lesions were measured pre- and post-therapy by the experienced interventional radiologist (SJD) according to mRECIST criteria.
Inter-reader assessment. For the inter-reader variability assessment, the pre- and post-treatment tumor volumes and volumes of maximal enhancement were additionally assessed by 4 independent radiologists using the MITK software in a single, randomly chosen patient. The radiologists had 4, 6, 6, and 10 years of experience in abdominal imaging. The radiologists had no previous experience in using the MITK software and were instructed by the same MD candidate who performed the lesion assessment for all patients. Overlap error was calculated between each pair of radiologists as well as the MD candidate on a voxel-by-voxel basis. Overlap error is derived from the Tanimoto coefficient and quantifies overlap between volumes A and B on a per voxel basis as CT=|A ∩ B|/|A ∪ B| where (1-CT) corresponds to the overlap error (21). A volumetric overlap error of 0 represents a perfect segmentation while a value of 100 indicates no overlap between the segmentation and reference.
Statistical analysis. Data were collected in Microsoft Excel and statistical analyses were performed with the statistic tool SAS 9.2, 2012 (SAS Institute Inc., Cary, NC, USA). Means and standard deviations and medians of liver tumor volumes and volumes of maximal tumor enhancement were calculated pre- and post-treatment for each reader. Ratios between post- and pre-treatment values were calculated. Means are given±standard deviations (S.D.). Since most of the distributions of the small samples were skewed, Wilcoxon tests were utilized to compare these values. Level of significance was set to α=0.05.
Results
TACE procedure was technically successful in all 22 patients and without major complications. Segmentation analyses were performed successfully by all readers using the acquired datasets. Total time for the segmentation of both total tumor and enhancing tumor volumes pre- and post-TACE ranged from 8-14 min per patient.
Volumetric measurements and inter-reader assessment. The mean pre-treatment tumor volume (±StDev) was 84.6±128.3 mL. The mean post-treatment tumor volume was 97.1±175.2 ml (median 27.4 ml, skewed distribution). This was statistically significantly higher than the pre-contrast volume (p=0.03) (Figure 2a).The mean pre-treatment volume of arterial-phase enhancing tumor was 9.1±8.8 ml. The mean post-treatment volume of arterial-phase enhancing tumor was 3.7±11.7 ml (median 1.0 ml), which was significantly lower than the mean pre-contrast volume (p=0.0001, Figure 2b). The mean ratio of the total tumor volume before/after TACE was 1.03±1.38. The mean ratio of the hypervascular volume of the tumor before/after TACE was 0.38±0.83.
mRECIST. As determined by the experienced interventional radiologist, the mean pre-treatment diameter of arterial enhancing viable tumor was 44.2±23.6 mm, median 38 mm. The mean post-treatment viable tumor diameter was 15.4±24.6 mm, median 2 mm (skewed distribution). In the statistical analysis, a significant difference between the pre- and post-TACE diameter of the viable tumor diameter was found (p<0.0001, Figure 2c). The mean mRECIST diameter ratio before/after TACE was 0.31±0.38. In the comparison of hypervascular tumor volume and mRECIST diameter ratios, no statistically significant difference was found.
Inter-reader assessment. In the inter-reader assessment, overlap errors between the 4 radiologists ranged from 12.3-17.7% for the entire tumor volume and from 36.3-54.0% for the enhancing tumor volume pre TACE. Post-TACE, overlap errors ranged between 12.6-13.8% for the entire tumor volume and 47.8-64.2% for the enhancing tumor volume.
Discussion
Loco-regional therapies such as percutaneous ablation and TACE have emerged as important treatment options in the management of HCC (4). As the goal with these techniques is tumor necrosis, that may result in no immediate size reduction of the neoplasm, new methods of identifying therapeutic response on medical images have emerged (8). The present work extends such techniques into volumetric measurements implementing and demonstrating the feasibility of using a semi-automated segmentation and analysis tool to quantify changes in tumor and viable, enhancing tumor volume on MRI following TACE therapy. By applying mRECIST criteria, statistically significant changes in tumor diameter after therapy were found. In the volumetric assessment, only hypervascular tumor proportions showed a significant decrease after TACE therapy, while overall tumor volume even increased. Similar discrepancies regarding changes in total tumor versus viable, enhancing tumor diameters were found in previous studies that used two-dimensional measurement techniques (7). This finally led to the implementation of mRECIST criteria for the response assessment of TACE therapy. mRECIST or EASL criteria are currently the methods of choice for evaluating therapeutic response in HCC patients (9, 22). However, they rely on evaluation of residual arterial tumor enhancement utilizing only 1 (mRECIST) or 2 dimensions (EASL). Several prior studies have examined volumetric versus single-greatest and two-dimensional measurements in the setting of lung cancer (10, 11, 23) as well as liver (24, 25) and lymph node metastases (26, 27), finding higher accuracy for volumetric measurements. It would stand to reason that volumetric measurements would also be preferable for the assessment of HCC.
One of the primary limitations of volumetric image assessments is the time-requirement necessary to perform manual segmentations. For example, one study found manual liver segmentation to take an average of 25 min per case (13). In that particular study, Hermoye et al. examined manual and semi-automated hepatic volumetric MR measurements in living transplant patients with correlation to actual intraoperative graft weight. The semi-automated measurements not only reduced the time required for segmentation to 5 min per case but also improved both accuracy relative to true graft volume and measurement reproducibility. Similar decreases in interpretation time have been shown with semi-automated versus manual techniques in other applications (27). Also, the diminished inter-observer variability with semi-automated versus manual techniques has been previously demonstrated in metastatic hepatic lesions (28). Kreil et al., likewise, examined semi-automated versus manual volumetric measurements in post-radiofrequency ablation (RFA) metastatic hepatic lesions, finding semi-automated techniques to correlate relatively-well with manual volumetric measurements for pre-therapy volume calculations and for determination of ablation cavity size following treatment (29). While manual and semi-automated segmentations were not directly compared in our study, the availability of free, open-source semi-automated segmentation tools such as those discussed herein makes the eventual incorporation of volumetric measurements into the daily clinical routine more likely.
In comparison to prior studies, our data suggest a relatively high reproducibility of volumetric hepatic tumor measurements. Calculations of overlap error are one of the most common ways to measure the reproducibility of segmentations (21). For measurements of total tumor volume, the 4 radiologists exhibited a relatively low overlap error of 12-18%, values on the order of those seen with segmentation of the entire liver by CT (21) and lower than those previously seen in a segmentation of hepatic metastatic lesions on CT (30). Measurements of enhancing tumor volume were less reproducible, with overlap errors reaching over 50% pre- and over 60% post-TACE. This higher variability might partly be explainable by differences in software and user differentiation of hemorrhagic and truly arterial hyper-enhancing areas. Additionally, truly arterial enhancing volumes were 10-fold lower than total tumor volumes, leading to relatively high overlap errors despite low absolute differences in volumes. Further improvements in segmentation software will presumably lead to more uniform delineation, especially when further reducing the user's influence in the selection of enhancing tumor portions.
The limitations of the present study include the relatively small sample size. Thus, results herein should be examined in a larger patient cohort, ideally in a prospective study design. This study attempts to translate mRECIST criteria into volumetric measurements; however, mRECIST criteria require identification of several target lesions and summation of the unidimensional diameters of enhancing tumor (8). Likewise, EASL criteria require summation of the two-dimensional diameter products of several target lesions (22). To simplify the computations in the present work, we rely upon measurements of only a primary index lesion. This target-lesion approach has been shown to correlate with disease progression and survival in a large retrospective study (20). Additionally, a recent study found no improvement in survival prediction when using multiple lesion assessment compared to evaluation of a single dominant lesion alone (31). In general, as in the present study, a minority of hepatocellular carcinomas are multifocal. Thus, for a large percentage of patients, multi-target lesion computations would have not been necessary or possible. The interaction time to perform semi-automated volumetric measurements is also a relevant parameter, and is undoubtedly longer than single or two-dimensional measurements. However, this variable was not directly assessed in our work.
Ultimately if volumetric measurements can provide highly relevant clinical data, interaction time with the software becomes a less relevant consideration. Semi-automated segmentation algorithms should also continue to improve, reducing the number of manual corrections and thus interaction time needed.
In summary, this study demonstrates the feasibility of a semi-automated segmentation and analysis tool to quantify therapeutic effects of TACE utilizing volumetric tumor assessments and measurements of enhancing tumor volume. Notably, while total tumor volume slightly increased after TACE therapy, the volume of arterial enhancing tumor areas and mRECIST diameter decreased significantly.
Footnotes
↵* These Authors contributed equally to this study.
- Received June 10, 2016.
- Revision received July 7, 2016.
- Accepted July 11, 2016.
- Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved