Abstract
Aim: To determine the reproducibility of histogram and texture parameters derived from intravoxel incoherent motion (IVIM) diffusion-weighted magnetic resonance imaging (MRI) of FN13762 rat breast carcinomas. Materials and Methods: IVIM diffusion-weighted MRI was performed twice, nine days after tumor implantation in 11 rats. At each session, histogram and texture parameters of entire tumors were extracted from apparent diffusion coefficient (ADC), true-diffusion coefficient (Dt), pseudo-diffusion coefficient (Dp), and perfusion fraction (Pf) maps. Intraobserver and interscan measurement reproducibilities were evaluated using intraclass correlation coefficients (ICC). Results: Mean, entropy, 5th, 10th, 25th percentiles from ADC and Dt maps revealed good intra-observer and interscan agreements [lower limits of 95% confidence interval (CI) for ICC≥0.75]. However, all parameters from Dp and Pf maps gave relatively poor intra-observer and interscan agreements (lower limits of 95% CI for ICC<0.75). Conclusion: Histogram and texture parameters derived from ADC and Dt maps were more reproducible than those from Dp and Pf maps.
- Reproducibility
- histogram
- texture
- intravoxel incoherent motion
- diffusion-weighted MRI
- FN13762 rat breast carcinoma
Quantitative analysis of apparent diffusion coefficient (ADC) maps provided by diffusion-weighted magnetic resonance imaging (MRI) has been shown to be a promising tool in the field of oncology in differentiating malignant from benign tumors, monitoring treatment response, detecting residual tumor or recurrence, and in predicting treatment outcome (1-3). ADC measurements can provide valuable information supplementary to conventional anatomic imaging as it reflects the mobility of water molecules within tissue and can sensitively detect increased cellularity, restriction of cellular membrane permeability, as well as disruption of cellular membrane depolarization, which are all common pathological changes observed in malignancies (4, 5). Recently, however, it has been shown that ADC values are affected by capillary perfusion, and thus, intravoxel incoherent motion (IVIM) was proposed to separate ‘true-diffusion’ from ‘pseudo-diffusion’ caused by microcirculatory perfusion of blood within capillaries (6). Thereafter, the clinical potential of quantitative analysis of parametric maps derived from IVIM diffusion-weighted MRI has been demonstrated in patients with several diseases including liver cirrhosis, focal hepatic and pancreatic lesions (7-10).
To date, quantitative analysis of ADC and parametric maps derived from IVIM diffusion-weighted MRI has been limited to the measurement of the mean pixel value within a region of interest (ROI) on the maps. Recently, however, more sophisticated analyses, such as histogram or texture analyses, have been applied to these maps and have shown comparable or superior performance as imaging biomarkers in the evaluation of the fibrosis stage and inflammatory activity of chronic hepatitis, differentiation of glioma grade, and in monitoring treatment response in glioblastoma patients (11-13). Further studies validating variable histogram or texture parameters may uncover additional biomarkers that can provide even more information through non-invasive assessment of target lesions, however, prior to its clinical application, reproducibility of the measurements of eligible imaging biomarkers should first be examined (14).
To our knowledge, however, there have been few studies on the reproducibility of measurements of histogram and texture parameters derived from ADC maps (11, 15), and nor of those parameters derived from IVIM MRI. Thus, the purpose of our study was to determine the reproducibility of histogram and texture parameters derived from IVIM diffusion-weighted MRI of FN13762 rat breast carcinomas.
Materials and Methods
This study was approved by the Animal Care and Use Committee of our Hospital [#13-0322-C0A0 (1)].
Tumor cell line, experimental animals and tumor implantation. The FN13762 murine mammary carcinoma cell line (American Type Culture Collection, Manassas, VA, USA) was used as the experimental animal tumor model in our study. FN13762 carcinoma has been shown to be a spontaneously metastatic syngeneic rat tumor, which has been extensively characterized for both its in vitro and in vivo growth (16).
Eleven female Fischer 344 rats (weight range, 250-300 g; Charles River, Sulzbach, Germany) were anesthetized for tumor cell implantation by injecting a solution of zolazepam (5 mg/kg, Zoletil®; Virbac, Carros, France) and xylazine (10 mg/kg, Rompun®; Bayer-Schering Pharma, Berlin, Germany) into their hindlimbs. After anesthesia, FN13762 cells were inoculated into the muscular layer of the right upper thigh of the rats with a 24-guage needle by injecting 4×105 cells suspended in 0.1 ml of Dulbecco's modified Eagle's medium (Cambrex Biosciences, Verviers, France). Cellular viability was tested prior to tumor implantation using trypan blue, which yielded a result greater than 90%. All procedures were performed using the aseptic technique.
MRI acquisition. After a 9-day feeding period, MRI scans were performed with a 3-T MRI system with a 6-channel rat body coil (Stark Contrast, Erlangen, Germany). Firstly, we acquired axial T2-weighted turbo spin-echo images (TR/TE, 8930/101 ms; bandwidth, 196 Hz/pixel; flip angle, 120°; field of view, 100×81 mm; matrix, 192×115; slice thickness, 0.8 mm; number of signals, 8) of the lower-half of the rat's body under anesthesia using zolazepam and xylazine. Thereafter, for IVIM diffusion-weighted MRI, one radiologist (YSS with three years of experience in body MRI) reviewed the T2-weighted images and determined the scan range to cover the entire volume of the tumor. IVIM diffusion-weighted MRIs were performed using single-shot echo-planar prototype imaging pulse sequence (Siemens Healthcare, Erlangen, Germany) with diffusion gradients applied in three orthogonal directions, data acquisition in free breathing, with the following parameters: TR/TE, 4400/73 ms; bandwidth, 798 Hz/pixel; field of view, 100×100 mm; matrix, 128×128; section thickness, 3 mm; number of signals, 8; and multiple b-values, 0, 25, 50, 75, 100, 200, 400, 800 and 1000 s/mm2. To shorten the echo train length, the parallel imaging technique (generalized autocalibrating partially parallel acquisitions; GRAPPA) with a two-fold acceleration factor was used. To assess the interscan reproducibility of the parameters derived from IVIM diffusion-weighted MRI, all rats underwent IVIM diffusion-weighted MRI twice, with the examinations set 5 minutes apart.
IVIM parametric map acquisition. Diffusion-weighted MRI data were post-processed using prototype software (Siemens Healthcare) to extract ADC and IVIM parameters composed of the true-diffusion coefficient (Dt), representing pure molecular diffusibility; pseudo-diffusion coefficient (Dp), representing perfusion-related incoherent microcirculation; and perfusion fraction (Pf), which has been linked to blood volume (17). ADC values were automatically calculated using all b values with a monoexponential fit using the following equation: SI/SI0=exp(−b×ADC), where SI0 is the mean signal intensity of the ROI for the b value of 0 sec/mm2 and SI is the signal intensity for the higher b values. On the basis of the IVIM concept, Dt, Dp and Pf values were calculated using a nonlinear biexponential fit according to the following equation (18): SI/SI0=(1 − Pf) · exp(−b · Dt) + Pf · exp(−b · Dp). Four parametric maps of ADC, Dt, Dp and Pf were created on a pixel-by-pixel basis for each animal (Figure 1).
Image analysis. One radiologist (SML with eight years of experience in body MRI) manually drew the ROI to encompass as much of the whole tumor as possible on all ADC maps from the first IVIM MRI using in-house software (Figure 2). Care was taken to avoid non-tumor tissues on the ADC maps. To evaluate intra-observer measurement reproducibility, the observer drew ROIs of whole tumors again on the ADC maps obtained from the first IVIM MRI four weeks later. As for interscan measurement reproducibility, the observer separately segmented the same tumors on ADC maps obtained from the first IVIM MRIs and those from the second IVIM MRIs. These segmentations were conducted separately with at least a 4-week interval. ROIs were then copied from the ADC map and placed on the other corresponding parametric maps for Dt, Dp and Pf. After the ROIs were placed, ROI volume, histogram and texture parameters were calculated and automatically extracted. Histogram parameters included: (a) mean, (b) standard deviation (SD), (c) skewness, (d) kurtosis and (e) percentile values (5th, 10th, 25th, 50th, 75th and 90th). Texture parameters included: (a) entropy, (b) homogeneity, (c) gray-level co-occurrence matrix (GLCM) inverse difference moment (IDM) and (d) GLCM contrast.
Calculation of histogram and texture parameters. Skewness represents the distribution pattern of the degree of pixel values on histograms. Negative and positive skewness indicates that the pixel values are more spread to the left and right of the mean, respectively (19). Kurtosis represents the position of the peak height, which indicates the pixel value of the maximum frequency on histograms. A normal distribution has a kurtosis of 0, a leptokurtic one is indicated by a sharper peak and has a kurtosis of more than 0, and a platykurtic distribution is indicated by a flatter peak with a kurtosis of less than 0 (19). Parameters are defined mathematically below, where N is the number of data points in a region-of-interest, is the mean and s is the standard deviation.

Heterogeneity within the tumor was assessed with entropy and homogeneity. Entropy is a parameter reflecting the unpredictability of any information content of an image and has been widely used in information theory (20). Homogeneity is the measure that increases with less contrast in the window, and was calculated using a two-dimensional image histogram for the purposes of this study (21). High entropy and low homogeneity values indicate increased heterogeneity of the tumor. Parameters are defined mathematically below, where G is the number of gray levels in a ROI and P(I) is the probability of the occurrence of a gray level I on a histogram.

GLCM is a matrix where element P(I, J) is relative frequency with which a combination of two pixels with intensity I and J occur in an image, separated by a given distance. A number of texture features may be extracted from the GLCM and two texture features calculated from the GLCM were used in this study. A GLCM inverse difference moment (IDM) is influenced by the homogeneity of the image and GLCM contrast reflects the local intensity variation (22).

Statistical analysis. We compared all parameters between the first and second measurements on the first IVIM MRI for the intra-observer reproducibility test, and between the first measurements on the first IVIM MRI and the second IVIM MRI for the interscan reproducibility test.
The paired t-test was used to assess the presence of any systemic bias in intra-observer and interscan comparisons for all parameters. Results with p-values of less than 0.05 were considered to indicate a statistically significant difference. Reproducibilities of all parameters were then evaluated using intraclass correlation coefficients (ICC). The lower limit of the 95% confidence interval (CI) for ICC ≥0.75 was considered to represent good agreement (23). We designated each parameter as a reproducible parameter when all the following conditions were satisfied in both intra-observer and interscan comparisons: (a) no systemic bias according to the paired t-test and (b) good agreement according to the ICC. To estimate the magnitude of measurement variability in all parameters, 95% limits of agreement in intraobserver and interscan comparisons were obtained according to the Bland-Altman method. Results of the Bland-Altman method were expressed as absolute values for skewness, kurtosis, entropy, homogeneity, GLCM IDM and GLCM contrast, and were transformed as percentages for mean, SD and percentile values as variabilities of those parameters were not independent of the magnitude of the measurement.
All statistical analyses were performed using Medcalc (version 12.7.2.0, MedCalc Software, Mariakerke, Belgium).
Results
The mean diameter of all 11 tumors was 14.36±3.00 mm (range=10.63-19.72 mm). As for volume measurements, there were no significant mean differences in terms of intra-observer variability nor interscan variability (p>0.05). Their ICCs were 0.99 (95% CI=0.97-1.00) and 0.99 (95% CI=0.95-1.00) for intra-observer and interscan comparisons, respectively. The 95% limits of agreement in intra-observer and interscan comparisons ranged from −18.2% to 18.0% (mean=−0.1%) and −27.5% to 25.7% (mean, −0.9%) of the ROI volumes, respectively.
ADC map. There were no significant differences in the mean values, SD, skewness, kurtosis, all percentile values (5th, 10th, 25th, 50th, 75th and 90th), entropy, homogeneity, GLCM IDM and GLCM contrast of ADC values from ADC maps in terms of both intra-observer and interscan comparisons (p>0.05). Table I summarizes the results of ICCs and Bland-Altman methods in intra-observer and interscan comparisons for all parameters from the ADC maps. For intra-observer comparison, mean, entropy, GLCM contrast and all percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.85 to 1.00. For interscan comparison, mean, entropy, 5th, 10th, 25th and 90th percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.77 to 0.94.
Dt map. There was a significance difference in kurtosis in terms of interscan comparison (p=0.034). Table II summarizes the results of ICCs and Bland-Altman methods in intraobserver and interscan comparisons for all parameters from the Dt maps. For intra-observer comparison, mean, entropy, GLCM contrast and all percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.87 to 1.00. For interscan comparison, mean, entropy, GLCM contrast, 5th, 10th, 25th and 50th percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.77 to 0.96.
Dp map. There were no significant differences in the mean values, SD, skewness, kurtosis, all percentile values (5th, 10th, 25th, 50th, 75th and 90th), entropy, homogeneity, GLCM IDM and GLCM contrast of Dp values from Dp maps in terms of both intra-observer and interscan comparisons (p>0.05). Table III summarizes the results of ICCs and Bland-Altman methods in intra-observer and interscan comparisons for all parameters from the Dp maps. For intra-observer comparison, mean, kurtosis, entropy, homogeneity, GLCM IDM, 25th, 50th and 75th percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.77 to 0.99. For interscan comparison, however, no parameter showed good agreement.
An FN13762 rat breast cancer in a rat model. Axial diffusion-weighted image of the tumor in the right hindlimb (b=200 s/mm2) (A) and corresponding apparent diffusion coefficient map (B). Intravoxel incoherent motion diffusion-weighted magnetic resonance imaging parametric maps of true-diffusion coefficient, pseudo-diffusion coefficient and perfusion fraction are demonstrated in C-E, respectively.
Pf map. There was a significant difference in the Pf values in terms of intra-observer comparison (p=0.049). Table IV summarizes the results of ICCs and Bland-Altman methods in intraobserver and interscan comparisons for all parameters from Pf maps. For intra-observer comparison, mean, 10th, 25th, 50th, 75th and 90th percentile values showed good agreement, with lower limits of 95% CI for ICC ranging from 0.87 to 1.00. For interscan comparison, however, no parameter showed good agreement.
Discussion
In our study, we evaluated the reproducibility of various parameters of ADC maps and parametric maps derived from IVIM diffusion-weighted MRI in terms of both intraobserver and interscan variabilities. Diffusion-weighted MRI was repeated in a single imaging session in a very short time interval, so that changes in the tumor microenvironment or in the physiological status of the animals would have been negligible, reflecting the measurement error that can result from image noise, image distortion and motion artifacts, as well as intra-observer variability in drawing an ROI. Indeed, all parameters that failed to attain good intra-observer agreement in our study also failed to attain good interscan agreements. Acceptable interscan reproducibility is essential for any imaging biomarker, as at least two sets of images scanned at different time points are required to evaluate changes in the imaging biomarker. Our study showed that several histogram and texture parameters can be reproducibly-measured from ADC and Dt maps of FN13762 rat breast carcinomas. Those parameters were the mean, entropy, 5th, 10th, 25th and 90th percentile values from ADC maps, and mean, entropy, GLCM contrast, 5th, 10th, 25th and 50th percentile values from Dt maps.
In-house software program for histogram and texture parameter measurements. This software program provides two screens of loaded images with or without magnification. A: Loaded apparent diffusion coefficient map without magnification. The marked area was selected for magnification. B: 1.3-Fold magnified image of the area shown in A. The region of interest (shown as polymorphic lines; the square on the line indicates the starting point for drawing the region-of-interest) was performed on this magnified image. The number on the image indicates the current image number of the ADC map stack.
The mean ADC and Dt values have been widely used as representative values of a target lesion in oncological imaging (24), and their good reproducibility has been reported in both animal and human studies (11, 17, 25). Another fact that may have positively influenced the reproducibility of the mean value in our study was the method of ROI drawing employed, namely that we calculated the mean value by averaging all pixel values from whole-tumor volumes. Indeed, Lambregts et al. reported that ADC measurements obtained from whole-tumor volumes are more reproducible than those obtained from single-slice or small sample ROIs (26). A previous clinical trial also reported the good reproducibility of ADC and perfusion-insensitive ADC values from ROIs encasing whole-tumor volumes (27).
Entropy describes the variation in a volume histogram of pixel values and has been shown to be a promising indicator of tumor heterogeneity (28). In addition, SD, homogeneity, GLCM IDM and GLCM contrast values also measure a particular characteristic of the distribution between pixels in the ROI (29), and can thus reflect tumor heterogeneity. However, as those variables are inter-dependent, the selection of fewer parameters, with the advantage of measurement reproducibility, may be a more practical approach in discovering new imaging biomarkers. According to our results, the entropy from both ADC and Dt maps, and GLCM contrast from the Dt map showed better reproducibility than other parameters. Fujimoto et al. also observed good interobserver reproducibility of entropy from ADC maps of the liver parenchyma in patients with chronic hepatitis C and reported that the coefficient of repeatability (i.e. 1.96-times the SD of the difference between the two measurements) was 0.072 (11). In our entropy measurement from FN13762 rat breast carcinomas, coefficients of repeatability were 0.096 and 0.118 for intraobserver and interscan comparisons, respectively. Jensen et al. also showed good interscan reproducibility of ADC entropy, for which the ICC was 0.84 (CI was not presented there) in patients with breast cancer (25).
Lower percentile values (i.e., 5th, 10th and 25th percentile values) of ADC and Dt histograms were chosen to account for tumor heterogeneity in our study and are regarded as a promising measurement method, which may sensitively detect focal regions of higher cellularity (12, 13, 30, 31). Our results showed that those parameters can be consistently measured in both ADC and Dt histograms of small animal tumors. The 50th percentile value, as well as the 10th and 25th percentile values, of ADC histograms was also suggested as a useful parameter in the prediction of the low-grade glioma subtype (32), but showed relatively worse reproducibility than the lower percentile values (i.e. 5th, 10th and 25th percentile values) in our study. On the other hand, the 50th percentile value of Dt histograms showed slightly better reproducibility than that of ADC histograms. In addition, the 90th percentile value of ADC and Dt histograms has been shown to reflect low cellularity regions within the tumor such as necrosis or cystic change (24, 33), and was slightly more reproducible in ADC histograms than Dt histograms in our study.
Results of intraclass correlation coefficients (ICC) and the Bland–Altman method for all parameters from apparent diffusion coefficient maps.
Results of intraclass correlation coefficients (ICC) and the Bland–Altman method for all parameters from true-diffusion coefficient maps.
Results of intraclass correlation coefficients (ICC) and the Bland–Altman method for all parameters from pseudo-diffusion coefficient maps.
Results of intraclass correlation coefficients (ICC) and the Bland–Altman method for all parameters from perfusion fraction maps
We also found that kurtosis and skewness measurements from ADC histograms exhibited relatively worse reproducibility in FN13762 rat breast carcinomas. Those parameters from ADC maps had previously been suggested as potential imaging biomarkers for prediction of chemotherapy response and progression-free survival in various malignancies (25, 31). However, according to Jensen et al. (25) who investigated the interscan reliability of ADC mean, skewness, and entropy in patients with breast cancer, the interscan ICC of ADC skewness (0.75; 95% CI was not presented there) was shown to be relatively worse than that of ADC mean and entropy (0.84), which is compatible with our results.
All parameters in Dp and Pf maps in our study had relatively worse reproducibility than ADC or Dt maps. Some parameters showed good intraobserver agreement, whereas all parameters failed to attain good interscan agreement on ICC analyses. Both perfusion-sensitive parameters were calculated from biexponential fitting of IVIM diffusion-weighted MRI data at multiple lower b values. However, signal measurements at low b-values, especially those ≤100 s/mm2, are known to be more prone to measurement errors and highly sensitive to signal-to-noise variations, thus hindering consistent model fitting (33). Andreou et al. also reported the poor measurement reproducibility of mean Dp and Pf in liver metastases, with a very wide 95% CI of differences (−89% to 2,120% and −75.3% to 241% for Dp and Pf, respectively) (34), and Koh et al. demonstrated the poor measurement reproducibility of ADC values calculated using low b values (27). They also found a large SD in the estimation of Pf (35). In this context, they described that voxel-by-voxel analysis may be inappropriate for perfusion-sensitive parameters (33), which is also supported by our results. In spite of the poor reproducibility, however, several studies reported significant differences in perfusion-sensitive parameters according to the target lesions (35-37). In colorectal liver metastases, the estimated Pf was significantly lower than that in normal liver parenchyma (35), in keeping with the hypovascular nature of these lesions. Shinmoto et al. reported that both fast (pseudo-diffusion) and slow (true-diffusion) ADCs were lower in prostate cancer than normal peripheral or transitional zones (36), and Lemke et al. demonstrated that the Pf of pancreatic carcinoma was significantly lower than normal pancreatic parenchyma (37). However, at present, there is no standard vendor software for biexponential model fitting of IVIM diffusion-weighted MRI data and there is no consensus or an established IVIM diffusion-weighted MRI protocol such as the number and choice of b-values or the number of signal averages. Thus, as previous studies (7-10, 35-37) on IVIM have used variable software and imaging protocols with a variable number of b-values, our results do not indicate inferiority of perfusion-sensitive parameters from any kind of IVIM technique and do not contradict the previously reported usefulness of those parameters (35-37). Further studies optimizing and individualizing the IVIM technique may improve the aforementioned errors in the measurement of perfusion-sensitive parameters. Furthermore, although the error of measurement of a certain parameter may be substantial, the parameter may still be used as an imaging biomarker when the change of scale of the parameter under certain clinical situations exceeds the measurement error. Therefore, histogram and texture parameters from perfusion-sensitive parametric maps still have potential clinical application, despite their relatively poor reproducibility.
Our study has several limitations. Firstly, there were a relatively small number of subjects in the evaluation of measurement reproducibility of the parameters from ADC and parametric maps derived from IVIM MRI. Second by, all MRI scans for tumors were performed at the same time point of tumor growth, which might have caused relatively uniform pathological states of the tumors. This may also have led to overestimation of intraobserver and interscan reproducibilities of the parameters. Finally, all parameters were measured by one observer, and hence measurements may be significantly influenced by the observer.
In conclusion, histogram and texture parameters derived from ADC and Dt maps were more reproducible than those from Dp and Pf maps. In addition, mean, entropy, 5th, 10th, 25th percentile values from ADC and Dt maps were reproducible.
Acknowledgements
This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (grant number: 2011-0022379).
- Received February 23, 2014.
- Revision received March 11, 2014.
- Accepted March 12, 2014.
- Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved