Abstract
Background/Aim: Proliferation biomarkers such as MIB-1 are strong predictors of clinical outcome and response to therapy in patients with non-small-cell lung cancer, but they require histological examination. In this work, we present a classification model to predict MIB-1 expression based on clinical parameters from positron emission tomography. Patients and Methods: We retrospectively evaluated 78 patients with histology-proven non-small-cell lung cancer (NSCLC) who underwent 18F-FDG-PET/CT for clinical examination. We stratified the population into a low and high proliferation group using MIB-1=25% as cut-off value. We built a predictive model based on binary classification trees to estimate the group label from the maximum standardized uptake value (SUVmax) and lesion diameter. Results: The proposed model showed ability to predict the correct proliferation group with overall accuracy >82% (78% and 86% for the low- and high-proliferation group, respectively). Conclusion: Our results indicate that radiotracer activity evaluated via SUVmax and lesion diameter are correlated with tumour proliferation index MIB-1.
Lung cancer is currently estimated as the first and third most common form of cancer in men and women worldwide, respectively (1). In the United States, lung & bronchus cancer is by far the leading cause of cancer-related deaths in both genders, accounting for ~24% of all cancer-related casualties [2019 estimates (2)]. Histologically, NSCLC is the most common variation, with an estimated relative incidence of ~83% (3). Tobacco smoking is the major risk factor for lung cancer, but other elements such as genetics, chronic inflammation, occupational exposure, air pollution and poor diet have also been investigated as potential determinants (4). Despite the steady (but slow) increase experienced during the last 40 years, the average five-year survival rate for NSCLC is still rather appalling [~25% (3)]. Timely detection and personalised care are currently considered as key factors for improving the clinical outcome of patients affected by this disease (5).
Fluorine 18 (18F) fluorodeoxyglucose positron emission tomography/computed tomography (PET/CT in the remainder) plays a major role in a number of tasks related to the management of patients with NSCLC such as staging, re-staging, detection of recurrence and evaluation of response to therapy (6-8). PET/CT has also proved useful in screening programmes for detecting lung cancer at an early stage (9). Standardized uptake value (SUV) is a semi-quantitative parameter representing the ratio between the activity in the tissue and the whole injected dose relative to body weight. The maximum value of SUV (SUVmax henceforth) is possibly the most commonly used PET-derived parameter in oncological clinical practice (7, 10). High values of SUVmax are considered as predictors of higher risk of recurrence and generally worse prognosis in patients with NSCLC (11-14). Likewise, SUVmax has been reported to be useful for discriminating between malignant and benign lung nodules, mainly as a consequence of the higher values exhibited by malignant lesions compared with benign ones (10, 15, 16). A moderate positive correlation between SUVmax and tumour size in patients with NSCLC has also been reported (17), thus implicitly confirming the correlation between SUVmax and tumour aggressiveness. In some studies, the ratio between SUVmax and primary tumour size has been reported as a more important indicator of prognosis than SUVmax alone (18, 19).
Proliferation biomarkers such as MIB-1 have long been recognised as strong predictors of clinical outcome and response to therapy in patients with NSCLC (20- 22). Since tumours showing higher SUVmax are also considered more aggressive and highly proliferative, much research in recent years has focussed on the potential correlations between SUVmax and proliferation activity markers. Among them, the relationship between the Ki-67 expression index and the degree of radiotracer uptake in NSCLC evaluated through SUVmax has been investigated in several studies (10, 23, 24). High SUVmax is also considered as an independent predictor of programmed cell death ligand 1 (PD-L1) expression (25). Direct analysis of molecular biomarkers, however, is only possible after biopsy and/or surgical resection of the lesion, none of which options is always a viable approach in clinical practice. Therefore, there is much interest in determining whether it is possible to estimate tumour proliferation markers from radiomic data. In this study we present an approach for estimating MIB-1 on primary lesions in patients with NSCLC using PET/CT data – specifically SUVmax and lesion diameter. The method is based on classification trees (ClT in the remainder), an artificial intelligence technique which enables automatic classification through a set of logical rules defined as binary options on sets of discriminative cut-off values of the input parameters.
Patients and Methods
Patient population. We retrospectively evaluated 78 patients with histology-proven NSCLC who were referred for PET/CT examination and treated at the Thoracic Surgery Unit of the Perugia University Medical School (Hospital ‘S. Maria della Misericordia’, Perugia, Italy). The clinical characteristics of the patient series are detailed in Table I. All patients gave written informed consent to undergo PET/CT for clinical purposes and to accept that their data could be used in anonymous form for scientific studies.
Examination protocol. PET/CT examination was carried out on a hybrid PET/CT scanner (Discovery ST integrated PET/CTMS system, General Electric Medical Systems, Waukesha, WI, USA) consisting of a combination of a four-detector row (912 detectors per row) CT unit and one PET scanner with bismuth germanate crystals in 24 rings. The patients were fasted for at least 12h before the scan and serum glucose levels were measured to confirm values below 120mg/dL (6.66 mmol/l). Weight and height of all the patients were also recorded. Fluorine 18 fluorodeoxyglucose was administered intravenously to each patient in a single bolus (340-450 MBq) 45-60 min before the PET scan, followed by 250-500 ml of saline solution. Subsequently, the patients were invited to stretch out within the gantry in a supine position with their arms raised to decrease beam-hardening artifacts. Unenhanced low-dose (80 mA) CT was carried out for attenuation correction with the following parameters: section thickness 3.75 mm; reconstruction index 1.25 mm; gantry rotation speed <1 s; pitch, 1-1.5 and tube voltage 140 kVp. For a more faithful reproduction of the anatomic conditions the entire CT volume was obtained during one single breath hold. After unenhanced computed-tomography, PET examination was performed with a two-dimensional technique in caudo-cranial direction from the proximal one-third of the femour to the skull and the duration of each scan was approximately 4 min. The other parameters were: matrix size 128px × 128px, zoom factor 1 and full-width at half maximum 4.69. The images were reconstructed by using axial-, coronal-, and sagittal-plane views and were corrected for attenuation through the CT data by ordered subsets expectation maximization algorithm. The average PET examination time was 24-28 min.
The resulting PET/CT scans were read independently by two nuclear medicine specialists (B.P. and S.G.M. – both with >15 yrs experience). SUVmax values were obtained by drawing the regions of interest over the most intense slice of the primary after correcting for the injected dose and the patient's weight (19). The tumour diameter in the primary site was also measured.
Immunohistochemistry. Immunohistochemistry (IHC) was performed on 4 μm formalin-fixed paraffin-embedded (FFPE) sections from the most representative area of each lesion using a monoclonal antibody targeting Ki67 (Clone Mib1, DAKO, dilution 1:100). The signal was detected using a biotin-free polymeric-horseradish peroxidase (HRP)- linker antibody conjugate system (Bond Polymer Refine Detection, Leica BioSystems, Newcastle, UK) with heat-induced epitope retrieval (pH 6 for 30 min). The slides were stained using the Bond III automated immunostainer (Leica BioSystems Pty Ltd., Melbourne, Australia) and semi-quantitative evaluation of the staining was performed. Only nuclear staining of the tumour cells was scored. The results were expressed as the fraction (in percentage) between the number of positive nuclei and the total number of nuclei examined.
Standard of reference. The standard of reference for diagnosis of all lesions was histological examination and clinical results and findings of follow-up imaging [conventional CT and/or magnetic resonance (MR) imaging and/or ultrasonography (US)] for at least 6 months. The evaluation of the clinical records and the clinical follow-up of the patients were performed and discussed in consensus at the multidisciplinary meetings of our hospital, and no conflicting results emerged.
Classification trees. Classification trees (also referred to as decision trees) are a type of classifier that reaches the decision through a set of questions applied individually to each feature of the input pattern. In this work we used binary classification trees, which are based on questions of the kind ‘is fi ≤ ti?’, being fi the i-th feature and ti the decision value (also referred to as threshold) for that feature (26).
The set of questions and the corresponding thresholds are determined by training, which is carried out on a set of pre-classified cases (train set). The training process involves the following steps: a) defining a splitting criterion to divide the data into two subsets while maximising the intra-subset homogeneity; b) at each node deciding the candidate questions to be asked; c) establishing a stop-splitting rule to control the growth of the tree and d) establishing a rule to assign each leaf to one of the possible classes. Further refinement of the tree can be obtained by pruning, which consists of removing the levels (decision rules) that are irrelevant for the classification. At the end of the training process the ClT will consist of a set of logical rules which can be used to make predictions about the clinical conditions of the patient under evaluation.
Classification trees are particularly appealing in that they are fast, relatively easy to train and able to provide performance similar to that obtainable via more sophisticated methods. Furthermore, the inference mechanism of ClT [the ‘gathered knowledge’ (27)] is – unlike other methods – human-readable, and, as such, can be encoded into rules than can be easily translated into clinical practice.
Estimation of MIB-1 expression. We used classification trees to estimate MIB-1 from SUVmax and lesion diameter. To this end we evaluated three different ClT structures based on the following combinations of the input features: 1) SUVmax and lesion diameter; 2) SUVmax only and 3) lesion diameter only. For each of the above combinations we tested ClTs with different degree of complexity, the latter being determined by the amount of pruning on the original ClT.
Experiments. We cast the MIB-1 estimation process into a binary classification problem in the following way: cases with MIB-1 ≤25% were labelled as class ‘A’, the others (MIb-1 >25%) as class ‘B’. Accuracy estimation was based on leave-one-out cross-validation (26) – i.e. each classification model was trained using the data of all the patients but one and tested on the left-out case. The overall accuracy was the average of the results obtained over all the subdivisions into train and test set. The implementation was based on Matlab's Neural Networks Toolbox (28).
Results
Table II summarises the classification results. As can be seen, the best overall performance was achieved by using both SUVmax and lesion diameter as input features, which gave 82.05% overall accuracy (per-class accuracy 78.05% and 86.49% for ‘A’ and ‘B’, respectively). The models that used only one of the two features had worse performance: SUVmax alone achieved 79.49% overall accuracy (92.68% and 64.86% for class ‘1’ and ‘2’, respectively); lesion diameter alone provided 65.38% (70.73% and 59.46%).
Table III reports the mean values and standard deviation of MIB-1, SUVmax and lesion diameter in the patient population. Looking at the structure of the ClT that achieved the best accuracy (Figure 1), we can see that a) SUVmax below 3.95 alone was indicative of low proliferation index (MIB-1 ≤25%); b), lesion diameter above 41.5mm alone or below that threshold but with SUVmax ≥17.8 was indicative of high proliferation index (MIB-1 >25%) and c) lesion diameter below 41.5mm and SUVmax in the range 6.6-17.8 indicated low proliferation index.
Discussion
Tumour proliferation biomarkers play a pivotal role in predicting the overall outcome and response to treatment in patients with NSCLC. However, the determination of such biomarkers is only possible after tissue biopsy and/or surgical resection, none of which options is free of risk and always necessarily requested in clinical practice. The possibility to estimate such data non-invasively can be of great help in the process of lesion characterisation.
Many authors have set into evidence the relationship between pathological type of NSCLC, lesion size and SUVmax as measured by PET/CT (29-31). Özgül et al. (29) for instance have found that SUVmax was significantly associated with tumour size. Specifically, they divided patients with NSCLC in three groups, according to tumor size (group 1, ≤3 cm; group 2, between 3 cm and 5 cm; group 3, >5 cm) and showed that SUVmax was significantly lower in groups 1 and 2 compared with group 3. Hsu et al. (30) have shown that patients with a reference lung tumor ≤3 cm and SUV ≤3.1 had an expected five-year survival of 100%, while subjects with a reference tumour >3 cm and SUV >3.1 had an expected five-year survival rate of 53.3%. Karam et al. (31) have reported a significant correlation between SUVmax, tumour differentiation and tumour size in patients with adenocarcinoma and between SUVmax and tumour size of individuals with squamous cell carcinoma. The authors also concluded that the overall SUVmax of patients with NSCLC could be predicted by tumour size value, the prediction being more accurate for patients with adenocarcinoma compared with those with squamous cell carcinoma.
In this work we investigated the correlation between the proliferation biomarker MIB-1 (Ki 67) and radiomic data – specifically SUVmax and lesion diameter from PET/CT. With this aim we split the patient population into a low and a high proliferation group using MIB-1=25% as threshold, then trained an automatic classifier to determine whether an unknown case could be correctly classified as low or high group based on SUVmax and lesion diameter from PET/CT. Our method could correctly predict the class label with overall accuracy >82% by using both radiomic parameters (SUVmax and lesion diameter) as input features. The results indicate that radiotracer activity evaluated via SUVmax and lesion diameter is correlated with the tumour proliferation index MIB-1. This is of clinical interest as the tumour proliferation marker MIB-1 reflects biological aggression. Furthermore, the use of an artificial intelligence technique like ClT enables the classification model to be encoded into a set of rules easily interpreted by humans.
This work is not exempt from limitations. Among them are its retrospective nature and the relatively limited sample size (n=78). The results found here are encouraging and should be validated in larger studies. Interesting directions for future work include the investigation of other radiomic parameters (e.g. shape and texture features from PET/CT) and their potential correlation with tumour proliferation index.
Acknowledgements
This work was partially supported by the Fondazione Cassa di Risparmio di Perugia (Perugia, Italy), under grant no. 2015.0389013 – “Application of Artificial Intelligence methods to PET/CT for computer-assisted diagnosis”, and by the Department of Engineering, Università degli Studi di Perugia (Perugia, Italy) under project “Shape, colour and texture features for the analysis of two- and three-dimensional images: methods and applications” (Fundamental Research Grants Scheme 2019).
Footnotes
Authors' Contributions
Conceptualization: B.P., F.P. and M.R.; Data collection and preparation: B.P., R.C. and S.C., S.G.M., G.B. and M.R.; Mathematical modelling and data analysis: M.L.F., S.C., and F.B.; Validation: B.P., R.C., M.L.F., G.B., S.G.M., A.S. and M.R.; Supervision, B.P., F.B., M.L.F., A.S., F.P and M.R.; Writing – preparation of the original draft, B.P., F.B., M.L.F., G.B. and M.R.; Writing – review and editing: all authors; Funding acquisition, B.P. and F.B. All Authors have read and agreed to the submitted version of the manuscript.
Conflicts of Interest
The Authors declare no conflicts of interest regarding this study.
- Received April 16, 2020.
- Revision received April 30, 2020.
- Accepted May 4, 2020.
- Copyright© 2020, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved