Abstract
Background/Aim: Diagnostic scores (DSs) for confirmation of acute renal colic (ARCO) have rarely been evaluated. Patients and Methods: A cohort of 1,333 patients with acute abdominal pain (AAP) were studied, including 59 patients with confirmed ARCO. The most significant diagnostic findings (in multivariate logistic regression analysis) were used to construct DS formulas for the diagnosis of ARCO. Meta-analytical techniques were used to detect the summary sensitivity and specificity estimates for each data set (clinical symptoms, signs and tests, as well as DS formulas). Results: In hierarchical summary receiver operating characteristic analysis (HSROC), the values for area under the curve (95% confidence interval) for i) clinical symptoms ii) signs and tests, and iii) DS were 0.650 (0.612-0.688), 0.724 (0.680-0.768) and 0.962 (0.940-0.984), respectively. In HSROC analysis of the area under the curve values, differences were significant between i) and iii) (p<0.0001) and between ii) and iii) (p<0.0001). Conclusion: The present study is the first to provide evidence suggesting that the DS can be used for clinical confirmation of ARCO in patients with AAP, with a high diagnostic accuracy without radiological or laboratory analyses.
Acute renal colic (ARCO) is the most common urological emergency and in the USA over one million patients per year with ARCO visit emergency units (1). Similarly, the incidence of ARCO is also increasing globally, as shown by a i) steady rise in the emergency visits, ii) use of imaging, and iii) use of drugs for ARCO (2, 3). The clinical findings of ARCO include flank or lower abdominal pain, characterized by loin radiation. These symptoms are non-specific, however, and computerized tomography (CT) has become the standard diagnostic modality for ARCO (2). Although CT is useful for the detection of kidney stones, the radiation exposure from CT is a concern because ARCO is a recurrent disease, with relapses in 30-50% of patients (4, 5). To our knowledge, there are very few data on the testing of diagnostic scoring (DS) for ARCO, which prompted us to re-evaluate the accuracy of the clinical diagnosis of ARCO among patients with acute abdominal pain (AAP). The present study evaluated the relative accuracy of i) clinical symptoms, ii) signs and test, as well as iii) DS in confirming ARCO.
Patients and Methods
A cohort of 1,333 patients with AAP were included, of whom 59 had ARCO. The clinical symptoms (n=22), signs (n=14) and laboratory tests (n=3) were recorded for each patient. The diagnosis of ARCO was confirmed by considering all clinical symptoms, clinical signs and results of the laboratory tests together and following the diagnostic criteria of AAP as previously described (6-11).
Identifying DS models. A multivariate logistic (stepwise) regression analysis (SPSS Statistics 26.0.0.1; IBM, Armonk, NY, USA) was used to disclose variables with an independent predictive value. All the variables of clinical history and diagnostic findings presented in Tables I and II were included in the analysis as binary data e.g., ARCO=1 and other diagnosis of AAP=0, where was a positive (PE) and negative (NE) endpoints as defined therein. Using the coefficients of the regression model, a DS was built and its predictive value for ARCO was studied. The coefficient of the multivariate analysis shows the relative risk of a patient with a given clinical symptom, sign or test of having ARCO.
The DS formula derived for ARCO was: −3.26 × gender (female=1, male=0) + 2.60 × duration of pain (PE=1, NE=0) + 1.54 × appetite (PE=1, NE=0) + 6.89 × tenderness (PE=1, NE=0) + 3.23 × renal tenderness (PE=1, NE=0) + 7.56 × urine (PE=1, NE=0) − 8.06.
Statistical analysis. STATA/SE version 16.1 (StataCorp, College Station, TX, USA) was used for further statistical analyses. The statistical tests presented were two-sided, and p-values under 0.05 were considered statistically significant. Using 2×2 tables, sensitivity and specificity with 95% confidence intervals (95% CI) for clinical symptoms, signs and tests were determined. Meta-analytical technique (metaprop) was used to create separate forest plots for sensitivity and specificity for each set of data, including each diagnostic variables. We calculated the summary estimates of sensitivity and specificity, positive and negative likelihood ratios, and diagnostic odds ratio using a random-effects bivariate model and fitted the summary hierarchical receiver operating characteristic (HSROC) curves, including all diagnostic variables in the DS model, using ARCO as an endpoint.
Results
Basic patient data. In ARCO study group there were 59 patients (14 females and 45 males) versus 1,274 patients in the non-ARCO group (683 females and 591 males) including the following patients with AAP: 616 with non-specific abdominal pain, 271 with acute appendicitis, 124 with acute cholecystitis, 53 with acute small bowel obstruction, 50 with non-organic dyspepsia and 160 other patients with AAP, with a mean±SD age of 37.5±21.7 years.
The clinical symptoms in ARCO. The sensitivity of the clinical symptoms for detecting ARCO was 52% (95% CI=37-66%). The sensitivity was higher than 52% for eight of the symptoms. The five best clinical symptoms (progression of pain from the onset to diagnosis, jaundice, bowel function, drugs for abdominal pain and consumption of alcohol) had 71-98% sensitivity in ARCO (Figure 1). The specificity of the clinical symptoms in confirming ARCO was 64% (95% CI=47-79%) (Figure 2). Altogether, 13 symptoms had specificity higher than 64%. The five most specific symptoms of ARCO (location of initial pain, location of pain at diagnosis, intensity of abdominal pain, vertigo, and micturition) had 85-99% specificity (Figure 2).
The clinical signs and tests in ARCO. The sensitivity of the diagnostic signs and tests for ARCO was 79% (95% CI=65-90%) (Figure 3), and 10 findings had sensitivity exceeding 79%. The five highly accurate findings (skin colour, abdominal movement, distension, mass, positive for Murphy’s sign) had 93-100% sensitivity (Figure 3). The specificity of the signs was only 42% (95% CI=23-62%) (Figure 4), while nine signs had individual specificity higher than 42%. The five most accurate signs (mood, scar, tenderness, renal tenderness and urine), however, had 74-100% specificity (Figure 4).
DS in confirming ARCO. The most important predictors of ARCO were gender, duration of pain, appetite, tenderness, renal tenderness and urine. The most accurate level for the DS model (DS V; sensitivity=96%, specificity=99%) was reached when the patients with a DS value of between −3.29 and 0.69 were considered as “undefined” patients with ARCO for whom follow-up was required (n=46) (Figure 5). The DS formula was tested at five different cut-off levels to disclose the best diagnostic performance (Figure 5). The overall sensitivity and specificity of these DS models was 91% (95% CI=84-97%) and 98% (95% CI=96-99%), respectively (Figures 5 and 6). Three of these models had sensitivity >91% and three models had specificity >98%. The best diagnostic DS model in these patients with ARCO (DS V, Figures 5 and 6) had sensitivity of 96% (95% CI=86-100%) and specificity of 99% (95% CI=98-99%).
HSROC and AUC. STATA (metandiplot) was used to draw the HSROC curves to visualise the pooled overall diagnostic accuracy of symptoms, signs and tests, and different DS models in detecting ARCO (Figures 7, 8 and 9). In HSROC analysis, the AUC (95% CI) values for i) clinical symptoms, ii) signs and tests, and iii) DS were 0.650 (0.612-0.688), 0.724 (0.680-0.768) and 0.962 (0.940-0.984), respectively. The differences between these AUC values (roccomp analysis) were as follows: Between i) and ii), p=0.199; between i) and iii), p<0.0001; between ii) and iii), p<0.0001.
Discussion
The DS models for ARCO published so far are based on clinico-radiological prediction, which necessitates ultrasonography (US) or CT imaging in scoring. Moore et al. studied a retrospective cohort of patients undergoing CT for suspected ureteral stone. In multivariate analysis, five factors were predictive of a ureteral stone: male gender, duration of pain <6 hours, Caucasian, nausea or vomiting, and erythrocytes in the urine analysis, yielding a score of 0-13 (the STONE score) (12). The STONE score classifies patients into categories with different risk for ARCO: Low, intermediate and high risk. Moore et al. suggested the STONE score for ARCO screening and to reduce the need for unnecessary CT studies. Although the STONE may be a comprehensive test for ARCO, Fukuhara et al. pointed out that the race factor (black versus non-black patients) may not be a suitable predictor for ARCO (13). In addition, in the study of Moore et al. (12), no ROC analysis with AUC values for the patients with ARCO were reported. Another bias of that study (12) is that the pretest probability for ARCO or the doctors’ estimate for likelihood of ARCO could not have been investigated because only 2.9% of the patients had a non-ARCO diagnosis, which is likely to have affected the clinical prediction rule.
Fukuhara et al. investigated 124 patients with ARCO, and the evaluation of hydronephrosis (HN) with US was found to be a key factor in their DS because HN was weighted 4/13 points (13). Although some studies have pointed out the importance of HN in the diagnosis of ARCO (14), US may lead to false-positive/negative findings because US is a user-dependent modality and its accuracy in HN may vary according to the experience level of the test performer (15). Fukuhara et al. correlated US scoring with the final diagnosis, reporting the diagnostic performance of their DS as high as AUC=0.95 (13). They also validated the STONE score using ROC analysis, reaching an AUC of 0.88 (95% CI=0.82-0.94) in their cohort of patients with ARCO. However, there are several limitations in their study: i) Small sample size, ii) selected cohort, iii) their study was conducted only on weekdays and during working hours. There were only 40 non-ARCO patients and therefore the diagnostic efficiency of their DS among patients with AAP is unclear. In addition, the reliable diagnosis of HN by US needs much training (15).
Al-Terki et al. reviewed a cohort of 200 patients with ARCO, showing that in multivariate analysis, serum creatinine, leucocyte count, the largest diameter of the stone in CT and stone place in CT were significantly associated with ARCO, with their DS model reaching AUC=0.945 (16). Their patient cohort was fully selected, however, including only patients with suspicion of ARCO and no other patients with AAP were included.
Rana et al. conducted a prospective study including 132 patients with ARCO. They validated the diagnostic performance of the Al-Terki DS including serum creatinine, leucocyte count, the largest diameter of the stone in CT and stone place in CT. The AUC value of the clinico-radiological DS was 0.93 (17). However, this study was also limited by the highly selected patient cohort, only including patients with ARCO.
Conclusion
Only few trials with US or CT imaging have studied the applicability of DS models in patients with ARCO. We were unable to perform direct comparisons to previous DS trials because the present study is the first to provide evidence that DS can be used for diagnosis of ARCO among patients presenting with AAP. Taken together, as determined by the AUC values of the HSROC tests, the DS for ARCO is superior to both i) symptoms and ii) signs and tests. The major advantages of our DS is that this model does not need US, CT or laboratory analyses to reach high accuracy for ARCO.
Acknowledgements
The study was funded by the Päivikki and Sakari Sohlberg Foundation.
Footnotes
↵* These Authors contributed equally to this study.
Authors’ Contributions
All Authors contributed to the collection and analysis of data, drafting and revising the manuscript, read and approved the final article.
Conflicts of Interest
The Authors report no conflicts of interest or financial ties to disclose.
- Received March 20, 2021.
- Revision received April 28, 2021.
- Accepted April 29, 2021.
- Copyright © 2021 International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.