Abstract
Background/Aim: Although hormone receptor–positive (HR+) invasive ductal carcinoma (IDC) is the most common breast cancer subtype, there is limited evidence describing how demographic and clinical features vary across U.S. regions. Understanding geographic disparities is essential for improving screening and treatment planning. To examine regional variations in demographic, socioeconomic status (SES), and stage-at-diagnosis characteristics among U.S. patients with HR+ IDC.
Patients and Methods: This cross-sectional study used data from the National Cancer Database (NCDB) for patients diagnosed with HR+ IDC between 2004 and 2020. Patients were categorized into 6 U.S. geographic regions: Northeast, Southeast, Midwest, Southwest, Mountain, and Pacific, based on the Commission on Cancer facility location. Descriptive and comparative analyses evaluated age, sex, race and ethnicity, insurance type, income, urban-rural residence, and American Joint Committee on Cancer stage.
Results: Among 136,280 patients (mean age, 64.4 years; 98.8% female), racial and SES composition differed significantly across regions. Black patients comprised 19.5% of the Southeast cohort and 18.1% of the Southwest cohort, compared with 2.9% in the Mountain region. The Asian population was highest in the Pacific (13.1%). Low-income households (<$63,000) were most prevalent in the Southwest (74.7%) and Southeast (69.5%), while the Pacific region had the highest proportion of higher-income households (46.4%) and metropolitan residents (94.3%). Stage III-IV disease at diagnosis occurred most often in the Southwest (17.6%) and least in the Northeast (14.0%).
Conclusion: Significant variation exists in the demographic and SES profile of patients with HR+ IDC, corresponding to differences in stage at diagnosis, and BC-related overall outcomes. These disparities likely reflect inequities in screening access, SES, and healthcare infrastructure, underscoring the need for region-specific public health strategies. Targeted regional interventions and equitable screening expansion are warranted to reduce geographic disparities and improve overall BC-related outcomes.
Introduction
Breast cancer (BC) remains the most commonly diagnosed malignancy among women in the United States and the second leading cause of cancer-related death overall. It is the leading cause of cancer mortality among Black and Hispanic women (1). More than 95% of BCs arise from the terminal ductal lobular unit, with invasive ductal carcinoma (IDC) being the most common histological subtype. The hormone receptor–positive (HR+) IDC represents the most prevalent subtype, with nearly 70% of all BCs, and typically has favorable outcomes when detected early (1, 2). Within HR+ disease, HER2 negativity is also associated with more indolent biology. Because this subtype is both common and comparatively uniform, it provides an optimal model for evaluating geographic differences without confounding more aggressive subtypes (3, 4). Despite national improvements in screening and therapy, geographic differences persist in stage at diagnosis and mortality, which may reflect regional disparities in socioeconomic status (SES), healthcare access, and population composition.
Annual reports from the National Cancer Institute’s (NCI’s) Surveillance, Epidemiology, and End Results (SEER) program and the Centers for Disease Control (CDC) describe national incidence and mortality trends, but few analyses have delineated demographic and clinical variation in HR+ IDC by region using contemporary nationwide data (5). Understanding these patterns may reveal structural determinants of late-stage presentation and guide equitable screening allocation.
This cross-sectional study used data from the National Cancer Database (NCDB) to characterize demographic, SES, and stage distributions among HR+ IDC patients across six U.S. geographic regions. To reduce biological heterogeneity and better isolate geographic effects, patients with invasive lobular carcinoma were excluded because this subtype exhibits distinct metastatic patterns and poorer outcomes in advanced HR+ disease (6). We hypothesized that areas with lower income and limited healthcare access would have higher proportions of advanced-stage disease at diagnosis.
Patients and Methods
Data were obtained from the NCDB, in collaboration with the American College of Surgeons Commission on Cancer (CoC) and the American Cancer Society, which captures approximately 70% of newly diagnosed malignancies nationwide. We included patients aged 18 years or older with histologically confirmed International Classification of Diseases for Oncology, Third Edition [ICD-O-3] code 8500 and were restricted to HR+ (ER and/or PR positive, HER2 status negative or unknown) status diagnosed between 2004 and 2020.
Geographic classification. Each patient was assigned to one of six regions: Northeast, Southeast, Midwest, Southwest, Mountain, or Pacific, based on reporting facility location following U.S. Census Bureau regional definitions. Variables analyzed included demographic characteristics [age, sex, race (White, Black, Asian, Native American), and Hispanic ethnicity]; socioeconomic indicators [insurance type (private, government, uninsured), median household income (<$63,000 vs. ≥$63,000), and urban-rural residence (metropolitan, urban, rural)]; and clinical characteristics, including American Joint Committee on Cancer (AJCC) stage 0-IV at diagnosis.
Statistical analysis. Demographic, SES, and clinical characteristics of patients were summarized across six regions. Continuous variables were reported as means (SDs) and categorical variables as frequencies and percentages. Regional distributions of age, race, income, insurance status, urban–rural residence, and AJCC stage were presented with 95% CIs to assess variation across regions. Analyses were stratified by sex, race, and urban–rural classification to explore demographic and population-density effects on stage at presentation.
Temporal changes in stage III-IV presentation from 2004 through 2020 were examined using the Joinpoint Regression Program, version 4.9.1.0 (National Cancer Institute), to estimate the average annual percent change (AAPC), with parallelism tests assessing regional differences in trend direction and magnitude. Missing data were excluded. Analyses were conducted with SPSS, version 29.0 (IBM Corp., Armonk, NY, USA) and GraphPad Prism, version 10.0 (GraphPad Inc., San Diego, CA, USA).
Ethical considerations. The study was deemed exempt by the Mayo Clinic Institutional Review Board and conformed to Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines. Because the NCDB contains deidentified data, informed consent was waived.
Results
Patient characteristics. A total of 136,280 patients with HR+ IDC were identified from the NCDB between 2004 and 2020. The median (range) age was 64 (40-90) years, and the mean (SD) age was 64.4 (13.0) years. Regional cohort sizes ranged from 5,613 patients (4.1%) in the Mountain region to 35,140 (25.8%) in the Midwest. Female patients comprised 98.8% of the cohort, with minimal regional variation.
Race and ethnicity. White patients constituted 83.3% (n=111,442) of all cases but ranged from 78.3% in the Southeast, 79.8% in the Southwest, and up to 92.6% in the Mountain region. Black patients accounted for 13.1% (n=17,459) overall, most prevalent in the Southeast (19.5%) and Southwest (18.1%), while Asian patients represented 3.3% (n=4,393), predominantly in the Pacific region (13.1%). Native American patients were rare (0.3%; n=456), highest in the Mountain region (1.9%). Hispanic ethnicity was recorded at 5.8% (n=7,713), most frequently in the Pacific (12.3%) and Southwest (9.1%) regions (Table I).
Summary of demographic data for patients diagnosed with hormone receptor-positive (HR+) invasive ductal carcinoma (IDC) by geographic location.
Socioeconomic status. Insurance data (n=133,506) showed that government insurance covered 53.4% (n=71,339), private insurance 44.0% (n=58,806), and 2.5% (3,361) were uninsured. Uninsured rates were highest in the Southwest (3.7%) and Southeast (3.2%). Income data were available for 132,473 patients. Lower-income households (<$63,000) accounted for 64.3% (n=85,193) overall, with the highest proportions in the Southwest (74.7%) and Southeast (69.5%), compared with 49.9% in the Northeast region.
Urban-rural classification. Urbanicity data (n=133,080) showed that 87.0% (n=115,796) of patients resided in metropolitan areas, 11.6% (15,480) in urban areas, and 1.4% (1,804) in rural communities. Metropolitan residence exceeded 90% in most regions, reaching a peak in the Pacific (94.3%), whereas the Midwest had the lowest metropolitan proportion (81.1%) and the highest rural share (2.3%).
Stage distribution. Stage information was available for 123,110 patients (92.7%). Stage I disease was the most common presentation (39.9%; n=49,113), followed by stage II (23.1%; n=28,448), stage 0 (19.8%; n=24,400), stage III (7.2%; n=8,914), and stage IV (9.9%; n=12,235). The proportion of early-stage disease (stage 0-I) was highest in the Northeast (62.3%) and lowest in the Southwest (56.6%), whereas advanced-stage disease (stage III-IV) was most prevalent in the Midwest (17.7%) and Mountain (18.2%) regions (Figure 1).
Stage distribution by US region among patients with hormone receptor-positive invasive ductal carcinoma, NCDB 2004-2020. Bars represent the proportion of each AJCC stage within the region. Early-stage (0-I) disease was most frequent in the Northeast, while later-stage (III-IV) presentations predominated in the Midwest and Mountain regions.
Discussion
In this national cross-sectional analysis of more than 136,000 patients with HR+ IDC, substantial geographic heterogeneity was identified in demographic, SES, and stage distributions across U.S. regions. The Southeast and Southwest regions exhibited higher proportions of Black and Hispanic patients, lower median household income, greater uninsured rates, and higher prevalence of advanced-stage (stage III-IV) disease. Similarly, patients in the Midwest and Mountain regions demonstrated a higher prevalence of advanced-stage presentation, potentially caused by barriers related to geographic access and regional healthcare disparities. These findings align with prior analyses showing that socioeconomic disadvantage, rural residence, and limited healthcare infrastructure are associated with delayed diagnosis of BC (7, 8).
Conversely, patients in the Pacific region demonstrated greater socioeconomic advantage and the highest proportion of urban residency, which corresponded with a higher possibility of earlier-stage detection. A recent study by Huang et al. demonstrated that patients with better healthcare access, higher area-level education, and more prior screening mammography had 54% lower odds of advanced-stage BC diagnosis and a 36% lower hazard of BC-specific death. However, racial and ethnic disparities were not correlated with screening history in that cohort (9). There are also similar studies with different age groups of patients that highlighted that the prior screening programs are a major determinant of early detection (10, 11). These findings suggest that healthcare access, education, and participation in screening are critical drivers of earlier-stage presentation in BC. Geographic variation in mammography facility distribution, oncology providers’ availability, and insurance coverage may explain the regional disparity.
Beyond individual and behavioral factors, structural and healthcare-system determinants play a pivotal role in improving BC outcomes overall. Regional differences in mammography facility density, oncology provider distribution, and travel distance to care centers have been strongly associated with treatment adherence and early detection rates. Webster et al. identified pronounced geographic inequities, finding that areas with higher proportions of Black residents had substantially fewer mammography facilities, 68% fewer statewide and up to 89% fewer in New Castle County, Delaware (12). Similarly, Hashtarkhani et al. utilized machine learning and geospatial modeling to demonstrate that mammography facility density and availability were among the most influential predictors (10). Further, consuming nutrient-poor diets may contribute to poor BC outcomes, especially in dense low-SES patient areas and ethnic minority areas (13). Another multi-ethnic study found that neighborhood-level disadvantages, such as overcrowding or food insecurity, were significantly associated with screening adherence (14). Collectively, ensuring equitable geographic access rather than patient awareness alone remains a major factor in closing breast cancer disparities observed by race and rurality.
Future efforts to mitigate these regional and racial disparities should address the structural and policy barriers and integrate geographic data into predictive models of screening and diagnosis. Expanding insurance coverage, particularly through Medicaid expansion and affordability initiatives, has been linked to higher early-stage diagnosis rates, greater use of preventive services, and reduced cancer mortality. Randomized trials showed that patient navigation programs increase BC screening rates by up to 50% and improve the timeliness of diagnosis and treatment completion. Another, deploying mobile and geographically targeted screening programs can overcome access barriers. Removing financial barriers to advanced imaging modalities and integrating social-needs navigation to address transportation or food insecurity may enhance participation among socioeconomically vulnerable groups. Finally, integrating social determinants of health into national cancer surveillance systems will help health agencies to identify high-risk regions, optimize resource allocation, and monitor the real-world impact of these interventions.
Study limitations. The NCDB captures approximately 70% of newly diagnosed malignancies from CoC-accredited hospitals but does not include non-CoC facilities, which tend to be smaller, more rural, and underserved communities. Most of the non-CoC hospitals have fewer oncology resources, reduced access to screening programs, and greater socioeconomic barriers. As a result, patients may present with more advanced disease than those included in the NCDB. This exclusion may lead to underrepresentation of rural, low-income, uninsured, and minority populations and may underestimate the true magnitude of geographic disparities. The cross-sectional design limits our capacity to establish causality; temporal changes are not available, so survival or treatment outcomes were not assessed. Certain variables, including income and insurance, contained missing data, and geographic categorization was based on reporting facility location, which may not always reflect patients’ true residential environment. Additionally, temporal trends could be influenced by changes in diagnostic or reporting practices during the study period.
Conclusion
In this nationwide cohort, regional differences in income, insurance coverage, and healthcare access were associated with a later stage at diagnosis in HR+IDC patients. The influential factors were identified to offer valuable information for policy makers, healthcare providers, and researchers to expand equitable, community-based screening programs, improve insurance coverage and oncology services in underserved areas, and reduce inequities in BC outcomes nationwide.
Footnotes
Authors’ Contributions
Berkay Demirors conceptualized the study, performed data collection and statistical analysis, and drafted the manuscript. Syeda Hoorulain Ahmed, Karen Grace, Paola Berrios Jimenez, Jade C. Bowers, Vishal Abhimutt Mahesh, Anjali Yadav, Harsheen K. Manaise, Guido Chiriboga, Lola Fuentes Brock, Reed Popp, and Angel Aguayo Merly contributed to data verification, literature review, and manuscript editing. Emmanuel Gabriel supervised the study, provided critical revisions, and approved the final version of the manuscript. All Authors reviewed and approved the final manuscript.
Conflicts of Interest
The Authors declare no conflicts of interest related to this study.
Artificial Intelligence (AI) Disclosure
No artificial intelligence (AI) tools, including large language models or machine learning software, were used in the preparation, analysis, or presentation of this manuscript.
- Received October 30, 2025.
- Revision received November 10, 2025.
- Accepted November 19, 2025.
- Copyright © 2026 The Author(s). Published by the International Institute of Anticancer Research.
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.







