<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Public Health</journal-id>
<journal-title>Frontiers in Public Health</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Public Health</abbrev-journal-title>
<issn pub-type="epub">2296-2565</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpubh.2023.1332277</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Public Health</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Intersectional analysis of inequalities in self-reported breast cancer screening attendance using supervised machine learning and PROGRESS-Plus framework</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Pedr&#x00F3;s Barnils</surname>
<given-names>N&#x00FA;ria</given-names>
</name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2565333/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sch&#x00FC;z</surname>
<given-names>Benjamin</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/844326/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff><institution>Institute for Public Health and Nursing Research, University of Bremen</institution>, <addr-line>Bremen</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by" id="fn0001"><p>Edited by: Rakesh PS, The Union South East Asia Office, India</p></fn>
<fn fn-type="edited-by" id="fn0002"><p>Reviewed by: Swathi Krishna Njarekkattuvalappil, KEM Hospital Research Centre, India</p><p>Tony Lawrence, Government of Kerala, India</p><p>Zinia Thajudeen Nujum, Government Medical College, India</p></fn>
<corresp id="c001">&#x002A;Correspondence: N&#x00FA;ria Pedr&#x00F3;s Barnils, <email>nupedros@uni-bremen.de</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>01</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>11</volume>
<elocation-id>1332277</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>11</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>12</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2024 Pedr&#x00F3;s Barnils and Sch&#x00FC;z.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Pedr&#x00F3;s Barnils and Sch&#x00FC;z</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<sec id="sec1">
<title>Background</title>
<p>Breast cancer is a critical public health concern in Spain, and organized screening programs have been in place since the 1990s to reduce its incidence. However, despite the bi-annual invitation for breast cancer screening (BCS) for women aged 45&#x2013;69, significant attendance inequalities persist among different population groups. This study employs a quantitative intersectional perspective to identify intersectional positions at risk of not undergoing breast cancer screening in Spain.</p>
</sec>
<sec id="sec2">
<title>Methods</title>
<p>Women were selected from the 2020 European Health Interview Survey in Spain, which surveyed the adult population (&#x003E;&#x2009;15&#x2009;years old) living in private households (<italic>N</italic>&#x2009;=&#x2009;22,072; 59% response rate). Inequality indicators based on the PROGRESS-Plus framework were used to disentangle existing social intersections. To identify intersectional groups, decision tree models, including classification and regression trees (CARTs), chi-squared automatic interaction detector (CHAID), conditional inference rees (CITs), and C5.0, along with an ensemble algorithm, extreme gradient boosting (XGBoost), were applied.</p>
</sec>
<sec id="sec3">
<title>Results</title>
<p>XGBoost (AUC 78.8%) identified regional differences (Autonomous Community) as the most important factor for classifying BCS attendance, followed by education, age, and marital status. The C5.0 model (balanced accuracy 81.1%) highlighted that the relative importance of individual characteristics, such as education, marital status, or age, for attendance differs based on women&#x2019;s place of residence and their degree of interaction. The highest risk of not attending BCS was observed among illiterate older women in lower social classes who were born in Spain, were residing in Asturias, Cantabria, Basque Country, Castile and Le&#x00F3;n, Extremadura, Galicia, Madrid, Murcia, La Rioja, or Valencian Community, and were married, divorced, or widowed. Subsequently, the risk of not attending BCS extends to three other groups of women: women living in Ceuta and Melilla; single or legally separated women living in the rest of Spain; and women not born in Spain who were married, divorced, or widowed and not residing in Ceuta or Melilla.</p>
</sec>
<sec id="sec4">
<title>Conclusion</title>
<p>The combined use of decision trees and ensemble algorithms can be a valuable tool in identifying intersectional positions at a higher risk of not utilizing public resources and, thus, can aid substantially in developing targeted interventions to increase BCS attendance.</p>
</sec>
</abstract>
<kwd-group>
<kwd>healthcare access</kwd>
<kwd>breast cancer screening</kwd>
<kwd>health inequalities</kwd>
<kwd>intersectionality</kwd>
<kwd>decision trees</kwd>
<kwd>supervised machine learning</kwd>
<kwd>PROGRESS-Plus framework</kwd>
<kwd>Spain</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="62"/>
<page-count count="12"/>
<word-count count="8927"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Life-Course Epidemiology and Social Inequalities in Health</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec5">
<label>1</label>
<title>Introduction</title>
<p>In the majority of EU countries, universal health insurance and, in theory, universal access to healthcare are in place. However, while many have achieved universal health coverage for a broad range of services, access to these services is not equally experienced by all citizens (<xref ref-type="bibr" rid="ref1">1</xref>).</p>
<p>The Conceptual Framework of Access to Healthcare developed by Levesque et al. presents a comprehensive multidimensional approach to understanding and improving health service accessibility that combines individual- and system-related variables (<xref ref-type="bibr" rid="ref2">2</xref>). In this article, we focus on an individual-level accessibility outcome, such as attendance at organized breast cancer screening programs, and examine groups at the intersections of individual-, system-, and environment-level inequality dimensions to identify those at the highest risk for non-attendance.</p>
<p>In 2020, breast cancer was the most diagnosed cancer among women, with 2.3 million cases, and the most prevalent, with 7.8 million women still living after a diagnosis within the past 5 years, globally (<xref ref-type="bibr" rid="ref3">3</xref>). According to the European Cancer Information System (ECIS), breast cancer accounted for 28.7% of all cancers in women in the EU-27 countries in 2020, with an estimated 355,457 new cases and an age-standardized mortality rate (ASR) of 142.8 per 100,000 inhabitants (<xref ref-type="bibr" rid="ref4">4</xref>). In Spain, the context for this study, breast cancer constitutes 30.7% of all cancers in women, with 110,946 new cases detected in 2020. The ASR for breast cancer is 132 per 100,000, followed by colorectal cancer with an ASR of 58.4 per 100,000 (<xref ref-type="bibr" rid="ref4">4</xref>).</p>
<p>The importance of early breast cancer detection has been on the European agenda since 1980. The European Commission has urged Member States to establish preventive organized screening programs (OSPs) that bi-annually invite women aged 50&#x2013;69&#x2009;years for breast cancer screening (BCS). Additionally, women aged 45&#x2013;49&#x2009;years and 70&#x2013;74&#x2009;years should be conditionally screened every 2&#x2013;3&#x2009;years and 3&#x2009;years, respectively (<xref ref-type="bibr" rid="ref5">5</xref>). Over time, Member States have implemented OSPs with age ranges ranging from 40 to 74&#x2009;years (<xref ref-type="bibr" rid="ref6">6</xref>).</p>
<p>In 1990, Spain established OSPs, making it one of the first European countries to do so. However, due to the fact that some competencies of the Spanish healthcare system are allocated at the level of Autonomous Communities (regions), there were substantial differences between regions in invitation schedules: Some regions implemented a bi-annual regimen for women aged 45&#x2013;69, some only for age groups 50&#x2013;69, and there were differences with regards to the starting points as well&#x2014;for example, Navarre was the first region implementing the program in 1990, followed by Catalonia, Castilla-La Mancha, Castile and Le&#x00F3;n, Catalonia, Valencian Community, and Galicia in 1992. Ceuta and Melilla, on the other hand, only started their program in 2005 (<xref ref-type="bibr" rid="ref7">7</xref>). The national attendance rate has steadily increased over the years, with 72.6% of targeted women attending their last medical appointment and 93.13% undergoing screening at least once in their lifetime in 2020. (<xref ref-type="bibr" rid="ref8">8</xref>). Although this is a small proportion (6.87%), there is a large number of invited women who did not seek care, despite having knowledge of their age-related risks and available healthcare services.</p>
<p>Studies have shown that attendance inequality exists in BCS, with non-attendance not being uniformly distributed among the targeted population. In a recent international systematic review, Mottram et al. found that migrant women, women of lower socioeconomic status, non-homeowners, and those who previously experienced false-positive results had the lowest attendance rate (<xref ref-type="bibr" rid="ref9">9</xref>). Recent studies in Spain have found that being married, having Spanish nationality, and having higher education are positive predictors of screening participation (<xref ref-type="bibr" rid="ref10">10</xref>). Similarly, Serral et al. stated that socioeconomic status and country of origin are the strongest discriminating factors for screening attendance in Spain (<xref ref-type="bibr" rid="ref11">11</xref>). Moreover, Mart&#x00ED;n-L&#x00F3;pez et al. reported that holding Spanish nationality, being married, and having a high educational and income level are positively associated with BCS attendance (<xref ref-type="bibr" rid="ref12">12</xref>). However, so far and to the authors&#x2019; knowledge, no study has assessed these sociodemographic variables alongside the regional importance of non-attendance rates. In decentralized healthcare systems such as the Spanish system, where the concentration of health professionals in certain autonomous regions is approximately twice that of others, it is crucial to assess the influence of regional disparities on healthcare accessibility (<xref ref-type="bibr" rid="ref13">13</xref>, <xref ref-type="bibr" rid="ref14">14</xref>).</p>
<p>Moreover, previous studies have examined dimensions of inequality as independent predictors of non-attendance. However, adopting an <italic>intersectional</italic> framework is necessary to understand the complexity of attending breast cancer screening by analyzing social and demographic risk factors beyond independent predictors.</p>
<p>The theory of intersectionality, developed in 1991 by Kimberl&#x00E9; Crenshaw in the context of unfair legal processes for Black women, posits that discrimination and oppression result from the intersection of multiple aspects of identity (e.g., gender, ethnicity) and related experiences (<xref ref-type="bibr" rid="ref15">15</xref>). This theoretical framework is used in several disciplines today, with a broader range of variables that embody social power structures (e.g., sexuality, age, and income) (<xref ref-type="bibr" rid="ref16">16</xref>). Inequalities in health-related outcomes, such as screening behavior, can be disentangled by adopting a non-additive, intersecting approach (<xref ref-type="bibr" rid="ref17">17</xref>). In other words, experiences of discrimination by individuals who lay at the intersection of several axes of inequalities cannot be assessed and understood only by summing up the oppression suffered by these two variables independently.</p>
<p>Although traditionally examined in qualitative studies, recent years have witnessed a significant increase in the use of quantitative methods for intersectionality research (<xref ref-type="bibr" rid="ref18">18</xref>). In this study, relatively simple regression analyses with interaction terms or intersection variables are more frequently used; more complex and less frequently used approaches include analyses such as decomposition analysis or decision trees (<xref ref-type="bibr" rid="ref18">18</xref>). The latter refers to machine learning techniques that allow a non-parametric, data-driven exploration of intersectional heterogeneity in the data. In this study, variables potentially discriminating between groups varying in their estimated prevalence of a health outcome, such as PROGRESS-Plus characteristics, can be included in the analysis without the risk of overfitting the model (<xref ref-type="bibr" rid="ref19">19</xref>). Even at the risk of non-replicative arbitrary data splits, decision trees allow for any interaction, which helps, on the one hand, with variable selection and reduction and, on the other hand, identifying subgroups for potential interventions (<xref ref-type="bibr" rid="ref20">20</xref>, <xref ref-type="bibr" rid="ref21">21</xref>).</p>
<p>Only a few public health studies have used decision trees to explore sociodemographic inequalities. Mena et al. used classification and regression trees (CARTs) and conditional inference trees (CITs) to identify groups with a higher prevalence of non-daily vegetable intake (<xref ref-type="bibr" rid="ref22">22</xref>). Eagle et al. employed chi-squared automatic interaction detection trees (CHAID) to explore the associations between race and ethnicity, sex, depression, and concussion history with reported suicide attempts among adolescents in the US (<xref ref-type="bibr" rid="ref23">23</xref>). Delgado-Gallegos et al. used C5.0 to assess perceived stress in healthcare professionals attending to COVID-19 cases in Mexico (<xref ref-type="bibr" rid="ref24">24</xref>). These studies suggest that decision trees are a viable and helpful tool to identify the intersectional groups at high risk for specific outcomes, but so far, only a few studies have employed decision trees for studying sociodemographic inequalities in attending breast cancer screening.</p>
<p>Freitas et al. used CHAID to examine individual and environmental factors that predict breast cancer screening practices among women aged 45&#x2013;69 in Portugal (<xref ref-type="bibr" rid="ref25">25</xref>), but have not adopted a decidedly intersectional perspective. To the author&#x2019;s knowledge, no study in Spain has addressed inequality in breast cancer screening from an intersectional perspective or using machine learning techniques. Thus, according to national and regional breast cancer screening guidelines and based on the intersections of PROGRESS-Plus characteristics, this study aims to identify the intersectional positions of women more at risk of not undergoing breast cancer screening in Spain.</p>
</sec>
<sec sec-type="materials|methods" id="sec6">
<label>2</label>
<title>Materials and methods</title>
<sec id="sec7">
<label>2.1</label>
<title>European health interview survey</title>
<p>We used the cross-sectional data from the third wave of the European Health Interview Survey (EHIS) conducted in Spain in 2019 and 2020 to identify intersectional groups. The sample size of the survey was 22,072 respondents, corresponding to 59% of the invited participants (<italic>N</italic>&#x2009;=&#x2009;22,072; 59% response rate). The survey, legally binding from the second wave onwards, is conducted every 5&#x2009;years (every 6&#x2009;years since 2019) and targets the population over 15&#x2009;years old living in private households. Its main objective is to gather data on the population&#x2019;s health status, the utilization of health services, and determinants of health in a harmonized and comparable manner at the European level (b). The survey is framed within the European Commission&#x2019;s (EC) Regulation 1338/2008, and the current legally binding EC Regulation describing the survey&#x2019;s definitions is EC 2018/255 of 19 February 2018 (<xref ref-type="bibr" rid="ref26">26</xref>, <xref ref-type="bibr" rid="ref27">27</xref>).</p>
</sec>
<sec id="sec8">
<label>2.2</label>
<title>Ethical consideration</title>
<p>No ethical approval is required for this study as it is a secondary analysis of de-identified publicly available data.</p>
</sec>
<sec id="sec9">
<label>2.3</label>
<title>Primary outcome</title>
<p>The primary outcome of this study was self-reported breast cancer screening attendance at some point during their lifetime via mammography for women aged 45&#x2013;69 in 7 out of 19 Autonomous Communities (Castilla-La Mancha, Castile and Le&#x00F3;n, Valencian Community, La Rioja, Navarre, Ceuta, and Melilla) and for women aged 50&#x2013;69 in the rest of Autonomous Communities. Possible answers to these questions were: yes, no, unknown, or unanswered. Answers were dichotomized, removing those few who responded unknown or unanswered to avoid incorporating uncertainty on whether the respondent reported never attending BCS (no&#x2009;=&#x2009;0, yes&#x2009;=&#x2009;1).</p>
</sec>
<sec id="sec10">
<label>2.4</label>
<title>Explanatory variables</title>
<p>The explanatory variables used in this research are based on the PROGRESS-Plus framework (<xref ref-type="bibr" rid="ref28">28</xref>). Their use to disentangle social inequalities has been extensively discussed (<xref ref-type="bibr" rid="ref29">29</xref>) and used (<xref ref-type="bibr" rid="ref30">30</xref>). For this study, we categorized existing indicators of the survey into the PROGRESS-Plus categories and used them as potential predictors of non-attendance at BCS.</p>
<p><italic>Place of Residence</italic> was determined through two different variables: the size of the municipality and the specific Autonomous Community. The first variable was composed of the following seven categories: less than 10,000 inhabitants, 10,000&#x2013;20,000 inhabitants, 20,000&#x2013;50,000 inhabitants, 50,000&#x2013;100,000 inhabitants, more than 100,000 inhabitants, capital of the province, or more than 500,000 inhabitants. As mentioned above, the specific Autonomous Community also serves as a proxy for the time period of the implementation of BCS programs and the density of health professionals in the region.</p>
<p><italic>Race, ethnicity, culture, and language</italic> were measured by the respondents&#x2019; country of origin. The information provided by this variable in the <italic>EHIS</italic> was whether the participants were born in Spain or another country. Thus, the variable was employed as a binary variable (Spain&#x2009;=&#x2009;1 and other&#x2009;=&#x2009;2).</p>
<p><italic>Occupation</italic> was operationalized through the respondents&#x2019; current working situation. The derived categorical variable had the following categories: paid employment, unemployed, retired, unable to work, (unpaid) household work, and others. The &#x201C;others&#x201D; category is a merged category that includes <italic>EHIS</italic> classification such as unable to work, studying, and others, as the two first categories had very few participants, 17 and 1, respectively.</p>
<p><italic>Sex</italic> was not included as an explanatory variable but rather as a requirement needed for including participants in the analysis: to be a female. <italic>Gender</italic> and <italic>religion</italic> dimensions were not included in the present research, as the <italic>EHIS</italic> did not comprise any questions covering gender identification or religious preferences.</p>
<p><italic>Education</italic> was measured following the CNED14 classification, the Spanish adaptation of ISCED-2011 (<xref ref-type="bibr" rid="ref31">31</xref>). The variables employed had the following categories: illiterate, uncompleted primary education, completed primary education, first-stage high school, finished high school, intermediate vocational training, superior vocational training, and university degree.</p>
<p><italic>Socioeconomic status</italic> was operationalized by the type of occupation. The variable&#x2019;s classification was obtained based on the Spanish Society of Epidemiology classification developed in 1995 and revised in 2012. This grouping comprises 6 groups, with 1 being the highest social class and 6 being the lowest (<xref ref-type="bibr" rid="ref32">32</xref>).</p>
<p><italic>Social capital</italic> was not explicitly included in the dataset; however, we used two proxy variables: marital status (single, married, legally separated/divorced, or widowed; indicating the availability of spousal support) and type of household (alone, with a partner, with a partner and children, alone with children, or others; indicating the availability of family support).</p>
<p>The <italic>Plus</italic> dimension of the PROGRESS-Plus framework was proposed to extend the social determinants of health that could potentially be discriminating features. Among these contextually dependent factors, characteristics that attract discrimination, features of relationships, and time-dependent relationships have been highlighted (<xref ref-type="bibr" rid="ref33">33</xref>). In this study, we assessed characteristics that attract discrimination through the Global Activity Limitations Indicator (GALI) and age (<xref ref-type="bibr" rid="ref34">34</xref>). The GALI self-reported measure is conceived by assessing the degree of limitation experienced in the last 6&#x2009;months, with the possible answers being severely limited, mildly limited, or not limited. Age is a categorical variable comprising women aged 45&#x2013;69 in quintiles (45&#x2013;49&#x2009;years old, 50&#x2013;54&#x2009;years old, 55&#x2013;59&#x2009;years old, 60&#x2013;54&#x2009;years old, and 65&#x2013;69&#x2009;years old). While one would anticipate a proportional relationship between age and BCS attendance (i.e., older women being invited to BCS more times in their lives than younger women), this variable was included to assess its potential interactions with other PROGRESS-Plus variables. Specifically, it aimed to examine whether distinct social determinants of health intersect in different age groups.</p>
<p>As a final point, none of the categories of the explanatory variables were collapsed (e.g., illiterate and uncompleted primary education); Instead, they were included in the model individually with all categories. This approach was chosen because decision tree algorithms permit a high level of granularity and the ability to capture interactions and potential intersections among the various categories (<xref ref-type="bibr" rid="ref35">35</xref>).</p>
</sec>
<sec id="sec11">
<label>2.5</label>
<title>Analytic approach</title>
<p>Descriptive analytics, frequencies, and percentages were calculated for all variables. The statistical significance between participants who attended BCS and those who did not was tested using a chi-squared test for all variables.</p>
<p>For this study, a complete case analysis was conducted when missing data were present. The total sample size was restricted to women aged 45&#x2013;69 in 7 out of 19 Autonomous Communities (Castilla-La Mancha, Castile and Le&#x00F3;n, Valencian Community, La Rioja, Navarre, Ceuta, and Melilla) and to 50&#x2013;69&#x2009;years in the rest of Autonomous Communities (<italic>n</italic>&#x2009;=&#x2009;4,180). Among these women, those who did not provide information on whether they ever underwent mammography (<italic>n</italic>&#x2009;=&#x2009;8) were excluded, and those who did not provide their marital status (<italic>n</italic>&#x2009;=&#x2009;21) were excluded. Hence, the final total sample size of the study was 4,151 participants.</p>
<p>We used decision trees to identify the groups of women most at risk of not attending BCS. There is scarce literature that suggests which decision trees better operate on diverse health outcomes. CART bases its splitting decision on the lowest Gini impurity (or entropy) coefficient of every possible split (<xref ref-type="bibr" rid="ref21">21</xref>). It is often criticized for not providing statistical significance measures and being biased toward variables with many categories. CIT was developed to overcome some of CART&#x2019;s limitations. CIT employs a formal statistical hypothesis for building the decision trees and avoids variable selection bias by dividing the selection process into two steps (<xref ref-type="bibr" rid="ref36">36</xref>). CHAID was the first decision tree developed based on statistical significance tests, in which all covariates are tested against the outcome, and the one with the highest association is selected as the splitting variable. The limitation of CHAID is that it computes only categorical variables and is based on the chi-squared test (<xref ref-type="bibr" rid="ref37">37</xref>). C5.0 is a non-parametric method that uses the entropy of the imputed variables to generate splits. Therefore, nodes are generated based on the data split that produces a higher information gain (i.e., the entropy of the data split is the lowest, meaning the homogeneity among the included cases is the highest) and therefore appears to be more advantageous for classification (<xref ref-type="bibr" rid="ref38">38</xref>). After C5.0 generates a vast tree, it applies the binomial confidence limit method to every subtree for pruning through a high predicted error rate. Moreover, C5.0 uses adaptive boosting (<xref ref-type="bibr" rid="ref39">39</xref>) and winnowing in the growing process (<xref ref-type="bibr" rid="ref40">40</xref>, <xref ref-type="bibr" rid="ref41">41</xref>).</p>
<p>In the present study, the four most commonly used decision trees were implemented using R packages &#x201C;caret&#x201D; (<xref ref-type="bibr" rid="ref42">42</xref>) for exploratively hypertuning the models and &#x201C;rpart&#x201D; (CART) (<xref ref-type="bibr" rid="ref43">43</xref>), &#x201C;chaid&#x201D; (CHAID) (<xref ref-type="bibr" rid="ref44">44</xref>), &#x201C;partykit&#x201D; (CIT) (<xref ref-type="bibr" rid="ref20">20</xref>), and &#x201C;C50&#x201D; (C5.0) (<xref ref-type="bibr" rid="ref40">40</xref>) for outputting the final model in R version 4.2.3.</p>
<p>Decision trees that use classification algorithms employ several metrics derived from a confusion matrix to evaluate their performance. In the case of a binary outcome, the confusion matrix is a 2&#x2009;&#x00D7;&#x2009;2 table that illustrates the total number of true positives (TP &#x2013; hit), true negatives (TN &#x2013; correct rejection), false negatives (FN &#x2013; type II error), and false positives (FP &#x2013; type I error) when comparing the predicted positive (PP) and predicted negative (PN) cases to the actual positive (P) and actual negative (N) cases. Measures such as balanced accuracy, recall, precision, and F1 score (<xref ref-type="table" rid="tab1">Table 1</xref>) can then be calculated and used for model performance evaluation (<xref ref-type="bibr" rid="ref45">45</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>Informative metrics derived from the confusion matrix for model&#x2019;s performance evaluation.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top">Evaluation metrics</th>
<th align="center" valign="top">Equation</th>
</tr>
</thead>
<tbody>
<tr>
<td align="center" valign="top">Balanced accuracy</td>
<td align="center" valign="top">
<inline-formula><mml:math id="M1"><mml:mfrac><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mi>P</mml:mi></mml:mfrac><mml:mo>+</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mi>N</mml:mi></mml:mfrac></mml:mrow><mml:mn>2</mml:mn></mml:mfrac></mml:math></inline-formula>
</td>
</tr>
<tr>
<td align="center" valign="top">Recall (sensitivity)</td>
<td align="center" valign="top">
<inline-formula><mml:math id="M2"><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mi>P</mml:mi></mml:mfrac></mml:math></inline-formula>
</td>
</tr>
<tr>
<td align="center" valign="top">Precision</td>
<td align="center" valign="top">
<inline-formula><mml:math id="M3"><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac></mml:math></inline-formula>
</td>
</tr>
<tr>
<td align="center" valign="top">F1 score</td>
<td align="center" valign="top">
<inline-formula><mml:math id="M4"><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mspace width="0.25em"/><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mspace width="0.25em"/><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:math></inline-formula>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Despite decision trees being reliable tools for intersectional subgroup identification, they are unstable given their dependence on the data used to train them and, thus, posing the risk of overfitting. Small changes in the training data could lead to differences in decision tree construction; hence, the implementation of ensemble algorithms is recommended (<xref ref-type="bibr" rid="ref46">46</xref>). There are several ensemble algorithms that employ different procedures, such as bagging (Random Forest), boosting (AdaBoost), and bagging and boosting (extreme gradient boosting). Following the results of previous research (<xref ref-type="bibr" rid="ref47 ref48 ref49">47&#x2013;49</xref>), we decided to use extreme gradient boosting (XGBoost) to increase the internal validation and robustness of the decision trees. The algorithm was implemented using the package &#x201C;xgboost&#x201D; (<xref ref-type="bibr" rid="ref49">49</xref>) in R version 4.2.3.</p>
</sec>
</sec>
<sec sec-type="results" id="sec12">
<label>3</label>
<title>Results</title>
<sec id="sec13">
<label>3.1</label>
<title>Descriptive statistics of the sample</title>
<p>A summary of the descriptive statistics of the sample can be found in <xref ref-type="table" rid="tab2">Table 2</xref>. Association chi-squared tests suggest marginal associations between each independent variable and the outcome, except for municipality size. The total sample size is 4,151. Of those, 3,886 attended BCS during the last 2&#x2009;years, while 285 did not. As expected, the youngest group, women aged 45&#x2013;49&#x2009;years, attended less in their lifetime (18.34% &#x2013; never attended) than older women (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;76.93, df&#x2009;=&#x2009;4, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001). Women not born in Spain (14.78%) attended less BCS compared to those born in Spain (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;35.54, df&#x2009;=&#x2009;1, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001). Illiterate women (43.75%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;74.94, df&#x2009;=&#x2009;7, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001) and women in the lowest social class (10.51%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;38.00, df&#x2009;=&#x2009;6, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001) had the smallest number of attendances at BCS. Regarding the place of residence, women living in municipalities with 10,000&#x2013;20,000 inhabitants (8.82%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;16.95, df&#x2009;=&#x2009;6, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.151) and women living in Melilla (37.14%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;157.26, df&#x2009;=&#x2009;18, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001) had the lowest attendance at BCS. Finally, legally separated women (13.39%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;68.11, df&#x2009;=&#x2009;4, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001), women living alone with children (8.21%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;27.8, df&#x2009;=&#x2009;4, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001), women in a working situation labeled as other (10.90%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;26.92, df&#x2009;=&#x2009;4, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001), and severely limited women (12.44%) (<italic>&#x03C7;</italic><sup>2</sup>&#x2009;=&#x2009;13.15, df&#x2009;=&#x2009;2, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.0106) participated in BCS the least among their PROGRESS-Plus dimension.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Descriptive PROGRESS-Plus characteristics on the attendance to BCS among targeted women in Spain based on the 2020 EHIS.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top">Attended BCS (<italic>N</italic>&#x2009;=&#x2009;3,866)</th>
<th align="center" valign="top">Never attended BCS (<italic>N</italic>&#x2009;=&#x2009;285)</th>
<th align="center" valign="top">Total (<italic>N</italic>&#x2009;=&#x2009;4,151)</th>
<th align="center" valign="top"><italic>p</italic>-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" colspan="5"><bold>Age quintiles (years)</bold></td>
</tr>
<tr>
<td align="left" valign="top">45&#x2013;49</td>
<td align="center" valign="top">187 (4.8%)</td>
<td align="center" valign="top">42 (14.7%)</td>
<td align="center" valign="top">229 (5.5%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">50&#x2013;54</td>
<td align="center" valign="top">865 (22.4%)</td>
<td align="center" valign="top">93 (32.6%)</td>
<td align="center" valign="top">958 (23.1%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">55&#x2013;59</td>
<td align="center" valign="top">982 (25.4%)</td>
<td align="center" valign="top">63 (22.1%)</td>
<td align="center" valign="top">1,045 (25.2%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">60&#x2013;64</td>
<td align="center" valign="top">956 (24.7%)</td>
<td align="center" valign="top">48 (16.8%)</td>
<td align="center" valign="top">1,004 (24.2%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">65&#x2013;69</td>
<td align="center" valign="top">876 (22.7%)</td>
<td align="center" valign="top">39 (13.7%)</td>
<td align="center" valign="top">915 (22.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Country of origin</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Spain</td>
<td align="center" valign="top">3,572 (92.4%)</td>
<td align="center" valign="top">234 (82.1%)</td>
<td align="center" valign="top">3,806 (91.7%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">Other</td>
<td align="center" valign="top">294 (7.6%)</td>
<td align="center" valign="top">51 (17.9%)</td>
<td align="center" valign="top">345 (8.3%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Educational group</bold></td>
</tr>
<tr>
<td align="left" valign="top">Illiterate</td>
<td align="center" valign="top">19 (0.5%)</td>
<td align="center" valign="top">13 (4.6%)</td>
<td align="center" valign="top">32 (0.8%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">Uncompleted primary education</td>
<td align="center" valign="top">186 (4.8%)</td>
<td align="center" valign="top">23 (8.1%)</td>
<td align="center" valign="top">209 (5.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Completed primary education</td>
<td align="center" valign="top">720 (18.6%)</td>
<td align="center" valign="top">63 (22.1%)</td>
<td align="center" valign="top">783 (18.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">First-stage high school</td>
<td align="center" valign="top">976 (25.2%)</td>
<td align="center" valign="top">75 (26.3%)</td>
<td align="center" valign="top">1,051 (25.3%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Completed high school</td>
<td align="center" valign="top">533 (13.8%)</td>
<td align="center" valign="top">30 (10.5%)</td>
<td align="center" valign="top">563 (13.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Intermediate vocational training</td>
<td align="center" valign="top">331 (8.6%)</td>
<td align="center" valign="top">18 (6.3%)</td>
<td align="center" valign="top">349 (8.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Superior vocational training</td>
<td align="center" valign="top">266 (6.9%)</td>
<td align="center" valign="top">22 (7.7%)</td>
<td align="center" valign="top">288 (6.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">University degree</td>
<td align="center" valign="top">835 (21.6%)</td>
<td align="center" valign="top">41 (14.4%)</td>
<td align="center" valign="top">876 (21.1%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Socioeconomic position</bold></td>
</tr>
<tr>
<td align="left" valign="top">1 (high)</td>
<td align="center" valign="top">427 (11.0%)</td>
<td align="center" valign="top">17 (6.0%)</td>
<td align="center" valign="top">444 (10.7%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">2</td>
<td align="center" valign="top">333 (8.6%)</td>
<td align="center" valign="top">14 (4.9%)</td>
<td align="center" valign="top">347 (8.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">3</td>
<td align="center" valign="top">851 (22.0%)</td>
<td align="center" valign="top">47 (16.5%)</td>
<td align="center" valign="top">898 (21.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">4</td>
<td align="center" valign="top">420 (10.9%)</td>
<td align="center" valign="top">21 (7.4%)</td>
<td align="center" valign="top">441 (10.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">5</td>
<td align="center" valign="top">1,104 (28.6%)</td>
<td align="center" valign="top">101 (35.4%)</td>
<td align="center" valign="top">1,205 (29.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">6 (low)</td>
<td align="center" valign="top">545 (14.1%)</td>
<td align="center" valign="top">64 (22.5%)</td>
<td align="center" valign="top">609 (14.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Not classifiable</td>
<td align="center" valign="top">186 (4.8%)</td>
<td align="center" valign="top">21 (7.4%)</td>
<td align="center" valign="top">207 (5.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Size of the municipality (people)</bold></td>
</tr>
<tr>
<td align="left" valign="top">&#x003C;10,000</td>
<td align="center" valign="top">484 (12.5%)</td>
<td align="center" valign="top">34 (11.9%)</td>
<td align="center" valign="top">518 (12.5%)</td>
<td align="center" valign="top">0.151</td>
</tr>
<tr>
<td align="left" valign="top">10,000&#x2013;20,000</td>
<td align="center" valign="top">920 (23.8%)</td>
<td align="center" valign="top">89 (31.2%)</td>
<td align="center" valign="top">1,009 (24.3%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">20,000&#x2013;50,000</td>
<td align="center" valign="top">314 (8.1%)</td>
<td align="center" valign="top">15 (5.3%)</td>
<td align="center" valign="top">329 (7.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">50,000&#x2013;100,000</td>
<td align="center" valign="top">350 (9.1%)</td>
<td align="center" valign="top">19 (6.7%)</td>
<td align="center" valign="top">369 (8.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x003E;100,000</td>
<td align="center" valign="top">590 (15.3%)</td>
<td align="center" valign="top">56 (19.6%)</td>
<td align="center" valign="top">646 (15.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Province Capital</td>
<td align="center" valign="top">436 (11.3%)</td>
<td align="center" valign="top">25 (8.8%)</td>
<td align="center" valign="top">461 (11.1%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x003E;500,000</td>
<td align="center" valign="top">772 (20.0%)</td>
<td align="center" valign="top">47 (16.5%)</td>
<td align="center" valign="top">819 (19.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Autonomous community</bold></td>
</tr>
<tr>
<td align="left" valign="top">Andalusia</td>
<td align="center" valign="top">443 (11.5%)</td>
<td align="center" valign="top">52 (18.2%)</td>
<td align="center" valign="top">495 (11.9%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">Aragon</td>
<td align="center" valign="top">154 (4.0%)</td>
<td align="center" valign="top">2 (0.7%)</td>
<td align="center" valign="top">156 (3.8%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Asturias</td>
<td align="center" valign="top">176 (4.6%)</td>
<td align="center" valign="top">13 (4.6%)</td>
<td align="center" valign="top">189 (4.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Balearic Islands</td>
<td align="center" valign="top">61 (1.6%)</td>
<td align="center" valign="top">8 (2.8%)</td>
<td align="center" valign="top">69 (1.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Canary Islands</td>
<td align="center" valign="top">212 (5.5%)</td>
<td align="center" valign="top">12 (4.2%)</td>
<td align="center" valign="top">224 (5.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Cantabria</td>
<td align="center" valign="top">165 (4.3%)</td>
<td align="center" valign="top">13 (4.6%)</td>
<td align="center" valign="top">178 (4.3%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Castile and Le&#x00F3;n</td>
<td align="center" valign="top">188 (4.9%)</td>
<td align="center" valign="top">14 (4.9%)</td>
<td align="center" valign="top">202 (4.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Castilla-La Mancha</td>
<td align="center" valign="top">228 (5.9%)</td>
<td align="center" valign="top">19 (6.7%)</td>
<td align="center" valign="top">247 (6.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Catalonia</td>
<td align="center" valign="top">356 (9.2%)</td>
<td align="center" valign="top">7 (2.5%)</td>
<td align="center" valign="top">363 (8.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Valencian Community</td>
<td align="center" valign="top">393 (10.2%)</td>
<td align="center" valign="top">28 (9.8%)</td>
<td align="center" valign="top">421 (10.1%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Extremadura</td>
<td align="center" valign="top">149 (3.9%)</td>
<td align="center" valign="top">8 (2.8%)</td>
<td align="center" valign="top">157 (3.8%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Galicia</td>
<td align="center" valign="top">217 (5.6%)</td>
<td align="center" valign="top">15 (5.3%)</td>
<td align="center" valign="top">232 (5.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Madrid</td>
<td align="center" valign="top">384 (9.9%)</td>
<td align="center" valign="top">20 (7.0%)</td>
<td align="center" valign="top">404 (9.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Murcia</td>
<td align="center" valign="top">144 (3.7%)</td>
<td align="center" valign="top">9 (3.2%)</td>
<td align="center" valign="top">153 (3.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Navarre</td>
<td align="center" valign="top">170 (4.4%)</td>
<td align="center" valign="top">7 (2.5%)</td>
<td align="center" valign="top">177 (4.3%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Basque Country</td>
<td align="center" valign="top">192 (5.0%)</td>
<td align="center" valign="top">12 (4.2%)</td>
<td align="center" valign="top">204 (4.9%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">La Rioja</td>
<td align="center" valign="top">140 (3.6%)</td>
<td align="center" valign="top">8 (2.8%)</td>
<td align="center" valign="top">148 (3.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Ceuta</td>
<td align="center" valign="top">50 (1.3%)</td>
<td align="center" valign="top">12 (4.2%)</td>
<td align="center" valign="top">62 (1.5%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Melilla</td>
<td align="center" valign="top">44 (1.1%)</td>
<td align="center" valign="top">26 (9.1%)</td>
<td align="center" valign="top">70 (1.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Marital status</bold></td>
</tr>
<tr>
<td align="left" valign="top">Single</td>
<td align="center" valign="top">548 (14.2%)</td>
<td align="center" valign="top">83 (29.1%)</td>
<td align="center" valign="top">631 (15.2%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">Married</td>
<td align="center" valign="top">2,269 (58.7%)</td>
<td align="center" valign="top">122 (42.8%)</td>
<td align="center" valign="top">2,391 (57.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Widowed</td>
<td align="center" valign="top">460 (11.9%)</td>
<td align="center" valign="top">27 (9.5%)</td>
<td align="center" valign="top">487 (11.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Legally separated</td>
<td align="center" valign="top">194 (5.0%)</td>
<td align="center" valign="top">30 (10.5%)</td>
<td align="center" valign="top">224 (5.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Divorced</td>
<td align="center" valign="top">395 (10.2%)</td>
<td align="center" valign="top">23 (8.1%)</td>
<td align="center" valign="top">418 (10.1%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Type of household</bold></td>
</tr>
<tr>
<td align="left" valign="top">Alone</td>
<td align="center" valign="top">1,024 (26.5%)</td>
<td align="center" valign="top">87 (30.5%)</td>
<td align="center" valign="top">1,111 (26.8%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">With partner</td>
<td align="center" valign="top">1,130 (29.2%)</td>
<td align="center" valign="top">49 (17.2%)</td>
<td align="center" valign="top">1,179 (28.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">With a partner and children</td>
<td align="center" valign="top">918 (23.7%)</td>
<td align="center" valign="top">66 (23.2%)</td>
<td align="center" valign="top">984 (23.7%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Alone with children</td>
<td align="center" valign="top">556 (14.4%)</td>
<td align="center" valign="top">65 (22.8%)</td>
<td align="center" valign="top">621 (15.0%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Other</td>
<td align="center" valign="top">238 (6.2%)</td>
<td align="center" valign="top">18 (6.3%)</td>
<td align="center" valign="top">256 (6.2%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Working situation</bold></td>
</tr>
<tr>
<td align="left" valign="top">In paid employment</td>
<td align="center" valign="top">1742 (45.1%)</td>
<td align="center" valign="top">129 (45.3%)</td>
<td align="center" valign="top">1,871 (45.1%)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">Unemployed</td>
<td align="center" valign="top">355 (9.2%)</td>
<td align="center" valign="top">37 (13.0%)</td>
<td align="center" valign="top">392 (9.4%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Retired</td>
<td align="center" valign="top">903 (23.4%)</td>
<td align="center" valign="top">34 (11.9%)</td>
<td align="center" valign="top">937 (22.6%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Household work (unpaid)</td>
<td align="center" valign="top">727 (18.8%)</td>
<td align="center" valign="top">68 (23.9%)</td>
<td align="center" valign="top">795 (19.2%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Others</td>
<td align="center" valign="top">139 (3.6%)</td>
<td align="center" valign="top">17 (6.0%)</td>
<td align="center" valign="top">156 (3.8%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experienced limitation</bold></td>
</tr>
<tr>
<td align="left" valign="top">Severely limited</td>
<td align="center" valign="top">176 (4.6%)</td>
<td align="center" valign="top">25 (8.8%)</td>
<td align="center" valign="top">201 (4.8%)</td>
<td align="center" valign="top">0.0106</td>
</tr>
<tr>
<td align="left" valign="top">Mildly limited</td>
<td align="center" valign="top">960 (24.8%)</td>
<td align="center" valign="top">55 (19.3%)</td>
<td align="center" valign="top">1,015 (24.5%)</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">Not limited</td>
<td align="center" valign="top">2,730 (70.6%)</td>
<td align="center" valign="top">205 (71.9%)</td>
<td align="center" valign="top">2,935 (70.7%)</td>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>&#x002A; The outcome is a binarized variable that measures whether the respondent never attended breast cancer screening (yes&#x2009;=&#x2009;0, no&#x2009;=&#x2009;1).</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="sec14">
<label>3.2</label>
<title>Data analysis: pre-processing steps</title>
<p>Several pre-processing steps were taken to prepare the data to be analyzed. First, a complete case analysis was conducted, as only 29 cases had missing data on any variable. Second, to improve external validity, data were randomly split into a training subset (<italic>N</italic>&#x2009;=&#x2009;3,321, 80% of the entire dataset) and a testing subset (<italic>N</italic>&#x2009;=&#x2009;830, 20% of the entire dataset), with stratification of the outcome (i.e., to respect the distribution of classes in both subsets). Furthermore, balance sampling methods were applied to the training data given the imbalanced nature of the dependent variable (93.13% ever attended BCS, 6.87% never attended). Several studies have demonstrated that model performance improves when resampling the training data in highly imbalanced datasets (<xref ref-type="bibr" rid="ref50 ref51 ref52">50&#x2013;52</xref>). Four balanced sampling techniques that minimize the majority class (undersampling), maximize the minority class (oversampling), or minimize the majority class and maximize the minority class (SMOTE and ROSE) were tested (<xref ref-type="table" rid="tab3">Table 3</xref>). Oversampling outperformed the other sampling techniques, estimated through the area under the curve (AUC) of logistic regression (LR).</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>Performance of balance sampling techniques on the training dataset.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Balancing technique</th>
<th align="center" valign="top">Never attended BCS</th>
<th align="center" valign="top">Attended BCS</th>
<th align="center" valign="top">Total</th>
<th align="center" valign="top">AUC (LR)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Oversampling</td>
<td align="center" valign="top">3,101</td>
<td align="center" valign="top">3,087</td>
<td align="center" valign="top">6,188</td>
<td align="center" valign="top">0.714</td>
</tr>
<tr>
<td align="left" valign="top">Undersampling</td>
<td align="center" valign="top">234</td>
<td align="center" valign="top">236</td>
<td align="center" valign="top">470</td>
<td align="center" valign="top">0.691</td>
</tr>
<tr>
<td align="left" valign="top">ROSE sampling</td>
<td align="center" valign="top">1,677</td>
<td align="center" valign="top">1,644</td>
<td align="center" valign="top">3,321</td>
<td align="center" valign="top">0.701</td>
</tr>
<tr>
<td align="left" valign="top">SMOTE sampling</td>
<td align="center" valign="top">3,087</td>
<td align="center" valign="top">3,087</td>
<td align="center" valign="top">6,174</td>
<td align="center" valign="top">0.708</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="sec15">
<label>3.3</label>
<title>Data analysis: explanation of the tree</title>
<p>We tested several decision trees, CART, CIT, CHAID, and C5.0, to the oversampled training data. C5.0 outperformed the others with a balanced accuracy of 81.1%, recall of 80.7%, precision of 24.3%, and F1 score of 0.374. See Supplementary material 1 for more details and <xref ref-type="fig" rid="fig1">Figure 1</xref>; <xref ref-type="table" rid="tab4">Table 4</xref> for the final C5.0 decision tree.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>C5.0 decision tree assessing intersectional social positions most at risk of not attending BCS.</p>
</caption>
<graphic xlink:href="fpubh-11-1332277-g001.tif"/>
</fig>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption>
<p>Description of the intersectional social positions (nodes) at risk of not attending BCS from the C5.0 decision tree.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Rank</th>
<th align="left" valign="top">Intersectional social positions</th>
<th align="left" valign="top">Node number</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">1</td>
<td align="left" valign="top">Illiterate women living in Asturias, Cantabria, Castile and Le&#x00F3;n, Valencian Community, Extremadura, Galicia, Madrid, Murcia Basque Country or La Rioja, from middle and low social classes, aged 55 or older, born in Spain, married, divorced, or widowed</td>
<td align="left" valign="middle">10</td>
</tr>
<tr>
<td align="left" valign="middle">2</td>
<td align="left" valign="top">Women living in Ceuta or Melilla</td>
<td align="left" valign="middle">1</td>
</tr>
<tr>
<td align="left" valign="middle">3</td>
<td align="left" valign="top">Single or separated women not living in Ceuta or Melilla</td>
<td align="left" valign="middle">2</td>
</tr>
<tr>
<td align="left" valign="middle">4</td>
<td align="left" valign="top">Married, divorced, or widowed women born outside Spain and not living in Ceuta or Melilla</td>
<td align="left" valign="middle">3</td>
</tr>
<tr>
<td align="left" valign="middle">5</td>
<td align="left" valign="top">Married, divorced, or widowed women, born outside Spain and not living in Ceuta or Melilla, younger than 55&#x2009;years of age with less than high school education or professional education</td>
<td align="left" valign="middle">4</td>
</tr>
<tr>
<td align="left" valign="middle">6</td>
<td align="left" valign="top">Married, divorced, or widowed women, living alone, with a partner, or with a partner and children, born outside Spain, older than 55&#x2009;years of age, belonging to social classes 3&#x2013;6, living in the Autonomous Communities of Andalusia, Balearic Islands, or Castilla-La Mancha</td>
<td align="left" valign="middle">8</td>
</tr>
<tr>
<td align="left" valign="middle">7</td>
<td align="left" valign="top">Married, divorced, or widowed women, born outside Spain and not living in Ceuta or Melilla, younger than 55&#x2009;years of age with more than high school education (except professional education)</td>
<td align="left" valign="middle">5</td>
</tr>
<tr>
<td align="left" valign="middle">8</td>
<td align="left" valign="top">Women living in Asturias, Cantabria, Castile and Le&#x00F3;n, Valencian Community, Extremadura, Galicia, Madrid, Murcia Basque Country or La Rioja, from middle and low social classes, with primary education or greater, aged 55&#x2009;years or older, born in Spain, married, divorced, or widowed</td>
<td align="left" valign="middle">11</td>
</tr>
<tr>
<td align="left" valign="middle">9</td>
<td align="left" valign="top">Married, divorced, or widowed women, living alone with children or other household constellations, born outside Spain, older than 55&#x2009;years of age, belonging to social classes 3&#x2013;6, living in the Autonomous Communities of Andalusia, Balearic Islands, or Castilla-La Mancha</td>
<td align="left" valign="middle">9</td>
</tr>
<tr>
<td align="left" valign="middle">10</td>
<td align="left" valign="top">Married, divorced, or widowed women, born outside Spain and not living in Ceuta or Melilla, older than 55&#x2009;years of age, belonging to the highest two social class groups</td>
<td align="left" valign="middle">6</td>
</tr>
<tr>
<td align="left" valign="middle">11</td>
<td align="left" valign="top">Married, divorced, or widowed women born outside Spain, older than 55&#x2009;years of age, belonging to social classes 3&#x2013;6, living in the Autonomous Communities of Navarre, Aragon, Canary Islands, or Catalonia</td>
<td align="left" valign="middle">7</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The best tree identified the variables Autonomous Communities, origin, marital status, age, education, socioeconomic class, origin, and type of household as relevant in explaining attendance and non-attendance to BCS. The first splitting point, the root node, is found in Autonomous Communities, where women living in Ceuta or Melilla form a subgroup (Node 1, 464 cases /71 no-cases), and the rest of Spain further split. The second split is found in marital status, where single or separated women create a subgroup (Node 2, 1,112/585), and married, divorced, and widowed women further split. The third node relates to the respondent&#x2019;s origin country: women born outside Spain form a subgroup (Node 3, 280/163), and women born in Spain proceed to split. Women born in Spain, married, divorced, or widowed, and not residing in Ceuta or Melilla are additionally divided based on being 55&#x2009;years of age or older. The youngest group is further split based on the level of education, forming two final leaves: less than high school education or professional education (Node 4, 496/334) and greater than high school education except for professional education (Node 5, 93/221). The oldest group is fragmented based on the socioeconomic class of the respondents, where those belonging to the highest two social classes form a subgroup (Node 6, 30/312), and the rest is additionally divided based on the place of residence. The sixth split divides the abovementioned group into those living in the Autonomous Communities of Navarre, Aragon, Canary Islands, or Catalonia (Node 7, 0/316), those living in Andalusia, Balearic Islands, or Castilla-La Mancha, and the rest of Spain. Women living in Andalusia, Balearic Islands, or Castilla-La Mancha are further separated based on the type of household they live in: alone, with a partner, or with a partner and children (Node 8, 307/243), and those living alone with children or other household constellations (Node 9, 7/48). Finally, women living in Asturias, Cantabria, Castile and Le&#x00F3;n, Valencian Community, Extremadura, Galicia, Madrid, Murcia, Basque Country, or La Rioja, from middle and low social classes, aged 55&#x2009;years or older, born in Spain, and married, divorced, or widowed, are lastly split based on their level of education: illiterate (Node 10, 42/2) and primary education or greater (Node 11, 270/792).</p>
<p>For more robust information and internal validation of the decision tree, extreme gradient boosting was performed. XGBoost sequentially trains multiple decision trees (bagging), where each tree endeavors to correct the classification errors of the precious one by assigning specific weights to each tree and its leaves (boosting) (<xref ref-type="bibr" rid="ref49">49</xref>). The parameters used to build the ensemble algorithm were a learning rate of 0.1 (low learning rate to be more robust to overfitting), a maximum number of boosting iterations of 53, and a learning objective of logistic regression for binary classification. XGBoost assessed the importance of the variables when building the boosted trees through the information gain criteria. In essence, this algorithm informs which variables were more significant in constructing the decision trees and therefore has a higher predictive power in the model. With an AUC of 78.80%, the algorithm resolved that Autonomous Community followed by education, age, and marital status are the most important variables for predicting whether targeted women in Spain will attend BCS or not. Following with less than 50% relative importance are origin, social class, GALI, population density, working situation, and type of household (<xref ref-type="fig" rid="fig2">Figure 2</xref>).</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Variables&#x2019; relative importance in the XGBoost model for predicting BCS attendance.</p>
</caption>
<graphic xlink:href="fpubh-11-1332277-g002.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="sec16">
<label>4</label>
<title>Discussion</title>
<p>In this study, we employed decision trees to identify intersectional groups of women at the highest risk for never attending breast cancer screening and ensemble algorithms to give more robustness to the classification model. Autonomous Community, education, age, and marital status, were reported as the most critical variables for classifying attendance at BCS among women aged 45&#x2013;69 in Spain. While all Autonomous Communities have now adopted the OSP, it is worth noting that Ceuta and Melilla were the last to implement in 2006, and these were the regions where the lowest number of targeted women attended screenings (<xref ref-type="bibr" rid="ref53">53</xref>). In addition, these regions are those with the lowest density of health professionals (2.5/1,000 inhabitants) in Spain (<xref ref-type="bibr" rid="ref14">14</xref>). Moreover, given the enclave of these cities in Morocco, many inhabitants define themselves as Muslim, and several studies have observed that Muslim women have a lower attendance at BCS (<xref ref-type="bibr" rid="ref54">54</xref>). Finally, the average level of education of women in these regions is significantly lower than in the rest of Spain (<xref ref-type="bibr" rid="ref8">8</xref>).</p>
<p>Autonomous Communities, such as Navarre (1990) and Catalonia (1992), where OSPs were first implemented sustain higher screening attendance (<xref ref-type="bibr" rid="ref7">7</xref>, <xref ref-type="bibr" rid="ref53">53</xref>). Indeed, our analysis enhances the importance of the intersections of individual and system levels of inequality. While women in higher social classes tend to attend BCS more than those in low and middle social classes (<xref ref-type="bibr" rid="ref9">9</xref>), our analysis shows that women born in Spain, married, divorced, or widowed, not living in Ceuta or Melilla from the two highest social classes, have a slightly higher risk of not attending BCS compared to women born in Spain, married, divorced, or widowed, with middle and low social class, living specifically in Navarre, Aragon, Canary Islands, and Catalonia. These Spanish regions are thus strongly protective factors for BCS attendance, potentially over individual-level variables such as social class.</p>
<p>Among women born in Spain, married, divorced, or widowed, not living in Ceuta or Melilla, and younger than 55&#x2009;years of age, their education level determines their risk of not attending BCS. Women with lower education have been at higher risk of not attending BCS compared to those with higher education (<xref ref-type="bibr" rid="ref12">12</xref>, <xref ref-type="bibr" rid="ref55">55</xref>). However, among women older than 55&#x2009;years of age, social class is the determinant variable. In this case, those in the two highest social classes are a subgroup with the second lowest predicted risk of not attending BCS. The non-linear nature of the interactions permitted in decision trees allows for the emergence of such divergences.</p>
<p>Another unexpected intersectional result of this analysis is the intersection between marital status and origin. Women not living in Ceuta and Melilla, single or separated, and who are married, divorced, or widowed with a migration background have a similarly high risk of not attending BCS. Single and separated women have a higher risk of not attending BCS than women with migration backgrounds who are married, divorced, or widowed. In other words, in this study, the origin of women plays a decisive role only for those who are not single or separated. Although contradictory to previous studies (<xref ref-type="bibr" rid="ref11">11</xref>, <xref ref-type="bibr" rid="ref56">56</xref>, <xref ref-type="bibr" rid="ref57">57</xref>), marital status plays a more prominent role than migration background. These results are reinforced by the variable importance rank for building the model, where marital status has double the importance of origin.</p>
<p>Finally, the intersectional subgroup with the highest risk of not attending BCS, Node 10, encompassed women with very low educational levels in the Autonomous Communities of Asturias, Cantabria, Basque Country, Castile and Le&#x00F3;n, Extremadura, Galicia, Madrid, Murcia, La Rioja, and Valencian Community. Although this subgroup intersects with the privileged position of other variables, such as marital status, age, and origin, being illiterate strongly determines their risk of not attending BCS. These results align with an earlier study showing that education is essential for BCS attendance (<xref ref-type="bibr" rid="ref55">55</xref>). The present study contributes to and strengthens this statement showing that, although lying at the intersection of several protective factors, intersecting with low levels of education is pernicious. Furthermore, such social inequalities will persist if the responsible public health institutions do not address them. Molina-Barcel&#x00F3; et al. found that most of the Autonomous Communities of this subgroup, Asturias, Castile and Le&#x00F3;n, Extremadura, Madrid, and Murcia, did not develop interventions to decrease inequalities in the uptake of BCS in the last decade (<xref ref-type="bibr" rid="ref58">58</xref>).</p>
<p>Although our results are specific to the Spanish healthcare system, some aspects might have implications beyond Spain. When healthcare competencies such as the allocation of financial resources, organization of territorial coverage, or development and implementation of preventive programs are delegated to regional authorities, disparities in access to healthcare services may emerge (<xref ref-type="bibr" rid="ref13">13</xref>). This study identified the Autonomous Community as the most pivotal variable for predicting whether a woman aged 45&#x2013;69&#x2009;years old in Spain will attend BCS. Indeed, the Spanish Federation of Breast Cancer has, for several years, been advocating for greater interregional political cohesion to enhance equitable access to breast cancer prevention and treatment (<xref ref-type="bibr" rid="ref59">59</xref>). Regions with lower economic means, such as Ceuta and Melilla, have invested less in addressing social inequalities compared to wealthier regions, such as Navarre or Catalonia, ultimately enlarging inequities within and between regions (<xref ref-type="bibr" rid="ref58">58</xref>). Regional differences in the use of health services and their correlation to the socioeconomic level of the regions have been previously found in Spain (<xref ref-type="bibr" rid="ref60">60</xref>). While a trend toward narrowing this gap has been observed, Autonomous Communities in the south and northwest of the country continue to exhibit worse health indicators than those in the north and northeast. This is particularly notable in regions such as Ceuta, Melilla, Canary Islands, and Andalusia (<xref ref-type="bibr" rid="ref61">61</xref>).</p>
<sec id="sec17">
<label>4.1</label>
<title>Strengths and limitations</title>
<p>Supervised machine learning techniques, specifically decision trees and the presented ensemble algorithm, allow for a non-parametric and non-linear explanation of the relationship between explanatory variables and the outcome. Thus, unlike the classic statistical analysis, where including many potential predictors could lead to overfitting the model, machine learning techniques allow for a more comprehensive inclusion of all available PROGRESS-Plus variables in the dataset. Multicollinearity deriving from potentially correlated predictors is not an issue in machine learning approaches since decision trees split based on an individual variable at each node, and ensemble algorithms further mitigate the risk by outputting the average importance of each variable. In this study, machine learning algorithms allowed an inductive exploration of intersecting social determinants of health in women. At last, no predictive value is tested; instead, variables are evaluated as potential predictors for building the classification models (<xref ref-type="bibr" rid="ref35">35</xref>).</p>
<p>The study is not without limitations. No causal inference can be drawn due to the cross-sectional design of the survey. Moreover, response and common-method bias may occur due to the nature of the self-reported survey. In fact, the response rate of <italic>EHiS</italic> wave 3 was 59%, emphasizing the need for caution when drawing conclusions from studies using this dataset. Previous researchers have speculated that the <italic>EHiS</italic> suffers from underestimating inequalities in access to screening programs (<xref ref-type="bibr" rid="ref58">58</xref>). Along these lines, the nature of the target population in the survey&#x2014;people living in private households&#x2014;could potentially reveal details about individuals living in less advantaged housing situations, such as public or institutional housing. Furthermore, although the presented decision tree outperformed others, it is relatively large and outputs both simple and highly dimensional intersectional positions. Therefore, the large dimensionality of some subgroups might challenge the usability of their content. Indeed, the present analysis ought to enhance the understanding of the intersectional importance of different variables and describe those most at risk of not attending the BCS. Finding the balance between algorithm performance and public health interpretability and usability is essential in health prevention.</p>
<p>Finally, secondary data intrinsically limit the included variables and their categorization. The dimension of race/ethnicity could only be estimated based on the respondent&#x2019;s country of origin, which does not fully capture the experience of discrimination that women might experience. An extensive body of literature shows how race/ethnicity is often a proxy for convergence factors such as experiences of discrimination, social mobility, and a lack of financial opportunities (<xref ref-type="bibr" rid="ref62">62</xref>).</p>
</sec>
</sec>
<sec sec-type="conclusions" id="sec18">
<label>5</label>
<title>Conclusion</title>
<p>The present study pinpointed the intersectional positions of women at risk of not undergoing breast cancer screening in Spain. In our analyses, decision trees combined with an ensemble algorithm show that the importance of individual characteristics, such as education, marital status, or age, diverges based on the place of residence and their interaction. Particular attention ought to be placed on women living in specific regions: Ceuta and Melilla, single or legally separated women living in the rest of Spain, women not born in Spain, not residing in Ceuta or Melilla who are married, divorced, or widowed, and finally, illiterate women in lower social classes who are living in Asturias, Cantabria, Basque Country, Castile and Le&#x00F3;n, Extremadura, Galicia, Madrid, Murcia, La Rioja, or Valencian Community at the protective intersection with marital status, age, and origin.</p>
<p>Beyond the Spanish context, applying a quantitative intersectionality framework that applies a comprehensive set of individual, system, and environmental indicators of inequality can greatly assist in the creation of health prevention programs that strive for equitable access to healthcare services.</p>
</sec>
<sec sec-type="data-availability" id="sec19">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec sec-type="ethics-statement" id="sec20">
<title>Ethics statement</title>
<p>Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants&#x2019; legal guardians/next of kin in accordance with the national legislation and the institutional requirements.</p>
</sec>
<sec sec-type="author-contributions" id="sec21">
<title>Author contributions</title>
<p>NP: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing &#x2013; original draft, Writing &#x2013; review &#x0026; editing. BS: Conceptualization, Supervision, Validation, Writing &#x2013; review &#x0026; editing.</p>
</sec>
</body>
<back>
<sec sec-type="funding-information" id="sec22">
<title>Funding</title>
<p>The author(s) declare financial support was received for the research, authorship, and/or publication of this article. Open access funding is provided by Bremen University.</p>
</sec>
<ack>
<p>The authors thank Emily Mena and Marc Arias for their insightful comments.</p>
</ack>
<sec sec-type="COI-statement" id="sec23">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><label>1.</label><citation citation-type="book"><person-group person-group-type="author"><collab id="coll1">OECD</collab></person-group>. <source>Health for everyone?: social inequalities in health and health systems. OECD health policy studies</source>. <publisher-loc>Paris</publisher-loc>: <publisher-name>OECD</publisher-name> (<year>2019</year>).</citation></ref>
<ref id="ref2"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levesque</surname><given-names>J-F</given-names></name> <name><surname>Harris</surname><given-names>MF</given-names></name> <name><surname>Russell</surname><given-names>G</given-names></name></person-group>. <article-title>Patient-centred access to health care: conceptualising access at the interface of health systems and populations</article-title>. <source>Int J Equity Health</source>. (<year>2013</year>) <volume>12</volume>:<fpage>18</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1475-9276-12-18</pub-id>, PMID: <pub-id pub-id-type="pmid">23496984</pub-id></citation></ref>
<ref id="ref3"><label>3.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll2">WHO</collab></person-group>. Breast cancer. (<year>2021</year>). <comment>Available at:</comment> <ext-link xlink:href="https://www.who.int/news-room/fact-sheets/detail/breast-cancer" ext-link-type="uri">https://www.who.int/news-room/fact-sheets/detail/breast-cancer</ext-link>.</citation></ref>
<ref id="ref4"><label>4.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll3">ECIS</collab></person-group>. European Cancer Information System. (<year>2020</year>). <comment>Available at:</comment> <ext-link xlink:href="https://ecis.jrc.ec.europa.eu/" ext-link-type="uri">https://ecis.jrc.ec.europa.eu/</ext-link>.</citation></ref>
<ref id="ref5"><label>5.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll4">EC</collab></person-group>. Screening ages and frequencies. (<year>2022</year>). <comment>Available at:</comment> <ext-link xlink:href="https://healthcare-quality.jrc.ec.europa.eu/ecibc/european-breast-cancer-guidelines/screening-ages-and-frequencies" ext-link-type="uri">https://healthcare-quality.jrc.ec.europa.eu/ecibc/european-breast-cancer-guidelines/screening-ages-and-frequencies</ext-link>.</citation></ref>
<ref id="ref6"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altobelli</surname><given-names>E</given-names></name> <name><surname>Rapacchietta</surname><given-names>L</given-names></name> <name><surname>Angeletti</surname><given-names>P</given-names></name> <name><surname>Barbante</surname><given-names>L</given-names></name> <name><surname>Profeta</surname><given-names>F</given-names></name> <name><surname>Fagnano</surname><given-names>R</given-names></name></person-group>. <article-title>Breast cancer screening programmes across the WHO European region: differences among countries based on national income level</article-title>. <source>Int J Environ Res Public Health</source>. (<year>2017</year>) <volume>14</volume>:<fpage>452</fpage>. doi: <pub-id pub-id-type="doi">10.3390/ijerph14040452</pub-id>, PMID: <pub-id pub-id-type="pmid">28441745</pub-id></citation></ref>
<ref id="ref7"><label>7.</label><citation citation-type="book"><person-group person-group-type="author"><collab id="coll5">Ministerio de Sanidad y Consumo</collab></person-group>. <source>Cribado poblacional de cancer de mama en Espa&#x00F1;a</source>. <publisher-loc>Madrid</publisher-loc>: <publisher-name>Direcci&#x00F3;n General de Salud P&#x00FA;blica</publisher-name> (<year>1996</year>).</citation></ref>
<ref id="ref8"><label>8.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll6">INE</collab></person-group>. <source>Encuesta europea de salud en Espa&#x00F1;a. In: Estad&#x00ED;stica INd</source>. <comment>editor</comment>. Instituto Nacional de Estad&#x00ED;stica (<year>2020</year>).</citation></ref>
<ref id="ref9"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mottram</surname><given-names>R</given-names></name> <name><surname>Knerr</surname><given-names>WL</given-names></name> <name><surname>Gallacher</surname><given-names>D</given-names></name> <name><surname>Fraser</surname><given-names>H</given-names></name> <name><surname>Al-Khudairy</surname><given-names>L</given-names></name> <name><surname>Ayorinde</surname><given-names>A</given-names></name> <etal/></person-group>. <article-title>Factors associated with attendance at screening for breast cancer: a systematic review and meta-analysis</article-title>. <source>BMJ Open</source>. (<year>2021</year>) <volume>11</volume>:<fpage>e046660</fpage>. doi: <pub-id pub-id-type="doi">10.1136/bmjopen-2020-046660</pub-id>, PMID: <pub-id pub-id-type="pmid">34848507</pub-id></citation></ref>
<ref id="ref10"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zamorano-Leon</surname><given-names>JJ</given-names></name> <name><surname>L&#x00F3;pez-de-Andres</surname><given-names>A</given-names></name> <name><surname>&#x00C1;lvarez-Gonz&#x00E1;lez</surname><given-names>A</given-names></name> <name><surname>Astasio-Arbiza</surname><given-names>P</given-names></name> <name><surname>L&#x00F3;pez-Farr&#x00E9;</surname><given-names>AJ</given-names></name> <name><surname>de Miguel-Diez</surname><given-names>J</given-names></name> <etal/></person-group>. <article-title>Reduction from 2011 to 2017 in adherence to breast cancer screening and non-improvement in the uptake of cervical cancer screening among women living in Spain</article-title>. <source>Maturitas</source>. (<year>2020</year>) <volume>135</volume>:<fpage>27</fpage>&#x2013;<lpage>33</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.maturitas.2020.02.007</pub-id></citation></ref>
<ref id="ref11"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serral</surname><given-names>G</given-names></name> <name><surname>Borrell</surname><given-names>C</given-names></name> <name><surname>Puigpinos</surname><given-names>IRR</given-names></name></person-group>. <article-title>Socioeconomic inequalities in mammography screening in Spanish women aged 45 to 69</article-title>. <source>Gac Sanit</source>. (<year>2018</year>) <volume>32</volume>:<fpage>61</fpage>&#x2013;<lpage>7</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gaceta.2016.12.010</pub-id>, PMID: <pub-id pub-id-type="pmid">28274621</pub-id></citation></ref>
<ref id="ref12"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mart&#x00ED;n-L&#x00F3;pez</surname><given-names>R</given-names></name> <name><surname>Jim&#x00E9;nez-Garc&#x00ED;a</surname><given-names>R</given-names></name> <name><surname>Lopez-de-Andres</surname><given-names>A</given-names></name> <name><surname>Hern&#x00E1;ndez-Barrera</surname><given-names>V</given-names></name> <name><surname>Jim&#x00E9;nez-Trujillo</surname><given-names>I</given-names></name> <name><surname>Gil-de-Miguel</surname><given-names>A</given-names></name> <etal/></person-group>. <article-title>Inequalities in uptake of breast cancer screening in Spain: analysis of a cross-sectional national survey</article-title>. <source>Public Health</source>. (<year>2013</year>) <volume>127</volume>:<fpage>822</fpage>&#x2013;<lpage>7</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.puhe.2013.03.006</pub-id>, PMID: <pub-id pub-id-type="pmid">23773337</pub-id></citation></ref>
<ref id="ref13"><label>13.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>del Castillo</surname><given-names>JR</given-names></name></person-group>. <comment>Descentralizaci&#x00F3;n de los servicios sanitarios. Aspectos generales y an&#x00E1;lisis del caso espanol. P&#x00FA;blica EAdS, editor: Copartgraf S.C.A. Granada: Escuela Andaluza de Salud P&#x00FA;blica</comment> (<year>1998</year>).</citation></ref>
<ref id="ref14"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Md</surname><given-names>S</given-names></name></person-group>. <article-title>N&#x00FA;mero de profesionales de la medicina que trabajan en el Sistema Nacional de Salud (SNS) en Atenci&#x00F3;n Primaria, Atenci&#x00F3;n Hospitalaria, Servicios de urgencias y emergencias (112/061) y Especialistas en formaci&#x00F3;n seg&#x00FA;n comunidad aut&#x00F3;noma</article-title>. <source>Sanidad en un vistazo</source>. (<year>2021</year>). Available from: <ext-link xlink:href="https://www.sanidad.gob.es/estadEstudios/sanidadDatos/tablas/tabla13.htm" ext-link-type="uri">https://www.sanidad.gob.es/estadEstudios/sanidadDatos/tablas/tabla13.htm</ext-link></citation></ref>
<ref id="ref15"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crenshaw</surname><given-names>K</given-names></name></person-group>. <article-title>Mapping the margins: intersectionality, identity politics, and violence against women of color</article-title>. <source>Stanford Law Rev</source>. (<year>1991</year>) <volume>43</volume>:<fpage>1241</fpage>. doi: <pub-id pub-id-type="doi">10.2307/1229039</pub-id></citation></ref>
<ref id="ref16"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowleg</surname><given-names>L</given-names></name></person-group>. <article-title>The problem with the phrase women and minorities: intersectionality-an important theoretical framework for public health</article-title>. <source>Am J Public Health</source>. (<year>2012</year>) <volume>102</volume>:<fpage>1267</fpage>&#x2013;<lpage>73</lpage>. doi: <pub-id pub-id-type="doi">10.2105/AJPH.2012.300750</pub-id>, PMID: <pub-id pub-id-type="pmid">22594719</pub-id></citation></ref>
<ref id="ref17"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spierings</surname><given-names>N</given-names></name></person-group>. <article-title>The inclusion of quantitative techniques and diversity in the mainstream of feminist research</article-title>. <source>Eur J Women's Stud</source>. (<year>2012</year>) <volume>19</volume>:<fpage>331</fpage>&#x2013;<lpage>47</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1350506812443621</pub-id></citation></ref>
<ref id="ref18"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bauer</surname><given-names>GR</given-names></name> <name><surname>Churchill</surname><given-names>SM</given-names></name> <name><surname>Mahendran</surname><given-names>M</given-names></name> <name><surname>Walwyn</surname><given-names>C</given-names></name> <name><surname>Lizotte</surname><given-names>D</given-names></name> <name><surname>Villa-Rueda</surname><given-names>AA</given-names></name></person-group>. <article-title>Intersectionality in quantitative research: a systematic review of its emergence and applications of theory and methods</article-title>. <source>SSM Popul Health</source>. (<year>2021</year>) <volume>14</volume>:<fpage>100798</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ssmph.2021.100798</pub-id>, PMID: <pub-id pub-id-type="pmid">33997247</pub-id></citation></ref>
<ref id="ref19"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cairney</surname><given-names>J</given-names></name> <name><surname>Veldhuizen</surname><given-names>S</given-names></name> <name><surname>Vigod</surname><given-names>S</given-names></name> <name><surname>Streiner</surname><given-names>DL</given-names></name> <name><surname>Wade</surname><given-names>TJ</given-names></name> <name><surname>Kurdyak</surname><given-names>P</given-names></name></person-group>. <article-title>Exploring the social determinants of mental health service use using intersectionality theory and CART analysis</article-title>. <source>J Epidemiol Community Health</source>. (<year>2014</year>) <volume>68</volume>:<fpage>145</fpage>&#x2013;<lpage>50</lpage>. doi: <pub-id pub-id-type="doi">10.1136/jech-2013-203120</pub-id>, PMID: <pub-id pub-id-type="pmid">24098046</pub-id></citation></ref>
<ref id="ref20"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hothorn</surname><given-names>T</given-names></name> <name><surname>Hornik</surname><given-names>K</given-names></name> <name><surname>Zeileis</surname><given-names>A</given-names></name></person-group>. <article-title>Unbiased recursive partitioning: a conditional inference framework</article-title>. <source>J Comput Graph Stat</source>. (<year>2006</year>) <volume>15</volume>:<fpage>651</fpage>&#x2013;<lpage>74</lpage>. doi: <pub-id pub-id-type="doi">10.1198/106186006X133933</pub-id></citation></ref>
<ref id="ref21"><label>21.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Breiman</surname><given-names>L</given-names></name> <name><surname>Friedman</surname><given-names>JH</given-names></name> <name><surname>Olshen</surname><given-names>RA</given-names></name> <name><surname>Stone</surname><given-names>CJ</given-names></name></person-group>. <source>Classification and regression trees</source>. <publisher-loc>Belmont</publisher-loc>: <publisher-name>Wadsworth</publisher-name>. (<year>1984</year>). <fpage>1</fpage>&#x2013;<lpage>358</lpage> p.</citation></ref>
<ref id="ref22"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mena</surname><given-names>E</given-names></name> <name><surname>Bolte</surname><given-names>G</given-names></name></person-group>. <article-title>Advance gender study g. Classification tree analysis for an intersectionality-informed identification of population groups with non-daily vegetable intake</article-title>. <source>BMC Public Health</source>. (<year>2021</year>) <volume>21</volume>:<fpage>2007</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12889-021-12043-6</pub-id></citation></ref>
<ref id="ref23"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eagle</surname><given-names>SR</given-names></name> <name><surname>Brent</surname><given-names>D</given-names></name> <name><surname>Covassin</surname><given-names>T</given-names></name> <name><surname>Elbin</surname><given-names>RJ</given-names></name> <name><surname>Wallace</surname><given-names>J</given-names></name> <name><surname>Ortega</surname><given-names>J</given-names></name> <etal/></person-group>. <article-title>Exploration of race and ethnicity, sex, sport-related concussion, depression history, and suicide attempts in US youth</article-title>. <source>JAMA Netw Open</source>. (<year>2022</year>) <volume>5</volume>:<fpage>e2219934</fpage>. doi: <pub-id pub-id-type="doi">10.1001/jamanetworkopen.2022.19934</pub-id>, PMID: <pub-id pub-id-type="pmid">35796154</pub-id></citation></ref>
<ref id="ref24"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delgado-Gallegos</surname><given-names>JL</given-names></name> <name><surname>Aviles-Rodriguez</surname><given-names>G</given-names></name> <name><surname>Padilla-Rivas</surname><given-names>GR</given-names></name> <name><surname>De Los Angeles Cosio-Leon</surname><given-names>M</given-names></name> <name><surname>Franco-Villareal</surname><given-names>H</given-names></name> <name><surname>Nieto-Hipolito</surname><given-names>JI</given-names></name> <etal/></person-group>. <article-title>Application of C5.0 algorithm for the assessment of perceived stress in healthcare professionals attending COVID-19</article-title>. <source>Brain Sci</source>. (<year>2023</year>) <volume>13</volume>:<fpage>513</fpage>. doi: <pub-id pub-id-type="doi">10.3390/brainsci13030513</pub-id>, PMID: <pub-id pub-id-type="pmid">36979323</pub-id></citation></ref>
<ref id="ref25"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freitas</surname><given-names>C</given-names></name> <name><surname>Tura</surname><given-names>LF</given-names></name> <name><surname>Costa</surname><given-names>N</given-names></name> <name><surname>Duarte</surname><given-names>J</given-names></name></person-group>. <article-title>A population-based breast cancer screening programme: conducting a comprehensive survey to explore adherence determinants</article-title>. <source>Eur J Cancer Care</source>. (<year>2012</year>) <volume>21</volume>:<fpage>349</fpage>&#x2013;<lpage>59</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2354.2011.01305.x</pub-id></citation></ref>
<ref id="ref26"><label>26.</label><citation citation-type="book"><person-group person-group-type="author"><collab id="coll7">European Commission</collab></person-group>. <source>Regulation (EC) No 1338/2008 of the European Parliament and of the Council of 16 December 2008 on Community statistics on public health and health and safety at work</source>. <publisher-loc>Brussels</publisher-loc>: <publisher-name>European Commission</publisher-name> (<year>2008</year>).</citation></ref>
<ref id="ref27"><label>27.</label><citation citation-type="book"><person-group person-group-type="author"><collab id="coll8">European Commission</collab></person-group>. <source>Commission Regulation (EU) 2018/255 of 19 February 2018 implementing Regulation (EC) No 1338/2008 of the European Parliament and of the Council as regards statistics based on the European Health Interview Survey (EHIS)</source>. <publisher-loc>Brussels</publisher-loc>: <publisher-name>European Commission</publisher-name> (<year>2018</year>).</citation></ref>
<ref id="ref28"><label>28.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Oliver</surname><given-names>S</given-names></name> <name><surname>Kavanagh</surname><given-names>J</given-names></name> <name><surname>Caird</surname><given-names>J</given-names></name> <name><surname>Lorenc</surname><given-names>T</given-names></name> <name><surname>Oliver</surname><given-names>K</given-names></name> <name><surname>Harden</surname><given-names>A</given-names></name> <etal/></person-group>. <source>Health promotion, inequalities and young people's health: a systematic review of research</source>. <publisher-loc>London</publisher-loc>: <publisher-name>EPPI-Centre</publisher-name> (<year>2008</year>).</citation></ref>
<ref id="ref29"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>O'Neill</surname><given-names>J</given-names></name> <name><surname>Tabish</surname><given-names>H</given-names></name> <name><surname>Welch</surname><given-names>V</given-names></name> <name><surname>Petticrew</surname><given-names>M</given-names></name> <name><surname>Pottie</surname><given-names>K</given-names></name> <name><surname>Clarke</surname><given-names>M</given-names></name> <etal/></person-group>. <article-title>Applying an equity lens to interventions: using PROGRESS ensures consideration of socially stratifying factors to illuminate inequities in health</article-title>. <source>J Clin Epidemiol</source>. (<year>2014</year>) <volume>67</volume>:<fpage>56</fpage>&#x2013;<lpage>64</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.jclinepi.2013.08.005</pub-id>, PMID: <pub-id pub-id-type="pmid">24189091</pub-id></citation></ref>
<ref id="ref30"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coetzee</surname><given-names>M</given-names></name> <name><surname>Clifford</surname><given-names>AM</given-names></name> <name><surname>Jordaan</surname><given-names>JD</given-names></name> <name><surname>Louw</surname><given-names>QA</given-names></name></person-group>. <article-title>Global profile of individuals undergoing total knee replacement through the PROGRESS-PLUS equity lens: protocol for a systematic review</article-title>. <source>S Afr J Physiother</source>. (<year>2022</year>) <volume>78</volume>:<fpage>1649</fpage>. doi: <pub-id pub-id-type="doi">10.4102/sajp.v78i1.1649</pub-id>, PMID: <pub-id pub-id-type="pmid">35547962</pub-id></citation></ref>
<ref id="ref31"><label>31.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll9">INE</collab></person-group>. Clasificaci&#x00F3;n Nacional de Educaci&#x00F3;n 2014 (CNED-2014). (<year>2016</year>). <comment>Available at:</comment> <ext-link xlink:href="https://www.ine.es/daco/daco42/clasificaciones/cned14/CNED2014_capitulo0.pdf" ext-link-type="uri">https://www.ine.es/daco/daco42/clasificaciones/cned14/CNED2014_capitulo0.pdf</ext-link>.</citation></ref>
<ref id="ref32"><label>32.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Dardet</surname><given-names>Carlos &#x00C1;lvarez</given-names></name> <name><surname>Alonso</surname><given-names>Jordi</given-names></name> <name><surname>Domingo</surname><given-names>Antonia</given-names></name> <name><surname>Regidor</surname><given-names>E</given-names></name></person-group>. <source>La Medici&#x00F3;n de la clase social en ciencias de la salud: informe de un Grupo de Trabajo de la Sociedad Epa&#x00F1;ola de Epidemiolog&#x00ED;a</source>. <publisher-loc>SG Editores y SEE</publisher-loc> (<year>1995</year>).</citation></ref>
<ref id="ref33"><label>33.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Oliver</surname><given-names>S</given-names></name> <name><surname>Dickson</surname><given-names>K</given-names></name> <name><surname>Newman</surname><given-names>M</given-names></name></person-group>. <article-title>Getting started with a review</article-title>. In: Gough D, Oliver S, Thomas J, editors. <source>An introduction to systematic reviews</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>SAGE Publications</publisher-name> (<year>2012</year>).</citation></ref>
<ref id="ref34"><label>34.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robine</surname><given-names>JM</given-names></name> <name><surname>Jagger</surname><given-names>C</given-names></name></person-group>. <article-title>Creating a coherent set of indicators to monitor health across Europe: the Euro-REVES 2 project</article-title>. <source>Eur J Pub Health</source>. (<year>2003</year>) <volume>13</volume>:<fpage>6</fpage>&#x2013;<lpage>14</lpage>. doi: <pub-id pub-id-type="doi">10.1093/eurpub/13.suppl_1.6</pub-id>, PMID: <pub-id pub-id-type="pmid">14533742</pub-id></citation></ref>
<ref id="ref35"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flaxman</surname><given-names>AD</given-names></name> <name><surname>Vos</surname><given-names>T</given-names></name></person-group>. <article-title>Machine learning in population health: opportunities and threats</article-title>. <source>PLoS Med</source>. (<year>2018</year>) <volume>15</volume>:<fpage>e1002702</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pmed.1002702</pub-id>, PMID: <pub-id pub-id-type="pmid">30481173</pub-id></citation></ref>
<ref id="ref36"><label>36.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Venkatasubramaniam</surname><given-names>A</given-names></name> <name><surname>Wolfson</surname><given-names>J</given-names></name> <name><surname>Mitchell</surname><given-names>N</given-names></name> <name><surname>Barnes</surname><given-names>T</given-names></name> <name><surname>Jaka</surname><given-names>M</given-names></name> <name><surname>French</surname><given-names>S</given-names></name></person-group>. <article-title>Decision trees in epidemiological research</article-title>. <source>Emerg Themes Epidemiol</source>. (<year>2017</year>) <volume>14</volume>:<fpage>11</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12982-017-0064-4</pub-id>, PMID: <pub-id pub-id-type="pmid">28943885</pub-id></citation></ref>
<ref id="ref37"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kass</surname><given-names>GV</given-names></name></person-group>. <article-title>An exploratory techinque for investigating large quantities of categorical data</article-title>. <source>J R Stat Soc</source>. (<year>1980</year>) <volume>29</volume>:<fpage>119</fpage>&#x2013;<lpage>27</lpage>.</citation></ref>
<ref id="ref38"><label>38.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Quinlan</surname><given-names>JR</given-names></name></person-group>. <source>C4.5: programs for machine learning</source>. <publisher-loc>San Mateo</publisher-loc>: <publisher-name>Morgan Kaufmann Publishers Inc.</publisher-name> (<year>1993</year>).</citation></ref>
<ref id="ref39"><label>39.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freund</surname><given-names>Y</given-names></name></person-group>. <article-title>A decision-theoretic generalization of on-line learning and an application to boosting</article-title>. <source>J Comput Syst Sci</source>. (<year>1996</year>) <volume>55</volume>:<fpage>119</fpage>&#x2013;<lpage>39</lpage>. doi: <pub-id pub-id-type="doi">10.1006/jcss.1997.1504</pub-id></citation></ref>
<ref id="ref40"><label>40.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Max Kuhn</surname><given-names>SW</given-names></name> <name><surname>Culp</surname><given-names>Mark</given-names></name> <name><surname>Coulter</surname><given-names>Nathan</given-names></name> <name><surname>Quinlan</surname><given-names>Ross</given-names></name></person-group>. C50: C5.0 decision trees and rule-based models. (<year>2023</year>). <comment>Available at:</comment> <ext-link xlink:href="https://rdrr.io/cran/C50/" ext-link-type="uri">https://rdrr.io/cran/C50/</ext-link>.</citation></ref>
<ref id="ref41"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Littlestone</surname><given-names>N</given-names></name></person-group>. <article-title>Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm</article-title>. <source>Mach Learn</source>. (<year>1988</year>) <volume>2</volume>:<fpage>285</fpage>&#x2013;<lpage>318</lpage>. doi: <pub-id pub-id-type="doi">10.1007/BF00116827</pub-id></citation></ref>
<ref id="ref42"><label>42.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhn</surname><given-names>M</given-names></name></person-group>. <article-title>Building predictive models in R using the caret package</article-title>. <source>J Stat Softw</source>. (<year>2008</year>) <volume>5</volume>:<fpage>1</fpage>&#x2013;<lpage>26</lpage>. doi: <pub-id pub-id-type="doi">10.18637/jss.v028.i05</pub-id></citation></ref>
<ref id="ref43"><label>43.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Therneau</surname><given-names>TM</given-names></name> <name><surname>Atkinson</surname><given-names>B</given-names></name> <name><surname>Ripley</surname><given-names>BD</given-names></name></person-group>. <source>Recursive partitioning and regression trees</source>. (<year>2015</year>).</citation></ref>
<ref id="ref44"><label>44.</label><citation citation-type="other"><person-group person-group-type="author"><collab id="coll10">The FoRt Student Project Team H, T</collab></person-group>. CHAID: CHi-squared automated interaction detection. R package version 0.1&#x2013;2. (<year>2009</year>). <comment>Available at:</comment> <ext-link xlink:href="https://rdrr.io/rforge/CHAID/" ext-link-type="uri">https://rdrr.io/rforge/CHAID/</ext-link>.</citation></ref>
<ref id="ref45"><label>45.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stehman</surname><given-names>SV</given-names></name></person-group>. <article-title>Selecting and interpreting measures of thematic classification accuracy</article-title>. <source>Remote Sens Environ</source>. (<year>1997</year>) <volume>62</volume>:<fpage>77</fpage>&#x2013;<lpage>89</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0034-4257(97)00083-7</pub-id></citation></ref>
<ref id="ref46"><label>46.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname><given-names>L</given-names></name></person-group>. <article-title>Random forests</article-title>. <source>Mach Learn</source>. (<year>2001</year>) <volume>45</volume>:<fpage>5</fpage>&#x2013;<lpage>32</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id></citation></ref>
<ref id="ref47"><label>47.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amjad</surname><given-names>M</given-names></name> <name><surname>Ahmad</surname><given-names>I</given-names></name> <name><surname>Ahmad</surname><given-names>M</given-names></name> <name><surname>Wr&#x00F3;blewski</surname><given-names>P</given-names></name> <name><surname>Kami&#x0144;ski</surname><given-names>P</given-names></name> <name><surname>Amjad</surname><given-names>U</given-names></name></person-group>. <article-title>Prediction of pile bearing capacity using XGBoost algorithm: modeling and performance evaluation</article-title>. <source>Appl Sci</source>. (<year>2022</year>) <volume>12</volume>:<fpage>2126</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app12042126</pub-id></citation></ref>
<ref id="ref48"><label>48.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>Z</given-names></name> <name><surname>Wang</surname><given-names>ZA</given-names></name></person-group>. <article-title>Risk prediction model for type 2 diabetes based on weighted feature selection of random forest and xgboost ensemble classifier</article-title>. <conf-name>Proceedings of the 2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI)</conf-name>; <conf-loc>Guilin, China</conf-loc>. (<year>2019</year>).</citation></ref>
<ref id="ref49"><label>49.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>T</given-names></name> <name><surname>Guestrin</surname><given-names>C</given-names></name></person-group>. <article-title>Scalable tree boosting system proceedings of the 22nd ACM SIGKDD.</article-title> In: <source>Conference on Knowledge Discovery and Data Mining</source> <publisher-loc>San Francisco, California</publisher-loc>: <publisher-name>ACM</publisher-name> (<year>2016</year>).</citation></ref>
<ref id="ref50"><label>50.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sung</surname><given-names>SF</given-names></name> <name><surname>Hung</surname><given-names>LC</given-names></name> <name><surname>Hu</surname><given-names>YH</given-names></name></person-group>. <article-title>Developing a stroke alert trigger for clinical decision support at emergency triage using machine learning</article-title>. <source>Int J Med Inform</source>. (<year>2021</year>) <volume>152</volume>:<fpage>104505</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ijmedinf.2021.104505</pub-id>, PMID: <pub-id pub-id-type="pmid">34030088</pub-id></citation></ref>
<ref id="ref51"><label>51.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leo</surname><given-names>J</given-names></name> <name><surname>Luhanga</surname><given-names>E</given-names></name> <name><surname>Michael</surname><given-names>K</given-names></name></person-group>. <article-title>Machine learning model for imbalanced cholera dataset in Tanzania</article-title>. <source>Sci World J</source>. (<year>2019</year>) <volume>2019</volume>:<fpage>1</fpage>&#x2013;<lpage>12</lpage>. doi: <pub-id pub-id-type="doi">10.1155/2019/9397578</pub-id>, PMID: <pub-id pub-id-type="pmid">31427903</pub-id></citation></ref>
<ref id="ref52"><label>52.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishaq</surname><given-names>A</given-names></name> <name><surname>Sadiq</surname><given-names>S</given-names></name> <name><surname>Umer</surname><given-names>M</given-names></name> <name><surname>Ullah</surname><given-names>S</given-names></name> <name><surname>Mirjalili</surname><given-names>S</given-names></name> <name><surname>Rupapara</surname><given-names>V</given-names></name> <etal/></person-group>. <article-title>Improving the prediction of heart failure Patients' survival using SMOTE and effective data mining techniques</article-title>. <source>IEEE Access</source>. (<year>2021</year>) <volume>9</volume>:<fpage>39707</fpage>&#x2013;<lpage>16</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ACCESS.2021.3064084</pub-id></citation></ref>
<ref id="ref53"><label>53.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carmona-Torres</surname><given-names>JM</given-names></name> <name><surname>Cobo-Cuenca</surname><given-names>AI</given-names></name> <name><surname>Mart&#x00ED;n-Espinosa</surname><given-names>NM</given-names></name> <name><surname>Piriz-Campos</surname><given-names>RM</given-names></name> <name><surname>Laredo-Aguilera</surname><given-names>JA</given-names></name> <name><surname>Rodr&#x00ED;guez-Borrego</surname><given-names>MA</given-names></name></person-group>. <article-title>Prevalence in the performance of mammographies in Spain: analysis by communities 2006-2014 and influencing factors</article-title>. <source>Aten Primaria</source>. (<year>2018</year>) <volume>50</volume>:<fpage>228</fpage>&#x2013;<lpage>37</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.aprim.2017.03.007</pub-id>, PMID: <pub-id pub-id-type="pmid">28732722</pub-id></citation></ref>
<ref id="ref54"><label>54.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Padela</surname><given-names>AI</given-names></name> <name><surname>Murrar</surname><given-names>S</given-names></name> <name><surname>Adviento</surname><given-names>B</given-names></name> <name><surname>Liao</surname><given-names>C</given-names></name> <name><surname>Hosseinian</surname><given-names>Z</given-names></name> <name><surname>Peek</surname><given-names>M</given-names></name> <etal/></person-group>. <article-title>Associations between religion-related factors and breast cancer screening among American Muslims</article-title>. <source>J Immigr Minor Health</source>. (<year>2015</year>) <volume>17</volume>:<fpage>660</fpage>&#x2013;<lpage>9</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10903-014-0014-y</pub-id>, PMID: <pub-id pub-id-type="pmid">24700026</pub-id></citation></ref>
<ref id="ref55"><label>55.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pons-Rodriguez</surname><given-names>A</given-names></name> <name><surname>Martinez-Alonso</surname><given-names>M</given-names></name> <name><surname>Perestelo-Perez</surname><given-names>L</given-names></name> <name><surname>Garcia</surname><given-names>M</given-names></name> <name><surname>Sala</surname><given-names>M</given-names></name> <name><surname>Rue</surname><given-names>M</given-names></name> <etal/></person-group>. <article-title>Informed choice in breast cancer screening: the role of education</article-title>. <source>Gac Sanit</source>. (<year>2020</year>) <volume>35</volume>:<fpage>243</fpage>&#x2013;<lpage>9</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gaceta.2020.01.002</pub-id>, PMID: <pub-id pub-id-type="pmid">32173050</pub-id></citation></ref>
<ref id="ref56"><label>56.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pons-Vigu&#x00E9;s</surname><given-names>M</given-names></name> <name><surname>Puigpin&#x00F3;s-Riera</surname><given-names>R</given-names></name> <name><surname>Rodr&#x00ED;guez-Sanz</surname><given-names>M</given-names></name> <name><surname>Serral</surname><given-names>G</given-names></name> <name><surname>Pal&#x00E8;ncia</surname><given-names>L</given-names></name> <name><surname>Borrell</surname><given-names>C</given-names></name></person-group>. <article-title>Preventive control of breast and cervical cancer in immigrant and native women in Spain: the role of country of origin and social class</article-title>. <source>Int J Health Serv</source>. (<year>2011</year>) <volume>41</volume>:<fpage>483</fpage>&#x2013;<lpage>99</lpage>. doi: <pub-id pub-id-type="doi">10.2190/HS.41.3.e</pub-id></citation></ref>
<ref id="ref57"><label>57.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rodr&#x00ED;guez-Sal&#x00E9;s</surname><given-names>V</given-names></name> <name><surname>Ortiz-Barreda</surname><given-names>G</given-names></name> <name><surname>Sanjos&#x00E9;</surname><given-names>SD</given-names></name></person-group>. <article-title>Revisi&#x00F3;n bibliogr&#x00E1;fica sobre la prevenci&#x00F3;n del c&#x00E1;ncer en personas inmigrantes residentes en Espa&#x00F1;a</article-title>. <source>Rev Esp Salud Publica</source>. (<year>2014</year>) <volume>88</volume>:<fpage>735</fpage>&#x2013;<lpage>43</lpage>. doi: <pub-id pub-id-type="doi">10.4321/S1135-57272014000600006</pub-id>, PMID: <pub-id pub-id-type="pmid">25418564</pub-id></citation></ref>
<ref id="ref58"><label>58.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Molina-Barcel&#x00F3;</surname><given-names>A</given-names></name> <name><surname>Moreno Salas</surname><given-names>J</given-names></name> <name><surname>Peir&#x00F3;-P&#x00E9;rez</surname><given-names>R</given-names></name> <name><surname>Arroyo</surname><given-names>G</given-names></name> <name><surname>Ib&#x00E1;&#x00F1;ez Cabanell</surname><given-names>J</given-names></name> <name><surname>Vanaclocha Esp&#x00ED;</surname><given-names>M</given-names></name> <etal/></person-group>. <article-title>Inequalities in access to cancer screening programmes in Spain and how to reduce them: data from 2013 and 2020</article-title>. <source>Rev Esp Salud Publica</source>. (<year>2021</year>) <volume>95</volume>:<fpage>e202101017</fpage></citation></ref>
<ref id="ref59"><label>59.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>El d&#x00ED;a</surname></name></person-group>. <source>Pacientes con c&#x00E1;ncer de mama denuncian la desigualdad de trato entre comunidades aut&#x00F3;nomas</source>. <publisher-loc>Madrid</publisher-loc>: <publisher-name>Europa Press</publisher-name> (<year>2017</year>).</citation></ref>
<ref id="ref60"><label>60.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borrell</surname><given-names>C</given-names></name> <name><surname>Rodr&#x00ED;guez-Sanz</surname><given-names>M</given-names></name> <name><surname>P&#x00E9;rez</surname><given-names>G</given-names></name> <name><surname>Garcia-Alt&#x00E9;s</surname><given-names>A</given-names></name></person-group>. <article-title>Las desigualdades sociales en salud en el Estado espa&#x00F1;ol</article-title>. <source>Atenci&#x00F3;n Primaria</source>. (<year>2008</year>) <volume>40</volume>:<fpage>59</fpage>&#x2013;<lpage>60</lpage>. doi: <pub-id pub-id-type="doi">10.1157/13116147</pub-id>, PMID: <pub-id pub-id-type="pmid">18358156</pub-id></citation></ref>
<ref id="ref61"><label>61.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>P&#x00E9;rez</surname><given-names>G</given-names></name> <name><surname>Cirera</surname><given-names>E</given-names></name> <name><surname>Rodr&#x00ED;guez-Sanz</surname><given-names>M</given-names></name> <name><surname>Borrell</surname><given-names>C</given-names></name></person-group>. <source>Desigualdades de mortalidad en las comunidades aut&#x00F3;nomas entre 1981 y 2002</source>. <publisher-loc>Madrid</publisher-loc>: <publisher-name>Observatorios de salud de la mujer y del SNS Agencia de Calidad Ministerio de Sanidad y Consumo</publisher-name> (<year>2005</year>).</citation></ref>
<ref id="ref62"><label>62.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Graham</surname><given-names>HD</given-names></name></person-group>. <article-title>Race, language, and social policy: comparing the black and Hispanic experience in the U.S</article-title>. <source>Popul Environ</source>. (<year>1990</year>) <volume>12</volume>:<fpage>43</fpage>&#x2013;<lpage>58</lpage>. doi: <pub-id pub-id-type="doi">10.1007/BF01378551</pub-id></citation></ref>
</ref-list>
</back>
</article>