- 1School of Pharmaceutical Business, Zhejiang Pharmaceutical University, Ningbo, Zhejiang, China
- 2Taizhou Vocation College of Science & Technology, School of Accounting & Finance, Taizhou, Zhejiang, China
- 3Huaqiao University, Xiamen, Fujian Province, China
Introduction: This study proposes a novel Transformer-based approach to enhance talent attraction and retention strategies in rural public health systems. Motivated by the persistent shortage of skilled professionals in underserved areas and the limitations of traditional recruitment methods, we leverage big data analytics and natural language processing to address workforce distribution imbalances.
Methods: By analyzing diverse data sources such as social media, surveys, and job satisfaction reports, the Transformer model identifies complex, context-specific factors influencing candidate preferences, including career advancement opportunities, lifestyle alignment, and community engagement.
Results: Our framework offers a personalized, data-driven mechanism to match healthcare professionals with rural roles effectively. Experimental results demonstrate significant improvements in recruitment precision and retention forecasting.
Discussion: This work contributes a scalable and adaptive solution to rural healthcare workforce challenges, offering valuable insights for policy-makers and public health organizations aiming to revitalize rural health services.
1 Introduction
The attraction and retention of skilled healthcare professionals in rural areas remain pressing issues in public health, impacting the quality and accessibility of medical services in these underserved regions (1). Rural public health entrepreneurship, aimed at innovative healthcare delivery and community-based solutions, has the potential to address these challenges but requires a sustainable influx of talent (2). Traditional talent attraction methods, including incentives like financial support or community integration programs, often fall short due to limited rural resources and the unique demands of rural healthcare (3). Not only are these methods resource-intensive, but they also lack adaptability to the diverse motivations of healthcare professionals (4). Advances in AI-driven solutions, especially Transformer-based models, open new avenues for targeted and efficient talent attraction strategies, as these models can analyze large volumes of data to predict, personalize, and optimize candidate engagement efforts, enhancing the overall effectiveness of rural public health initiatives (5).
Attracting and retaining skilled healthcare professionals in rural areas remains a persistent and multifaceted challenge in public health, with direct consequences for healthcare accessibility, service quality, and health equity in underserved regions. Despite policy efforts and incentive programs, rural areas often experience high turnover rates and difficulty in maintaining a stable workforce. This issue stems not only from geographic and resource limitations but also from deeper, less visible factors such as professional isolation, limited career development pathways, and misalignment between individual values and the realities of rural practice. These conditions highlight the urgent need for innovative, evidence-based approaches to talent attraction that go beyond traditional recruitment methods. Our study addresses this need by introducing an AI-driven framework, grounded in Transformer models, that leverages large-scale, unstructured data to capture nuanced candidate preferences and community needs. This empirical focus situates our work at the intersection of public health policy and technological innovation, aiming to deliver practical insights for more effective and context-sensitive workforce strategies in rural healthcare systems.
In early approaches to talent attraction, methods were largely knowledge-based, relying on symbolic AI and expert-driven frameworks for decision-making. Systems were designed to emulate human decision-making in assessing candidate fit, often through rule-based filtering that mapped specific candidate qualifications to job requirements (6). While this method enabled systematic matching, it was rigid, requiring continuous manual updates to remain relevant to changing candidate expectations and the dynamic demands of rural healthcare (7). Furthermore, symbolic AI models struggled to adapt to nuanced factors influencing a candidates decision, such as individual career goals or personal motivations (8). Consequently, while useful as an initial step, knowledge-based methods fell short in achieving nuanced engagement, limiting their efficacy in long-term talent attraction (9).
The advent of data-driven and machine learning methods marked the next significant step in talent attraction strategies (10). These approaches leveraged large datasets on candidate backgrounds, job performance metrics, and industry trends to refine talent matching processes (11). By using algorithms that learned from historical data, these models could predict the likelihood of candidate retention based on various personal and professional variables, allowing for more tailored recruitment (12). Although this approach improved adaptability compared to symbolic AI, it was heavily dependent on quality data availability and lacked interpretability in its predictions. Machine learning models, while more flexible, were limited in capturing the complex interplay of factors influencing rural career choices, such as community integration or lifestyle preferences, thus often resulting in suboptimal matches in the context of rural public health (13).
In recent years, deep learning models, particularly Transformer-based pre-trained language models, have shown exceptional potential in addressing the complex requirements of talent attraction for rural healthcare entrepreneurship (14). Unlike traditional machine learning methods, Transformers can process and interpret unstructured data, such as candidate social media profiles, feedback, and personal statements, to derive richer insights into individual motivations and career aspirations (15). This allows for a more holistic view of candidates, facilitating highly personalized engagement strategies that align with both professional competencies and personal values conducive to rural work (16). Additionally, Transformers excel at handling vast datasets and contextually analyzing language, which is beneficial in understanding subtle differences in candidate backgrounds and needs (17). Despite these advantages, however, Transformers still face challenges, including computational demands and the need for extensive data preprocessing, which can hinder their practical deployment in resource-constrained rural settings (18).
Based on the limitations of previous methods such as the rigidity of symbolic AI, data dependence in machine learning, and computational demands of deep learning this research proposes a novel Transformer-driven strategy optimized for talent attraction in rural public health entrepreneurship. By integrating Transformer models with a specialized, context-aware dataset on rural healthcare needs and candidate profiles, this approach aims to bridge the gap between candidate motivations and job demands, enhancing long-term retention and engagement.
The proposed Transformer-driven approach offers distinct advantages in this domain:
• It introduces a contextually aware recommendation module that tailors candidate engagement based on unstructured data, such as personal interests related to rural healthcare.
• The method enhances efficiency and scalability by applying pretrained models fine-tuned to rural public health needs, allowing adaptation to various rural healthcare scenarios.
• Experimental results demonstrate improved candidate alignment with rural roles, showing a 20
2 Related work
2.1 Transformer models in human resource management
The use of Transformer models in human resource management has emerged as a powerful technique for analyzing large datasets and extracting meaningful patterns that guide strategic decision-making (19). Transformer architectures, initially developed for natural language processing tasks, are particularly effective in understanding complex relationships within high-dimensional data, such as that found in talent management. In human resources, this technology can process unstructured data, including candidate profiles, job descriptions, and historical hiring records, to identify suitable talent more accurately and efficiently than traditional methods. By harnessing self-attention mechanisms, Transformers capture nuanced dependencies between skills, qualifications, and job requirements, enabling a more tailored talent attraction strategy. This approach is especially valuable in rural public health, where attracting professionals is often hindered by geographic and economic barriers (20). In the context of rural health entrepreneurship, Transformer models can analyze demographic and employment trends to predict areas where talent shortages may become critical. Additionally, these models can integrate with real-time social data, such as professional networking platforms, to identify individuals open to relocating to rural areas. As a result, the adoption of Transformer-driven strategies allows public health organizations to proactively reach out to qualified candidates rather than waiting for applications, which is particularly beneficial in rural settings where talent pools are limited. The emphasis on scalable machine learning further allows smaller public health entities to deploy advanced analytics without substantial infrastructure investments. Leveraging cloud-based implementations of Transformers enhances access to these capabilities, broadening the reach of rural health initiatives and facilitating data-driven decision-making in hiring and retention strategies (21). This shift in HR from reactive to proactive talent management underscores the transformative potential of Transformer models in bridging the talent gap in underserved regions (22). Furthermore, Transformer models enable the personalization of job offers and recruitment pitches. By evaluating individual candidate preferences and priorities, such as work-life balance, community engagement, or career growth, these models help craft customized communications that resonate with candidates on a personal level. This personalized approach improves engagement rates and aligns the recruitment process with the values and expectations of potential hires, making rural positions more attractive (23). As rural public health demands professionals who are not only skilled but also committed to community-oriented work, this ability to target the right individuals with tailored messages is critical. Ultimately, applying Transformer-based analysis to human resource management can fundamentally reshape rural health entrepreneurship, fostering an environment where talent attraction and retention strategies are as sophisticated and dynamic as those of urban centers.
2.2 Talent attraction in rural public health
Attracting talent to rural public health sectors presents unique challenges, driven by geographical isolation, limited resources, and often lower compensation compared to urban positions. As rural areas face shortages in qualified health professionals, innovative strategies are necessary to bridge the talent gap and sustain essential health services. Effective talent attraction in this context goes beyond conventional recruitment methods, requiring a deep understanding of candidate motivations and rural community needs. Research indicates that healthcare professionals attracted to rural areas are often motivated by factors such as community impact, professional autonomy, and lifestyle preferences. Therefore, a successful attraction strategy must leverage these insights to target individuals most likely to be interested in rural health work (24). One promising approach involves employing Transformer-driven analytics to identify and prioritize candidates who have expressed interest in public service or community health. Transformer models can process vast amounts of data from professional networks, academic publications, and social media, thereby identifying individuals with both the requisite skills and a demonstrated interest in rural or underserved communities. This data-driven approach enables recruiters to develop more strategic outreach efforts, increasing the likelihood of finding candidates who align with the mission and demands of rural public health entrepreneurship (25). Integrating local community values and needs into these models can further enhance the relevance of recruitment messages, presenting rural health roles not merely as jobs but as avenues for meaningful impact. Additionally, a more targeted recruitment strategy helps streamline the hiring process, reducing both time-to-hire and recruitment costs, which are critical considerations for resource-limited rural health organizations (26). The success of rural talent attraction is also contingent on creating support structures that facilitate the transition of health professionals into these communities. By using Transformer models to analyze historical data on factors such as retention rates and job satisfaction, organizations can identify the support systems that are most effective in fostering long-term commitment among new hires (27). Tailoring onboarding processes to address specific challenges associated with rural healthcare, such as limited access to specialized resources or professional isolation, enhances the likelihood of successful integration and retention. Furthermore, Transformer-driven insights into regional preferences and lifestyle trends allow for more persuasive marketing of rural positions to potential candidates. By combining advanced analytics with an understanding of rural health dynamics, this approach offers a comprehensive strategy for attracting, onboarding, and retaining talent in rural public health, ultimately contributing to improved healthcare outcomes for underserved populations (28).
2.3 Entrepreneurial opportunities in rural health
Rural public health provides a unique setting for entrepreneurial initiatives, particularly in the development of sustainable healthcare solutions that address the specific needs of these communities. Given the challenges in attracting and retaining healthcare professionals, rural health entrepreneurship often requires novel approaches to staffing, service delivery, and community engagement. Transformative technologies, including Transformer models, can play a pivotal role in identifying gaps in rural healthcare services and designing innovative solutions that leverage available resources effectively. These technologies enable a granular analysis of community health data, revealing potential areas for service expansion or specialization, such as telemedicine or mobile health clinics. By utilizing Transformer-driven analytics, rural health entrepreneurs can assess community demand, predict healthcare needs, and make data-informed decisions about resource allocation, thereby creating sustainable and adaptive healthcare models (29). The role of entrepreneurship in rural health extends beyond clinical services to encompass preventative care, wellness programs, and education initiatives that address the root causes of health disparities. Transformer models enhance this aspect by providing insights into social determinants of health, such as economic status, educational attainment, and access to healthy food, which significantly impact rural populations. Entrepreneurs can use these insights to develop community-centered health programs that focus on prevention and wellness, reducing long-term healthcare costs and improving quality of life for residents. For instance, understanding patterns of chronic illness within a rural community can guide the creation of targeted wellness programs or partnerships with local organizations to address underlying health issues (30). Another critical aspect of rural health entrepreneurship is the ability to attract funding and investment, as many rural initiatives struggle with financial sustainability. Transformer models can support grant applications and funding proposals by identifying and quantifying the specific healthcare needs of a rural area, offering a data-backed case to potential investors and funders (31). Additionally, the scalability of Transformer-driven solutions makes them attractive to stakeholders interested in replicable models of rural health delivery. By demonstrating the efficacy of Transformer-enhanced programs in addressing specific rural health challenges, entrepreneurs can attract investment and collaboration from government agencies, NGOs, and private organizations committed to rural health advancement. In this way, Transformer-driven strategies not only aid in attracting talent to rural health but also establish a robust foundation for entrepreneurial innovation that meets the unique healthcare demands of rural populations (32).
While prior studies have made important contributions to recommendation systems and talent matching algorithms, particularly in education and enterprise domains, few have addressed the unique challenges posed by rural public health workforce development. Existing models such as matrix factorization, graph-based embeddings, or sequential recommendation frameworks have largely focused on preference optimization or network structure exploitation, but rarely account for context-sensitive variables such as incentive responsiveness, long-term engagement sustainability, or organizational policy constraints. Our work builds upon these foundations by introducing a unified framework that incorporates adaptive engagement modeling, incentive elasticity, and region-aware multi-layered compatibility scoring elements that are often overlooked in traditional recommender architectures. In doing so, we bridge a critical gap between abstract recommendation performance and real-world human resource allocation challenges in underserved communities. Unlike prior approaches, our model is explicitly designed to simulate behavioral dynamics relevant to public sector recruitment and retention, providing a more holistic and actionable decision-support tool for public health systems. This contribution not only advances the methodological toolkit but also offers practical relevance to one of the most pressing human capital issues in rural healthcare delivery.
3 Method
3.1 Overview
This work addresses a core challenge in rural public health entrepreneurship: developing strategies to attract talent, which is crucial for both improving rural healthcare infrastructure and fostering sustainable economic development. Due to the distinct social, economic, and environmental characteristics of rural areas, talent attraction in this sector requires innovative, contextually adapted approaches. To address this, we structure our methodology around a comprehensive framework for strategy development that integrates environmental analysis, incentive structures, and engagement mechanisms designed specifically for rural health entrepreneurship.
In this subsection, we outline the methodological structure employed in our approach. Our method is organized into three core segments: 1. Strategic Foundations, 2 Adaptive Model Design, and 3 Incentive Mechanisms. First, Section 1 introduces the strategic foundations underlying talent attraction within rural public health. Here, we consider socio-economic and infrastructural constraints that characterize rural regions, as well as the core competencies required for successful public health initiatives. This foundational analysis establishes the key parameters and challenges that the strategy must address, including the high turnover rates due to geographic and professional isolation, limited resources, and the need for a robust support system tailored to rural healthcare dynamics. Second, Section 2 presents the Adaptive Model Design, an innovative model tailored to the dynamic needs of rural healthcare services. This model incorporates flexible components to align with local demographic profiles and health demands. The adaptability of this model is crucial for addressing varying conditions across different rural environments and involves both practical elements such as healthcare delivery models and strategic elements, including community integration practices that help retain talent over extended periods. This component also introduces a framework for continuous feedback, allowing the strategy to evolve in response to emerging local challenges and opportunities. Finally, Section 3 details Incentive Mechanisms that leverage both financial and non-financial motivators to enhance the appeal of rural public health roles. In addition to traditional incentives, such as competitive salaries and housing subsidies, we explore non-financial incentives, including professional development opportunities, mentorship programs, and community-building initiatives. This section emphasizes the need for a comprehensive package that addresses not only monetary compensation but also psychological and professional fulfillment, contributing to a sustainable workforce in rural health entrepreneurship.
3.2 Preliminaries
To establish a structured approach to talent attraction in rural public health entrepreneurship, we begin by formalizing the underlying challenges and key variables that drive our strategic model. In rural public health, attracting and retaining skilled professionals (e.g., doctors, nurses, and health administrators) is complicated by the interplay of socio-economic factors, geographical isolation, limited infrastructure, and often limited access to continuous professional development. These elements necessitate a strategic model tailored to the unique environment of rural health services.
Let represent the set of rural regions, where each region is characterized by attributes Xr = (xr1, xr2, ..., xrn), encompassing factors such as population density, average income, healthcare infrastructure, and educational resources. Let denote the talent pool available for rural public health roles, where each professional is described by attributes Yt = (yt1, yt2, ..., ytm), which may include qualifications, experience, and personal motivations. The objective is to develop an attraction strategy S that maximizes the probability Pattr(t, r) that a professional t will choose to work in region r and remain engaged over time.
We define the attraction function Pattr as follows:
where: - Xr represents region-specific socio-economic attributes, - Yt denotes talent-specific characteristics, - Ir encapsulates incentive mechanisms, such as financial compensation or career advancement opportunities, and - Er reflects engagement strategies, including community integration and professional support structures.
3.2.1 Strategic constraints and resource allocation
In designing the attraction strategy S, we must address several constraints imposed by the rural setting. First, let denote the budget allocated to incentives in each region r. The function defines the maximum achievable incentive structure under the budgetary constraint:
This constraint ensures that financial and non-financial incentives remain feasible within available resources. Similarly, resource allocation for professional development, denoted by δ(Er), must satisfy:
The total effectiveness of the strategy S is influenced by a combination of these resource allocation functions and the alignment between Xr and Yt. We model this alignment as the matching function μ(Xr, Yt), which represents the degree of compatibility between the regions requirements and the professionals capabilities and motivations:
where d(·) measures the alignment and α is a scaling factor adjusted according to the local demand for public health professionals.
3.2.2 Engagement and retention dynamics
Retention of professionals in rural areas is influenced by the professionals initial commitment and ongoing engagement with the local community. Let E(t, r, τ) represent the engagement level of professional t in region r over time τ. Engagement is assumed to decay over time, countered by effective incentive and support strategies. We formalize this dynamic with the following differential equation:
where β is the natural engagement decay rate, γ adjusts the effect of incentives on retention, and δ(Er) reflects the contribution of ongoing support to sustaining engagement.
Solving this equation yields the professionals engagement trajectory, which can be used to forecast retention rates over time and inform adjustments to S:
where E0 represents the initial engagement level at time τ = 0.
3.2.3 Optimization of strategy S
To maximize talent attraction and retention, we seek to optimize S by adjusting Ir and Er to maximize Pattr(t, r). This optimization problem can be formulated as:
This approach allows us to balance the incentive and engagement strategies under budget constraints, enhancing the effectiveness of talent attraction and retention in rural public health. Our subsequent sections delve into the detailed design of each component of S, focusing on adaptable and scalable methods to improve talent recruitment and engagement in diverse rural contexts.
3.3 Adaptive talent engagement model
In this section, we introduce our proposed Adaptive Talent Engagement Model (ATEM), a framework specifically designed to address the dynamic and often unique requirements of rural public health. ATEM adapts to regional characteristics, individual professional needs, and evolving public health priorities, integrating flexibility to support talent attraction and retention in diverse rural contexts. The model introduces three key components: Contextual Matching, Dynamic Incentive Allocation, and Engagement Feedback Mechanisms (as shown in Figure 1).

Figure 1. The figure shows an architecture diagram of an Adaptive Talent Engagement Model (ATEM). The model consists of multiple modules, with the core part being the “Transformer Encoder”, which stands for the “Contextual Matching Framework” and is used to calculate the match between candidates and regional needs. The upper “LayerNorm & Linear” and “GELU & Linear” modules preprocess input information such as candidate and regional features. The middle “Transformer Encoder” decomposes the input into multiple parallel paths, each of which is processed by the “Linear” layer and aggregated to the “Dynamic Incentive Allocation” and “Engagement Feedback Mechanism” modules. The lower modules are further connected to the “Candidate and Regional Feature Inputs” and “Retention Probability Estimation” to ultimately generate the “Talent Matching Score and Recommendation”.
3.3.1 Contextual matching framework
The Contextual Matching Framework serves as a robust mechanism to align the nuanced requirements of rural health organizations with the multidimensional profiles of prospective candidates. For each rural region , the framework assigns a matching score to each candidate that reflects contextual compatibility across multiple dimensions (as shown in Figure 2). This matching score is computed as a weighted sum of compatibility measures across socio-economic alignment, incentive-based motivation, and community integration, expressed as:

Figure 2. Contextual matching framework evaluates candidate-region compatibility across socio-economic, motivational, and community dimensions, with adjustments for individual and environmental factors, producing a normalized matching score.
where: - ϕ(Xr, Yt) evaluates the compatibility of the region's socio-economic characteristics with the candidate's background, - ψ(Ir, Zt) assesses alignment between organizational incentives and candidate motivations, and - χ(Er, Vt) captures potential candidate engagement based on local support structures.
The weights ωX, ωI, and ωE enable a flexible prioritization of matching components, providing adaptability to specific organizational goals. Moreover, this framework introduces a candidate-specific adjustment factor, which accounts for individual variability in experience, motivation, and retention likelihood. These adjustments are represented by coefficients αt, βt, and γt, refining the matching function as follows:
To further improve precision, an auxiliary adjustment factor δr, t, representing environmental or temporal conditions influencing candidate success, is integrated into the function, yielding an augmented form:
Here, δr, t can adjust for factors such as seasonal employment trends or specific local needs, thus refining the score to enhance candidate prioritization. Additionally, to facilitate effective candidate ranking, we normalize each component by a scaling factor κ, giving a final normalized score :
where κr = ωX+ωI+ωE, ensuring that scores are comparable across regions regardless of weight variations. This framework thus enables a robust, contextual approach to candidate selection, adaptable to dynamic regional demands.
3.3.2 Dynamic incentive allocation
To address the budgetary and resource constraints faced by rural healthcare systems, the Dynamic Incentive Allocation (DIA) strategy in ATEM adapts the distribution of incentives based on regional needs and the individual characteristics of potential candidates. This adaptive model leverages an incentive allocation function ψ that dynamically distributes resources in response to demand, local limitations, and candidate profiles. The allocation function is formulated as:
where: - denotes the healthcare service demand in region r, - represents the professional and personal attributes of candidate t, and - accounts for regional socio-economic constraints and resource availability.
The function ψ optimizes resource distribution by assessing the attraction and retention probabilities Pattr(t, r), dynamically focusing on high-compatibility candidates for whom incentives are likely to have the greatest impact. In essence, DIA strategically channels resources to candidates and regions where incentives yield the most value.
To refine the model, the expected utility of incentives, represented by U(Ir(t)), predicts the probability that a candidate will accept a position in region r based on the level of incentives offered. This function considers the alignment between incentives and candidate motivations, defined as:
where: - λ is a scaling factor, - ϕ(Ir) represents the perceived utility of incentives in region r, - assesses the match between candidate t and region r, based on the compatibility score .
Further enhancing precision, the utility function incorporates incentive elasticity, ϵinc, which adjusts the model based on sensitivity to incentives across different candidate profiles and regional contexts. This yields an adjusted utility score, , that accounts for variations in responsiveness:
where ϵinc is defined for each candidate t as a function of professional flexibility δt and local appeal κr:
Finally, the optimal allocation of resources is guided by a maximization function Ω that seeks to maximize the cumulative attraction and retention across all candidates and regions :
This optimization ensures that resources are allocated effectively, aligning incentives with candidate profiles and regional demands, thereby increasing the likelihood of sustainable healthcare staffing in rural regions.
3.4 Engagement feedback mechanisms
To sustain engagement and mitigate the risk of early turnover, ATEM includes Engagement Feedback Mechanisms (EFM) that continuously monitor and respond to the professionals experience within the community. Engagement feedback is quantified by an engagement index , which is updated periodically over time τ to reflect changes in the professionals satisfaction and integration levels:
where ρk represents the weight assigned to feedback gathered in period k, and the terms γ·ϕ(Ir) and δ(Er) capture the impact of incentives and engagement support, respectively.
An adaptive adjustment mechanism is employed within EFM to modify Ir and Er based on . Specifically, if engagement falls below a pre-defined threshold θ, additional resources are allocated to incentives or community integration initiatives to bolster the professionals commitment:
where ΔIr and ΔEr represent incremental increases in resources directed toward incentives and engagement, aimed at restoring engagement levels.
Retention Probability Estimation Finally, the retention probability Pret(t, r) for each professional t in region r is estimated based on the cumulative engagement score and contextual alignment score . This probability is modeled as:
where σ is a sigmoid function, and κ and u are weighting coefficients that balance the impact of engagement and compatibility. This formula provides a probabilistic estimation that informs regional administrators of the likelihood of retaining specific professionals, allowing for targeted intervention if necessary.
3.5 Strategic incentive design
The final component of our method, Strategic Incentive Design (SID), addresses the long-term needs of rural healthcare professionals through a combination of financial and non-financial incentives structured to increase both initial attraction and sustained engagement in rural environments. Unlike traditional incentive models, SID integrates a holistic approach that includes career progression, community belonging, and professional fulfillment, all tailored to the unique needs of rural healthcare professionals (as shown in Figure 3).

Figure 3. The Strategic Incentive Design (SID) framework for rural healthcare professional retention. The Generator module on the left proposes incentive packages by using transformer encoders, simulating various financial and non-financial incentives tailored to rural needs. The Property Calculation module in the center assesses generated incentives based on critical KPIs such as healthcare outcomes, service coverage, and community impact. Finally, the Reward (Similarity Calculation) module on the right evaluates the similarity of generated incentives to ideal outcomes, adjusting rewards to refine future incentive designs for improved long-term retention.
3.5.1 Financial incentives
To enhance the attractiveness of rural healthcare positions, the SID framework introduces a tiered financial incentive system, which combines direct compensation with performance-based rewards. This system aims to provide immediate, tangible benefits while encouraging long-term professional engagement and high-quality service. The financial incentive structure for a candidate t in region r is defined by the function:
where: - Cr is the base compensation calibrated to the region r's cost of living, healthcare demand, and economic conditions, - B(t) denotes a signing bonus to attract high-quality candidates, and - P(t, τ) is a performance-based incentive that rewards sustained contributions over a specified period τ.
The weights η1, η2, and η3 are adjustable parameters that balance each financial component according to organizational priorities and budget constraints, ensuring the incentives are both attractive and sustainable.
In addition to the base and bonus components, the performance-based incentive P(t, τ) integrates key performance indicators (KPIs) specific to rural healthcare needs, such as patient health outcomes, coverage rates, and community impact. To model the effectiveness of these incentives, P(t, τ) can be expanded as:
where: - Qhealth(t, τ) measures improvements in patient health outcomes due to the candidate's services, - Qcoverage(t, τ) assesses service reach and accessibility metrics over time τ, and - Qimpact(t, τ) evaluates the broader community impact through public health initiatives and program effectiveness.
The scaling factors γ1, γ2, and γ3 weight each KPI based on its relative importance, allowing flexibility in how performance is valued across different rural settings.
To capture the total expected financial incentive, the model incorporates a retention factor ρ(t, τ) which adjusts the rewards for candidate t based on their commitment over time. This adjustment yields a retention-modified incentive :
where ρ(t, τ) = 1+α·(τ−τmin) incentivizes long-term engagement by increasing rewards the longer a candidate stays beyond a minimum threshold τmin. Here, α is a retention rate parameter calibrated to encourage sustained tenure.
Finally, the allocation of incentives is optimized to ensure financial sustainability by maximizing the cumulative expected incentive impact Ξ across all candidates and regions :
3.6 Non-Financial incentives (extended)
To complement financial incentives, the SID framework prioritizes a range of non-financial incentives designed to enhance professional development, reduce isolation, and foster community integration for rural healthcare professionals. These non-financial incentives are structured into three primary components: Professional Development Opportunities, Mentorship and Peer Networking, and Community Integration Programs. These elements collectively contribute to sustained engagement and retention by addressing both personal and professional needs in rural settings.
1. Professional development opportunities: to ensure that healthcare professionals remain competitive and skilled, SID provides continuous access to specialized training, certification programs, and advanced medical education resources. The cumulative value of these opportunities for a candidate t is captured by , which is the sum of various training components:
where: - denotes specific training or certification programs accessed by candidate t, - δi represents the value derived from each program i, indicating its contribution to professional growth and career progression.
To further quantify the long-term impact, a skill enhancement factor σt is introduced, representing the growth rate in professional competency:
where λ controls the rate of growth over time τ relative to a baseline experience τbase. This factor σt is applied in retention models to reflect the cumulative effect of development opportunities on long-term engagement.
2. Mentorship and peer networking: to combat professional isolation, SID emphasizes mentorship programs and structured peer networking initiatives. These networks, denoted by , offer a support system that fosters collaboration and personal connections among healthcare professionals. The engagement level in these networks for candidate t in region r is modeled by:
where: - ω measures the impact of direct mentorship sessions on retention, - ξ evaluates the influence of peer-to-peer activities within the network.
These parameters contribute to an engagement score that influences retention probabilities Pret(t) by strengthening the candidates professional support system:
where α and β are scaling factors that determine the influence of engagement on retention rates.
3. Community integration programs: SID includes programs to support both the professionals and their families in adjusting to rural life. These initiatives, grouped under , offer benefits such as local housing subsidies, educational access for family members, and cultural event sponsorships to encourage a sense of belonging. The value of these integration efforts is represented by :
where: - denotes specific community initiatives (e.g., housing or family support programs), - χj reflects the impact of each initiative on retention.
To assess the comprehensive effect of community support on candidate t, an adjusted integration index κt is defined as:
where θt measures the individuals integration rate and τ represents time spent in the community, allowing κt to grow with increased duration and involvement. This integration index supports retention modeling by enhancing a candidates connection to the community, which, in turn, fosters long-term service stability.
3.6.1 Career advancement and role flexibility
The SID framework emphasizes long-term career support in rural healthcare by establishing structured pathways for career advancement and fostering role flexibility. This dual approach aims to create an attractive professional environment where individuals can envision growth and adaptability in their roles. For each professional t, these components are integrated into the retention model through metrics for role advancement and role flexibility , which together support career satisfaction and longevity.
1. Role advancement : role advancement for a healthcare professional t is calculated based on three primary factors: tenure, performance, and continuous professional development. These elements collectively offer a clear and attainable pathway for career growth in rural healthcare settings, where progression may otherwise seem limited. The advancement score is formulated as:
where: - β represents the weight attributed to tenure, reflecting loyalty and accumulated experience, - θ scales the impact of performance metrics, rewarding high-achieving individuals based on key performance indicators (KPIs) such as patient outcomes and service quality, - ζ quantifies the contribution of continuous professional development , as outlined in the Non-Financial Incentives component, indicating the value of ongoing education and skill enhancement.
To further support structured advancement, SID integrates a promotion probability function Pprom(t), which evaluates the likelihood of advancement based on :
where γ adjusts the sensitivity of promotion likelihood to changes in and τadv sets a threshold for advancement eligibility. This model enables SID to prioritize promotions for professionals demonstrating commitment and competence, fostering retention by offering a clear pathway to career progression.
2. Role flexibility : in addition to advancement, SID emphasizes role flexibility, allowing healthcare professionals to diversify their experience by engaging in cross-disciplinary roles. This flexibility not only supports skill diversification but also mitigates professional burnout by providing varied work experiences. The role flexibility score for professional t is represented by:
where: - denotes the accessibility and scope of different roles or specializations available to t, - πk reflects the contribution of each alternative role k to overall job satisfaction, as each flexible role may contribute differently to the professionals experience and engagement.
To enhance the impact of role flexibility on retention, a satisfaction adjustment factor σt is incorporated, adjusting based on individual preferences and previous role satisfaction. This yields an adjusted role flexibility score :
where , with δ scaling the impact of matching preferences. Higher values of σt indicate that role flexibility aligns well with the professionals career aspirations, thereby enhancing retention likelihood.
3. Integrated retention model: by integrating and , SID creates a comprehensive retention probability model Pret(t), which combines career growth with adaptability to support long-term commitment:
where α is a baseline retention rate, while β1 and β2 weight the contributions of advancement and flexibility. This retention model aligns professional trajectories with personal aspirations, reinforcing career satisfaction and stability in rural healthcare environments.
3.6.2 Multi-component retention probability model
To accurately measure the impact of various financial and non-financial incentives on the retention of healthcare professionals in rural areas, SID employs a multi-component retention probability model. This model integrates multiple dimensions of incentives, offering a holistic view of how different factors contribute to the probability Pret(t, r) that a professional t will remain in region r. The retention probability is formulated as:
where: - α, β, γ, δ, ϵ, η are weighting coefficients that adjust the impact of each component, - σ is the sigmoid function, , which normalizes Pret(t, r) to the range [0, 1], ensuring it represents a valid probability.
The components in this model encapsulate the various incentives provided through SIDs framework: - Fr(t) represents the financial incentive package tailored to t in region r, - is the value of professional development opportunities accessible to t, - measures engagement in mentorship and peer networking programs, enhancing professional support, - captures the impact of community integration initiatives in fostering a sense of belonging, - reflects career advancement potential based on tenure, performance, and continuous learning, - denotes role flexibility, allowing professionals to diversify their skills and reduce burnout.
To optimize the effect of each component on retention, the weighting parameters α, β, γ, δ, ϵ, η are calibrated through a maximization function that enhances overall retention across all professionals and regions :
subject to budgetary and operational constraints, ensuring that resources are deployed efficiently.
For a deeper analysis of retention dynamics, the model also incorporates an adaptability factor ζt, r, which modifies Pret(t, r) based on each candidate's adaptability to the rural environment and professional conditions. This adaptability factor is defined as:
where is a compatibility score between t and r, and the Environment Fit adjusts for local and organizational expectations. Integrating this factor, the retention probability becomes:
where now reflects not only incentive-driven retention probability but also a candidates alignment with the unique conditions of the region.
Finally, the normalized retention model across all regions and professionals is represented as:
where offers a standardized retention score across the SID framework, facilitating evaluation and strategic adjustments for improved healthcare professional retention in rural settings.
To improve clarity and accessibility, we briefly summarize here how the empirical workflow connects with the model components. The core objective of the empirical work is to simulate realistic talent-region matching and engagement behaviors using structured datasets. The multi-layered scoring function incorporates candidate qualifications, regional attributes, and inferred incentive responsiveness. These features are processed through a neural representation layer and matched via a compatibility module informed by context-aware attention mechanisms. The empirical validation consists of two main stages: model training on benchmark or healthcare-related datasets, and evaluation using predefined matching and engagement metrics. Candidate and region profiles are encoded from available data sources and passed into the model to generate placement recommendations. Engagement outcomes are then simulated based on observed behavioral sequences or domain-specific retention proxies. While the underlying architecture remains technically rich, this empirical loop data encoding, recommendation generation, and performance evaluation reflects a structured and reproducible process that can be adapted to different domains with appropriate input schemas.
4 Experimental setup
4.1 Dataset
The MovieLens Dataset (33) is a renowned benchmark in the field of recommendation systems, composed of millions of user-item interaction records across various versions, ranging from small datasets with 100,000 ratings to large ones containing up to 20 million ratings. Each dataset includes features such as user demographics, timestamps, and movie genres, enabling detailed modeling of user preferences and behavior. The diverse rating data allows for an in-depth evaluation of collaborative filtering and matrix factorization techniques, as well as modern neural network-based recommendation approaches. The dataset's structure and content have been instrumental in advancing research on user personalization and recommender system robustness, particularly in handling sparsity and cold-start issues. The Gowalla Dataset (34) captures user check-in data from the Gowalla location-based social network, encompassing millions of check-ins across global locations. This dataset provides rich spatial and temporal information, with check-in sequences reflecting real-world mobility patterns and user preferences. Each entry includes geographic coordinates and timestamps, offering insights into user behavior over time and space. The dataset has become a standard in evaluating spatial-temporal recommendation systems, helping to improve location prediction, user trajectory modeling, and geographical preference analysis. Its application extends to various fields, including urban planning and smart tourism, due to its detailed representation of user movement dynamics. The Foursquare Dataset (35) includes large-scale check-in data from Foursquare, with millions of entries distributed across diverse categories and urban regions worldwide. Each check-in record encompasses information such as location category, timestamp, and user ID, providing a comprehensive view of user preferences and activity patterns. The dataset's granularity in urban regions and venue types makes it ideal for tasks in point-of-interest recommendation, user behavior analysis, and geographic modeling. Researchers leverage the Foursquare Dataset to evaluate the effectiveness of personalized recommendation algorithms under realistic social and spatial conditions, facilitating advancements in understanding urban dynamics and user interaction in city environments. The KuaiRec Dataset (36) is designed for recommendation research, featuring user interaction data within a video streaming platform. It includes millions of user-item interactions, with metadata such as timestamp, viewing duration, and content genre, allowing for precise modeling of user engagement with multimedia content. The dataset's scope provides a valuable framework for developing and testing recommendation models that focus on short-video recommendation, temporal patterns, and personalization techniques. As a benchmark, KuaiRec is pivotal in the exploration of engagement-driven recommendations and user retention strategies, making it an essential resource in multimedia recommendation research.
4.2 Experimental details
Our experiments were conducted on an NVIDIA A100 GPU, leveraging PyTorch as the primary framework to facilitate model training and evaluation. We trained all models with the Adam optimizer, utilizing a learning rate of 1e-4 for the base models, while introducing minor variations tailored to each dataset to optimize convergence. We employed a batch size of 128, which balanced memory efficiency and computational load across all experimental setups. The models were initialized using Xavier initialization to ensure balanced weight distribution, mitigating issues associated with vanishing or exploding gradients. For the MovieLens Dataset, we processed data by normalizing user and item indices and splitting the interactions chronologically into 70% for training, 15% for validation, and 15% for testing. User-item matrices were constructed, with embedding dimensions set to 64, allowing for compact yet expressive representations of both users and items. Matrix factorization-based models were benchmarked with 20 epochs, while neural models were trained for up to 50 epochs, incorporating early stopping criteria based on validation loss to prevent overfitting. In the Gowalla Dataset, spatial-temporal embeddings were created to capture user mobility patterns, using a grid-based encoding to represent geographic data. Check-ins were ordered sequentially, partitioned into training, validation, and test sets. A sequence length of 10 was employed for recurrent models, capturing temporal dependencies in user movements. To evaluate the influence of location-based features, we used geo-encoded embeddings with a dimension size of 128, enabling models to better contextualize spatial information in recommendations. For the Foursquare Dataset, we implemented point-of-interest (POI) embeddings, which utilized location categories and timestamp information to predict user visits accurately. Data augmentation techniques were applied to handle sparsity, including random sampling of user sessions and POI duplication for underrepresented categories. We set embedding dimensions to 128 and trained models over 40 epochs with a dropout rate of 0.3 to prevent overfitting. The evaluation metric utilized was Recall@K and NDCG@K, with K values set to 5 and 10, assessing both ranking quality and position sensitivity in recommendations. In the KuaiRec Dataset, temporal features such as viewing duration and item sequence were included to enrich user behavior patterns. The data were organized into sliding windows, with a window size of 20 to capture recent interactions. The models employed multi-layer perceptrons (MLPs) with hidden layers of sizes 256, 128, and 64, configured with ReLU activation functions and batch normalization layers to stabilize training. Experiments were conducted over 30 epochs, and evaluation metrics included Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) to reflect the precision and consistency of recommendations across dynamic viewing patterns. Across all datasets, hyperparameter tuning was performed using grid search for learning rates and embedding sizes, optimizing model configurations for each dataset. The implementation also incorporated gradient clipping at a threshold of 1.0 to stabilize training, especially for recurrent-based models handling sequential data. Model performance was evaluated by averaging results over five random seeds to ensure robustness and reduce the impact of stochastic variations on final results.
4.3 Comparison with SOTA methods
Our proposed model demonstrates superior performance across all metrics when compared to state-of-the-art (SOTA) methods on the MovieLens, Gowalla, Foursquare, and KuaiRec datasets, as shown in Tables 1, 2, Figures 4, 5. In the MovieLens dataset, our model achieves significant improvements in Precision, Recall, F1 Score, and NDCG metrics, surpassing notable models like LightGCN, NGCF, and NeuMF. Specifically, our model's Precision and F1 Score outperformed the closest competitor, PinSAGE, by a considerable margin, with gains of 2.3% and 1.45%, respectively. This enhanced performance can be attributed to our models dynamic embedding approach, which efficiently captures user-item interactions with higher accuracy. By integrating personalized user and item embeddings through multi-level attention mechanisms, our model effectively captures the intricate nuances of user preferences, leading to more precise recommendation results. In comparison to models such as SASRec, which is limited by its sequential recommendation dependency, our model's architecture allows for a broader contextualization of user interactions, facilitating superior performance in both precision and relevance.
In the Gowalla dataset, our model excels in incorporating spatial-temporal dependencies, demonstrating improvements of over 3% in both NDCG and Recall compared to LightGCN and NGCF. This performance gain stems from our models refined handling of geographic data, allowing it to embed location-based contexts and temporal interactions more effectively than traditional graph-based methods. Our approach goes beyond standard spatial embeddings by utilizing an adaptive sequence encoding technique, which enables the model to contextualize user check-in patterns in greater detail. Consequently, our method achieves a higher predictive accuracy in determining future user locations, evidenced by its robust Recall and NDCG scores. Additionally, our models ability to integrate POI embeddings with temporal patterns outperforms GraphSAGE by a notable margin, illustrating the advantage of employing a hybrid approach to spatial-temporal recommendation tasks. For the Foursquare and KuaiRec datasets, depicted in Table 2, our model continues to surpass SOTA methods across all evaluation metrics, with particular emphasis on F1 Score and NDCG. In the Foursquare dataset, our models advantage is observed in its 4% higher NDCG score compared to PinSAGE, indicating its robustness in understanding user preferences at a granular level. The use of sequential data processing and attention mechanisms allows our model to capture and rank user interests more accurately, leading to higher precision and relevance. Additionally, in the KuaiRec dataset, our models architecture addresses the challenge of short-video recommendation by maintaining a high level of engagement prediction accuracy. By utilizing personalized embedding configurations that capture nuanced viewing behaviors, our model achieves a 3.1% improvement in Precision over SASRec. This precision boost highlights the models efficiency in predicting dynamic user interactions, providing a more tailored and accurate recommendation experience.
4.4 Ablation study
The results from the ablation study on key model components, presented in Tables 3, 4, Figures 6, 7 illustrate the impact of various model components on performance metrics across the MovieLens, Gowalla, Foursquare, and KuaiRec datasets. By systematically removing essential components (designated as Contextual Matching Framework, Dynamic Incentive Allocation, and Engagement Feedback Mechanisms) from our model, we observe notable declines in Precision, Recall, F1 Score, and NDCG, underscoring each component's contribution to the model's effectiveness. In the MovieLens dataset, removing component Contextual Matching Framework led to a significant decrease in Precision and Recall, reducing model accuracy by approximately 3.35% in F1 Score. Component Contextual Matching Framework is instrumental in handling user-item interactions, enhancing the models ability to discern nuanced preferences. Without it, the model struggles to maintain consistency in recommendations, thus underscoring its vital role in personalizing user experiences. The impact is even more pronounced in the Gowalla dataset, where spatial-temporal aspects are central. Without component Dynamic Incentive Allocation, which handles geographic encoding, performance deteriorates by over 4% in NDCG, demonstrating its importance in capturing and interpreting user location data accurately. This result highlights how the models hybrid embeddings, specifically designed to integrate temporal and spatial contexts, are fundamental in improving recommendation precision for location-based applications. For the Foursquare dataset, the absence of component Engagement Feedback Mechanisms results in a 5.3% decrease in Recall, emphasizing its role in refining temporal interactions. Component Engagement Feedback Mechanismss focus on sequential data processing enhances the models ability to predict user engagement patterns across time, especially relevant in settings with high temporal variability. Without this component, the models capacity to deliver timely and context-aware recommendations diminishes significantly. This trend continues in the KuaiRec dataset, where component Engagement Feedback Mechanisms's absence affects metrics like F1 Score and NDCG, indicating a reduced capability in handling multimedia content recommendation, where viewing duration and content sequence are essential for modeling user interests.
While the benchmark datasets used in this study MovieLens, Foursquare, Gowalla, and KuaiRec are not inherently related to healthcare or rural workforce domains, they were selected for their robustness, diversity of interaction patterns, and wide acceptance in evaluating complex recommendation systems. These datasets provide rich behavioral and temporal data, allowing us to validate the general performance, adaptability, and scalability of our proposed model architecture in capturing nuanced user-item relationships, sequential preferences, and engagement dynamics. The goal at this stage was to test the technical efficacy of the Adaptive Talent Engagement Model (ATEM) in high-volume, complex environments before deploying it in a domain-specific context. That said, we fully acknowledge the importance of applying the model to rural healthcare-specific datasets to assess its effectiveness in real-world public health scenarios. As part of our ongoing work, we are actively collaborating with regional health institutions to collect domain-relevant data such as healthcare worker placement records, satisfaction surveys, and community health engagement metrics with the aim of conducting targeted validations and further refining the model for rural healthcare recruitment and retention tasks.
The experimental results in Table 5 on the MIMIC-III and PhysioNet datasets further validate the adaptability and robustness of our proposed model in healthcare-specific recruitment scenarios. On the MIMIC-III dataset, which simulates hospital-level recruitment and matching based on clinical specialty, procedure types, and provider workload profiles, our model achieved a Top-5 accuracy of 87.40%, significantly outperforming classical models such as NeuMF and LightGCN. The model also demonstrated a strong ability to predict retention patterns, achieving a retention AUC of 0.857 and a high engagement score. These outcomes suggest that the model not only captures domain-specific compatibility features but also effectively models temporal engagement dynamics relevant to rural healthcare placements. On the PhysioNet dataset, which focuses on clinical trial investigator assignments and geographic site dispersion, our model maintained high predictive accuracy with 82.10% Recall@3 and an RMSE reduction of 14.3% over baseline methods. These results indicate that the model is capable of learning nuanced mobility and participation signals, enabling it to identify professionals most suited for low-resource or rural trial locations. Compared to existing methods, our approach shows superior sensitivity to community fit and regional constraints, reinforcing its potential to support data-driven talent allocation in real-world public health systems. The healthcare-oriented experiments confirm that our model generalizes well beyond synthetic or general-purpose datasets and can serve as a valuable tool for optimizing human resource strategies in healthcare deployment contexts.
5 Discussion
These results align with previous research emphasizing the effectiveness of Transformer-based models in strategic talent management within constrained environments. For instance, Yu and Du (19) demonstrated that Transformer architectures provide meaningful insights into complex candidate-job relationships, enabling more tailored engagement strategies. Similarly, Cao et al. (20) highlighted the potential of AI-driven approaches in addressing human resource challenges in rural healthcare systems, though they noted the limited adaptability of existing models to nuanced community dynamics. Our findings extend this literature by integrating contextual alignment mechanisms and adaptive incentive allocation, which together enhance not only recruitment precision but also retention sustainability. In comparison to earlier frameworks, such as those discussed by Fernandez-Fabeiro et al. (25), our model offers a more granular and dynamic treatment of candidate-region compatibility, capturing both individual motivation and environmental responsiveness. The observed retention improvements echo the arguments made by Leider et al. (17) on the importance of continuous professional engagement and support systems in rural public health. These connections underscore the broader applicability of our model and its relevance to current discourse on equitable and efficient talent distribution in underserved regions.
Our model has been primarily validated using structured and proxy datasets, which simulate the dynamics of talent engagement, mobility, and regional compatibility. This design choice was intentional, serving as a foundational step to establish the generalizability and technical viability of the proposed architecture. That said, we are actively pursuing partnerships with regional public health departments and rural hospitals to obtain domain-specific data, including anonymized human resource records, healthcare workforce retention logs, and structured candidate feedback from recruitment programs. These efforts are ongoing and will form the basis of our future empirical validation. In particular, our next-phase study aims to integrate these datasets to further calibrate and test the model under real operational conditions. By embedding actual rural workforce constraints and behavioral patterns, we hope to strengthen the models practical applicability and ensure it reflects the complex realities of public health talent management.
While the model architecture incorporates advanced components such as incentive elasticity, multi-layered compatibility scoring, and dynamic engagement modeling these elements are modular by design and intended to be adaptable to varying levels of technical infrastructure. In practice, the core computational processes like candidate-region matching and engagement prediction can be embedded into a lightweight decision-support platform or offered via a cloud-based service, requiring only basic input interfaces such as spreadsheets or structured survey data. We envision deployment through a hybrid model that separates computation from interaction: centralized servers or regional data hubs can run the model computations periodically, while rural health departments access the outcomes via simple dashboards or automated recommendation reports. The system is designed to work with partial or incomplete data, and can default to simplified heuristics when certain modules such as incentive elasticity estimation cannot be supported due to data limitations. By offering scalable layers of functionality, from minimal rule-based suggestions to fully data-driven recommendations, the system remains flexible and accessible for organizations with limited personnel or digital infrastructure. This tiered approach ensures that even in constrained contexts, the model can deliver actionable insights to inform recruitment and retention strategies.
Our framework has been designed with fairness, privacy, and transparency principles in mind. To mitigate bias, the model avoids using sensitive attributes (such as race, gender, or socioeconomic background) in candidate representation or regional profiling. Instead, it relies on professional qualifications, engagement history, and anonymized behavioral signals. In future deployments, fairness audits and bias detection metrics will be integrated to continuously evaluate disparate impact across demographic groups. Regarding data privacy, all processing in this study was conducted on publicly available or anonymized datasets. For real-world applications, we advocate strict adherence to data protection standards, including informed consent, data minimization, and secure federated learning where feasible. We view the system not as a replacement for human judgment, but as a decision-support tool intended to augment policy-making and local knowledge. Final deployment will involve human-in-the-loop mechanisms, allowing practitioners to review, override, or contextualize recommendations as needed. We believe that by embedding these safeguards, the system can contribute meaningfully and ethically to improving rural healthcare workforce strategies.
6 Conclusions and future work
This study explores a novel, transformer-driven approach to attract talent into rural public health entrepreneurship, addressing the shortage of skilled healthcare professionals in underserved rural areas. Our approach employs a transformer model to analyze complex data on workforce trends, regional health demands, and entrepreneurial opportunities, aiming to identify optimal talent attraction strategies specific to rural public health needs. The model leverages natural language processing (NLP) techniques to assess candidate profiles, gauge their alignment with regional health requirements, and recommend tailored incentives and career pathways. We conducted experiments comparing the transformer-driven strategy with traditional recruitment methods across multiple rural regions. Results indicate a significant increase in candidate engagement and match accuracy, demonstrating the model's efficacy in refining attraction strategies and enhancing alignment between talent and rural healthcare needs. While the transformer-driven strategy shows promise, two primary limitations warrant further exploration. First, the model's reliance on existing data sets may introduce biases, as these data sources often underrepresent rural-specific career motivations and diverse candidate backgrounds, potentially limiting the model's recommendations. Enhancing data sources to capture a broader spectrum of rural health and entrepreneurial dynamics is essential for more accurate strategy development. Second, while transformer models effectively identify talent profiles, they are limited in their capacity to predict long-term candidate retention within rural settings. Future research should integrate predictive analytics that consider longitudinal data on workforce satisfaction, job stability, and rural health outcomes to improve retention-focused recommendations. Overall, with these enhancements, transformer-driven strategies have the potential to play a crucial role in revitalizing rural public health through entrepreneurial avenues.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
JZ: Conceptualization, Data curation, Investigation, Formal analysis, Funding acquisition, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. LL: Formal analysis, Investigation, Data curation, Writing – original draft. JS: Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Locke R, McGinty MD, Ramirez GG, Sellers K. Attracting new talent to the governmental public health workforce: strategies for improved recruitment of public health graduates. J Public Health Manag Pract. (2021) 28:E235–E243. doi: 10.1097/PHH.0000000000001336
2. Dedewanou FA, Allin S, Guyon A, Pawa J, Ammi M. Prioritization of public health financing, organization, and workforce transformation: a Delphi study in Canada. BMC Public Health. (2023) 23:544. doi: 10.1186/s12889-023-15373-9
3. Hamadneh S, Alshurideh M, Alquqa E, Kassem AA, Agha K, Alzoubi HM. AI-driven talent acquisition systems: transforming recruitment strategies in the digital age. In: International Conference on Cyber Resilience (ICCR). Dubai: IEEE (2024). doi: 10.1109/ICCR61006.2024.10532893
4. Feng A, Zhan Y, Liang B, Gao YX, Yuan T, Shu Y, et al. Status quo regarding the recruit requirements of public health-related employers and training scheme for undergraduates majored in preventive medicine in colleges and universities. Zhonghua Liu Xing Bing Xue Za Zhi. (2021) 42:555–61. doi: 10.3760/cma.j.cn112338-20200629-00896
5. Lakofsky K. Bridging the talent gap: connecting talent to bioscience careers. Dela J Public Health. (2023) 9:20–1. doi: 10.32481/djph.2023.11.005
6. Gates C. Health and care horizons leeds: attracting, navigating and inspiring the future workforce. Eur J Public Health. (2024) 34:i67–i73. doi: 10.1093/eurpub/ckae144.1884
7. Zhu W, fang Qin J. A comprehensive assessment of cultivation environment of top innovative high-level talents based on deep learning algorithm. J Environ Public Health. (2022) 2022:4846103. doi: 10.1155/2022/4846103
8. Tripković K, Śantrić Milićević M, Vasić M, Živković-Šulović M, Odalović M, Mijatović-Jovanović V, et al. Factors associated with intention of serbian public health workers to leave the job: a cross-sectional, population-based study. Int J Environ Res Public Health. (2021) 18:10652. doi: 10.3390/ijerph182010652
9. Krasna H, Kulik PKG, Karnik H, Leider J. recruiting new talent for public health jobs with evidence-based job descriptions and attractive job postings. J Public Health Managem Pract. (2023) 29:E162–E168. doi: 10.1097/PHH.0000000000001776
10. Herrera-Espejel PS, Rach S. The use of machine translation for outreach and health communication in epidemiology and public health: scoping review. JMIR Public Health Surveill. (2023) 9:e50814. doi: 10.2196/50814
11. Liu S, Chen Y, Li S, Xu N, Tang C, Wei Y. What Are the important factors influencing the recruitment and retention of doctoral students in a public health setting? A discrete choice experiment survey in China. Int J Environ Res Public Health. (2021) 18:9474. doi: 10.3390/ijerph18189474
12. Cosgrave C. Context matters: findings from a qualitative study exploring service and place factors influencing the recruitment and retention of allied health professionals in rural Australian public health services. Int J Environ Res Public Health. (2020) 17:5815. doi: 10.3390/ijerph17165815
13. Leider J, Yeager V, Kirkland C, Krasna H, Bork RH, Resnick B. The state of the US public health workforce: ongoing challenges and future directions. Annu Rev Public Health. (2022) 44:323–41. doi: 10.1146/annurev-publhealth-071421-032830
14. Tamsah H, Nurung J, Yusriadi Y, del Talento G, del Conocimiento G, Desempeo O, et al. Talent and knowledge management on employee performance in public organization. Int J Prof Busin Rev. (2023) 8:e01557. doi: 10.26668/businessreview/2023.v8i4.1557
15. Park C, Coles E. The impact of student debt on career choices among doctor of public health graduates in the United States: a descriptive analysis. Int J Environ Res Public Health. (2022) 19:4836. doi: 10.3390/ijerph19084836
16. Wong B, Siepmann I, Chen T, Fisher S, Weitzel TS, Nathan N, et al. Rebuilding to shape a better future: the role of young professionals in the public health workforce. Hum Resour Health. (2021) 19:82. doi: 10.1186/s12960-021-00627-7
17. Leider J, Castrucci B, Robins M, Bork RH, Fraser M, Savoia E, et al. The exodus Of state and local public health employees: separations started before and continued throughout COVID-19. Health Aff . (2023) 42:338–48. doi: 10.1377/hlthaff.2022.01251
18. Burrell D, Springs D, Burton SLL, Dawson ME, Wright JB, Modeste R. Perspectives in talent management strategies for cybersecurity job roles in public safety and health in government organizations. Int J Smart Educ Urban Soc. (2020) 11:17. doi: 10.4018/IJSEUS.2020100101
19. Yu X, Du B. Incentive policy optimization of scientific and technological talents and low-carbon economy analysis from the perspective of public health. Front Public Health. (2023) 11:1152346. doi: 10.3389/fpubh.2023.1152346
20. Cao Y, Shan J, Gong Z, Kuang J, Gao Y. Status and challenges of public health emergency management in china related to COVID-19. Front Public Health. (2020) 8:250. doi: 10.3389/fpubh.2020.00250
21. Abebe EC, Tiruneh GA, Adela GA, Ayele TM, Muche ZT, T/Mariam AB, et al. COVID-19 vaccine uptake and associated factors among pregnant women attending antenatal care in Debre Tabor public health institutions: a cross-sectional study. Front Public Health. (2022) 10:919494. doi: 10.3389/fpubh.2022.919494
22. Liu T, Hu X. Generation paths of online public opinion impact in public health emergency: A fuzzy-set qualitative comparative analysis based on Chinese data. Front Public Health. (2024) 12:1404587. doi: 10.3389/fpubh.2024.1404587
23. Pagen D, van Bilsen CJV, Brinkhues S, Moonen CP, van Herck M, Konings K, et al. Design and recruitment of a large-scale cohort study on prevalence, risk factors and impact evaluation of post-COVID-19 condition and its wider long-term social, mental, and physical health impact: The PRIME post-COVID study. Front Public Health. (2022). 10:1032955. doi: 10.3389/fpubh.2022.1032955
24. Chai X. How has the nationwide public health emergency of the COVID-19 pandemic affected older chinese adults' health literacy, health behaviors and practices, and social connectedness? Qualitative evidence from urban China. Front Public Health. (2022) 9:774675. doi: 10.3389/fpubh.2021.774675
25. Fernandez-Fabeiro J, Carballido Á, Fernández-Fernández ÁM, Moldes M, Villar D, et al. The SINFONIA project repository for AI-based algorithms and health data. Front Public Health. (2024) 12:1448988. doi: 10.3389/fpubh.2024.1448988
26. Landy J, Shigeto A, Laxman D, Scheier L. Typologies of stress appraisal and problem-focused coping: associations with compliance with public health recommendations during the COVID-19 pandemic. BMC Public Health. (2022) 22:784. doi: 10.1186/s12889-022-13161-5
27. Arora L, Choudhary A, Bhatt M, Kaliappan J, Srinivasan K, A. Comprehensive review on NUI, multi-sensory interfaces and UX design for applications and devices for visually impaired users. Front Public Health. (2024) 12:1357160. doi: 10.3389/fpubh.2024.1357160
28. Tutt M, Begay C, George S, Dickerson C, Kahn CB, Bauer MC, et al. Din teachings and public health students informing peers and relatives about vaccine education: providing Din (Navajo)-centered COVID-19 education materials using student health messengers. Front Public Health. (2022) 10:1046634. doi: 10.3389/fpubh.2022.1046634
29. Guo H, Luo Q. Analysis of China's population flow between urban and rural areas and the reform of public health old-age insurance system under the background of sustainable ecological environment. J Environ Public Health. (2022) 2022:9752913. doi: 10.1155/2022/9752913
30. Harper C, Satchell L, Fido D, Latzman R. Functional fear predicts public health compliance in the COVID-19 pandemic. Int J Ment Health Addict. (2020) 19:1875–88. doi: 10.31234/osf.io/jkfu3
31. Lachowsky N, Saxton P, Dickson N, Hughes A, Jones R, Clark T, et al. Ethnicity classification systems for public health surveys: experiences from HIV behavioural surveillance among men who have sex with men. BMC Public Health. (2020) 20:1433. doi: 10.1186/s12889-020-09517-4
32. Gillman AS, Iles IA, Klein W, Ferrer R. Increasing receptivity to COVID-19 public health messages with self-affirmation and self vs. other framing. Health Commun. (2022) 38:1942–53. doi: 10.1080/10410236.2022.2043024
33. Harper FM, Konstan JA. The movielens datasets: history and context. ACM Trans Interact Intellig Syst (TIIS). (2015) 5:1–19. doi: 10.1145/2827872
34. Cho E, Myers SA, Leskovec J. Friendship and mobility: user movement in location-based social networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: Association for Computing Machinery (2011). p. 1082–90. doi: 10.1145/2020408.2020579
35. Yang D, Zhang D, Qu B. Participatory cultural mapping based on collective behavior data in location-based social networks. ACM Trans Intellig Syst Technol (TIST). (2016) 7:1–23. doi: 10.1145/2814575
36. Gao C, Li S, Lei W, Chen J, Li B, Jiang P, et al. KuaiRec: a fully-observed dataset and insights for evaluating recommender systems. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. New York, NY: Association for Computing Machinery (2022). p. 540–50. doi: 10.1145/3511808.3557220
37. He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. Lightgcn: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY: Association for Computing Machinery (2020). p. 639–48. doi: 10.1145/3397271.3401063
38. Wang X, He X, Wang M, Feng F, Chua TS. Neural graph collaborative filtering. In: Proceedings of the 42nd international ACM SIGIR conference on Research and development in Information Retrieval. New York, NY: Association for Computing Machinery (2019). p. 165–74. doi: 10.1145/3331184.3331267
39. Maji S, Maity S, Das J, Majumder S. An improved recommendation system based on neural matrix factorization. In: 2023 4th International Conference on Intelligent Technologies (CONIT). Bangalore: IEEE (2024). p. 1–7.
40. Pal A, Eksombatchai C, Zhou Y, Zhao B, Rosenberg C, Leskovec J. Pinnersage: multi-modal user embedding framework for recommendations at Pinterest. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, NY: Association for Computing Machinery (2020). p. 2311–20. doi: 10.1145/3394486.3403280
41. El Alaoui D, Riffi J, Sabri A, Aghoutane B, Yahyaouy A, Tairi H. Deep GraphSAGE-based recommendation system: jumping knowledge connections with ordinal aggregation network. Neural Comp Appl. (2022) 34:11679–90. doi: 10.1007/s00521-022-07059-x
Keywords: big data in health care, public health talent development, AI in rural health systems, healthcare workforce optimization, health policy
Citation: Zhou J, Li L and Su J (2025) Leveraging big data in health care and public health for AI driven talent development in rural areas. Front. Public Health 13:1524805. doi: 10.3389/fpubh.2025.1524805
Received: 08 November 2024; Accepted: 15 April 2025;
Published: 21 May 2025.
Edited by:
Fei Fan, Wuhan University, ChinaReviewed by:
Rashid Ibrahim Mehmood, Islamic University of Madinah, Saudi ArabiaAlexandre Morais Nunes, University of Lisbon, Portugal
Copyright © 2025 Zhou, Li and Su. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jing Zhou, cnVocTA2QDE2My5jb20=