- 1Sandra Rosenbaum School of Social Work, University of Wisconsin-Madison, Madison, WI, United States
- 2New York University Robert F. Wagner Graduate School of Public Service, New York, NY, United States
- 3Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
- 4Columbia University School of Social Work, New York, NY, United States
Objectives: Decision support tools, such as predictive risk models (PRMs), may assist medical personnel identify child abuse and neglect (CAN). However, no prior work has established whether Medicaid claims alone are sufficient for accurate prediction. We developed a Medicaid claims-based PRM to predict CAN, validating against medical encounters and clinical diagnoses that precede and follow a CAN-related diagnosis code.
Methods: We conducted a case–control study of Medicaid-enrolled children born in New York State between 2006 and 2008. Children with a CAN indicator in the first 10 years of life (case) were matched to children without (control) based on sex, race/ethnicity, county, low birthweight (<2,500 g), birth year, and months enrolled in Medicaid. Outcomes in 1 year included medical encounters, ED visits by injury intent, number of providers, and expenditures, as well as death within 5 years.
Results: Of the 50,515 children in the matched sample, 13,109 (26%) were Black, 24,298 (48%) were female, 31,917 (63%) resided in New York City, and by design, 13,429 (26.5%) were diagnosed with CAN before age 10. The final PRM achieved an area under the receiver operating curve of 62%, an accuracy of 75%, a sensitivity of 98.5%, and a specificity of 9.5%. Children with a CAN indicator and those with a high probability of CAN had more high-expenditure medical encounters, particularly injury-related ED visits, compared with those without in the year following the index date.
Conclusions: This claims-based tool demonstrated high sensitivity, identifying nearly all children with a CAN indicator; however, it had very low specificity. Future work should link Medicaid claims to child protective services reports to increase predictive accuracy.
Introduction
Child abuse and neglect (CAN) is one of the nation's most pressing public health epidemics. In 2022, medical personnel submitted reports to child protective services (CPS) on behalf of 11.2% (843,000) of the 7.5 million children reported, primarily for neglect (66.2%) and physical abuse (19.5%) (1). Many physical indications of CAN, such as sentinel bruising, are difficult to distinguish from accidental injuries or conditions. In the emergency department (ED), identification is further hampered by the lack of historical and contextual information about the child and their caregiver(s), as clinicians are limited to the information available in medical records. However, early and accurate identification of CAN is imperative. An estimated 20%–50% of fatal CAN victims had at least one previous medical encounter at which the signs of CAN were present but not diagnosed or reported to CPS (2–6). In response, many US medical systems are piloting or implementing decision support tools to provide additional information to medical personnel when faced with the complex, high-stakes decision of whether to notify CPS (7–10).
In this paper, we develop a predictive risk model (PRM) to identify CAN exclusively from Medicaid claims and enrollment data. Whereas previous work demonstrated that ED encounters can be used to predict CAN when health records are matched to administrative child welfare data, decision tools requiring external data linkages cannot be used in practice due to privacy restrictions. Therefore, the aim of this study is to determine whether unlinked administrative medical records, in this case Medicaid claims, can be used to predict CAN in support of clinician decision-making (11). Specifically, our model identifies the pattern of injuries or conditions that led to the first, formal indication of CAN in Medicaid claims. These indications may be confirmed by medical providers or by CPS. The latter is a function of both objective harm and the medical personnel's propensity to notify CPS, whereas the former can be interpreted as a more objective measure of harm.
Our secondary aim is to document the pattern of injuries and conditions that follow the first indication of CAN in Medicaid claims. Comparing health and medical use outcomes within one year yields the unadjusted “effect” of a CAN indication in medical records. Relative to children at a high risk of CAN, comparing post-visit outcomes yields insight into the consequence of a “missed” diagnosis.
Unlike CPS referrals that generally lack information on contextual and historical factors preceding the CPS encounter, Medicaid claims capture a rich, detailed history of the physical and often social risk factors for CAN. Previous studies have highlighted the potential of Medicaid claims for predicting CAN by using diagnosis and procedure codes to construct the following predictors: types and locations of injuries, the nature and number of previous encounters overall and by specific providers (e.g., emergency department physicians), specific diagnoses, such as traumatic brain injuries and sentinel bruising, and patterns of medical use that are associated with CAN. We leverage Medicaid claims to predict CAN for these reasons. In addition, (1) most children who encounter the child welfare system are enrolled in Medicaid at the time of their encounter (70%–80%) (12, 13) (2) the data are longitudinal and standardized across clinical contexts, and (3) concerns with reporting bias—which plague administrative child welfare data—are minimized (7, 11).
Materials and methods
Data and study design
Our sample consists of 365,598 children with Medicaid-funded births in New York State between 2006 and 2008. Medicaid claims were accessed through the Health Evaluation and Analytics Lab (HEAL) at NYU and include all fee-for-service claims and managed care encounters for individuals insured by Medicaid in NYS. We identify children with any indication of suspected or confirmed CAN (“cases”) according to the International Classification of Diseases Clinical Modification codes, including ICD-9-CM (9955 series, V6121, V7181, V715) and ICD-10-CM (T74, T76, Z69010, Z0472, Z0471, and Z0441) codes. We refer to these codes throughout as CAN indicators, and select the first CAN indicator to appear in Medicaid claims as the index event (14). Children in the control group did not have a CAN indicator at any point in their first ten years of life.
We constructed a case–control sample using a 3-to-1 greedy propensity score matching (with replacement) based on children's residential location (county at time of birth), birth year and month, race/ethnicity (white, Black, Asian or Pacific/Islander, Hispanic, other races, and unknown), sex (male, female), low birthweight (<2,500 g at birth), SSI/TANF receipt, and months of Medicaid enrollment over the 10-year period. Our final analytic sample consisted of 50,515 children, including 26.58% (n = 13,429) case children with a CAN indicator and 73.42% (n = 37,086) control children without. Henceforth, the term “CAN” refers to the first indication of suspected or confirmed CAN in Medicaid claims. This study was reviewed by the Institutional Review Board at New York University and was considered exempt as a secondary analysis of existing data.
Statistical analysis
In the first phase of our analysis, we applied machine learning to identify the patterns of diagnoses and medical encounters that predict CAN. We began with a list of 292 diagnosis codes identified in previous retrospective case note analyses as detailed in Supplementary Material (11, 15). After randomly partitioning our sample into a 70/30 train/test set, we used gradient boosted trees to predict CAN, an algorithm with increased accuracy in predicting rare outcomes (16). We employed fivefold cross-validation using out-of-bag estimates of error to select hyperparameters, including tree number, complexity, and learning rate. We trained models to optimize the area under the receiver operating curve (AUC). We considered models with all candidate predictors, as well as a subset without the medical encounter predictors.
In the second phase of our analysis, we tested for differences across quartiles of predicted CAN risk using chi-square tests and by estimating relative risks. We exploited the longitudinal nature of the data to validate our model by comparing the rate of subsequent outcomes and medical encounters across children with CAN versus quartiles of predicted CAN risk in the following year, except for death, which we observe within 5 years (17–19). We selected a set of outcomes that either reflected an omission of preventative care, potentially excessive emergent care, and any injuries related to these visits. We additionally constructed a new outcome—a high-risk CAN indicator—based on the diagnosis codes identified in phase 1 (shown in Supplementary eTable 2). To account for potential confounding due to changes in Medicaid enrollment, we created average rates by months enrolled. All data manipulation and propensity score matching were conducted in SAS (SAS Institute, Cary, NC). All models were run in R.
Results
Sample characteristics
Table 1 presents the characteristics of the full sample (columns 1 and 2), children with a CAN indicator (case, columns 3 and 4), those without (control, columns 5 and 6), and the control group disaggregated by quartile of CAN risk (columns 7 through 14). The full sample of case and control children are similar on matched characteristics, but the demographic characteristics of children within the control group vary substantially when we disaggregate the control sample by CAN risk quartile.
Predictive risk model
The final gradient boosting algorithm included the medical encounter variables and yielded an AUC of 0.62 in the validation set, meaning that the model is only moderately good at discriminating between children with and without CAN. Overall accuracy in the full sample was 0.748, considered acceptable for prediction. Evaluation metrics for both models with and without the medical encounter variables, for both the test and full dataset, are presented in Supplementary eTable 1. Our final model (column 4, Supplementary eTable 1) exhibited accuracy (precision) of 0.748, sensitivity (recall) of 0.985, and specificity of 0.095. This indicates that our model accurately identifies children who will subsequently have a CAN indicator (few false negatives), but does so by casting a wide net—flagging many children who ultimately never have a CAN indicator (false positives). Despite exhibiting moderate accuracy, the relatively low AUC (0.54) and specificity (0.095) indicate that the yield of this model is low and not particularly informative (11, 20, 21).
We find that eight medical encounter variables are the strongest predictors of CAN (shown in Supplementary eTable 2): Medicaid expenditures, number of claims, number of ED visits, number of primary care providers, number of providers, number of visits to specialists, number of vaccines, and number of well-child visits. Three of these variables, observed in the month before the index visit, were highly associated with CAN: Medicaid expenditures for the last month, number of claims in the last month, and the number of providers in the last month. This result implies that case children receive intensive medical care in the month prior to their index visit.
The top 20 diagnostic predictors from our final model are shown in Supplementary eTable 3, with exclusion codes shown in Supplementary eTable 4. The most predictive diagnoses of CAN include contusions (bruising); asthma; disorders related to short gestation and low birthweight; convulsions; unexplained failure to thrive; housing and economic circumstances; hepatitis B, C, delta, and E; poisoning; unexplained nutritional deficiencies; unexplained abnormal weight loss; scabies; traumatic brain injury; pregnancy; substance poisoning; encopresis; diabetic retinopathy; and dental caries. Of these, only four diagnostic codes appear to predict CAN within the subsequent month: disorders relating to short gestation and low birthweight, contusions, unexplained abnormal weight loss, and convulsions. As there are multiple, plausible explanations for each of these codes, in addition to CAN, we caution against considering them as independently predictive of CAN.
To further facilitate interpretation, we present SHAP (SHapley Additive exPlanations) values in Supplementary eFigure 3. The leftmost bar chart ranks predictors by their absolute SHAP value, indicating that contusions, asthma, and convulsions are the most important features for prediction. The beeswarm figure on the right shows the heterogeneity in feature contributions across individual observations. Although most SHAP values are concentrated near zero, those with non-zero contributions—asthma, disorders relating to preterm birth or low birthweight, and femur fracture—indicate that positive values are concentrated on a small subset of cases, signifying low prevalence but high conditional importance.
Patterns of care after a visit with a CAN indicator
Table 2 presents the outcomes observed after the index visit across case and control groups. Specifically, in the year after the index visit, we compare case and control children in terms of medical encounters, ED visits, and death within 5 years. We find that case children faced a threefold risk of death in 5 years. Case children were 1.88 times more likely to have any of the high-risk CAN indicators (listed in Supplementary eTable 2) in the year following the index visit, relative to those without. On average, case children had 4.21 high-risk CAN indicators compared with 1.72 in the control group. Case children, relative to controls, were also twice as likely to have an ED encounter and over three times as likely to have an inpatient stay. Among the reasons for ED visits shown in Panel B, case children faced 3.35 times the risk of having a self-harm injury, 3.47 times the risk of an unintentional injury, 2.29 times the risk of an injury with an unknown origin, and 4.3 times the risk of having any injury-related visit. Control children had a very low prevalence of assault-related injuries (the exact number was suppressed for privacy reasons) relative to case children (8). Interestingly, case children exhibited higher rates of preventative care, with a moderately higher risk of primary care and well-child visits (1.16 and 1.17 times, respectively).
The results in Table 3 compare case children (columns 1 and 2) across CAN probability quartiles estimated using PRM. Results in Panel A suggest that as CAN risk increases, comparing Q4 to Q1 for example (columns 9 and 10 to 3 and 4), the risk of adverse outcomes resembles case children (columns 1 and 2). The risk of death within 5 years was 2.83 times higher among the children in the top quartile of predicted CAN risk (Q4) relative to those below the median (columns 5 and 6). Children in Q4 exhibited more high-risk CAN diagnosis and were more likely to be admitted to the ED and inpatient. The count of high-risk diagnosis codes in the control group increased between Q1 and Q4 (with those in Q4 having 2.23 more high-risk CAN indicators in Supplementary eTable 2, on average), suggesting that the accumulation of high-risk CAN indicators could be a meaningful indicator of CAN. Like the case–control comparison shown in Table 2, children in Q4 were only modestly more likely to have at least one visit with a primary care provider; however, unlike the case–control comparison, children in Q4 were less likely to have well-child visits, suggesting that children who faced a high CAN were more likely to be observed by specialized providers and emergency physicians. The types of concerns associated with ED visits followed the anticipated pattern (Panel B), with Q4 children facing 2.02 times the risk of self-harm injuries, 1.81 times the risk of unintentional injuries, 3.47 times the risk of injuries with unknown intent, and 1.84 times the risk of any injury-related visit.
Table 3. Risk ratios of children's outcomes 12 months after index visit, by predicted CAN quartiles.
We examine the patterns of medical use among case relative to control children disaggregated by risk quartile in Panel C. These results indicated that as the risk of CAN increased (moving from columns 3 and 4 to 9 and 10), medical use patterns more closely resembled case children in terms of the number of claims per year, the number of emergency department visits/year, and the number of primary care visits/year. Across all three categories of encounters, children with the highest risk of CAN were more likely to rely on emergency and specialized care and have more primary care visits per year. Children in Q4 saw on average 1.22 more providers, incurred $900.86 in additional Medicaid expenditures, and received 3.65 fewer vaccines, relative to low-risk children (quartile 1).
Discussion
Our first aim was to determine whether PRM can predict CAN exclusively from Medicaid claims. We trained a gradient boosted trees machine learning algorithm to predict CAN in Medicaid-enrolled children ages 10 and younger. While our final model exhibited a high sensitivity (98.5%) and moderate accuracy (74.8%), the relatively low specificity (9.5%) and AUC (62.2%) suggest that additional data (e.g., clinical notes, case narratives) and research are needed to improve model discrimination to supplement clinical decision-making in practice. An unwarranted CPS report could bring unnecessary harm to families and children who have a low CAN risk, implying that medical personnel should consider alternative decision support tools (8, 22, 23). Yet, as unsubstantiated CPS reports are highly predictive of later substantiated investigations (24), fatal injuries (25, 26), and ED use (27), the benefits of early intervention for those with a high CAN risk should be carefully weighed against the potential costs of a false positive.
For comparison, the Allegheny Family Screening Tool (AFST)—a PRM trained to predict removal and placement into foster care using CPS data linked to a range of social services data—exhibited a considerably higher AUC of 0.78, emphasizing the potential of data-driven decision support tools (11, 20).
Our second aim was to examine the pattern of diagnosis and medical encounters following the index visit, defined as the medical visit at which the first indication of CAN appears in children's Medicaid claims. Our results indicate that in the year following the index visit, case children had substantially higher risk of adverse outcomes and more high-expenditure medical encounters, such as ED visits and inpatient admissions, as well as slightly more low-expenditure, preventative care visits. Case children exhibited higher rates of ED visits for reasons related to self-harm, unintentional injury, unknown causes, and any injury encounter. They had nearly double the number of monthly medical encounters relative to low-risk children, implying that abused and neglected children should be closely monitored by medical personnel, regardless of whether or not the child is removed from their home.
Although only a small fraction of case children died within 5 years, the risk of death was threefold that of control children. Case children also visited the ED for injury-related concerns at over four times the rate of those without for reasons related to assault, self-harm, and accidental injuries. This finding reiterates previous research revealing that physical abuse injuries are screened and identified at higher rates than injuries or conditions related to neglect, which—while similarly consequential to children's longevity and wellbeing—lack the physical indicators required for unbiased, early diagnosis (7). In addition, these results imply that case children were either not reported to CPS or that CPS interventions did not match the level of risk experienced by the child. Without CPS data, we are unable to rule out either option.
Among control children, those with a high predicted probability of CAN exhibited a patterns similar to case children in terms of high-expenditure ED and inpatient encounters, and fewer low-expenditure, preventative encounters, such as well-child visits and vaccine administrations.
Limitations and ethical considerations
Although our findings offer novel insight into the potential of a Medicaid claim-based screening tool for CAN, our study is not without limitations. First, we were unable to observe CPS reports, investigations, and removals, which prevented us from validating the algorithm as a screening tool. Without additional information on encounters with CPS and later trajectories, we cannot estimate the costs or benefits of formal intervention in terms of children's objective wellbeing. Relatedly, deaths are vastly underreported in Medicaid claims, further limiting validation (28).
Second, our findings are not generalizable to all children because our sample was limited to Medicaid-enrolled children in a single state, New York, for which Medicaid eligibility and enrollment differ from other states. Third, our results should not be interpreted as causal as our approach was implicitly focused on prediction, rather than identifying causal pathways.
Despite the many benefits of using Medicaid claims to predict child maltreatment for the purposes of facilitating early—and ostensibly less severe—intervention, there are two additional ethical considerations worth noting. First, a baseline concern with implementing algorithmic prediction models using a selected sample, as is the case of our reliance on Medicaid claims, is that the resulting predictions may capture variation due to poverty, rather than maltreatment risk, thus reinforcing existing disparities. However, as our model was trained to predict maltreatment identification within the Medicaid system, it is plausibly less biased than a model that predicts CPS reporting, because parents voluntarily seek medical care, whereas CPS reporting is involuntary and subject to race- and class-based bias. Relatedly, a second concern is whether Medicaid claims should be repurposed to identify children who face a high risk of maltreatment, given that the data were not collected for this purpose. The objective of our analysis was to test whether stand-alone claims can reliably predict maltreatment, with particular attention to minimizing type I and type II errors. We conclude that the benefits of using claims alone do not outweigh the accuracy-related costs.
Conclusions
While many county and state CPS agencies use PRM and other decision support tools to aid social workers in high-stakes screening and removal decisions, such as the pathbreaking AFST, tools to support medical personnel in medical settings primarily consist of EHR-based screeners and triggers, training programs, and child abuse pediatric teams (7, 8, 10, 11, 29). Although several screening tools demonstrate remarkably high predictive accuracy for physical abuse injuries, few screen children for all forms of abuse and neglect, and the use of PRMs in medical settings is limited in practice (7–10, 22, 23, 30–32). The results of our analysis suggest that PRMs based entirely upon Medicaid claims exhibits low-to-moderate accuracy and should not be used as a substitute for human intuition.
That said, our analysis yields two findings regarding the patterns of outcomes after maltreatment. First, in the year following the first indication of CAN in medical records, children are more likely to have subsequent high-risk CAN indicators (Supplementary eTable 2) and ED encounters for any form of injury, self-harm or otherwise. Second, the diagnosis and medical use patterns of control children with a high CAN risk resembled those of case children, implying that screening tools could rely on a relatively short list of diagnostic codes and encounters to facilitate earlier intervention for at-risk children. Rather than waiting until CAN is confirmed to generate a CPS referral, earlier identification of CAN risk would enable medical providers to address children's needs immediately through targeted interventions, referrals to public assistance such as nutritional or income support programs, or family support services that are less intensive than CPS.
Data availability statement
The data analyzed in this study are subject to the following licenses/restrictions: the HEAL data are restricted and only accessible by approved researchers. Requests to access these datasets should be directed to John Billings,am9obi5iaWxsaW5nc0BueXUuZWR1.
Author contributions
JP: Conceptualization, Writing – original draft, Writing – review & editing, Methodology, Validation, Investigation. SG: Methodology, Data curation, Supervision, Conceptualization, Writing – review & editing, Resources, Writing – original draft. RH: Validation, Data curation, Methodology, Visualization, Formal analysis, Investigation, Writing – review & editing, Writing – original draft. JW: Supervision, Writing – original draft, Writing – review & editing, Investigation, Resources, Conceptualization.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This research received support from Help For Children for initial data analysis.
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2025.1698582/full#supplementary-material
References
1. U.S. Department of Health & Human Services, Administration for Children and Families, Administration on Children, Youth and Families, Children's Bureau. Child Maltreatment 2022. (2024). Available online at: https://www.acf.hhs.gov/cb/data-research/child-maltreatment (Accessed September 3, 2025).
2. Ravichandiran N, Schuh S, Bejuk M, Al-Harthy N, Shouldice M, Au H, et al. Delayed identification of pediatric abuse-related fractures. Pediatrics. (2010a) 125(1):60–6. doi: 10.1542/peds.2008-3794
3. Thorpe EL, Zuckerbraun NS, Wolford JE, Berger RP. Missed opportunities to diagnose child physical abuse. Pediatr Emerg Care. (2014) 30(11):771–6. doi: 10.1097/PEC.0000000000000257
4. Oral R, Blum KL, Johnson C. Fractures in young children: are physicians in the emergency department and orthopedic clinics adequately screening for possible abuse? Pediatr Emerg Care. (2003) 19(3):148–53. doi: 10.1097/01.pec.0000081234.20228.33
5. King WK, Kiesel EL, Simon HK. Child abuse fatalities: are we missing opportunities for intervention? Pediatr Emerg Care. (2006) 22(4):211–4. doi: 10.1097/01.pec.0000208180.94166.dd
6. Jenny C, Hymel KP, Ritzen A, Reinert SE, Hay TC. Analysis of missed cases of abusive head trauma. J Am Med Assoc. (1999) 281(7):621–6. doi: 10.1001/jama.281.7.621
7. Berger RP, Lindberg DM. Early recognition of physical abuse: bridging the gap between knowledge and practice. J Pediatr. (2019) 204:16–23. doi: 10.1016/j.jpeds.2018.07.081
8. Stilwell L, Golonka M, Ankoma-Sey K, Yancy M, Kaplan S, Terrell L, et al. Electronic health record tools to identify child maltreatment: scoping literature review and key informant interviews. Acad Pediatr. (2022) 22(5):718–28. doi: 10.1016/j.acap.2022.01.017
9. Hanson RF, Zhu V, Are F, Espeleta H, Wallis E, Heider P, et al. Initial development of tools to identify child abuse and neglect in pediatric primary care. BMC Med Inform Decis Mak. (2023) 23(1):1–11. doi: 10.1186/s12911-023-02361-7
10. Shahi N, Shahi AK, Phillips R, Shirek G, Lindberg DM, Moulton SL. Using deep learning and natural language processing models to detect child physical abuse. J Pediatr Surg. (2021) 56(12):2326–32. doi: 10.1016/j.jpedsurg.2021.03.007
11. Vaithianathan R, Putnam-Hornstein E, Chouldechova A, Benavides-Prado D, Berger R. Hospital injury encounters of children identified by a predictive risk model for screening child maltreatment referrals: evidence from the allegheny family screening tool. JAMA Pediatr. (2020) 1010:1–7. doi: 10.1001/jamapediatrics.2020.2770
12. Casanueva C, Armstrong JM, Kluckman M, Larrabee HM, Dolan M, Ringeisen H. OPRE Report #2024-024. NSCAW III Baseline Report (2017-2022): Introduction to NSCAW III. (2024). Available online at: https://acf.gov/opre/report/nscaw-iii-baseline-report-2017-2022-introduction-nscaw-iii (Accessed January 5, 2026).
13. Mark TL, Dolan M, Allaire B, Bradley C. OPRE Report 2023-038. Linking child welfare and medicaid data: lessons learned from two states. Available online at: https://www.ncbi.nlm.nih.gov/books/NBK602544/ (Accessed September 3, 2025).
14. Swedo E, Idaikkadar N, Leemis R, Dias T, Radhakrishnan L, Stein Z, et al. Trends in U.S. Emergency department visits related to suspected or confirmed child abuse and neglect among children and adolescents aged <18 years before and during the COVID-19 pandemic — united States, January 2019–September 2020. MMWR Morb Mortal Wkly Rep. (2020) 69(49):1841–7. doi: 10.15585/mmwr.mm6949a1
15. Schnitzer PG, Slusher PL, Kruse RL, Tarleton MM. Identification of ICD codes suggestive of child maltreatment. Child Abuse Negl. (2011) 35(1):3–17. doi: 10.1016/j.chiabu.2010.06.008
16. Blagus R, Lusa L. Gradient boosting for high-dimensional prediction of rare events. Comput Stat Data Anal. (2017) 113:19–37. doi: 10.1016/j.csda.2016.07.016
17. Abaluck J, Agha L, Kabrhel C, Raja A, Venkatesh AK. The determinants of productivity in medical testing: intensity and allocation of care. Am Econ Rev. (2016) 106(12):3730–64. doi: 10.1257/aer.20140260
18. Chandra A, Staiger D. Productivity spillovers in healthcare: evidence from the treatment of heart attacks. J Pol Econ. (2007) 115:103–40. doi: 10.1086/512249
19. Mullainathan S, Obermeyer Z. Diagnosing physician error: a machine learning approach to low-value health care. Q J Econ (2022) 137(2):679–727. doi: 10.1093/qje/qjab046
20. Goldhaber-Fiebert JD, Prince L. Impact evaluation of a predictive risk modeling tool for Allegheny County's Child Welfare Office (2019). Available online at: https://analytics.alleghenycounty.us/wp-content/uploads/2024/05/23-ACDHS-19_FamilyScreeningToolUpdate_v2.pdf (Accessed January 5, 2026).
21. Ahn E, An R, Jonson-Reid M, Palmer L. Leveraging machine learning for effective child maltreatment prevention: a case study of home visiting service assessments. Child Abuse Negl. (2024) 151. doi: 10.1016/j.chiabu.2024.106706
22. Krawiec C, Gerard S, Iriana S, Berger R, Levi B. What we can learn from failure: an EHR-based child protection alert system. Child Maltreat. (2020) 25(1):61–9. doi: 10.1177/1077559519848845
23. Berger RP, Saladino RA, Fromkin J, Heineman E, Suresh S, McGinn T. Development of an electronic medical record-based child physical abuse alert system. J Am Med Inform Assoc. (2018) 25(2):142–9. doi: 10.1093/jamia/ocx063
24. Palusci VJ, Smith EG, Paneth N. Predicting and responding to physical abuse in young children using NCANDS. Child Youth Serv Rev. (2005) 27(6):667–82. doi: 10.1016/j.childyouth.2004.12.004
25. Putnam-Hornstein E. Report of maltreatment as a risk factor for injury death: a prospective birth cohort study. Child Maltreat. (2011) 16(3):163–74. doi: 10.1177/1077559511411179
26. Putnam-Hornstein E, Cleves MA, Licht R, Needell B. Risk of fatal injury in young children following abuse allegations: evidence from a prospective, population-based study. Am J Public Health. (2013) 103(10):39–44. doi: 10.2105/AJPH.2013.301516
27. Schneiderman JU, Hurlburt MS, Leslie LK, Zhang J, Horwitz SMC. Child, caregiver, and family characteristics associated with emergency department use by children who remain at home after a child protective services investigation. Child Abuse Negl. (2012) 36(1):4–11. doi: 10.1016/j.chiabu.2011.07.008
28. Li G, Udalova V. Evaluating the completeness of mortality information in medicaid records. Health Serv Res. (2025) 60(6). doi: 10.1111/1475-6773.70007
29. Eldred LM, Gifford EJ. Empirical evidence on legal levers aimed at addressing child maltreatment. Child Youth Serv Rev. (2016) 60:11–9. doi: 10.1016/j.childyouth.2015.11.014
30. Gonzalez DO, Deans KJ. Hospital-based screening tools in the identification of non-accidental trauma. Semin Pediatr Surg. (2017) 26(1):43–6. doi: 10.1053/j.sempedsurg.2017.01.002
31. Hoft M, Haddad L. Screening children for abuse and neglect: a review of the literature. J Forensic Nurs. (2017) 13(1):26–34. doi: 10.1097/JFN.0000000000000136
Keywords: child abuse and neglect (CAN), child abuse prevention, child victimization, Medicaid claims data, predictive risk model
Citation: Pac J, Glied S, Howland R and Waldfogel J (2026) Medicaid claims alone have high sensitivity but low specificity in identifying child abuse and neglect. Front. Pediatr. 13:1698582. doi: 10.3389/fped.2025.1698582
Received: 3 September 2025; Revised: 15 December 2025;
Accepted: 22 December 2025;
Published: 2 February 2026.
Edited by:
Melissa Jonson-Reid, Washington University in St. Louis, United StatesReviewed by:
Genevieve Graaf, University of Texas at Arlington, United StatesAviv Landau, University of Pennsylvania, United States
Copyright: © 2026 Pac, Glied, Howland and Waldfogel. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jessica Pac, cGFjQHdpc2MuZWR1
Renata Howland3