ORIGINAL RESEARCH article

Front. Genet.

Sec. Statistical Genetics and Methodology

Volume 16 - 2025 | doi: 10.3389/fgene.2025.1554773

Uncovering codon usage patterns during murine embryogenesis and tissue-specific developmental diseases

Provisionally accepted
Sarah  E FumagalliSarah E Fumagalli1Sean  SmithSean Smith2Brian  LinBrian Lin1Rahul  PaulRahul Paul2Collin  CampbellCollin Campbell2Luis  Santana-QuinteroLuis Santana-Quintero2Anton  GolikovAnton Golikov2Juan  IblaJuan Ibla3Haim  BarHaim Bar4Anton  A KomarAnton A Komar5,6Ryan  C HuntRyan C Hunt7Michael  DicuccioMichael Dicuccio8Chava  Kimchi-SarfatyChava Kimchi-Sarfaty1*
  • 1Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research (CBER), United States Food and Drug Administration, Silver Spring, United States
  • 2High-performance Integrated Virtual Environment (HIVE), Office of Biostatistics and Pharmacovigilance (OBPV), Center for Biologics Evaluation and Research (CBER), United States Food and Drug Administration, Silver Spring, United States
  • 3Department of Anesthesiology, Critical Care and Pain Medicine, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States
  • 4Department of Statistics, University of Connecticut, Storrs, CT, United States
  • 5Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, Georgia, United States
  • 6Department of Biochemistry and Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH, United States
  • 7Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies (OTAT), Center for Biologics Evaluation & Research, US Food and Drug Administration, Silver Spring, Maryland, United States
  • 8Not Applicable, Rockville, MD, United States

The final, formatted version of the article will be published soon.

Mouse models share significant genetic similarities with humans and have expanded our understanding of how embryonic tissue-specific genes influence disease states. By improved analyses of temporal, transcriptional data from these models, we can capture unique tissue codon usage patterns and determine how deviations from these patterns can influence developmental disorders. We analyzed transcriptomic-weighted data from four mouse strains across three different germ layer tissues (liver, heart, and eye) and through embryonic stages. Applying a multifaceted approach, we calculated relative synonymous codon usage, reduced the dimensionality, and employed machine learning clustering techniques. These techniques identified relative synonymous codon usage differences/similarities among strains and deviations in codon usage patterns between healthy and disease-linked genes. Original transcriptomic mouse data and RefSeq gene sequences can be found at the associated Mouse Embryo CoCoPUTs (codon and codon pair usage tables) website. Future studies can leverage this resource to uncover further insights into the dynamics of embryonic development and the corresponding codon usage biases that are paramount to understanding disease processes of embryologic origin.

Keywords: mouse embryology, tissue-specific, transcriptomic-weighted, Relative synonymous codon usage, Clustering methods, disease-associated comparison, machine learning

Received: 07 Jan 2025; Accepted: 21 Apr 2025.

Copyright: © 2025 Fumagalli, Smith, Lin, Paul, Campbell, Santana-Quintero, Golikov, Ibla, Bar, Komar, Hunt, Dicuccio and Kimchi-Sarfaty. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Chava Kimchi-Sarfaty, Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research (CBER), United States Food and Drug Administration, Silver Spring, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.