ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Marine Megafauna

Volume 12 - 2025 | doi: 10.3389/fmars.2025.1562045

Genomic infrastructure for cetacean research and conservation: Reference genomes for eight families spanning the cetacean tree of life

Provisionally accepted
Phillip  A MorinPhillip A Morin1*Bernhard  BeinBernhard Bein2,3Chiara  BortoluzziChiara Bortoluzzi4Yury  V BukhmanYury V Bukhman5Taylor  HainsTaylor Hains6,7Dorothea  HeimeierDorothea Heimeier8,9Marcela  Uliano-SilvaMarcela Uliano-Silva10Dominic  E AbsolonDominic E Absolon10Linelle  AbuegLinelle Abueg11Jessica  Antosiewicz- BourgetJessica Antosiewicz- Bourget5Jennifer  R BalaccoJennifer R Balacco11Robert  BondeRobert Bonde12Nadolina  BrajukaNadolina Brajuka11Andrew  C BrownlowAndrew C Brownlow13Emma  L CarrollEmma L Carroll8Molly  CarterMolly Carter14Joanna  CollinsJoanna Collins10Nicholas  John DavisonNicholas John Davison13Amy  DentonAmy Denton10Olivier  FedrigoOlivier Fedrigo11Giulio  FormentiGiulio Formenti11Andrew  FooteAndrew Foote15Guido  GalloGuido Gallo16Carola  GreveCarola Greve2Marlys  L HouckMarlys L Houck17Caroline  HowardCaroline Howard10Jeff  JacobsenJeff Jacobsen18Nivesh  JainNivesh Jain11Ksenia  KrasheninnikovaKsenia Krasheninnikova10Brigid  Mary MaloneyBrigid Mary Maloney11Bethan  F ManleyBethan F Manley10Thomas  C MathersThomas C Mathers10Shane  A MccarthyShane A Mccarthy10Michael  R McgowenMichael R Mcgowen19Susanne  MeyerSusanne Meyer20Jacquelyn  Mountcastle, MPHJacquelyn Mountcastle, MPH11Benjamin  Adam NeelyBenjamin Adam Neely21Brian  O'tooleBrian O'toole11Sarah  PelanSarah Pelan10P  RoselP Rosel22Teri  K RowlesTeri K Rowles23Oliver  A RyderOliver A Ryder17Tilman  SchellTilman Schell2Ying  SimsYing Sims10Judy  St LegerJudy St Leger24Ron  StewartRon Stewart5Kerstin  TernesKerstin Ternes25Tatiana  TilleyTatiana Tilley11Conor  WhelanConor Whelan11Jonathan  WoodJonathan Wood10Michael  HillerMichael Hiller26*Mark  BlaxterMark Blaxter27*Erich  David JarvisErich David Jarvis28,29*
  • 1Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration, La Jolla, United States
  • 2LOEWE Centre for Translational Biodiversity Genomics, Senckenberg Research Institute, Goethe University, Frankfurt, Germany, Frankfurt, Germany
  • 3Institute of Cell Biology & Neuroscience, Faculty of Biological Sciences, Goethe University Frankfurt, Frankfurt am Main, Hesse, Germany
  • 4Wellcome Sanger Institute (WT), Hinxton, Cambridgeshire, United Kingdom
  • 5Morgridge Institute for Research, Madison, Wisconsin, United States
  • 6Committee on Evolutionary Biology, The University of Chicago, Chicago, Illinois, United States
  • 7Earth Sciences, Negaunee Integrative Research Center, Field Museum of Natural History, Chicago, Illinois, United States
  • 8School of Biological Sciences, Faculty of Science, The University of Auckland, Auckland, Auckland, New Zealand
  • 9Faculty of Life Sciences and Aquaculture, Nord University, Bodø, Nordland, Norway
  • 10Tree of Life, Wellcome Sanger Institute, Cambridge, United Kingdom
  • 11Vertebrate Genome Laboratory, The Rockefeller University, New York, United States
  • 12Wetland and Aquatic Research Center, United States Geological Survey (USGS), Gainesville, FL, United States
  • 13School of Biodiversity, One Health and Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
  • 14Wellcome Genome Campus, Hinxton, United Kingdom
  • 15Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
  • 16Department of Biosciences, University of Milan, Milan, Lombardy, Italy
  • 17Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, Escondido, CA, United States
  • 18V.E. Enterprises, Arcata, CA, United States
  • 19Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States
  • 20Neuroscience Research Institute, University of California, Santa Barbara, Santa Barbara, California, United States
  • 21Hollings Marine Laboratory, National Institute of Standards and Technology, Charleston, South Carolina, United States
  • 22Marine Mammal and Turtle Division, Southeast Fisheries Science Center, National Marine Fisheries Service, NOAA, Lafayette, LA, United States
  • 23Office of Protected Resources, NOAA Fisheries, Silver Spring, Maryland, United States
  • 24New York State College of Veterinary Medicine, Cornell University, Ithaca, NY, United States
  • 25Zoo Duisburg AG, Duisburg, North Rhine-Westphalia, Germany
  • 26Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Hesse, Germany
  • 27Tree of Life, Wellcome Sanger Institute (WT), Hinxton, Cambridgeshire, United Kingdom
  • 28Vertebrate Genome Laboratory, The Rockefeller University, New York City, New York, United States
  • 29Howard Hughes Medical Institute (HHMI), Chevy Chase, Maryland, United States

The final, formatted version of the article will be published soon.

Reference genomes from representative species across families provide the critical infrastructure for research and conservation. The Cetacean Genomes Project (CGP) began in early 2020 to facilitate the generation of near error-free, chromosome-resolved reference genomes for all cetacean species. Towards that goal, and using the methods, goals and genome assembly quality standards of the Vertebrate Genomes Project (VGP), we generated 13 new reference genomes across eight of the 14 cetacean families. Additionally, we summarize the genome assembly characteristics for 18 species, including these newly-generated and five published genome assemblies that meet the completeness and quality standards. We infer ancestral linkage groups (ALG) for cetaceans, showing that the ancestral karyotype of 22 ALGs is largely conserved in extant species, except for Ziphiidae, and for Balaenidae and Kogiidae, which exhibit similar independent fusions. Gene annotation, characterization of historical demography, heterozygosity and runs of homozygosity (ROH) reveal important information for conservation applications. By comparing the new reference genomes to previous draft assemblies, we show that the reference genomes have enhanced characteristics that will support and promote scientific research. Specifically, the genomes improve resolution and characterization of repetitive elements, provide validation (or exclusion) of genes linked to complex traits, and allow more complete characterization of gene regions such as the highly complex Major Histocompatibility Complex (MHC) Class I and II gene clusters that are important for population health.

Keywords: Reference genome, Comparative genomics, conservation genomics, whale, dolphin, porpoise, Cetacea Resources, Writing -review & editing. Phillip A Morin: Conceptualization

Received: 16 Jan 2025; Accepted: 30 May 2025.

Copyright: © 2025 Morin, Bein, Bortoluzzi, Bukhman, Hains, Heimeier, Uliano-Silva, Absolon, Abueg, Antosiewicz- Bourget, Balacco, Bonde, Brajuka, Brownlow, Carroll, Carter, Collins, Davison, Denton, Fedrigo, Formenti, Foote, Gallo, Greve, Houck, Howard, Jacobsen, Jain, Krasheninnikova, Maloney, Manley, Mathers, Mccarthy, Mcgowen, Meyer, Mountcastle, MPH, Neely, O'toole, Pelan, Rosel, Rowles, Ryder, Schell, Sims, St Leger, Stewart, Ternes, Tilley, Whelan, Wood, Hiller, Blaxter and Jarvis. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Phillip A Morin, Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration, La Jolla, United States
Michael Hiller, Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Hesse, Germany
Mark Blaxter, Tree of Life, Wellcome Sanger Institute (WT), Hinxton, CB10 1SA, Cambridgeshire, United Kingdom
Erich David Jarvis, Vertebrate Genome Laboratory, The Rockefeller University, New York City, 10065, New York, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.