Tandem Repeats: Methods and Roles in Protein Function and Evolution

  • 4,028

    Total downloads

  • 21k

    Total views and downloads

About this Research Topic

Submission closed

Background

This article collection explores the diverse biological roles, evolutionary significance, and technological advancements related to protein repeats and short tandem repeats (STRs) in genomic and proteomic contexts. The first article examines how mutations affect natural repeat proteins, highlighting the context-dependent nature of mutational impacts within structurally symmetric motifs such as TPR and Ankyrin repeats. The second article leverages the high mutation rate of STRs as powerful markers for tracing recent evolutionary and demographic patterns in wild orangutan populations, showcasing their utility in bridging population genetics and adaptation studies. The third article introduces the Tandem Repeat Annotation Library (TRAL), a comprehensive and integrative open-source platform for detecting and analyzing tandem repeats in genomic sequences, reflecting growing computational capabilities in repeat biology. The final article focuses on human protein STRs, revealing their frequent occurrence within intrinsically disordered regions, enrichment in nuclear and cancer-related functions, and potential contributions to cancer pathology even in tumors lacking classical microsatellite instability. Together, these works reveal a multi-dimensional view of how tandem repeats shape molecular function, drive evolution and adaptation, impact disease, and benefit from advancing analytical tools.
---

Tandem repeats (TRs) are adjacent repetitive stretches of genomic DNA, found in abundance across all kingdoms of life. Recent estimates suggest as much as > 50% of an organism's proteins may contain at least one TR region. With time, the originally perfect repeated unit copies may diverge via mutations, deletions and insertions, which are shaped by existing structural or functional constraints. TRs display a wide variety of repeating units, from homorepeats (one repeating amino acid) and short TRs (1-10 nucleotides or STRs) to whole protein domains. Especially, shorter TRs are known for their orders of magnitude high mutation rates compared to SNPs and indels.

Due to their repetitive structure, TRs have been posing challenges at many stages of genomic analyses: from sequencing and genome assembly to gene annotation and evolutionary analyses. In the last decade, TR-suitable methods and resources are emerging one after the other allowing accurate TR annotation and genotyping integrated in existing genomic pipeline workflows. Studies using these resect approaches in a systematic fashion reveal that TRs are found in both coding and non-coding genomic regions, with various effects on protein function and expression. For example, protein TRs are often cited for their associations with diseases and immunity-related functions. Non-coding TRs, such as those found in promoters, may regulate the expression of neighboring genes.

Recently, numerous studies aimed to evaluate these effects in a more systematic fashion. This included both the development of new methods for TR analyses, as well as genome-wide studies of associations between TR variation, gene expression and human disease, such as cancers. TRs provide a rich source of variation in populations, and therefore a perfect playground for natural selection forces. Therefore, analysis of TR variation in populations and over longer evolutionary time may provide crucial insights leading us to a better understanding of their role in molecular interactions and functions of key proteins.

Within this research topic we welcome any research articles that address some of the current challenges in the TR field. While we welcome articles focusing on protein TRs, the contributions to this research topic may also relate to non-coding TRs and their effects. Overall, potential contributions to this research topic can include (but not limited to):

● Bioinformatics methods for TR detection, annotation and analysis
● Benchmarking existing methods for TR detection and analysis
● Computational methods for structure prediction of protein TR regions
● Computational analyses of TRs and their effects on protein folding, function, and expression
● Modeling of mechanisms of TR evolution and origin
● Methods and analyses of evolution of TRs regions
● Study cases analyzing the functional role of TRs in different biological scenario, such as host-pathogen interactions, immune, resistance or pathogenicity functions
● Large-scale systematic analyses of TR regions with insights on function and organismal biology
● Analyses of TR roles in protein interaction networks and biological pathways
● Classifications of protein TRs based on sequence profile models and structure
● Resources focusing on TRs: relevant databases, workflows, visualization tools
● Reviews of the above

Research Topic Research topic image

Keywords: Tandem repeat, homorepeat, STR, evolution, protein function, protein structure, gene expression

Important note: All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.

Topic editors