AUTHOR=Wang Nuohan , Ma Qiang , Ma Jianjiang , Pei Wenfeng , Liu Guoyuan , Cui Yupeng , Wu Man , Zang Xinshan , Zhang Jinfa , Yu Shuxun , Ma Lingjian , Yu Jiwen TITLE=A Comparative Genome-Wide Analysis of the R2R3-MYB Gene Family Among Four Gossypium Species and Their Sequence Variation and Association With Fiber Quality Traits in an Interspecific G. hirsutum × G. barbadense Population JOURNAL=Frontiers in Genetics VOLUME=Volume 10 - 2019 YEAR=2019 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2019.00741 DOI=10.3389/fgene.2019.00741 ISSN=1664-8021 ABSTRACT=Cotton (Gossypium spp.) is the most important natural fiber crop in the world. The R2R3-MYB gene family is a large gene family involved in many plant functions including cotton fiber development. Although previous studies have reported the phylogenetic relationship, gene structures and expression patterns of the gene family in tetraploid G. hirsutum and diploid G. raimondii, little is known on the sequence variation of the R2R3-MYB gene family members between G. hirsutum and G. barbadense and their involvement in the natural quantitative variation of fiber quality and yield. In this study, a comprehensive genome-wide comparative analysis was performed among the four Gossypium species with genome sequences, i.e., tetraploid G. hirsutum (AD1) and G. barbadense (AD2) as well as their likely ancestral diploid extants G. raimondii (D5) and G. arboreum (A2), leading to the identification of 406, 393, 216 and 213 MYB genes, respectively. To elucidate whether the R2R3-MYB genes are genetically associated with fiber quality traits, 86 R2R3-MYB genes were co-localized with quantitative trait loci (QTL) hotspots for fiber quality and yield, including 42 genes localized within the fiber length QTL hotspots, in interspecific G. hirsutum × G. barbadense populations. There were 20 interspecific nonsynonymous single-nucleotide polymorphism (SNP) sites between the two tetraploid cultivated species, 16 of which developed from 11 R2R3-MYB genes were significantly correlated with fiber quality and yield in a backcross inbred population (BIL) of G. hirsutum × G. barbadense at least in one of the four field tests. Taken together, these results indicate that the sequence variation in these 11 R2R3-MYB genes is associated with the natural variation (i.e., QTL) in fiber quality and yield. Moreover, the functional SNPs of five R2R3-MYB allele pairs from the AD1 and AD2 genomes were significantly correlated with the fiber quality related gene expression in fiber development. The results will be useful for further elucidating the role of the R2R3-MYB genes during fiber development.