<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">972557</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2022.972557</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>G &#xd7; EBLUP: A novel method for exploring genotype by environment interactions and genomic prediction</article-title>
<alt-title alt-title-type="left-running-head">Song et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2022.972557">10.3389/fgene.2022.972557</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Song</surname>
<given-names>Hailiang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1854870/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wang</surname>
<given-names>Xue</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1989665/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Guo</surname>
<given-names>Yi</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1989649/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Ding</surname>
<given-names>Xiangdong</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/25830/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Beijing Key Laboratory of Fisheries Biotechnology</institution>, <institution>Fisheries Science Institute</institution>, <institution>Beijing Academy of Agriculture and Forestry Sciences</institution>, <addr-line>Beijing</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture and Rural Affairs</institution>, <institution>National Engineering Laboratory for Animal Breeding</institution>, <institution>College of Animal Science and Technology</institution>, <institution>China Agricultural University</institution>, <addr-line>Beijing</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/37033/overview">Li Ma</ext-link>, University of Maryland, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1883344/overview">Rostam Abdollahi</ext-link>, Aviagen, United Kingdom</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/230665/overview">Hailan Liu</ext-link>, Maize Research Institute of Sichuan Agricultural University, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Xiangdong Ding, <email>xding@cau.edu.cn</email>, <email>orcid.org/0000000226842551</email>
</corresp>
<fn fn-type="equal" id="fn1">
<label>
<sup>&#x2020;</sup>
</label>
<p>These authors have contributed equally to this work</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>09</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>972557</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>06</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>08</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Song, Wang, Guo and Ding.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Song, Wang, Guo and Ding</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Genotype by environment (G &#xd7; E) interaction is fundamental in the biology of complex traits and diseases. However, most of the existing methods for genomic prediction tend to ignore G &#xd7; E interaction (GEI). In this study, we proposed the genomic prediction method G &#xd7; EBLUP by considering GEI. Meanwhile, G &#xd7; EBLUP can also detect the genome-wide single nucleotide polymorphisms (SNPs) subject to GEI. Using comprehensive simulations and analysis of real data from pigs and maize, we showed that G &#xd7; EBLUP achieved higher efficiency in mapping GEI SNPs and higher prediction accuracy than the existing methods, and its superiority was more obvious when the GEI variance was large. For pig and maize real data, compared with GBLUP, G &#xd7; EBLUP showed improvement by 3% in the prediction accuracy for backfat thickness, while our findings indicated that the trait of days to 100&#xa0;kg of pig was not affected by GEI and G &#xd7; EBLUP did not improve the accuracy of genomic prediction for the trait. A significant advantage was observed for G &#xd7; EBLUP in maize; the prediction accuracy was improved by &#x223c;5.0 and 7.7% for grain weight and water content, respectively. Furthermore, G &#xd7; EBLUP was not influenced by the number of environment levels. It could determine a favourable environment using SNP Bayes factors for each environment, implying that it is a robust and useful method for market-specific animal and plant breeding. We proposed G &#xd7; EBLUP, a novel method for the estimation of genomic breeding value by considering GEI. This method identified the genome-wide SNPs that were susceptible to GEI and yielded higher genomic prediction accuracies and lower mean squared error compared with the GBLUP method.</p>
</abstract>
<kwd-group>
<kwd>G &#xd7; E interaction</kwd>
<kwd>snps</kwd>
<kwd>bayes factors</kwd>
<kwd>traits</kwd>
<kwd>genomic prediction</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Genomic selection (GS) (<xref ref-type="bibr" rid="B18">Meuwissen et al., 2001</xref>), which relies on linkage disequilibrium between single nucleotide polymorphisms (SNPs) and causative variants, has become a useful tool in animal (<xref ref-type="bibr" rid="B30">VanRaden et al., 2009</xref>) and plant (<xref ref-type="bibr" rid="B34">Zhong et al., 2009</xref>) breeding. However, GS analytical modelling usually assumes no G &#xd7; E interaction (GEI) and opposes the true genetic architecture of complex traits. In fact, interaction is fundamental in biology, and there is growing interest in estimating breeding value by considering GEI and using genome-wide SNPs.</p>
<p>The current state-of-the-art methods for the estimation of genomic breeding value without considering GEI include GBLUP (<xref ref-type="bibr" rid="B29">VanRaden, 2008</xref>) and Bayes-Alphabet (e.g., Bayes A, Bayes B and Bayes C) (<xref ref-type="bibr" rid="B8">Habier et al., 2011</xref>). Multi-trait (<xref ref-type="bibr" rid="B23">Richard, 1996</xref>) and reaction norm models (<xref ref-type="bibr" rid="B22">Rebecka et al., 2002</xref>) are the two prevalent GEI-handling methods that are used for genomic evaluations. However, the multi-trait model could only capture GEI in a limited number of environments, and the computational demands of multi-trait models would increase rapidly with an increase in the number of environment levels (<xref ref-type="bibr" rid="B26">Song et al., 2020a</xref>). The reaction norm model captures only part of the GEI because it needs to accommodate a continuous range of environmental values and cannot select excellent individuals using the unique estimated breeding value in actual breeding (<xref ref-type="bibr" rid="B11">Jarquin et al., 2014</xref>; <xref ref-type="bibr" rid="B27">Song et al., 2020b</xref>).</p>
<p>To explore GEI, certain G &#xd7; E interaction-affected methods have been proposed for detecting SNPs. <xref ref-type="bibr" rid="B31">Wang et al. (2019)</xref> proposed several methods (Bartlett, F-Killeen, L-mean and L-median) that can be used to infer GEI from variance quantitative trait locus (vQTL) analysis without requiring environmental factor measurements. <xref ref-type="bibr" rid="B20">Moore et al. (2019)</xref> proposed StructLMM, which is useful for studying interactions with hundreds of environment variables. Moreover, <xref ref-type="bibr" rid="B14">Kerin and Marchini. (2020)</xref> proposed LEMMA, which infers GEI using a Bayesian whole-genome regression model. However, in Wang&#x2019;s method, GEI-affected SNP detection is possible because of selection, epistasis and phantom vQTL, instead of only GEI (<xref ref-type="bibr" rid="B31">Wang et al., 2019</xref>). StructLMM and LEMMA do not currently enable accounting for relatedness, and these methods cannot be efficiently applied to livestock and plant breeding due to the close genetic relationships that widely exists between individuals (Kerin et al., 2020; <xref ref-type="bibr" rid="B20">Moore et al., 2019</xref>). Therefore, it is essential to develop new methods for the estimation of genomic breeding value by considering GEI.</p>
<p>In this study, we proposed a novel approach for the estimation of genomic breeding value by considering GEI, which can handle environment variables in different dimensions. The basic principle of the new approach was to first detect the genome-wide markers affected by GEI (G &#xd7; EWAS), in which a score-test statistics was implemented to identify the significant GEI-associated SNPs. Next, all markers were classified into SNPs with/without GEI to construct the genomic relationship matrices separately and predict the genomic breeding value using the mixed model (G &#xd7; EBLUP). For its general application, the efficiency of the proposed method was evaluated through simulation study and real data from pigs and maize.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec id="s2-1">
<title>Ethics statement</title>
<p>The animal study was reviewed and approved by Animal handling and sample collection were conducted according to protocols approved by the Institutional Animal Care and Use Committee (IACUC) at China Agricultural University. All authors strictly complied with the Regulations on the Administration of Laboratory Animals (Order No. 2 of the State Science and Technology Commission of the People&#x2019;s Republic of China, 1988). There was no use of human participants, data or tissues.</p>
</sec>
<sec id="s2-2">
<title>G &#xd7; EWAS</title>
<p>G &#xd7; EWAS extends the conventional linear mixed model by including an additional per-individual effect term accounting for G &#xd7; E, which can be represented as N <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mo>&#xd7;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> 1 vector, <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. The model was defined as follows:<disp-formula id="e1">
<mml:math id="m3">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mi mathvariant="bold">b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">x</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b2;</mml:mi>
<mml:mi mathvariant="bold-italic">G</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">x</mml:mi>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="bold-italic">G</mml:mi>
<mml:mi mathvariant="bold-italic">x</mml:mi>
<mml:mi mathvariant="bold-italic">E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">u</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold-italic">e</mml:mi>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>where <bold>y</bold> is the vector of observed phenotypic values; <inline-formula id="inf3">
<mml:math id="m4">
<mml:mrow>
<mml:mi mathvariant="normal">b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of fixed effects; <inline-formula id="inf4">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi mathvariant="bold">G</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the average effect of gene substitution of a particular SNP; and <inline-formula id="inf5">
<mml:math id="m6">
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of the genotype indicator variable of the variant coded as 0, 1 or 2. <inline-formula id="inf6">
<mml:math id="m7">
<mml:mrow>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo>&#x2299;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf7">
<mml:math id="m8">
<mml:mrow>
<mml:mo>&#x2299;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> denotes the element-wise (Hadamard) product and <inline-formula id="inf8">
<mml:math id="m9">
<mml:mrow>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> denotes the <inline-formula id="inf9">
<mml:math id="m10">
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">N</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>diagonal matrix whose diagonal is <inline-formula id="inf10">
<mml:math id="m11">
<mml:mrow>
<mml:mi mathvariant="bold-italic">x</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>.The per-individual effect size vector <inline-formula id="inf11">
<mml:math id="m12">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is defined as a random effect, following the multivariate normal distribution <inline-formula id="inf12">
<mml:math id="m13">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x223c;</mml:mo>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf13">
<mml:math id="m14">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, is the variance and covariance matrix of the G &#xd7; E effect, <inline-formula id="inf14">
<mml:math id="m15">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi mathvariant="double-struck">R</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> parameterises how per-individual effects covary across individuals and is calculated as a function of observed environment variables. <inline-formula id="inf15">
<mml:math id="m16">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
<mml:mo>&#x2261;</mml:mo>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="normal">E</mml:mi>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf16">
<mml:math id="m17">
<mml:mrow>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the <inline-formula id="inf17">
<mml:math id="m18">
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">L</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix of L observed environments. The linear covariance function (<inline-formula id="inf18">
<mml:math id="m19">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="normal">E</mml:mi>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>) was primarily used because of two appealing properties. First, as the number of samples typically exceeds the number of environments in larger populations (L &#x3c;&#x3c; N), a low-rank linear covariance is noted, which enables parameter inference with a computational complexity that scales linearly with the increasing population size. Second, a linear covariance is directly interpretable as there is one-to-one correspondence between G &#xd7; EWAS and linear regression using L covariates to account for GEI. Notably, for the special case of <inline-formula id="inf19">
<mml:math id="m20">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, the model G &#xd7; EWAS reduces to a standard linear mixed model for genome-wide association study; thus, G &#xd7; EWAS is a single-SNP regression model; <inline-formula id="inf20">
<mml:math id="m21">
<mml:mrow>
<mml:mi mathvariant="normal">u</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of random polygenic effects with a normal distribution <inline-formula id="inf21">
<mml:math id="m22">
<mml:mrow>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, in which <inline-formula id="inf22">
<mml:math id="m23">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the polygenic variance and <inline-formula id="inf23">
<mml:math id="m24">
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the genomic relationship matrix. It was constructed using the markers according to <xref ref-type="bibr" rid="B29">VanRaden (2008)</xref>; <inline-formula id="inf24">
<mml:math id="m25">
<mml:mrow>
<mml:mi mathvariant="normal">X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the incidence matrix linking <inline-formula id="inf25">
<mml:math id="m26">
<mml:mrow>
<mml:mi mathvariant="bold">b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> to <bold>y</bold>; <inline-formula id="inf26">
<mml:math id="m27">
<mml:mrow>
<mml:mi mathvariant="bold">e</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of random errors with normal distribution of N (0, <bold>I</bold> <inline-formula id="inf27">
<mml:math id="m28">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>), where <inline-formula id="inf28">
<mml:math id="m29">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the residual variance and <inline-formula id="inf29">
<mml:math id="m30">
<mml:mrow>
<mml:mi mathvariant="normal">I</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the identity matrix. The analysis of G &#xd7; EWAS was based only on the reference data to avoid the double counting of the SNP effect in genomic prediction.</p>
<p>For the parameter inference, we considered the marginalised form of the model in <xref ref-type="disp-formula" rid="e1">Eq. 1</xref>, which was obtained by integrating over the G &#xd7; E effects <inline-formula id="inf30">
<mml:math id="m31">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and the random effect component <inline-formula id="inf31">
<mml:math id="m32">
<mml:mrow>
<mml:mi mathvariant="normal">u</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> <bold>:</bold>
<disp-formula id="e2">
<mml:math id="m33">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi mathvariant="bold">N</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mi mathvariant="bold">b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi mathvariant="bold">G</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>Using the marginalised model in <xref ref-type="disp-formula" rid="e2">Eq. 2</xref>, a G &#xd7; E interaction test corresponds to the alternative hypothesis <inline-formula id="inf32">
<mml:math id="m34">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. We defined an efficient score-based test that enabled the <italic>p</italic>-value calculation with a complexity that scaled linearly with the number of individuals, provided that there is low-rank environment covariance <inline-formula id="inf33">
<mml:math id="m35">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. The null model of the interaction test reduced to a standard linear mixed model with a low-rank covariance matrix for additive genetic effects, and the existing efficient inference strategies for the standard linear mixed model can be reused. The score-test statistics can be computed in an analogous manner according to the procedure described by <xref ref-type="bibr" rid="B32">Wu et al. (2011)</xref>:<disp-formula id="e3">
<mml:math id="m36">
<mml:mrow>
<mml:mi mathvariant="bold">Q</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mi mathvariant="bold">K</mml:mi>
</mml:mrow>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi mathvariant="bold">T</mml:mi>
</mml:msup>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtext>diag</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:msup>
<mml:mtext>diag</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi mathvariant="bold">T</mml:mi>
</mml:msup>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:mi mathvariant="bold">T</mml:mi>
</mml:msup>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>Where<disp-formula id="equ1">
<mml:math id="m37">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">K</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mtext>diag</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2211;</mml:mo>
<mml:mtext>diag</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="equ2">
<mml:math id="m38">
<mml:mrow>
<mml:mi mathvariant="normal">W</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="equ3">
<mml:math id="m39">
<mml:mrow>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">H</mml:mi>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">H</mml:mi>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:msup>
<mml:msubsup>
<mml:mi mathvariant="bold">H</mml:mi>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:msup>
<mml:msubsup>
<mml:mi mathvariant="bold">H</mml:mi>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>The matrix <inline-formula id="inf34">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes the total covariance matrix estimated under the null model <inline-formula id="inf35">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> &#x3d;<inline-formula id="inf36">
<mml:math id="m42">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="normal">G</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> <bold>&#x2b;</bold> <inline-formula id="inf37">
<mml:math id="m43">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="normal">I</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> <bold>Q</bold> follows a mixture of <inline-formula id="inf38">
<mml:math id="m44">
<mml:mrow>
<mml:msup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> distributions (<xref ref-type="bibr" rid="B32">Wu et al., 2011</xref>; <xref ref-type="bibr" rid="B15">Lippert et al., 2014</xref>): <inline-formula id="inf39">
<mml:math id="m45">
<mml:mrow>
<mml:mi mathvariant="normal">Q</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, where the vector of the coefficients <inline-formula id="inf40">
<mml:math id="m46">
<mml:mrow>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> can be computed as the eigenvalues of <inline-formula id="inf41">
<mml:math id="m47">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="normal">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="normal">K</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:msup>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. According to the procedure in SKAT (<xref ref-type="bibr" rid="B32">Wu et al., 2011</xref>), as the distribution of the score-test statistics was a mixture of <inline-formula id="inf42">
<mml:math id="m48">
<mml:mrow>
<mml:msup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, the <italic>p</italic>-values were computed using the Davies method (<xref ref-type="bibr" rid="B7">Davies, 1980</xref>). Alternatively, the Liu method (<xref ref-type="bibr" rid="B10">Huan et al., 2008</xref>) was employed when the Davies method failed to converge.</p>
<p>The evidences for individual environment variables or environment sets for driving the observed G &#xd7; E effects can be assessed by comparing the model log marginal likelihoods between models with and without including these environments. The Bayes factors (BF) obtained from such comparisons is directly calibrated as the parameter number fitted using maximum likelihood and is independent of the environment variable numbers.</p>
<p>Given a variant and set of L environment L &#x3d; (<inline-formula id="inf43">
<mml:math id="m49">
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>),<disp-formula id="e4">
<mml:math id="m50">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
</mml:mrow>
<mml:mi mathvariant="italic">Log</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mi>F</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>L</mml:mi>
<mml:mi>M</mml:mi>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>L</mml:mi>
<mml:mi>M</mml:mi>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>where LML(L) and LML(<inline-formula id="inf44">
<mml:math id="m51">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>) represent the marginal log-likelihood of the model described in <xref ref-type="disp-formula" rid="e2">Eq. 2</xref>, either considering the full or reduced environment sets to define the G &#xd7; E environment covariance, respectively. log(BF) &#x3c; 0 indicates the lack of contribution of the environmental impact on G &#xd7; E interaction, whereas log(BF) &#x3e; 3 indicates strong G &#xd7; E environment interaction (<xref ref-type="bibr" rid="B13">Kass and Raftery., 1995</xref>).</p>
</sec>
<sec id="s2-3">
<title>G &#xd7; EBLUP</title>
<p>The G &#xd7; EBLUP model includes additive genetic and GEI effects. The model is as follows:<disp-formula id="e5">
<mml:math id="m52">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mi mathvariant="bold">b</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">Z</mml:mi>
<mml:mi mathvariant="bold">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold">Z</mml:mi>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold">e</mml:mi>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>where <inline-formula id="inf45">
<mml:math id="m53">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>
<bold>,</bold> <inline-formula id="inf46">
<mml:math id="m54">
<mml:mrow>
<mml:mi mathvariant="normal">X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>
<bold>,</bold> <inline-formula id="inf47">
<mml:math id="m55">
<mml:mrow>
<mml:mi mathvariant="normal">b</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf48">
<mml:math id="m56">
<mml:mrow>
<mml:mi mathvariant="normal">e</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> denote the same parameters as in the G &#xd7; EWAS model, <inline-formula id="inf49">
<mml:math id="m57">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of genomic values captured by genetic markers associated with GEI, following a normal distribution of <inline-formula id="inf50">
<mml:math id="m58">
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>); <inline-formula id="inf51">
<mml:math id="m59">
<mml:mrow>
<mml:mi mathvariant="bold">u</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of genomic values captured by the remaining genetic marker sets (SNPs that are not significantly associated with GEI), following a normal distribution <inline-formula id="inf52">
<mml:math id="m60">
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="bold-italic">u</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>u</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) and <inline-formula id="inf53">
<mml:math id="m61">
<mml:mrow>
<mml:mi mathvariant="normal">Z</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is an incidence matrix that links <inline-formula id="inf54">
<mml:math id="m62">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf55">
<mml:math id="m63">
<mml:mrow>
<mml:mi mathvariant="normal">u</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> to <inline-formula id="inf56">
<mml:math id="m64">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Matrices <inline-formula id="inf57">
<mml:math id="m65">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf58">
<mml:math id="m66">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> were constructed similarly as <inline-formula id="inf59">
<mml:math id="m67">
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>; the former was constructed using only the genetic marker set defined by GEI, as described below, and the latter was constructed using the remaining markers. <inline-formula id="inf60">
<mml:math id="m68">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">E</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf61">
<mml:math id="m69">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>u</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> are the variance components explained by the variants with and without GEI, respectively. When an SNP was significant the GEI with phenotypes based on the prespecified significance cutoff level (E01-E05), showing the SNP was considered to impact the GEI.</p>
</sec>
<sec id="s2-4">
<title>Data simulation</title>
<p>So far, only few genomic data simulating softwares considering GEI have been available. In this study, we proposed a reaction norm model accounting for heterogeneous residual variances to simulate phenotypic and environmental values.<disp-formula id="equ4">
<mml:math id="m70">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2217;</mml:mo>
<mml:mi mathvariant="bold-italic">c</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2217;</mml:mo>
<mml:mi mathvariant="bold-italic">c</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>where <inline-formula id="inf62">
<mml:math id="m71">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of phenotypic value, <inline-formula id="inf63">
<mml:math id="m72">
<mml:mrow>
<mml:mi mathvariant="bold">c</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of environmental value; <inline-formula id="inf64">
<mml:math id="m73">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf65">
<mml:math id="m74">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the random additive genetic effects for the intercept and slope, respectively; and <inline-formula id="inf66">
<mml:math id="m75">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf67">
<mml:math id="m76">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the random residual effects for the intercept and slope, respectively.</p>
<p>The environmental value <inline-formula id="inf68">
<mml:math id="m77">
<mml:mrow>
<mml:mi mathvariant="bold-italic">c</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is further divided into two components:<disp-formula id="equ5">
<mml:math id="m78">
<mml:mrow>
<mml:mi mathvariant="bold">c</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="bold">&#x3f5;</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>where <inline-formula id="inf69">
<mml:math id="m79">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of the random genetic effect and <inline-formula id="inf70">
<mml:math id="m80">
<mml:mrow>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of the random residual effect.</p>
<p>We assumed that <inline-formula id="inf71">
<mml:math id="m81">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf72">
<mml:math id="m82">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf73">
<mml:math id="m83">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are affected by all QTLs simultaneously, and these three effects of each QTL are drawn from a multivariate normal distribution with the vector of means 0 and the variance&#x2013;covariance structure <inline-formula id="inf74">
<mml:math id="m84">
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. The genetic variance of each QTL is computed using 2 <inline-formula id="inf75">
<mml:math id="m85">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf76">
<mml:math id="m86">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>is the frequency of one allele of <italic>i</italic>th QTL, <inline-formula id="inf77">
<mml:math id="m87">
<mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the effect of the <italic>i</italic>th QTL for <inline-formula id="inf78">
<mml:math id="m88">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf79">
<mml:math id="m89">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula id="inf80">
<mml:math id="m90">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Then, the substitution effects are rescaled to ensure the total variances <inline-formula id="inf81">
<mml:math id="m91">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf82">
<mml:math id="m92">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf83">
<mml:math id="m93">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> for <inline-formula id="inf84">
<mml:math id="m94">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf85">
<mml:math id="m95">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf86">
<mml:math id="m96">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, respectively. The <inline-formula id="inf87">
<mml:math id="m97">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf88">
<mml:math id="m98">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf89">
<mml:math id="m99">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are re-calculated using the scaled substitution effects of QTL. The <inline-formula id="inf90">
<mml:math id="m100">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf91">
<mml:math id="m101">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf92">
<mml:math id="m102">
<mml:mrow>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> values of each individual are sampled from a multivariate normal distribution with the vector of means 0 and the variance&#x2013;covariance structure <inline-formula id="inf93">
<mml:math id="m103">
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>&#x3b5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>&#x3b5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>&#x3b5;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>For the G &#xd7; E interaction simulations, the parameter <inline-formula id="inf94">
<mml:math id="m104">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> was set to control the extent of the G &#xd7; E interaction, whereas other parameters (<inline-formula id="inf95">
<mml:math id="m105">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf96">
<mml:math id="m106">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf97">
<mml:math id="m107">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf98">
<mml:math id="m108">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf99">
<mml:math id="m109">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf100">
<mml:math id="m110">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf101">
<mml:math id="m111">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf102">
<mml:math id="m112">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>&#x3f5;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf103">
<mml:math id="m113">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf104">
<mml:math id="m114">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mi>&#x3f5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf105">
<mml:math id="m115">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>&#x3f5;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>) were fixed. The pseudo true breeding values (TBVs) of an individual for <inline-formula id="inf106">
<mml:math id="m116">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf107">
<mml:math id="m117">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf108">
<mml:math id="m118">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are its QTL effects multiplied by genotypes, followed by the scaling of the means of the pseudo TBVs to 0. Finally, the environmental valuec of each individual is obtained by adding the cumulative effect across all QTLs for <inline-formula id="inf109">
<mml:math id="m119">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> with the residual <inline-formula id="inf110">
<mml:math id="m120">
<mml:mrow>
<mml:mi mathvariant="bold">&#x3f5;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, followed by the generation of the phenotype <inline-formula id="inf111">
<mml:math id="m121">
<mml:mrow>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> of each individual through the model <inline-formula id="inf112">
<mml:math id="m122">
<mml:mrow>
<mml:mi mathvariant="normal">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2a;</mml:mo>
<mml:mi mathvariant="bold-italic">c</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold-italic">e</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, without accounting for heterogeneous residual variance. For the simulated data, <inline-formula id="inf113">
<mml:math id="m123">
<mml:mrow>
<mml:mi mathvariant="bold-italic">c</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> was used as the environment variable <bold>E</bold>, as described in formula (1). The real genotypes of 7,334 individuals determined using the Illumina BovineSNP50 BeadChip from the Chinese Holstein population were referred for phenotype and environment simulation, and 45,323 SNPs remained after imputation of missing genotypes and removal of SNPs with a minor allele frequency (MAF) of &#x3c;0.01. Additional File 3 <xref ref-type="sec" rid="s12">Supplementary Figure S1</xref> presents the heat map of the genomic relationship matrix of 7,334 Chinese Holsteins. Three simulated datasets with GEI effect variances of 0.25, 1 and 2 were obtained, and the corresponding phenotypic variances were 2.25, 3 and 4, respectively. Additionally, when the GEI effect variance was 0.25, the datasets 2 and 3 covariate environments were simulated and compared. For each dataset, 306 SNPs that affected the trait of interest were simulated and referred to as simulated QTLs in this study. For each scenario, the simulation was repeated 20 times. We used the DMU software (<xref ref-type="bibr" rid="B17">Madsen et al., 2006</xref>) to estimate the variances of the additive effect, GEI effect and residual using the reaction norm model for each replicate. As shown in Additional File 2 <xref ref-type="sec" rid="s12">Supplementary Table S1</xref>, these estimated values were close to the assigned values. Moreover, a dataset was randomly selected from the 20 repeated datasets, and 6 SNPs with GEI were randomly selected from this dataset, showing that the phenotypic variation was largely affected by GEI and that it was relatively small in the scenarios with no GEI (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figures S2&#x2013;S6</xref>), thus implying that the simulation fitted well. All analyses with G &#xd7; EBLUP and GBLUP models and simulation were conducted using in-house scripts written in Python3.8 by the first author.</p>
</sec>
<sec id="s2-5">
<title>Real data</title>
<sec id="s2-5-1">
<title>Pig data</title>
<p>Yorkshire pigs were sampled from a breeding company with five breeding farms distributed across China (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figure S7</xref>). Different farms displayed distinct climates, housing systems, nutritional regimes, disease pressures and stocking densities, potentially leading to GEI. <xref ref-type="table" rid="T1">Table 1</xref> presents the phenotype data. We examined two growth traits &#x2018;days to 100&#xa0;kg (AGE)&#x2019; and &#x2018;backfat thickness adjusted to 100&#xa0;kg (BFT)&#x2019;. Genotyping was performed using the PorcineSNP80 BeadChip (Illumina, CA, USA), which included 68,528 SNPs across the entire pig genome. A total of 1,778 animals born between 2011 and 2016 were genotyped (<xref ref-type="table" rid="T1">Table 1</xref>).</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Descriptive statistics for pig and maize population traits.</p>
</caption>
<table>
<tbody valign="top">
<tr>
<td align="left">
<bold>Population</bold>
</td>
<td align="left">
<bold>Trait</bold>
<xref ref-type="table-fn" rid="Tfn1">
<sup>a</sup>
</xref>
</td>
<td align="left">
<bold>N-obs</bold>
<xref ref-type="table-fn" rid="Tfn2">
<sup>b</sup>
</xref>
</td>
<td align="left">
<bold>Genotyped individuals</bold>
</td>
<td align="left">
<bold>N-env</bold>
<xref ref-type="table-fn" rid="Tfn3">
<sup>c</sup>
</xref>
</td>
<td align="left">
<bold>Mean</bold>
</td>
<td align="left">
<bold>SD</bold>
</td>
<td align="left">
<bold>Min</bold>
</td>
<td align="left">
<bold>Max</bold>
</td>
</tr>
<tr>
<td rowspan="2" align="left">Pig</td>
<td align="left">AGE (day)</td>
<td align="left">28,827</td>
<td align="left">1778</td>
<td align="left">5</td>
<td align="left">170.8</td>
<td align="left">13.9</td>
<td align="left">124.0</td>
<td align="left">211.0</td>
</tr>
<tr>
<td align="left">BFT (mm)</td>
<td align="left">28,827</td>
<td align="left">1778</td>
<td align="left">5</td>
<td align="left">11.8</td>
<td align="left">2.4</td>
<td align="left">5.0</td>
<td align="left">30.7</td>
</tr>
<tr>
<td rowspan="2" align="left">Maize</td>
<td align="left">GW (kg)</td>
<td align="left">2676</td>
<td align="left">681</td>
<td align="left">11</td>
<td align="left">6.75</td>
<td align="left">1.39</td>
<td align="left">0.407</td>
<td align="left">11.24</td>
</tr>
<tr>
<td align="left">WC (%)</td>
<td align="left">2676</td>
<td align="left">681</td>
<td align="left">11</td>
<td align="left">26.89</td>
<td align="left">4.58</td>
<td align="left">14.80</td>
<td align="left">47.80</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="Tfn1">
<label>a</label>
<p>AGE: days to 100&#xa0;kg; BFT: backfat thickness adjusted to 100&#xa0;kg; GW: grain weight; WC: water content.</p>
</fn>
<fn id="Tfn2">
<label>b</label>
<p>N-obs: number of observations.</p>
</fn>
<fn id="Tfn3">
<label>c</label>
<p>N-env: number of environments.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="s2-6">
<title>Pig data</title>
<p>Maize is one of the most important crops worldwide. It provides food for humans and animals; it is a raw material for industrial processes and a model plant for understanding evolution, domestication and heterosis (<xref ref-type="bibr" rid="B24">Romay et al., 2013</xref>). Thus, maize data from 11 regions across China (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figure S7</xref>) were obtained to verify G &#xd7; EBLUP, with the regions used as environment variables. Because of varying conditions of light, temperature, air, water and soil in different regions, under which GEI could show its effect, region effect was considered as an environmental covariate in the present study. A total of 681 maize lines were collected and each line had phenotype records of two traits, grain weight (GW) and water content (WC), in 1&#x2013;8 environments. Overall, 2,676 observations were collected for the two traits. <xref ref-type="table" rid="T1">Table 1</xref> lists the detailed information on GW and WC. Meanwhile, all lines were genotyped using the customised SNP panel of 61,224 markers across the maize genome.</p>
<p>For real pig and maize data, Beagle 4.1 (<xref ref-type="bibr" rid="B3">Browning and Browning, 2009</xref>) was used for the imputation of the missing SNP genotypes, and only loci on autosomes were used for further analysis. PLINK software (v1.90) (<xref ref-type="bibr" rid="B4">Chang et al., 2015</xref>) was implemented for quality control. We excluded SNPs with a MAF of &#x3c;0.05, call rate of &#x3c;0.90, or those severely deviating from the Hardy&#x2013;Weinberg equilibrium (HWE) (<italic>p</italic> &#x3c; 10<sup>&#x2013;7</sup>). Similarly, we excluded the pig individuals or maize samples with a call rate of &#x3c;0.90. Finally, 56,463 and 59,401 SNPs were present in the pig and maize data, respectively, and all genotyped pigs and maize were retained.</p>
</sec>
<sec id="s2-7">
<title>Method application</title>
<sec id="s2-7-1">
<title>Application to simulated data</title>
<p>Simulated data analysis was performed using G &#xd7; EWAS proposed in this study to identify markers associated with GEI. We used Bonferroni correction at a significance level of 0.05 to identify significant SNPs. We implemented the four methods (L-median, L-mean, Bartlett and F-Killeen) proposed by <xref ref-type="bibr" rid="B31">Wang et al. (2019)</xref> in addition to StructLMM proposed by <xref ref-type="bibr" rid="B20">Moore et al. (2019)</xref> to identify SNPs affected by GEI. Based on the G &#xd7; EWAS results, we performed genomic prediction on each simulated dataset using G &#xd7; EBLUP. To investigate more SNPs with GEI, <italic>p</italic>-value gradients of 1-E01&#x2013;1-E05 were chosen as threshold standards to select the SNPs associated with GEI. Five 10-fold cross-validation (10-CV) repetitions were used to assess the genomic prediction using G &#xd7; EBLUP. In each cross-validation, the reference and validation populations comprised 6,601 and 733 individuals, respectively. The accuracy of genomic prediction was calculated as the Pearson&#x2019;s correlation between original phenotypes Phe and the genomic estimated breeding values (GEBVs) of all validation individuals <italic>r</italic>(Phe, GEBV). Moreover, the mean squared error (MSE) of the prediction ability matrix was used to evaluate the performance of the models; MSE was computed as the average square of the difference between Phe and GEBV centred on zero. In each scenario, we performed the comparisons between G &#xd7; EBLUP and GBLUP (<xref ref-type="bibr" rid="B29">VanRaden, 2008</xref>) at different GEI variances (0.25, 1 and 2) and different number of environment variables (1, 2 and 3).</p>
</sec>
<sec id="s2-7-2">
<title>Application to real data</title>
<p>Pig data were used to detect the SNPs affected by GEI. The herd-year-season effects, estimated using the conventional pedigree-based BLUP method, were used as environmental covariates, and the corrected phenotype were used as response variables. The calculations for the corrected phenotype value followed the method described by <xref ref-type="bibr" rid="B25">Song et al. (2017)</xref>. Overall, 207 young and the remaining 1,571 individuals were considered as the validation and reference populations, respectively. For the maize data, an environment was randomly selected for each line to ensure that all lines could be used for analysis. Accordingly, we used 681 lines for each analysis and performed five replications of 5-fold cross-validation.</p>
<p>For the G &#xd7; EBLUP method, the <italic>p</italic>-values for all markers were calculated using G &#xd7; EWAS, and a threshold standard with <italic>p</italic>-value gradients of 1-E01&#x2013;1-E05 was selected to screen the GEI-associated SNPs. The GBLUP and G &#xd7; EBLUP methods were then used to estimate GEBVs. The genomic prediction accuracy on the real data was evaluated differently than that on the simulated data, using the correlation between GEBVs and the phenotypic values y (maize data) or corrected phenotypes y<sub>c</sub> (pig data) in the validation population. MSE was computed as the average square of the difference between y or y<sub>c</sub> and GEBVs centred on zero. The BF for each environment variable was calculated to obtain the sensitive environments of markers associated with GEI. For simulated data and real data, the improvement in prediction accuracy for G &#xd7; EBLUP over GBLUP was calculated by subtracting the prediction accuracy obtained by GBLUP from the prediction accuracy obtained by G &#xd7; EBLUP and then dividing by the prediction accuracy obtained by GBLUP.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Genome-wide G &#xd7; E association analysis</title>
<sec id="s3-1-1">
<title>Simulated data</title>
<p>
<xref ref-type="table" rid="T2">Table 2</xref> indicates that G &#xd7; EWAS performed better than the other methods. When the GEI variance was 0.25, G &#xd7; EWAS detected 2,435 significant SNPs with Bonferroni correction (0.05/45,293). Of these SNPs, 43 overlapped with the 306 simulated GEI SNPs. Although a low number of SNPs (41) overlapped with the simulated GEI SNPs, StructLMM detected a high number of significant SNPs (2,472). Similarly, G &#xd7; EWAS detected a higher number of significant SNPs than those detected using the Bartlett, F-Killeen, L-mean and L-median methods, which identified 507, 101, 224 and 188 significant SNPs, respectively. Further, 9, 6, 6 and 6 SNPs overlapped with the simulated GEI QTLs, respectively. A similar trend was also observed in the scenario where GEI variance increased to 1 or 2. Larger GEI effect variances led to the identification of a higher number of real QTLs with GEI (with the exception of the F-Killeen method), as shown in <xref ref-type="table" rid="T2">Table 2</xref> and Additional File 3 <xref ref-type="sec" rid="s12">Supplementary Figures S8&#x2013;S10</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Significant G &#xd7; E interaction single nucleotide polymorphisms (SNPs) detected on simulated data using the proposed G &#xd7; EWAS method and the five approaches, StructLMM, Bartlett, F-Killeen, L-mean and L-median under different variance of G &#xd7; E interactions. The SNP numbers overlapping with simulated genotype&#x2013;environment interaction quantitative trait locus (306) are in parentheses.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Variance of G &#xd7; E interactions</th>
<th align="left">0.25</th>
<th align="left">1</th>
<th align="left">2</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">G &#xd7; EWAS</td>
<td align="char" char="(">2435 (43)</td>
<td align="char" char="(">5081 (64)</td>
<td align="char" char="(">3981 (77)</td>
</tr>
<tr>
<td align="left">StructLMM</td>
<td align="char" char="(">2472 (41)</td>
<td align="char" char="(">5092(62)</td>
<td align="char" char="(">4084 (77)</td>
</tr>
<tr>
<td align="left">Bartlett</td>
<td align="char" char="(">507 (9)</td>
<td align="char" char="(">3495 (33)</td>
<td align="char" char="(">3976 (54)</td>
</tr>
<tr>
<td align="left">F-killeen</td>
<td align="char" char="(">101 (6)</td>
<td align="char" char="(">144 (2)</td>
<td align="char" char="(">109 (3)</td>
</tr>
<tr>
<td align="left">L-mean</td>
<td align="char" char="(">224 (6)</td>
<td align="char" char="(">1037 (8)</td>
<td align="char" char="(">606 (14)</td>
</tr>
<tr>
<td align="left">L-median</td>
<td align="char" char="(">188 (6)</td>
<td align="char" char="(">808 (8)</td>
<td align="char" char="(">439 (9)</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-1-2">
<title>Pig and maize data</title>
<p>
<xref ref-type="fig" rid="F1">Figure 1</xref> illustrates the genome-wide G &#xd7; E marker mapping on pig and maize data. <xref ref-type="fig" rid="F1">Figure 1</xref> shows that a total of 1,164 and 5,448 significant SNPs were detected in pigs for AGE and BFT traits with Bonferroni correction (0.05/56,445), respectively. For maize data, only 1 and 84 genome-wide significant SNPs were detected for the two traits, GW and WC, respectively. Additionally, the genotypic values and SNP effect values of the top most significant SNPs for each trait at different environments in pig and maize populations indicated that BFT, GW and WC were affected by the environment; however, no GEI was detected on AGE (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figures S11, S12</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>G &#xd7; E marker genome-wide association analysis of pig and maize population traits. AGE: days to 100&#xa0;kg; BFT: backfat thickness adjusted to 100&#xa0;kg; GW: grain weight; WC: water content.</p>
</caption>
<graphic xlink:href="fgene-13-972557-g001.tif"/>
</fig>
</sec>
</sec>
<sec id="s3-2">
<title>Accuracy and mean squared error of genomic prediction</title>
<sec id="s3-2-1">
<title>Simulated data</title>
<p>The significance level was set at the <italic>p</italic>-value gradient of 1-E01&#x2013;1-E05 to determine SNPs associated with GEI. <xref ref-type="table" rid="T3">Table 3</xref> presents the number of SNPs affected by GEI. These SNPs along with the corresponding remaining SNPs were used in G &#xd7; EBLUP. All 45,323 qualified SNPs were used in GBLUP. The genomic prediction accuracy and MSE of the simulated data were determined from a randomly selected replicate. <xref ref-type="table" rid="T3">Table 3</xref> shows that the G &#xd7; EBLUP accuracy was different under different <italic>p</italic>-values. G &#xd7; EBLUP showed the highest genomic prediction accuracy and the lowest MSE with a <italic>p</italic>-value of 1-E03. Further, the prediction accuracy of G &#xd7; EBLUP improved by 1.7% compared with that of GBLUP. <xref ref-type="fig" rid="F2">Figure 2</xref> shows the averaged accuracies and MSE of genomic prediction obtained using G &#xd7; EBLUP and GBLUP under different GEI variances. For G &#xd7; EBLUP, the average prediction accuracy was calculated by selecting the highest prediction accuracy values under different <italic>p</italic>-values in each repetition. When the GEI variance was 0.25, the G &#xd7; EBLUP yielded 1.7% higher prediction accuracies than GBLUP. Moreover, G &#xd7; EBLUP yielded lower MSE than GBLUP, with average MSE values of 1.816 and 1.942, respectively. G &#xd7; EBLUP performed significantly better than GBLUP when the GEI variance was increased to 1 and 2, yielding 3.9 and 6.4% higher prediction accuracies, respectively, and a lower MSE than GBLUP.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Genomic prediction accuracies and mean squared error (MSE) for GBLUP and G &#xd7; EBLUP method under different G &#xd7; E interaction <italic>p</italic>-values (E01&#x223c;E05).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Data set</th>
<th rowspan="2" align="left">Trait</th>
<th rowspan="2" align="left">Content</th>
<th rowspan="2" align="left">GBLUP</th>
<th colspan="5" align="left">P-value<xref ref-type="table-fn" rid="Tfn4">
<sup>a</sup>
</xref>
</th>
</tr>
<tr>
<th align="left">E01</th>
<th align="left">E02</th>
<th align="left">E03</th>
<th align="left">E04</th>
<th align="left">E05</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">Simulation<xref ref-type="table-fn" rid="Tfn5">
<sup>b</sup>
</xref>
</td>
<td rowspan="3" align="left">One<xref ref-type="table-fn" rid="Tfn6">
<sup>c</sup>
</xref>
</td>
<td align="left">SNP number</td>
<td align="left">45,323</td>
<td align="left">23,517</td>
<td align="left">14,210</td>
<td align="left">8844</td>
<td align="left">5543</td>
<td align="left">2186</td>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left">0.737</td>
<td align="left">0.735</td>
<td align="left">0.739</td>
<td align="left">0.749</td>
<td align="left">0.738</td>
<td align="left">0.712</td>
</tr>
<tr>
<td align="left">MSE</td>
<td align="left">1.818</td>
<td align="left">1.820</td>
<td align="left">1.810</td>
<td align="left">1.715</td>
<td align="left">1.813</td>
<td align="left">1.894</td>
</tr>
<tr>
<td rowspan="6" align="left">Pig</td>
<td rowspan="3" align="left">AGE</td>
<td align="left">SNP number</td>
<td align="left">56,445</td>
<td align="left">27,762</td>
<td align="left">14,117</td>
<td align="left">7420</td>
<td align="left">3964</td>
<td align="left">2117</td>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left">0.225</td>
<td align="left">0.223</td>
<td align="left">0.226</td>
<td align="left">0.226</td>
<td align="left">0.226</td>
<td align="left">0.224</td>
</tr>
<tr>
<td align="left">MSE</td>
<td align="left">179.81</td>
<td align="left">179.918</td>
<td align="left">179.722</td>
<td align="left">179.703</td>
<td align="left">179.677</td>
<td align="left">179.797</td>
</tr>
<tr>
<td rowspan="3" align="left">BFT</td>
<td align="left">SNP number</td>
<td align="left">56,445</td>
<td align="left">37,448</td>
<td align="left">25,242</td>
<td align="left">17,110</td>
<td align="left">11,801</td>
<td align="left">8098</td>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left">0.268</td>
<td align="left">0.275</td>
<td align="left">0.276</td>
<td align="left">0.272</td>
<td align="left">0.269</td>
<td align="left">0.268</td>
</tr>
<tr>
<td align="left">MSE</td>
<td align="left">2.707</td>
<td align="left">2.693</td>
<td align="left">2.69</td>
<td align="left">2.699</td>
<td align="left">2.704</td>
<td align="left">2.706</td>
</tr>
<tr>
<td rowspan="6" align="left">Maize<xref ref-type="table-fn" rid="Tfn5">
<sup>b</sup>
</xref>
</td>
<td rowspan="3" align="left">GW</td>
<td align="left">SNP number</td>
<td align="left">59,401</td>
<td align="left">17,285</td>
<td align="left">4279</td>
<td align="left">834</td>
<td align="left">421</td>
<td align="left">143</td>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left">0.288</td>
<td align="left">0.290</td>
<td align="left">0.306</td>
<td align="left">0.294</td>
<td align="left">0.271</td>
<td align="left">0.269</td>
</tr>
<tr>
<td align="left">MSE</td>
<td align="left">46.323</td>
<td align="left">46.317</td>
<td align="left">46.174</td>
<td align="left">46.178</td>
<td align="left">46.223</td>
<td align="left">46.347</td>
</tr>
<tr>
<td rowspan="3" align="left">WC</td>
<td align="left">SNP number</td>
<td align="left">59,401</td>
<td align="left">28,636</td>
<td align="left">12,123</td>
<td align="left">4168</td>
<td align="left">2132</td>
<td align="left">875</td>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left">0.295</td>
<td align="left">0.301</td>
<td align="left">0.315</td>
<td align="left">0.318</td>
<td align="left">0.293</td>
<td align="left">0.273</td>
</tr>
<tr>
<td align="left">MSE</td>
<td align="left">721.588</td>
<td align="left">721.229</td>
<td align="left">721.129</td>
<td align="left">720.854</td>
<td align="left">721.590</td>
<td align="left">722.009</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="Tfn4">
<label>a</label>
<p>Cut-off p-values for G &#xd7; E interaction single nucleotide polymorphisms on G &#xd7; E.</p>
</fn>
<fn id="Tfn5">
<label>b</label>
<p>One randomly selected replicate.</p>
</fn>
<fn id="Tfn6">
<label>c</label>
<p>The variance of G &#xd7; E interactions was 0.25.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>GBLUP and G &#xd7; EBLUP method for the <bold>(A)</bold> accuracy and <bold>(B)</bold> mean squared error (MSE) of genomic prediction under different G &#xd7; E interaction variances. Genomic prediction <bold>(C)</bold> accuracy and <bold>(D)</bold> MSE under different numbers of environment variables.</p>
</caption>
<graphic xlink:href="fgene-13-972557-g002.tif"/>
</fig>
<p>
<xref ref-type="fig" rid="F2">Figure 2</xref> also shows how the number of the environment variables influences the genomic prediction. In the scenario with a GEI variance of 0.25, when the number of environment variables was 1, 2 and 3, no significant differences were noted in the prediction accuracy and MSE among the number of different environment variables for G &#xd7; EBLUP. Additionally, in all scenarios, G &#xd7; EBLUP yielded 1.8% higher accuracies and a 12.4% lower MSE than GBLUP, confirming that G &#xd7; EBLUP performed better.</p>
</sec>
<sec id="s3-2-2">
<title>Pig and maize data</title>
<p>
<xref ref-type="fig" rid="F3">Figure 3</xref> and <xref ref-type="table" rid="T3">Table 3</xref> present the accuracy and MSE of genomic prediction on pig and maize data. According to the results of G &#xd7; EWAS regarding AGE trait in pigs, 27,762; 14,117; 7,420; 3964 and 2,117 SNPs were selected as G &#xd7; E markers in G &#xd7; EBLUP, and the accuracies of G &#xd7; EBLUP under different G &#xd7; E markers (1-E01&#x2013;1-E05) were not significantly different. The average prediction accuracy obtained using G &#xd7; EBLUP was 0.225, which was same as that obtained using GBLUP. Moreover, no differences were noted in the MSE between G &#xd7; EBLUP and GBLUP. These findings indicated that AGE was not affected by GEI. However, for BFT, G &#xd7; EBLUP showed the best performance at the <italic>p</italic>-value of 1-E02, the prediction accuracy was improved by 3% compared with that of GBLUP, and the MSE was reduced by 0.107, decreasing from 2.707 to 2.600.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>GBLUP and G &#xd7; EBLUP method for the <bold>(A,C)</bold> accuracy and <bold>(B,D)</bold> mean squared error (MSE) of genomic prediction in pigs and maize. AGE: days to 100&#xa0;kg; BFT: backfat thickness adjusted to 100&#xa0;kg; GW: grain weight; WC: water content. MSE is a relative value by assuming that the MSE of GBLUP method is equal to 0 because of the large MSE value.</p>
</caption>
<graphic xlink:href="fgene-13-972557-g003.tif"/>
</fig>
<p>Compared with pig data, the genomic prediction of G &#xd7; EBLUP showed considerable improvement in maize data. As shown in <xref ref-type="table" rid="T3">Table 3</xref>, for the trait GW from a randomly selected replicate, G &#xd7; EBLUP showed the best performance at the <italic>p</italic>-value of 1-E02, the genomic prediction accuracy was improved by 6.25% compared with that of GBLUP, and MSE was reduced from 46.323 to 46.174. For the trait WC from a randomly selected replicate, the highest prediction accuracy of G &#xd7; EBLUP was obtained at the <italic>p</italic>-value of 1-E03, the prediction accuracy was improved by 7.8% compared with that of GBLUP, and MSE was reduced from 721.588 to 720.854. The average prediction accuracy of 5 repetitions of 10-fold CV for GBLUP and G &#xd7; EBLUP are shown in <xref ref-type="fig" rid="F3">Figure 3</xref>, G &#xd7; EBLUP showed approximately 5.0 and 7.7% improvement in prediction accuracy for GW and WC in maize population, respectively. Moreover, G &#xd7; EBLUP also showed a lower prediction MSE than GBLUP, which further verified the advantages of G &#xd7; EBLUP.</p>
</sec>
</sec>
<sec id="s3-3">
<title>Sensitive environment detection in real data</title>
<p>The BF for each environmental factor was obtained, and then the sensitive environments were assessed accordingly. In pig population, for AGE and BFT, 10 SNPs each with the smallest <italic>p</italic>-values were selected for sensitive environmental detection. As shown in <xref ref-type="fig" rid="F4">Figure 4</xref>, for AGE, the top 10 SNPs regarding season showed the largest BF, indicating that the most sensitive environmental factor for the top 10 SNPs was season. Similarly, the least sensitive environmental factors for SNPs were farm and year, as the values of BF of farm and year were equally low. For BFT, the most sensitive environmental factors for all SNPs were farm and season, and their BF values were also same; year was the least sensitive environmental factor (<xref ref-type="fig" rid="F4">Figure 4</xref>). In maize population, as shown in <xref ref-type="fig" rid="F4">Figure 4</xref>, the averaged log BF values indicated that the region environment variable has a strong GEI (Log(BF) &#x3e; 3) with WC and GW. Additionally, the BF values of WC were higher than that of GW, which was also consistent with the superiority of genomic prediction of WC (<xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Bayes factors of 10 single nucleotide polymorphisms for <bold>(A)</bold> AGE, <bold>(B)</bold> BFT, <bold>(C)</bold> GW and <bold>(D)</bold> WC in different environmental factors. AGE: days to 100&#xa0;kg; BFT: backfat thickness adjusted to 100&#xa0;kg; GW: grain weight; WC: water content.</p>
</caption>
<graphic xlink:href="fgene-13-972557-g004.tif"/>
</fig>
</sec>
<sec id="s3-4">
<title>Computing time</title>
<p>The average computation time for G &#xd7; EBLUP and GBLUP to complete each fold of CV is presented in <xref ref-type="table" rid="T4">Table 4</xref>. Running time of the methods was measured in minutes on an HP server <bold>(</bold>CentOS Linux 7.9.2009, 2.5&#xa0;GHz Intel Xeon processor and 515G total memory). In all scenarios, G &#xd7; EBLUP runs longer than GBLUP, mainly because running G &#xd7; EBLUP requires concurrently running G &#xd7; EWAS, which is a single marker regression model with a long running time, e.g. G &#xd7; EWAS took an average of 45.8&#xa0;min in each fold of CV to complete the analysis, requiring considerably less time than GBLUP (15.05&#xa0;min). In addition, there is no obvious difference here between different traits in the same population due to the same size population and number of SNPs.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Average computation time for G &#xd7; EBLUP and GBLUP to complete each fold of cross-validation.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Date set</th>
<th align="left">Trait<xref ref-type="table-fn" rid="Tfn7">
<sup>a</sup>
</xref>
</th>
<th align="left">G &#xd7; EBLUP</th>
<th align="left">GBLUP</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">Simulation</td>
<td align="left">V0.25</td>
<td align="left">45&#xa0;min 48&#xa0;s</td>
<td align="left">15&#xa0;min 3&#xa0;s</td>
</tr>
<tr>
<td align="left">V1</td>
<td align="left">46&#xa0;min 27&#xa0;s</td>
<td align="left">15&#xa0;min 12&#xa0;s</td>
</tr>
<tr>
<td align="left">V2</td>
<td align="left">46&#xa0;min 13&#xa0;s</td>
<td align="left">15&#xa0;min 9&#xa0;s</td>
</tr>
<tr>
<td rowspan="2" align="left">Pig</td>
<td align="left">AGE</td>
<td align="left">30&#xa0;min 9&#xa0;s</td>
<td align="left">2&#xa0;min 14&#xa0;s</td>
</tr>
<tr>
<td align="left">BFT</td>
<td align="left">30&#xa0;min 14&#xa0;s</td>
<td align="left">2&#xa0;min 18&#xa0;s</td>
</tr>
<tr>
<td rowspan="2" align="left">Maize</td>
<td align="left">GW</td>
<td align="left">26&#xa0;min 13&#xa0;s</td>
<td align="left">1&#xa0;min 7&#xa0;s</td>
</tr>
<tr>
<td align="left">WC</td>
<td align="left">26&#xa0;min 8&#xa0;s</td>
<td align="left">1&#xa0;min 10&#xa0;s</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="Tfn7">
<label>a</label>
<p>V0.25, V1 and V2: The traits with variance of G &#xd7; E interactions of 0.25, 1 and 2, respectively, in simulated data.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>G &#xd7; E interactions play an important role in livestock and plants and should be considered in breeding programmes to select elite individuals in specific environments (<xref ref-type="bibr" rid="B6">Crossa, 2012</xref>; <xref ref-type="bibr" rid="B9">Heslot et al., 2014</xref>; <xref ref-type="bibr" rid="B12">Jarquin et al., 2017</xref>; <xref ref-type="bibr" rid="B21">Perez-Rodriguez et al., 2017</xref>; <xref ref-type="bibr" rid="B28">Tiezzi et al., 2017</xref>; <xref ref-type="bibr" rid="B16">Liu et al., 2019</xref>; <xref ref-type="bibr" rid="B33">Zhang et al., 2019</xref>; <xref ref-type="bibr" rid="B2">Braz et al., 2021</xref>). However, because of their complexity, G&#xd7;E interactions are usually ignored in conventional breeding and the current widely used methods of estimating genomic breeding value, e.g. GBLUP (<xref ref-type="bibr" rid="B29">VanRaden, 2008</xref>), single-step GBLUP (<xref ref-type="bibr" rid="B19">Misztal et al., 2009</xref>), BayesA (<xref ref-type="bibr" rid="B18">Meuwissen et al., 2001</xref>), BayesCpi (<xref ref-type="bibr" rid="B8">Habier et al., 2011</xref>), which could lead to biases in the estimation of breeding values and selection decisions. Although the multi-trait (<xref ref-type="bibr" rid="B23">Richard, 1996</xref>) and reaction norm models (<xref ref-type="bibr" rid="B22">Rebecka et al., 2002</xref>) are two prevalent methods for handling GEI in the estimation of genomic breeding value, our previous studies have explicated the disadvantages of these two types of methods (<xref ref-type="bibr" rid="B27">Song et al., 2020b</xref>). Moreover, these two methods could not detect the markers associated with GEI. In this study, we proposed G &#xd7; EBLUP, which is a novel method for genomic breeding value estimation that takes GEI into account. The core of G &#xd7; EBLUP is the estimation of GEI using G &#xd7; EWAS by including an additional per-individual effect term that accounts for GEI; it is also powerful for the identification of the SNPs that are susceptible to GEI. Comprehensive simulation studies and real data of pigs and maize have demonstrated the superiority of the proposed method.</p>
<p>In the simulated data, our results indicated the superiority of the G &#xd7; EBLUP method for genomic prediction in all scenarios, which was more remarkable when the variance of GEI was large (<xref ref-type="fig" rid="F2">Figures 2A,B</xref>), showing that the new method can appropriately handle GEI. Our results also showed that there was no significant difference in the prediction accuracy of G &#xd7; EBLUP under different numbers of environment variables (<xref ref-type="fig" rid="F2">Figures 2C,D</xref>), implying that the number of environment levels have no effect on our new method. This is a major highlight of the G &#xd7; EBLUP method compared with the methods based on the multi-trait (<xref ref-type="bibr" rid="B23">Richard, 1996</xref>) and reaction norm models (<xref ref-type="bibr" rid="B22">Rebecka et al., 2002</xref>), in which the number of environment levels was the main limitation (<xref ref-type="bibr" rid="B26">Song et al., 2020a</xref>; <xref ref-type="bibr" rid="B27">Song et al., 2020b</xref>). The advantage of G &#xd7; EBLUP for joint G &#xd7; E analysis of multiple environment variables could be that multiple environments can interact with a single genetic locus to influence the phenotypes (<xref ref-type="bibr" rid="B20">Moore et al., 2019</xref>). In our new method, the interactions of genotype with different environments could be represented by one or more markers, as explicated by G &#xd7; EWAS. Therefore, it was not sensitive to the number of environment levels.</p>
<p>In this study, we proposed G &#xd7; EWAS and a score-test statistic to identify the significance of SNP affected by GEI; the details of G &#xd7; EWAS and its computational complexity can be found in Additional File 1 <xref ref-type="sec" rid="s12">Supplementary Material</xref>. In G &#xd7; EWAS, <inline-formula id="inf114">
<mml:math id="m124">
<mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the <inline-formula id="inf115">
<mml:math id="m125">
<mml:mrow>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi mathvariant="normal">L</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix of L observed environments, and <inline-formula id="inf116">
<mml:math id="m126">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">E</mml:mi>
<mml:mi mathvariant="bold">E</mml:mi>
</mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is used as a variance&#x2013;covariance structure for G &#xd7; E effects; thus, G &#xd7; EWAS is a single-SNP regression model, which is different from the multi-trait (<xref ref-type="bibr" rid="B23">Richard, 1996</xref>) and reaction norm models (<xref ref-type="bibr" rid="B22">Rebecka et al., 2002</xref>). The linear covariance function (<bold>EE&#x2019;</bold>) was primarily used because of two appealing properties. First, as the number of samples typically exceeds the number of environments in larger populations (L &#x3c;&#x3c; N), a low-rank linear covariance is noted, which enables parameter inference with a computational complexity that scales linearly with the increasing population size. Second, a linear covariance is directly interpretable, as there is one-to-one correspondence between G &#xd7; EWAS and linear regression using L covariates to account for GEI. Compared with the four methods proposed by <xref ref-type="bibr" rid="B31">Wang et al. (2019)</xref>, our results showed the obvious advantages of G &#xd7; EWAS in GEI detection (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figures S8&#x2013;S10</xref> and <xref ref-type="table" rid="T2">Table 2</xref>). The low efficiency of the other methods could be because the selection, epistasis and phantom vQTL can also cause vQTL instead of just GEI, which may lead to biases in the detection of G &#xd7; E markers, e.g. the overlapped simulated QTLs were decreased for L-mean and L-median when the variance of G &#xd7; E increased. Although the efficiency of G &#xd7; EWAS was improved with increase in the variance of GEI, it yielded higher number of overlap QTLs with GEI. As a combination of the standard linear mixed model for genome-wide association study and StructLMM, our proposed G &#xd7; EWAS performed better than StructLMM (Additional File 3: <xref ref-type="sec" rid="s12">Supplementary Figures S8&#x2013;S10</xref> and <xref ref-type="table" rid="T2">Table 2</xref>). The superiority of G &#xd7; EWAS was mainly because it built a genomic relationship matrix to capture the realised relationships among individuals; moreover, it can accurately capture the effect of each environment on markers by adding the GEI vector in the model, which follows a multivariate normal distribution. In the scenario of larger variance of GEI, more QTLs would contribute to the GEI, increasing the weight of per-individual effect size as described in <xref ref-type="disp-formula" rid="e1">Eq. 1</xref>; thus, it could be easily detected using G &#xd7; EWAS.</p>
<p>Although G &#xd7; EWAS could detect more significant SNPs associated with GEI using Bonferroni correction, only a small amount of the whole markers were detected. The best performance of G &#xd7; EBLUP was obtained at the marker selection criterion of <italic>p</italic>-value of E02 (BFT in pig and WC in maize) or E03 (simulated data and WC in maize) (<italic>p</italic>-values &#x3c; 10<sup>&#x2013;2</sup> or 10<sup>&#x2013;3</sup>) in all scenarios (<xref ref-type="table" rid="T3">Table 3</xref>). In fact, the performance of G &#xd7; EBLUP at E02 and E03 was similar. Accordingly, the selected SNPs with GEI for simulated data, AGE and BFT in pig and WC and GW in maize were enough to build a genomic relationship matrix to elucidate the contribution of GEI. The number of selected SNPs with GEI in G &#xd7; EBLUP was lower than that of the significant SNPs at false discovery rate of 0.05 (Additional File 2: <xref ref-type="sec" rid="s12">Supplementary Table S2</xref>). Therefore, it is reasonable to use <italic>p</italic>-values of &#x3c;10<sup>&#x2013;3</sup> as threshold for determination of SNPs with GEI in G &#xd7; EBLUP. The advantage of G &#xd7; EBLUP over GBLUP is mainly because G &#xd7; EBLUP allows the assignment of different weights to the genomic variants in the different genomic relationship based on their estimated genomic parameters, which can better fit the genetic architecture of the trait, while randomly selected a subset of SNPs that are not all associated with the trait, giving more weight to these SNPs in G &#xd7; EBLUP does not improve the accuracy of genomic prediction (results not shown).</p>
<p>Along with the mapping of G &#xd7; E markers, G &#xd7; EWAS could determine favourable environment using BF of SNPs on each environment. This is extremely helpful for market-specific breeding in animals and plants as it may provide further explanation to those individuals who have a higher risk of being affected by GEI in a certain environment variable (sensitive environment). In this study, farm was identified as a sensitive environment for BFT in pigs, which allows the selection of elite individuals with good performance in specified farms. Further, our results showed that GEI was different for distinct traits, e.g. similar genomic prediction accuracies were obtained for G &#xd7; EBLUP and GBLUP for AGE, whereas G &#xd7; EBLUP showed improvement by approximately 3% in the prediction accuracy for BFT in pigs (<xref ref-type="fig" rid="F3">Figure 3</xref>). This observation is consistent with that of our previous report that showed GEI for BFT but not for AGE (<xref ref-type="bibr" rid="B27">Song et al., 2020b</xref>). This could be explained by values of the variance of the slope (<inline-formula id="inf117">
<mml:math id="m127">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) compared with those of the intercept (<inline-formula id="inf118">
<mml:math id="m128">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) in the reaction norm model; <inline-formula id="inf119">
<mml:math id="m129">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> / <inline-formula id="inf120">
<mml:math id="m130">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">a</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> were 0.002 and 0.348 for AGE and BFT, respectively. Thus, traits with small variance of GEI cannot improve the accuracy of genomic prediction using G &#xd7; EBLUP even after the identification of more significant SNPs on AGE. Similarly, the less significant SNPs in the maize data showed larger variance and greater improvement in the genomic prediction accuracy than those in the pig data (<xref ref-type="fig" rid="F3">Figure 3</xref>). Further, this might be due to the trait characteristics of plants, which are more vulnerable to GEI than livestock, the effect of environment in animal become ignorable as the industrial management. Conversely, the small sample size of the maize may have reduced the power of G &#xd7; EWAS, leading to the identification of a small number of significant GEI markers, which may explain why the gain of G &#xd7; EBLUP was lower than expected. In addition, using sensitive tests to detect sensitive environments is an alternative, and then fitting the overlap environment of SNPs in the G &#xd7; EBLUP model based on the pre-defined environment. However, the effect of this method and how to fit it in G &#xd7; EBLUP model need to be investigated in the future.</p>
<p>Our results showed G &#xd7; EBLUP is a powerful alternative to the conventional method for the estimation of genomic breeding value. However, there are several limitations in this approach. First, G &#xd7; EWAS is not a whole-genome regression model and does not account for the genome-wide contribution of all other variants, thus G &#xd7; EWAS is still a single marker regression model with a long calculation time (<xref ref-type="table" rid="T4">Table 4</xref>). G &#xd7; EWAS assumes all SNPs affected by GEI in the current model, and it could not differentiate between the significant SNPs with or without GEI. It might be the reason why a large number of significant SNPs were detected in the simulated data, although only few overlapped with simulated QTLs. The same phenomenon was also found in other methods, such as Bartlett, F-Killeen, L-mean, L-median and StructLMM, implying that the detection of GEI is more difficult due to the flexibility of the environment. Second, although sensitive environment can be obtained by calculating the BF value for each environment variable, the BF value of each level of an environment variable cannot be obtained (e.g., BF for each farm in pig data), which might be important for directional breeding. Third, G &#xd7; EBLUP has an advantage only when the variance of the G &#xd7; E interaction is relatively large, e.g., similar genomic prediction accuracies were obtained for G &#xd7; EBLUP and GBLUP for AGE, as was not affected by GEI. Finally, G &#xd7; EWAS cannot handle binary traits at present, as G&#xd7;E tests need the estimation of nuisance parameters to capture the main effects of binary traits, and estimating these parameters requires high-dimensional integration and the inversion of a high-dimensional similarity matrix. Nevertheless, it is worth being investigated in the future.</p>
<p>Moreover, G &#xd7; EBLUP can be extended to single-step method (ssG &#xd7; EBLUP), which could improve the genomic prediction accuracy using pedigree and genomic information (<xref ref-type="bibr" rid="B19">Misztal et al., 2009</xref>; <xref ref-type="bibr" rid="B1">Aguilar et al., 2010</xref>; <xref ref-type="bibr" rid="B5">Christensen and Lund., 2010</xref>).</p>
</sec>
<sec sec-type="conclusion" id="s5">
<title>Conclusion</title>
<p>The G &#xd7; EBLUP method proposed in this study showed the following four features: 1) genomic prediction was performed using the G &#xd7; EBLUP method by considering GEI and yielded higher accuracies and lower MSE in both simulated and real pig and maize data when the variance of G &#xd7; E interaction is large; 2) it could powerfully detect the genome-wide SNPs subject to GEI; 3) the number of environment levels did not influence the genomic prediction accuracy of the proposed G &#xd7; EBLUP, circumventing the limitation of current methods; 4) it could determine favourable environment using SNP BF for each environment, thus being useful for market-specific animal and plant breeding.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The simulated data, pig and maize data supporting the conclusions of this article are available from Figshare: <ext-link ext-link-type="uri" xlink:href="https://fshare.com/articles/dataset/GXEBLUP/20347368">https://figshare.com/articles/dataset/GXEBLUP/20347368</ext-link>.</p>
</sec>
<sec id="s7">
<title>Ethics statement</title>
<p>The animal study was reviewed and approved by Animal handling and sample collection were conducted according to protocols approved by the Institutional Animal Care and Use Committee (IACUC) at China Agricultural University. Written informed consent was obtained from the owners for the participation of their animals in this study.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>HS and XD conceived the new method, designed the experiment. HS, XW and YG participated in the data analysis and the result interpretation. HS and XW wrote the paper. XD revised the paper. All authors read and approved the final manuscript.</p>
</sec>
<sec id="s9">
<title>Funding</title>
<p>This work was supported by the National Key Research and Development Project (grant number: 2019YFE0106800); Modern Agriculture Science and Technology Key Project of Hebei Province (grant number: 19226376D); Beijing Natural Science Foundation (grant number: 6222014) and China Agriculture Research System of MOF and MARA.</p>
</sec>
<ack>
<p>The authors gratefully acknowledge Lai Jinsheng and Song Weibin at China Agricultural University, Beijing Tongzhou International Seed Co., Ltd., Beijing Liuma Pig Breeding Co., Ltd. for providing maize data and pig blood samples.</p>
</ack>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s12">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2022.972557/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2022.972557/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet3.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet2.docx" id="SM2" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.docx" id="SM3" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aguilar</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Misztal</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>D. L.</given-names>
</name>
<name>
<surname>Legarra</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tsuruta</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lawlor</surname>
<given-names>T. J.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.</article-title> <source>J. Dairy Sci.</source> <volume>93</volume> (<issue>2</issue>), <fpage>743</fpage>&#x2013;<lpage>752</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2730</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Braz</surname>
<given-names>C. U.</given-names>
</name>
<name>
<surname>Rowan</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Schnabel</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Decker</surname>
<given-names>J. E.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Genome-wide association analyses identify genotype-by-environment interactions of growth traits in Simmental cattle</article-title>. <source>Sci. Rep.</source> <volume>11</volume> (<issue>1</issue>), <fpage>13335</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-92455-x</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Browning</surname>
<given-names>B. L.</given-names>
</name>
<name>
<surname>Browning</surname>
<given-names>S. R.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals</article-title>. <source>Am. J. Hum. Genet.</source> <volume>84</volume> (<issue>2</issue>), <fpage>210</fpage>&#x2013;<lpage>223</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2009.01.005</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Chang</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Chow</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Tellier</surname>
<given-names>L. C.</given-names>
</name>
<name>
<surname>Vattikuti</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Purcell</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J. J.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Second-generation PLINK: rising to the challenge of larger and richer datasets</article-title>. <source>Gigascience</source> <volume>2</volume> (<issue>25</issue>), <fpage>4</fpage>&#x2013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1186/s13742-015-0047-8</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Christensen</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Genomic prediction when some animals are not genotyped</article-title>. <source>Genet. Sel. Evol.</source> <volume>42</volume> (<issue>1</issue>), <fpage>2</fpage>. <pub-id pub-id-type="doi">10.1186/1297-9686-42-2</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Crossa</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>From genotype &#xd7; environment interaction to gene &#xd7; environment interaction</article-title>. <source>Curr. Genomics</source> <volume>13</volume> (<issue>3</issue>), <fpage>225</fpage>&#x2013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.2174/138920212800543066</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davies</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1980</year>). <article-title>Algorithm as 155: The distribution of a linear combination of &#x3c7;2 random variables</article-title>. <source>Appl. Stat.</source> <volume>29</volume> (<issue>3</issue>), <fpage>323</fpage>&#x2013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.2307/2346911</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Habier</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Fernando</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Kizilkaya</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Garrick</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Extension of the bayesian alphabet for genomic selection</article-title>. <source>BMC Bioinforma.</source> <volume>12</volume>, <fpage>186</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-186</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heslot</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Akdemir</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sorrells</surname>
<given-names>M. E.</given-names>
</name>
<name>
<surname>Jannink</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions</article-title>. <source>Theor. Appl. Genet.</source> <volume>127</volume> (<issue>2</issue>), <fpage>463</fpage>&#x2013;<lpage>480</lpage>. <comment>[Journal Article; Research Support, Non-U.S. Gov&#x27;t; Research Support, U.S. Gov&#x27;t, Non-P.H.S.]</comment>. <pub-id pub-id-type="doi">10.1007/s00122-013-2231-5</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yongqiang</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>H. Z.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>A new chi-square approximation to the distribution of non-negative definite quadratic forms in non-central normal variables</article-title>. <source>Comput. Statistics Data Analysis</source> <volume>53</volume> (<issue>4</issue>), <fpage>853</fpage>. <pub-id pub-id-type="doi">10.1016/j.csda.2008.11.025</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jarquin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Crossa</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lacaze</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Du Cheyron</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Daucourt</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lorgeou</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>A reaction norm model for genomic selection using high-dimensional genomic and environmental data</article-title>. <source>Theor. Appl. Genet.</source> <volume>127</volume> (<issue>3</issue>), <fpage>595</fpage>&#x2013;<lpage>607</lpage>. <comment>N.I.H., Extramural; Research Support, Non-U.S. Gov&#x27;t]</comment>. <pub-id pub-id-type="doi">10.1007/s00122-013-2243-1</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jarquin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lemes</surname>
<given-names>D. S. C.</given-names>
</name>
<name>
<surname>Gaynor</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Poland</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Fritz</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Howard</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Increasing Genomic-Enabled prediction accuracy by modeling genotype x environment interactions in Kansas wheat</article-title>. <source>The plant genome</source> <volume>10</volume> (<issue>2</issue>). <pub-id pub-id-type="doi">10.3835/plantgenome2016.12.0130</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kass</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Raftery</surname>
<given-names>A. E.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Bayes factors</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>90</volume> (<issue>430</issue>), <fpage>773</fpage>&#x2013;<lpage>795</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1995.10476572</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kerin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Marchini</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Inferring gene-by-environment interactions with a bayesian whole-genome regression model</article-title>. <source>Am. J. Hum. Genet.</source> <volume>107</volume> (<issue>4</issue>), <fpage>698</fpage>&#x2013;<lpage>713</lpage>. <comment>[Journal Article; Research Support, Non-U.S. Gov&#x27;t]</comment>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2020.08.009</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lippert</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Xiang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Horta</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Widmer</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Kadie</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Heckerman</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Greater power and computational efficiency for kernel-based association testing of sets of genetic variants</article-title>. <source>Bioinformatics</source> <volume>30</volume> (<issue>22</issue>), <fpage>3206</fpage>&#x2013;<lpage>3214</lpage>. <comment>N.I.H., Extramural; Research Support, Non-U.S. Gov&#x27;t]</comment>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu504</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Su</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Hoglund</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Thomasen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Christiansen</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Genotype by environment interaction for female fertility traits under conventional and organic production systems in Danish Holsteins</article-title>. <source>J. Dairy Sci.</source> <volume>102</volume> (<issue>9</issue>), <fpage>8134</fpage>&#x2013;<lpage>8147</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2018-15482</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Madsen</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Sorensen</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Su</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Damgaard</surname>
<given-names>L. H.</given-names>
</name>
<name>
<surname>Thomsen</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Labouriau</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2006</year>). &#x201c;<article-title>Dmu - a package for analyzing multivariate mixed models</article-title>,&#x201d; in <conf-name>the proceedings of the 8th World Congress on Genetics Applied to Livestock Production</conf-name>, <conf-loc>Brasil</conf-loc>, <fpage>11</fpage>&#x2013;<lpage>27</lpage>. </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meuwissen</surname>
<given-names>T. H.</given-names>
</name>
<name>
<surname>Hayes</surname>
<given-names>B. J.</given-names>
</name>
<name>
<surname>Goddard</surname>
<given-names>M. E.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Prediction of total genetic value using genome-wide dense marker maps</article-title>. <source>Genetics</source> <volume>157</volume> (<issue>4</issue>), <fpage>1819</fpage>&#x2013;<lpage>1829</lpage>. <pub-id pub-id-type="doi">10.1093/genetics/157.4.1819</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Misztal</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Legarra</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Aguilar</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information.</article-title> <source>J. Dairy Sci.</source> <volume>92</volume> (<issue>9</issue>), <fpage>4648</fpage>&#x2013;<lpage>4655</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2064</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moore</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Casale</surname>
<given-names>F. P.</given-names>
</name>
<name>
<surname>Jan</surname>
<given-names>B. M.</given-names>
</name>
<name>
<surname>Horta</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Franke</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Barroso</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>A linear mixed-model approach to study multivariate gene-environment interactions.</article-title> <source>Nat. Genet.</source> <volume>51</volume> (<issue>1</issue>), <fpage>180</fpage>&#x2013;<lpage>186</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0271-0</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perez-Rodriguez</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Crossa</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Rutkoski</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Poland</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Legarra</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Single-Step genomic and pedigree genotype x environment interaction models for predicting wheat lines in international environments</article-title>. <source>Plant Genome-US</source> <volume>10</volume> (<issue>2</issue>), <fpage>28724079</fpage>. <comment>[Journal Article; Research Support, Non-U.S. Gov&#x27;t; Research Support, U.S. Gov&#x27;t, Non-P.H.S.]</comment>. <pub-id pub-id-type="doi">10.3835/plantgenome2016.09.0089</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rebecka</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Erling</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Per</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Just</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hossein</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Genotype by environment interaction in nordic dairy cattle studied using reaction norms</article-title>. <source>Acta Agric. Scand. Sect. A - Animal Sci.</source> <volume>52</volume> (<issue>1</issue>), <fpage>11</fpage>&#x2013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1080/09064700252806380</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richard</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>Introduction to quantitative genetics (4th edn)</article-title>. <source>Trends Genet.</source> <volume>12</volume> (<issue>7</issue>), <fpage>280</fpage>. <pub-id pub-id-type="doi">10.1016/0168-9525(96)81458-2</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Romay</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Millard</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Glaubitz</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Peiffer</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Swarts</surname>
<given-names>K. L.</given-names>
</name>
<name>
<surname>Casstevens</surname>
<given-names>T. M.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Comprehensive genotyping of the USA national maize inbred seed bank.</article-title> <source>Genome Biol.</source> <volume>14</volume> (<issue>6</issue>), <fpage>R55</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2013-14-6-r55</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Mi</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Genomic prediction for growth and reproduction traits in pig using an admixed reference population.</article-title> <source>J. Anim. Sci.</source> <volume>95</volume> (<issue>8</issue>), <fpage>3415</fpage>&#x2013;<lpage>3424</lpage>. <pub-id pub-id-type="doi">10.2527/jas.2017.1656</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020a</year>). <article-title>The superiority of multi-trait models with genotype-by-environment interactions in a limited number of environments for genomic prediction in pigs</article-title>. <source>J. Anim. Sci. Biotechnol.</source> <volume>11</volume> (<issue>1</issue>), <fpage>88</fpage>. <pub-id pub-id-type="doi">10.1186/s40104-020-00493-8</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Misztal</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020b</year>). <article-title>Genomic prediction of growth traits for pigs in the presence of genotype by environment interactions using single-step genomic reaction norm model.</article-title> <source>J. Anim. Breed. Genet.</source> <volume>137</volume> (<issue>6</issue>), <fpage>523</fpage>&#x2013;<lpage>534</lpage>. <pub-id pub-id-type="doi">10.1111/jbg.12499</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tiezzi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>de Los</surname>
<given-names>C. G.</given-names>
</name>
<name>
<surname>Parker</surname>
<given-names>G. K.</given-names>
</name>
<name>
<surname>Maltecca</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Genotype by environment (climate) interaction improves genomic prediction for production traits in US Holstein cattle</article-title>. <source>J. Dairy Sci.</source> <volume>100</volume> (<issue>3</issue>), <fpage>2042</fpage>&#x2013;<lpage>2056</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2016-11543</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>VanRaden</surname>
<given-names>P. M.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Efficient methods to compute genomic predictions</article-title>. <source>J. Dairy Sci.</source> <volume>91</volume> (<issue>11</issue>), <fpage>4414</fpage>&#x2013;<lpage>4423</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2007-0980</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>VanRaden</surname>
<given-names>P. M.</given-names>
</name>
<name>
<surname>Van Tassell</surname>
<given-names>C. P.</given-names>
</name>
<name>
<surname>Wiggans</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Sonstegard</surname>
<given-names>T. S.</given-names>
</name>
<name>
<surname>Schnabel</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>J. F.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <article-title>Invited review: Reliability of genomic predictions for North American Holstein bulls</article-title>. <source>J. Dairy Sci.</source> <volume>92</volume> (<issue>1</issue>), <fpage>16</fpage>&#x2013;<lpage>24</lpage>. <comment>[Journal Article; Research Support, Non-U.S. Gov&#x27;t; Research Support, U.S. Gov&#x27;t, Non-P.H.S.]</comment>. <pub-id pub-id-type="doi">10.3168/jds.2008-1514</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Kemper</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Genotype-by-environment interactions inferred from genetic effects on phenotypic variability in the UK Biobank</article-title>. <source>Sci. Adv.</source> <volume>5</volume> (<issue>8</issue>), <fpage>w3538</fpage>. <pub-id pub-id-type="doi">10.1126/sciadv.aaw3538</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Rare-variant association testing for sequencing data with the sequence kernel association test</article-title>. <source>Am. J. Hum. Genet.</source> <volume>89</volume> (<issue>1</issue>), <fpage>82</fpage>&#x2013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2011.05.029</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Kargo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Thomasen</surname>
<given-names>J. R.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Su</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Genotype-by-environment interaction of fertility traits in Danish Holstein cattle using a single-step genomic reaction norm model</article-title>. <source>Hered. (Edinb)</source> <volume>123</volume> (<issue>2</issue>), <fpage>202</fpage>&#x2013;<lpage>214</lpage>. <comment>[Journal Article; Research Support, Non-U.S. Gov&#x27;t]</comment>. <pub-id pub-id-type="doi">10.1038/s41437-019-0192-4</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Dekkers</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Fernando</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Jannink</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: A barley case study</article-title>. <source>Genetics</source> <volume>182</volume> (<issue>1</issue>), <fpage>355</fpage>&#x2013;<lpage>364</lpage>. <comment>U.S. Gov&#x27;t, Non-P.H.S.]</comment>. <pub-id pub-id-type="doi">10.1534/genetics.108.098277</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>