<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2014.00341</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Methylated DNA is over-represented in whole-genome bisulfite sequencing data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Ji</surname> <given-names>Lexiang</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Sasaki</surname> <given-names>Takahiko</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Sun</surname> <given-names>Xiaoxiao</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/183388"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ma</surname> <given-names>Ping</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/32146"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Lewis</surname> <given-names>Zachary A.</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Schmitz</surname> <given-names>Robert J.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/135367"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Genetics, University of Georgia, Athens, GA</institution> <country>USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Institute of Bioinformatics, University of Georgia, Athens, GA</institution> <country>USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Microbiology, University of Georgia, Athens, GA</institution> <country>USA</country></aff>
<aff id="aff4"><sup>4</sup><institution>Department of Statistics, University of Georgia, Athens, GA</institution> <country>USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Mark D. Robinson, University of Zurich, Switzerland</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Anne Parle-McDermott, Dublin City University, Ireland; Cuncong Zhong, J. Craig Venter Institute, USA; Michael B. Stadler, Friedrich Miescher Institute, Switzerland</italic></p></fn>
<fn fn-type="corresp" id="fn002"><p>&#x0002A;Correspondence: <italic>Robert J. Schmitz, Department of Genetics, University of Georgia,120 East Green Street, Davison Life Sciences, B416 Athens, GA 30602, USA e-mail: schmitz@uga.edu</italic></p></fn>
<fn fn-type="other" id="fn001"><p>This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>10</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>5</volume>
<elocation-id>341</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>06</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>09</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Ji, Sasaki, Sun, Ma, Lewis and Schmitz.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p> This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The development of whole-genome bisulfite sequencing (WGBS) has resulted in a number of exciting discoveries about the role of DNA methylation leading to a plethora of novel testable hypotheses. Methods for constructing sodium bisulfite-converted and amplified libraries have recently advanced to the point that the bottleneck for experiments that use WGBS has shifted to data analysis and interpretation. Here we present empirical evidence for an over-representation of reads from methylated DNA in WGBS. This enrichment for methylated DNA is exacerbated by higher cycles of PCR and is influenced by the type of uracil-insensitive DNA polymerase used for amplifying the sequencing library. Future efforts to computationally correct for this enrichment bias will be essential to increasing the accuracy of determining methylation levels for individual cytosines. It is especially critical for studies that seek to accurately quantify DNA methylation levels in populations that may segregate for allelic DNA methylation states.</p>
</abstract>
<kwd-group>
<kwd>DNA methylation</kwd>
<kwd>whole genome bisulfite sequencing</kwd>
<kwd>Epigenomics</kwd>
<kwd>epigenetics</kwd>
<kwd>PCR bias</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="54"/>
<page-count count="10"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>INTRODUCTION</title>
<p>The ability to produce genome-wide data sets has radically changed during the last decade. With the advent of high-throughput sequencing, billions of short read data can be generated within days. Currently, one of the primary uses of these short reads is to explore genetic variation within the human population with the goal of linking variants to diseases. Identifying these causal associations is the first major step to understanding how diseases arise, how they manifest and how they can be treated. Additionally, these short reads are used to study a wide range of model systems and an ever-growing list of non-model systems. Although much of the energy in the scientific community focuses on genetic variants there are a growing number of scientists that are fascinated by the epigenome. The NIH Roadmap Epigenomics Mapping Consortium defines the epigenome as a map of features distributed throughout and on top of the genome (<xref ref-type="bibr" rid="B5">Bernstein et al., 2010</xref>). These features can include RNAs such as mRNAs, long non-coding RNAs and small RNAs, DNA-protein interactions, chromatin accessibility, nucleosome positioning, covalent modifications to histone tails and DNA methylation, to name a few. Understandably, refinement of epigenomic techniques lagged behind development of genomic DNA library preparation due to their demand for use and due to the difficulty in execution. Part of the reason for this sudden increase and interest in epigenomics is due to the standardization of protocols and analysis pipelines used to produce and interpret data. No longer is data generation the rate-limiting step. Instead, the ability to accurately analyze and interpret these data sets has become the major bottleneck in the daily workflow.</p>
<p>One of the most exciting technical advances in field of epigenomics was the development of methods to analyze DNA methylation at single-base resolution (<xref ref-type="bibr" rid="B12">Cokus et al., 2008</xref>; <xref ref-type="bibr" rid="B28">Lister et al., 2008</xref>). Generally referred to as whole-genome bisulfite sequencing (WGBS), these methods combine sodium bisulfite conversion (<xref ref-type="bibr" rid="B20">Hayatsu et al., 1970</xref>; <xref ref-type="bibr" rid="B43">Shapiro et al., 1973</xref>, <xref ref-type="bibr" rid="B44">1974</xref>; <xref ref-type="bibr" rid="B19">Hayatsu, 1976</xref>; <xref ref-type="bibr" rid="B15">Frommer et al., 1992</xref>; <xref ref-type="bibr" rid="B10">Clark et al., 1994</xref>, <xref ref-type="bibr" rid="B11">2006</xref>), the gold standard for determining DNA methylation states at individual cytosines, with high-throughput DNA sequencing. Using this technique, unmethylated cytosines are converted into uracils and then into thymines after PCR. The observation of cytosines in sequencing reads typically indicates that the cytosine was methylated, as methylcytosines are protected from conversion by treatment with sodium bisulfite. These methods are capable of testing approximately 90% of all cytosines in genomes studied to date. They also generate quantitative measurements of DNA methylation at each cytosine due to the multiple independent reads that align to a particular sequence. These methods were originally tested on <italic>Arabidopsis thaliana</italic> genomic DNA (<xref ref-type="bibr" rid="B12">Cokus et al., 2008</xref>; <xref ref-type="bibr" rid="B28">Lister et al., 2008</xref>) as this species has a relatively small genome size (&#x0223C;150 Mb) compared to human genomes (&#x0223C;3 Gb per haploid genome), which made testing and troubleshooting cost efficient. Since the first single base resolution DNA methylomes for <italic>A. thaliana</italic> surfaced, larger DNA methylomes have been sequenced such as humans (<xref ref-type="bibr" rid="B29">Lister et al., 2009</xref>), mice (<xref ref-type="bibr" rid="B48">Stadler et al., 2011</xref>), corn (<xref ref-type="bibr" rid="B17">Gent et al., 2013</xref>), and soybean (<xref ref-type="bibr" rid="B38">Schmitz et al., 2013a</xref>), but in general it is cost prohibitive to sequence large numbers of individual samples using this technique. As a result, reduced representation bisulfite sequencing (RRBS) was developed in which &#x0223C;1% of the genome is studied making it possible to investigate large numbers of individuals at the cost of surveying the entire genome (<xref ref-type="bibr" rid="B32">Meissner et al., 2005</xref>). Most recently, additional modifications to cytosines have been identified such as 5-hydroxymethylation, 5-formylcytosine, and 5-carboxylcytosine and methodologies have been developed to study the presence of some of these base modifications genome-wide (<xref ref-type="bibr" rid="B6">Booth et al., 2012</xref>, <xref ref-type="bibr" rid="B7">2014</xref>; <xref ref-type="bibr" rid="B53">Yu et al., 2012</xref>).</p>
<p>The initial genome-wide maps of DNA methylation were invaluable for defining previously unexpected epigenomic signatures in a variety of systems, such as partially methylated domains and non-CG methylation in animals (<xref ref-type="bibr" rid="B29">Lister et al., 2009</xref>) and CHH islands in maize (<xref ref-type="bibr" rid="B17">Gent et al., 2013</xref>). Since these initial maps, WGBS has been used to study epigenome reprogramming of induced pluripotent stem cells (<xref ref-type="bibr" rid="B30">Lister et al., 2011</xref>), genomic imprinting in both plants (<xref ref-type="bibr" rid="B16">Gehring et al., 2009</xref>; <xref ref-type="bibr" rid="B22">Hsieh et al., 2009</xref>; <xref ref-type="bibr" rid="B54">Zemach et al., 2010</xref>; <xref ref-type="bibr" rid="B9">Calarco et al., 2012</xref>) and animals (<xref ref-type="bibr" rid="B52">Xie et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Kobayashi et al., 2013</xref>; <xref ref-type="bibr" rid="B46">Shirane et al., 2013</xref>), germ line epigenome reprogramming in plants (<xref ref-type="bibr" rid="B47">Slotkin et al., 2009</xref>; <xref ref-type="bibr" rid="B9">Calarco et al., 2012</xref>; <xref ref-type="bibr" rid="B23">Ibarra et al., 2012</xref>; <xref ref-type="bibr" rid="B14">Creasey et al., 2014</xref>) and animals (<xref ref-type="bibr" rid="B42">Seisenberger et al., 2012</xref>; <xref ref-type="bibr" rid="B24">Jiang et al., 2013</xref>) and also to study the impact of genetic variation on DNA methylation (<xref ref-type="bibr" rid="B38">Schmitz et al., 2013a</xref>,b; <xref ref-type="bibr" rid="B34">Orozco et al., 2014</xref>). Clearly, mapping and analyses of DNA methylomes is a relatively young field with great potential for exciting discoveries in the next decade.</p>
<p>It is well established that there is a GC content bias in amplification of fragments of DNA used for high-throughput sequencing applications (<xref ref-type="bibr" rid="B1">Aird et al., 2011</xref>; <xref ref-type="bibr" rid="B4">Benjamini and Speed, 2012</xref>). As a result, methods were developed to avoid PCR amplification all together in the construction of DNA sequencing libraries (<xref ref-type="bibr" rid="B27">Kozarewa et al., 2009</xref>). However, this approach is not applicable to bisulfite-treated DNA as the uracils in the fragments would inhibit cluster formation on the instrument unless reagents could be customized to include a uracil-insensitive DNA polymerase. As a result, bisulfite-treated DNA is amplified for varying cycles of PCR to replace uracils in the DNA sequence and to amplify libraries as treatment with sodium bisulfite degrades DNA. The degradation of DNA by sodium bisulfite also creates limitations with regards to the types of samples that can be sequenced, as higher input amounts are often required for most protocols. Moreover, after bisulfite conversion, previously complementary strands are single stranded and will no longer anneal together. In fact, previous observations of bisulfite-conversion and PCR have uncovered strand biases that severely affects estimates of DNA methylation levels (<xref ref-type="bibr" rid="B51">Warnecke et al., 1997</xref>). Correction methods have been developed for targeted bisulfite PCR experiments, but these methods are not feasible for WGBS (<xref ref-type="bibr" rid="B26">Komori et al., 2011</xref>; <xref ref-type="bibr" rid="B33">Moskalev et al., 2011</xref>). Given that methylated DNA will retain higher GC content after bisulfite conversion and PCR compared to unmethylated DNA, over-representation of methylated DNA may occur in the construction of sequencing libraries. Although libraries can be constructed from lower input DNA samples such as those prepared from FFPE (formalin-fixed paraffin-embedded), sequence-capture or single cell approaches, there usually is a cost of increasing the total PCR cycles. This cost likely comes in the form of increasing any bias in read enrichment between methylated and unmethylated sequences at the same locus.</p>
<p>In this study, we explore the presence of an amplification bias created in fragments that retain higher GC content after sodium-bisulfite treatment with unmethylated sequences and with genomic DNA sequences. A strong positive correlation between DNA methylation levels and normalized read counts are observed in the WGBS data. Investigation of three different commercially available uracil-insensitive enzymes reveal differences in amplification bias of highly methylated and high GC content DNA sequences.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>MATERIALS AND METHODS</title>
<sec>
<title>PLANT AND FUNGAL MATERIAL</title>
<p>The <italic>A. thaliana</italic> Col-0 accession was used for all. Plants were grown in 16 h day lengths and tissue was harvested from above ground rosette leaves 3 weeks after planting. DNA was isolated using the Qiagen (Valencia, CA, cat-69106, USA) DNeasy Plant Kit according to the manufacturer&#x02019;s protocol. All <italic>Neurospora</italic> experiments were performed with the wild-type &#x0201C;Oak Ridge&#x0201D; strain (FGSC# 2489) obtained from the Fungal Genetics Stock Center (<xref ref-type="bibr" rid="B31">McCluskey et al., 2010</xref>). &#x0223C;1 &#x000D7; 10<sup>6</sup> conidia/ml were germinated in 50ml of Vogel&#x02019;s Minimal Medium with 1.5% sucrose and grown for 5 h.</p>
</sec>
<sec>
<title>METHYLC-SEQUENCING LIBRARY PREPARATION</title>
<p>MethylC-seq libraries were prepared according to the following protocol (<xref ref-type="bibr" rid="B50">Urich et al., 2014</xref>), with the some modifications. Three different enzyme mixtures were used to amplify bisulfite-converted adapter ligated DNA: Kapa HiFi Uracil+ (cat-KK2802, Kapa Biosystems), Pfu Turbo Cx Hotstart DNA polymerase (Agilent Technologies, Santa Clara, CA, cat-600410, USA), and EpiMark (New England Biolabs., Ipswich, MA, cat-M0490S, USA). PCR cycle number varied from 4, 8, or 15 cycles as described in the main text.</p>
</sec>
<sec>
<title>PREPARATION AND SEQUENCING OF <italic>Neurospora</italic> DNA</title>
<p>Chromatin immunoprecipitation (ChIP) and DNA isolation were performed as described previously (<xref ref-type="bibr" rid="B37">Sasaki et al., 2014</xref>). For Illumina sequencing, libraries were prepared from 10 ng of ChIP input DNA using an Illumina TruSeq kit (Illumina cat-FC-121-2002). Libraries were prepared according to manufacturer instructions except for the following modifications. DNA adaptors were diluted 1:100 prior to ligation. PCR was performed for 4, 8, or 15 cycles, as indicated in the text. In some experiments, a PCR reaction mixture containing Kapa HiFi polymerase (Kapa Biosystems, cat-KK2502), 1X Kapa HiFi buffer, 200 nM dNTPs, 1X TruSeq primer cocktail, and 60 mM tetramethylammonium chloride (TMAC; Sigma-Aldrich, catalog # T3411) was used in place of the TruSeq PCR master mix [as described in (<xref ref-type="bibr" rid="B36">Oyola et al., 2012</xref>)]. Illumina sequencing was performed using an Illumina Next-Seq500 instrument at the University of Georgia genomics facility.</p>
</sec>
<sec>
<title>METHYLC-SEQ DATA ANALYSIS</title>
<p>FASTQ files were trimmed for adapters, preprocessed to remove low quality reads and aligned to the TAIR10 reference genome as previously described in (<xref ref-type="bibr" rid="B39">Schmitz et al., 2013b</xref>). Inefficiencies in the sodium bisulfite conversion reaction are calculated by measuring the fraction of methylated basecalls detected in the chloroplast genome (which is unmethylated). This non-conversion rate is used as the null hypothesis for a binomial test to determine if a cytosine is methylated and the resulting <italic>p</italic>-values are corrected for multiple testing using Benjamini&#x02013;Hochberg with an FDR cut off of 5%.</p>
</sec>
<sec>
<title>CALCULATION OF METHYLATION LEVELS</title>
<p>Weighted methylation levels were performed as described in (<xref ref-type="bibr" rid="B41">Schultz et al., 2012</xref>).</p>
</sec>
<sec>
<title>CALCULATION OF CORRELATION COEFFICIENTS</title>
<p>Pearson correlation was calculated. In <bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>, the outliers were identified using Bonferroni test (<xref ref-type="bibr" rid="B13">Cook and Weisberg, 1982</xref>) and then Pearson correlation was calculated after the outliers were removed.</p>
</sec>
<sec>
<title>CALCULATION OF NORMALIZED READ COVERAGE</title>
<p>For <bold>Figures <xref ref-type="fig" rid="F1">1D&#x02013;F</xref></bold> normalized read coverage was calculated by the total number of reads in the DMR (Differentially Methylated Region) divided by the total number of reads sequenced from the library. This value was then multiplied by 10<sup>5</sup>. For <bold>Figure <xref ref-type="fig" rid="F1">1G</xref></bold>, normalized read coverage was calculated as follows: genome sequencing reads from previously published <italic>A. thaliana</italic> Col-0 data (<xref ref-type="bibr" rid="B3">Becker et al., 2011</xref>) were used to determine a baseline level of mappability by the DNA that had not undergone sodium bisulfite treatment. In this case, all cytosines were converted to thymines in the FASTQ files and then the genome sequencing data set was used in the same analysis pipeline as the one used to map and determine methylation levels for sodium bisulfite-treated libraries. <italic>A. thaliana</italic> mutation accumulation line 1 replicate 2 was selected as bisulfite sequencing sample (<xref ref-type="bibr" rid="B40">Schmitz et al., 2011</xref>). In each 10 kb window, read numbers were counted and then divided by the number of mapped reads in the first tile to normalize them to the same scale. Normalized read coverage was then computed by subtracting the normalized number of reads present in the genomic DNA sample from the normalized number of reads in the bisulfite sequencing sample. 5% outliers were identified, as described in the Calculation of correlation coefficients section, and removed before computing linear regression (<xref ref-type="bibr" rid="B13">Cook and Weisberg, 1982</xref>). For <bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>, the window size was set to 100 kb, the enrichment of mapped reads for each tile was computed using the same method as described for 1 G. For <bold>Figures <xref ref-type="fig" rid="F3">3</xref>&#x02013;<xref ref-type="fig" rid="F5">5</xref></bold>, the methylation levels from <bold>Figure <xref ref-type="fig" rid="F1">1G</xref></bold> were used to set a fixed horizontal axis. Therefore, data points were only able to move in a vertical direction depending on normalized read coverage. This method was used because higher read coverage resulted in higher methylation levels, which made it challenging to accurately reveal and compare the trends in each figure. For <bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold>, to calculate normalized read coverage for <italic>Neurospora</italic>, the genome was divided into 1 kb windows using bedtools, and a gtf annotation file was constructed using custom scripts. Cuffdiff software (<xref ref-type="bibr" rid="B49">Trapnell et al., 2013</xref>) and the custom annotation file were used to calculate the enrichment ratios at each 1 kb window. %GC was determined using a custom script.</p>
</sec>
<sec>
<title>DATA ACCESS</title>
<p>The data generated for this study have been deposited in the Gene Expression Omnibus<sup><xref ref-type="fn" rid="fn01">1</xref></sup> and are accessible through accession number GSE58217. For WGBS data in <bold>Figures <xref ref-type="fig" rid="F1">1</xref></bold> and <bold><xref ref-type="fig" rid="F2">2</xref></bold> data were downloaded from the National Center for Biotechnology Information (NCBI) &#x02013; Sequence Read Archive (SRA)<sup><xref ref-type="fn" rid="fn02">2</xref></sup> and for genomic DNA sequence used in <bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold> data were downloaded from (<ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/sra/?term==ERX386705"></ext-link>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Methylated DNA is enriched in <bold>whole-genome bisulfite sequencing data (WGBS)</bold>. (A&#x02013;C)</bold> Examples of differentially methylated regions (DMRs) that contain identical DNA sequences. <bold>(D&#x02013;F)</bold> Highly methylated regions contain an abundance of bisulfite sequencing reads compared to lowly methylated sequences. <bold>(G)</bold> Highly methylated DNA is positively correlated with bisulfite sequencing read counts. The names of the samples in <bold>(A&#x02013;C)</bold> indicate lineage numbers as previously reported <xref ref-type="bibr" rid="B45">Shaw et al. (2000)</xref>. Pink shaded regions indicate DMRs in <bold>(A&#x02013;C)</bold> and the location of data used to produce <bold>(D&#x02013;F)</bold>. In <bold>(A&#x02013;C)</bold>, gold lines = methylated CGs, blue lines = methylated CHGs (whereH = A, C, or T) and pink lines = methylated CHHs. In <bold>(D&#x02013;F)</bold>, black dots = highly methylated alleles, red dots = lowly methylatedalleles. Data used to produce <bold>Figure <xref ref-type="fig" rid="F1">1G</xref></bold> were obtained from line1 replicate 2 from (<xref ref-type="bibr" rid="B40">Schmitz et al., 2011</xref>).</p></caption>
<graphic xlink:href="fgene-05-00341-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>Methylated DNA has a skewed GC content after sodium bisulfite conversion of DNA and this leads to an enrichment of methylated reads in bisulfite sequencing. (A)</bold> GC content beforebisulfite conversion, after bisulfite conversion, computed full conversion and the absolute difference in GC content between unconverted and bisulfite-treated DNA. <bold>(B)</bold> WGBS reads are positively correlated and enriched with highly methylated regions of the chromosome compared to alignment of sequencing reads from genomic DNA libraries. The black line and the corresponding <italic>y</italic>-axis on the left depict the total level of cytosine methylation in all combined contexts (CG, CHG, or CHH). The orange line and the corresponding <italic>y</italic>-axis on the right depict normalized read coverage. WGBS data used to produce this figure were obtained from line 1 replicate 2 from (<xref ref-type="bibr" rid="B40">Schmitz et al., 2011</xref>). Genomic DNA sequencing data used to produce this figure were obtained from sample 119 in (<xref ref-type="bibr" rid="B3">Becker et al., 2011</xref>).</p></caption>
<graphic xlink:href="fgene-05-00341-g002.tif"/>
</fig>
</sec>
</sec>
<sec>
<title>RESULTS AND DISCUSSIONS</title>
<sec>
<title>METHYLATED DNA IS OVER-REPRESENTED IN BISULFITE SEQUENCING DATA</title>
<p>Our previous studies that applied MethylC-seq (a WBGS method) on a population of <italic>A. thaliana</italic> MA lines (<xref ref-type="bibr" rid="B45">Shaw et al., 2000</xref>) revealed the existence of spontaneous epialleles (<xref ref-type="bibr" rid="B40">Schmitz et al., 2011</xref>; <bold>Figures <xref ref-type="fig" rid="F1">1A&#x02013;C</xref></bold>). These differentially methylated regions (DMRs) of the population have identical underlying DNA sequences as revealed by whole genome sequencing (<xref ref-type="bibr" rid="B35">Ossowski et al., 2010</xref>), but vary significantly in DNA methylation levels (<bold>Figures <xref ref-type="fig" rid="F1">1A&#x02013;C</xref></bold>). Further examination of the underlying data revealed that reads from highly methylated DMRs were over-represented compared to unmethylated regions after bisulfite conversion and DNA sequencing (<bold>Figures <xref ref-type="fig" rid="F1">1D&#x02013;F</xref></bold>). In total, eight different MA lines were sequenced in duplicate for a total of 16 samples. In every case, there was a positive correlation between the methylation level of the DMR and the normalized number of aligned sequenced reads (<bold>Figures <xref ref-type="fig" rid="F1">1D&#x02013;F</xref></bold>). Similarly, a chromosome-wide test revealed a strong positive correlation between DNA methylation levels and normalized (aligned) read counts from WGBS data (<bold>Figure <xref ref-type="fig" rid="F1">1G</xref></bold>; R<sup>2</sup> = 0.55).</p>
</sec>
<sec>
<title>GC CONTENT INFLUENCES ENRICHMENT OF HIGHLY METHYLATED DNA</title>
<p>Although the sequences within DMRs are identical, GC contents were significantly different after sodium bisulfite conversion. Therefore, the possibility exists that the underlying GC content likely influences the enrichment bias of methylated DNA. To further explore this question, the GC content throughout chromosome five was determined prior to and post sodium bisulfite conversion (<bold>Figure <xref ref-type="fig" rid="F2">2A</xref></bold>). Post bisulfite treatment, the GC content is significantly reduced by approximately 50%, which is expected as most of the cytosines are unmethylated and are converted to uracils and ultimately thymines after PCR. The lowest percentage reduction in GC content is observed in the pericentromeric region of the chromosome (<bold>Figure <xref ref-type="fig" rid="F2">2A</xref></bold>) where the highest density of methylation has been observed (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>; <xref ref-type="bibr" rid="B12">Cokus et al., 2008</xref>; <xref ref-type="bibr" rid="B28">Lister et al., 2008</xref>). To empirically test if sodium bisulfite-treated and amplified DNA is preferentially biased toward highly methylated reads, both WBGS and genomic DNA sequencing data from wild-type Col-0 were aligned using the same parameters. The read enrichment was calculated and plotted along chromosome five, which revealed an enrichment of WGBS reads compared to genomic DNA sequencing reads that paralleled DNA methylation levels along the chromosome (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>). Collectively, these data suggest that highly methylated sequences, which retain high GC content after bisulfite conversion and PCR amplification, are preferentially enriched in WGBS data compared to unmethylated sequences.</p>
</sec>
<sec>
<title>NUMBER OF PCR CYCLES INFLUENCES ENRICHMENT OF HIGHLY METHYLATED DNA</title>
<p>The introduction of altered GC content occurs during bisulfite treatment and subsequent PCR amplification. Bisulfite-converted DNA is rich with uracils, which impairs the ability of most DNA polymerases to efficiently amplify these fragments. The B-type DNA polymerase from <italic>Pyrococcus furiosus</italic> is able to polymerize uracil-rich DNA because it binds to deaminated bases (<xref ref-type="bibr" rid="B21">Horvath and Vertessy, 2010</xref>). This enzyme was engineered to polymerize uracil-rich DNA by mutating a single amino acid that blocks recognition of the deaminated bases (<xref ref-type="bibr" rid="B21">Horvath and Vertessy, 2010</xref>). These types of engineered uracil-tolerant enzymes are critical for single-base resolution studies of DNA methylation and as a result, a number of commercially available enzymes have been engineered to amplify uracil-containing templates. We performed a test on the Pfu Turbo Cx enzyme with biological replicates (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref>) using three different cycles (4, 8, and 15 cycles) of PCR (<bold>Figure <xref ref-type="fig" rid="F3">3A</xref></bold>) to assess the degree to which a uracil-insensitive enzyme is prone to an amplification bias of methylated DNA. A comparison between 4, 8, and 15 PCR cycles revealed that 15 cycles results in an exacerbation in the preferential enrichment of methylated DNA (<bold>Figures <xref ref-type="fig" rid="F3">3B&#x02013;G</xref></bold>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>Polymerase chain reaction (PCR) cycle number influences enrichment bias. (A)</bold> Diagram of the experimental design for testing the effect PCR cycle number on representation of methylated DNA. Normalized read counts versus methylation levels of 10 kb fragments from chromosome five for <bold>(B)</bold> 4 cycles, <bold>(C)</bold> 8 cycles, and <bold>(D)</bold> 15 cycles of PCR amplification using Pfu Turbo Cx. <bold>(E&#x02013;G)</bold> Normalized read numbers for each 10 kb window were compared between different PCR cycle numbers and correlation scores were determined as indicated in the upper right hand of the plot. All R<sup>2</sup> values reported reflect Pearson correlation coefficients.</p></caption>
<graphic xlink:href="fgene-05-00341-g003.tif"/>
</fig>
</sec>
<sec>
<title>COMPARATIVE ANALYSIS OF COMMERCIALLY AVAILABLE URACIL-INSENSITIVE ENZYMES</title>
<p>Based on the results from <bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>, an experiment was designed to test two additional commonly used commercially available enzymes in biological replicates (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref>) for their ability to amplify bisulfite-converted DNA (Kapa HiFi Uracil + and EpiMark). Upon constructing the MethylC-seq libraries the concentration of the samples were determined using a Qubit. This revealed that the total library yield for the 4-cycle/8-cycle PCR was similar between the three enzymes tested, but for the 15-cycle PCR, the KAPA HiFi Uracil + enzyme yielded significantly more total library (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>). Regardless, the total amount of library yielded in all experiments is more than adequate to sequence to sufficient depth. Next, the methylation levels were plotted against normalized read counts and as observed with the Pfu Turbo Cx enzyme in <bold>Figures <xref ref-type="fig" rid="F1">1</xref></bold> and <bold><xref ref-type="fig" rid="F2">2</xref></bold>, the other two tested enzymes also preferentially enrich methylated DNA, although to a lesser extent (<bold>Figures <xref ref-type="fig" rid="F4">4</xref></bold> and <bold><xref ref-type="fig" rid="F5">5</xref></bold>). However, both of these enzymes are sensitive to highly methylated sequences as the normalized read numbers started to decrease with higher methylation levels compared to intermediate methylation levels (<bold>Figures <xref ref-type="fig" rid="F4">4</xref></bold> and <bold><xref ref-type="fig" rid="F5">5</xref></bold>). Interestingly, between the 4, 8, and 15 cycle experiments for the Kapa HiFi Uracil + enzyme, extremely high GC content regions did not result in an increase in amplification bias of DNA like it does with both the Pfu Turbo Cx and EpiMark enzymes (<bold>Figures <xref ref-type="fig" rid="F3">3</xref>&#x02013;<xref ref-type="fig" rid="F5">5</xref></bold>), which would indicate this enzyme is likely preferred for most WGBS applications. Interestingly, the EpiMark enzyme revealed that at 15 PCR cycles a strong bias was associated with increasing GC content, but not with increasing levels of DNA methylation compared to the 4- and 8-cycle reactions (<bold>Figures <xref ref-type="fig" rid="F5">5A&#x02013;F</xref></bold>). Therefore, both PCR cycles and enzymes selected for amplification can influence enrichment of highly methylated DNA in sequencing data.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Sample name, cycle number and total library yield for each WGBS library are reported.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Sample name</th>
<th valign="top" align="left">Cycle number</th>
<th valign="top" align="left">Total library yield (ng/ul) replicate 1</th>
<th valign="top" align="left">Total library yield (ng/ul) replicate 2</th>
<th valign="top" align="left">Correlation coefficient between biological replicates</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Kapa HiFi U+</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">0.44</td>
<td valign="top" align="left">0.38</td>
<td valign="top" align="left">0.9167678</td>
</tr>
<tr>
<td valign="top" align="left">Pfu Turbo Cx</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">0.49</td>
<td valign="top" align="left">0.39</td>
<td valign="top" align="left">0.5800957</td></tr>
<tr>
<td valign="top" align="left">NEB EpiMark</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">0.59</td>
<td valign="top" align="left">0.25</td>
<td valign="top" align="left">0.8791214</td>
</tr>
<tr>
<td valign="top" align="left">Kapa HiFi U+</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">2.32</td>
<td valign="top" align="left">1.63</td>
<td valign="top" align="left">0.980213</td>
</tr>
<tr>
<td valign="top" align="left">Pfu Turbo Cx</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">1.45</td>
<td valign="top" align="left">1.03</td>
<td valign="top" align="left">0.8921665</td></tr>
<tr>
<td valign="top" align="left">NEB EpiMark</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">2.64</td>
<td valign="top" align="left">1.89</td>
<td valign="top" align="left">0.9589284</td>
</tr>
<tr>
<td valign="top" align="left">Kapa HiFi U+</td>
<td valign="top" align="left">15</td>
<td valign="top" align="left">80.8</td>
<td valign="top" align="left">62.4</td>
<td valign="top" align="left">0.9677855</td>
</tr>
<tr>
<td valign="top" align="left">Pfu Turbo Cx</td>
<td valign="top" align="left">15</td>
<td valign="top" align="left">40.2</td>
<td valign="top" align="left">36.1</td>
<td valign="top" align="left">0.9730408</td></tr>
<tr>
<td valign="top" align="left">NEB EpiMark</td>
<td valign="top" align="left">15</td>
<td valign="top" align="left">52.6</td>
<td valign="top" align="left">39.2</td>
<td valign="top" align="left">0.8796196</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>The correlation coefficient was calculated based on the normalized reads coverage between the biological replicates.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>Polymerase chain reaction amplification bias observed with Kapa HiFi Uracil +.</bold> Normalized read counts versus methylation levels for 10 kb windows from chromosome five for <bold>(A&#x02013;C)</bold> Kapa HiFi Uracil + from 4-, 8-, and 15-cycles of PCR. <bold>(D&#x02013;F)</bold> Normalized read numbers for each 10 kb window were compared between different PCR cycle numbers and correlation scores were determined as indicated in the upper right hand of the plot. All R<sup>2</sup> values reported reflect Pearson correlation coefficients.</p></caption>
<graphic xlink:href="fgene-05-00341-g004.tif"/>
</fig>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p><bold>Polymerase chain reaction amplification bias observed with NEB EpiMark.</bold> Normalized read counts versus methylation levels for 10 kb windows from chromosome five for <bold>(A&#x02013;C)</bold> NEB EpiMark from 4-, 8-, and 15-cycles of PCR. <bold>(D&#x02013;F)</bold> Normalized read numbers for each 10 kb window were compared between different PCR cycle numbers and correlation scores were determined as indicated in the upper right hand of the plot. All R<sup>2</sup> values reported reflect Pearson correlation coefficients.</p></caption>
<graphic xlink:href="fgene-05-00341-g005.tif"/>
</fig>
</sec>
<sec>
<title>AT-RICH SEQUENCES ARE DEPLETED FROM DNA SEQUENCING LIBRARIES</title>
<p>ChIP-seq experiments carried out with the filamentous fungus <italic>Neurospora crassa</italic> also revealed a bias against AT-rich DNA sequences, demonstrating reduced coverage at AT-rich DNA is not restricted to uracil-tolerant polymerases. <italic>Neurospora</italic> heterochromatin domains have an average GC content of 30% compared to an average GC content of 51.5% for genes (<bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold>). ChIP input DNA was used to prepare Illumina sequencing libraries using two different polymerases and PCR was performed for 4, 8, and 15 cycles. Libraries prepared using a TruSeq PCR master-mix revealed equal representation of GC-rich and AT-rich regions after 4 and 8 cycles of PCR, but a clear amplification bias was observed after 15 cycles of PCR (<bold>Figures <xref ref-type="fig" rid="F6">6A,C</xref></bold>). It was reported previously that the Kapa HiFi polymerase, a distinct enzyme from the Kapa HiFi Uracil + polymerase used throughout the rest of the manuscript, efficiently amplified AT-rich DNA when supplemented with 60 mM TMAC (<xref ref-type="bibr" rid="B36">Oyola et al., 2012</xref>). However, our experiments with <italic>Neurospora</italic> DNA revealed a significant underrepresentation of AT-rich DNA after only 8 cycles of PCR using the KAPA HiFi DNA polymerase + TMAC (<bold>Figures <xref ref-type="fig" rid="F6">6B,D</xref></bold>). Therefore, the DNA polymerase used for PCR amplification and the number of PCR cycles selected can affect amplification of genomic DNA that contains large disparities in GC content.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p><bold>Polymerase chain reaction amplification bias in <italic>Neurospora crassa</italic> heterochromatin domains <bold>(A&#x02013;B)</bold> Sequence coverage is shown across a 1.2 Mb domain of the <italic>Neurospora</italic> genome (chromosome 1: 200&#x02013;1400 kb).</bold> Enrichment of tri-methyl H3 K9 (H3K9me3; <xref ref-type="bibr" rid="B37">Sasaki et al., 2014</xref>) and GC-content (%GC) are plotted to indicate the position of AT-rich heterochromatin domains. Data are shown for libraries prepared with Illumina PCR master mix <bold>(A)</bold> or Kapa HiFi polymerase + TMAC <bold>(B)</bold> and amplified for 4, 8, and 15 cycles, as indicated. The positions of genes are plotted beneath the coverage tracks. <bold>(C&#x02013;D)</bold> Normalized read coverage and GC-content was calculated for 1 kb windows across the entire <italic>Neurospora</italic> genome for libraries prepared using different conditions. Log<sub>2</sub> values obtained by comparing normalized read coverage after 4 and 8 cycles <bold>(top)</bold> or 4 and 15 cycles <bold>(bottom)</bold> are plotted on the <italic>y</italic>-axis. %GC is plotted on the <italic>x</italic>-axis. Results for TruSeq master mix and Kapa HiFi polymerase are shown in panels <bold>(C,D)</bold>, respectively.</p></caption>
<graphic xlink:href="fgene-05-00341-g006.tif"/>
</fig>
</sec>
</sec>
<sec>
<title>CONCLUSION</title>
<p>Whole-genome bisulfite sequencing is an excellent method for determining base resolution DNA methylomes and although in most genomes studied to date, it has been able to determine methylation levels for >90% of cytosines, it does have some limitations. In this study, we show that there exists a strong amplification bias of methylated DNA compared to unmethylated sequences and that this bias can be affected by PCR cycle number and the enzyme used for PCR amplification. Interestingly, this change in amplification bias is not observed with the Kapa HiFi Uracil + enzyme in comparison to the others tested. This observation suggests that the high PCR cycle number that is necessary to construct a sequenceable library for low input samples such as RRBS (<xref ref-type="bibr" rid="B18">Gu et al., 2011</xref>), single cells, sequence-capture or FFPE samples can be achieved without introducing major artifacts from the number of PCR cycles selected. Therefore, although an amplification bias will occur at these highly methylated sequences, in these types of libraries, it will not grow in strength to the point that it depletes lower methylated sequences from produced sequencing data. This effect is not true for all enzymes used in amplification of bisulfite converted DNA as the Pfu Turbo Cx enzyme resulted in a significant over-enrichment of reads associated with methylated DNA with increasing cycle number. Furthermore, even tests on genomic DNA from <italic>Neurospora</italic> revealed significant variation between different commercially available enzymes and selected PCR cycle number. To improve the quality and to reduce to bias associated with WGBS data it is recommended to start higher concentrations of genomic DNA when possible, limit the number of PCR cycles and optimize the polymerase used for amplification of the library.</p>
<p>In general, it is difficult to precisely quantify the effect of this observed bias in WGBS data on percent methylation calculations. As we show here, bias depends on polymerase choice and PCR cycle number. We expect that other factors can also affect bias, not only the methylation level, which itself can be affected by biological and technical variation, but also factors like raw sequence read length, sequencing errors, efficiency of sodium bisulfite conversion, sequence structure itself, etc. In order to develop computational tools that correct for bias, additional experiments are needed to define how these factors contribute to the observed bias in WGBS data. We note that the bias introduced using current protocols is likely to influence DMR analysis. In particular, loci that are heterozygous for methylation state could be underrepresented by DMR finding algorithms.</p>
<p>The sodium bisulfite conversion and PCR amplification steps are impossible to remove from WGBS library preparation and these are the steps where a bias is introduced. In fact, efforts to correct for amplification bias in targeted bisulfite sequencing of genomic regions have been successfully performed by using a calibration method that requires a parallel analysis of input samples with known methylation levels (<xref ref-type="bibr" rid="B33">Moskalev et al., 2011</xref>). Additionally, RainDance Technology has also proved useful for correction of amplification bias in targeted bisulfite PCR (<xref ref-type="bibr" rid="B26">Komori et al., 2011</xref>), but it is unknown if this correction will be observed on WGBS libraries. Regardless, both of these methods are not feasible alternatives for routine correction of the observed amplification bias in WGBS, as they are technically cumbersome compared to current methods and they are cost prohibitive. This bias will be most pronounced in the study of allele-specific methylation. The results of this study make it clear that PCR enzyme and the PCR amplification cycle used can influence determined methylation levels, which undoubtedly could affect results of previously published studies that were not aware of these biases. Fortunately, most experiments that use this approach can be corrected for this bias by using SNP data in the bisulfite-sequencing reads. Caution should be taken when calculating methylation levels at loci with heterozygous methylation states, especially when experiments study regions that do not have SNPs or do not take into consideration SNPs linked to bisulfite-sequencing reads. Caution should also be taken when using platforms that do not detect SNPs. For example, many microarray platforms have been designed to detect DNA methylation states and although PCR is not generally used to amplify bisulfite-converted DNA, the use of Whole Genome Amplification (WGA) methods will still introduce a bias based on the GC content of DNA sequences (<xref ref-type="bibr" rid="B8">Bredel et al., 2005</xref>; <xref ref-type="bibr" rid="B2">Arriola et al., 2007</xref>). Because WGBS is primarily used on species with published reference genomes, it may be possible to predict this amplification bias based on GC content and the range of potential methylation levels at these regions. Therefore, future efforts to computationally model and predict expected read coverage for regions will be advantageous for studies where SNP information is not available or present at specific regions of interest.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>We are grateful to Nathan Springer for original discussions related to this study and Chad Niederhuth for critical reading of this manuscript. This study was supported by start-up funds from the University of Georgia and by research grants to Robert J. Schmitz from the National Science Foundation (IOS-1339194) and the National Institutes of Health (R00GM100000) and from the March of Dimes Foundation (#5-FY14-89) to Zachary A. Lewis. Xiaoxiao Sun and Ping Ma were partially supported by research grants from the National Science Foundation (DMS-1222718, DMS-1438957).</p>
</ack>
<sec sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fgene.2014.00341/abstract">http://www.frontiersin.org/journal/10.3389/fgene.2014.00341/abstract</ext-link></p>
<supplementary-material xlink:href="Table_1.XLSX" id="SM1" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>REFERENCES</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aird</surname> <given-names>D.</given-names></name> <name><surname>Ross</surname> <given-names>M. G.</given-names></name> <name><surname>Chen</surname> <given-names>W. S.</given-names></name> <name><surname>Danielsson</surname> <given-names>M.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Russ</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries.</article-title> <source><italic>Genome Biol.</italic></source> <volume>12</volume> R18. <pub-id pub-id-type="doi">10.1186/gb-2011-12-2-r18</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arriola</surname> <given-names>E.</given-names></name> <name><surname>Lambros</surname> <given-names>M. B.</given-names></name> <name><surname>Jones</surname> <given-names>C.</given-names></name> <name><surname>Dexter</surname> <given-names>T.</given-names></name> <name><surname>Mackay</surname> <given-names>A.</given-names></name> <name><surname>Tan</surname> <given-names>D. S.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Evaluation of Phi29-based whole-genome amplification for microarray-based comparative genomic hybridisation.</article-title> <source><italic>Lab. Invest.</italic></source> <volume>87</volume> <fpage>75</fpage>&#x02013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1038/labinvest.3700495</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Becker</surname> <given-names>C.</given-names></name> <name><surname>Hagmann</surname> <given-names>J.</given-names></name> <name><surname>Muller</surname> <given-names>J.</given-names></name> <name><surname>Koenig</surname> <given-names>D.</given-names></name> <name><surname>Stegle</surname> <given-names>O.</given-names></name> <name><surname>Borgwardt</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Spontaneous epigenetic variation in the <italic>Arabidopsis thaliana</italic> methylome.</article-title> <source><italic>Nature</italic></source> <volume>480</volume> <fpage>245</fpage>&#x02013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1038/nature10555</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benjamini</surname> <given-names>Y.</given-names></name> <name><surname>Speed</surname> <given-names>T. P.</given-names></name></person-group> (<year>2012</year>). <article-title>Summarizing and correcting the GC content bias in high-throughput sequencing.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <volume>e72.</volume> <pub-id pub-id-type="doi">10.1093/nar/gks001</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernstein</surname> <given-names>B. E.</given-names></name> <name><surname>Stamatoyannopoulos</surname> <given-names>J. A.</given-names></name> <name><surname>Costello</surname> <given-names>J. F.</given-names></name> <name><surname>Ren</surname> <given-names>B.</given-names></name> <name><surname>Milosavljevic</surname> <given-names>A.</given-names></name> <name><surname>Meissner</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The NIH roadmap epigenomics mapping consortium.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>28</volume> <fpage>1045</fpage>&#x02013;<lpage>1048</lpage>. <pub-id pub-id-type="doi">10.1038/nbt1010-1045</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Booth</surname> <given-names>M. J.</given-names></name> <name><surname>Branco</surname> <given-names>M. R.</given-names></name> <name><surname>Ficz</surname> <given-names>G.</given-names></name> <name><surname>Oxley</surname> <given-names>D.</given-names></name> <name><surname>Krueger</surname> <given-names>F.</given-names></name> <name><surname>Reik</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Quantitative sequencing of 5-methylcytosine and 5-hydroxymethylcytosine at single-base resolution.</article-title> <source><italic>Science</italic></source> <volume>336</volume> <fpage>934</fpage>&#x02013;<lpage>937</lpage>. <pub-id pub-id-type="doi">10.1126/science.1220671</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Booth</surname> <given-names>M. J.</given-names></name> <name><surname>Marsico</surname> <given-names>G.</given-names></name> <name><surname>Bachman</surname> <given-names>M.</given-names></name> <name><surname>Beraldi</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Quantitative sequencing of 5-formylcytosine in DNA at single-base resolution.</article-title> <source><italic>Nat. Chem.</italic></source> <volume>6</volume> <fpage>435</fpage>&#x02013;<lpage>440</lpage>. <pub-id pub-id-type="doi">10.1038/NCHEM.1893</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bredel</surname> <given-names>M.</given-names></name> <name><surname>Bredel</surname> <given-names>C.</given-names></name> <name><surname>Juric</surname> <given-names>D.</given-names></name> <name><surname>Kim</surname> <given-names>Y.</given-names></name> <name><surname>Vogel</surname> <given-names>H.</given-names></name> <name><surname>Harsh</surname> <given-names>G. R.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>Amplification of whole tumor genomes and gene-by-gene mapping of genomic aberrations from limited sources of fresh-frozen and paraffin-embedded DNA.</article-title> <source><italic>J. Mol. Diagn.</italic></source> <volume>7</volume> <fpage>171</fpage>&#x02013;<lpage>182</lpage>. <pub-id pub-id-type="doi">10.1016/S1525-1578(10)60543-0</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calarco</surname> <given-names>J. P.</given-names></name> <name><surname>Borges</surname> <given-names>F.</given-names></name> <name><surname>Donoghue</surname> <given-names>M. T.</given-names></name> <name><surname>Van Ex</surname> <given-names>F.</given-names></name> <name><surname>Jullien</surname> <given-names>P. E.</given-names></name> <name><surname>Lopes</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Reprogramming of DNA methylation in pollen guides epigenetic inheritance via small RNA.</article-title> <source><italic>Cell</italic></source> <volume>151</volume> <fpage>194</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2012.09.001</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>S. J.</given-names></name> <name><surname>Harrison</surname> <given-names>J.</given-names></name> <name><surname>Paul</surname> <given-names>C. L.</given-names></name> <name><surname>Frommer</surname> <given-names>M.</given-names></name></person-group> (<year>1994</year>). <article-title>High sensitivity mapping of methylated cytosines.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>22</volume> <fpage>2990</fpage>&#x02013;<lpage>2997</lpage>. <pub-id pub-id-type="doi">10.1093/nar/22.15.2990</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>S. J.</given-names></name> <name><surname>Statham</surname> <given-names>A.</given-names></name> <name><surname>Stirzaker</surname> <given-names>C.</given-names></name> <name><surname>Molloy</surname> <given-names>P. L.</given-names></name> <name><surname>Frommer</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). <article-title>DNA methylation: bisulphite modification and analysis.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>1</volume> <fpage>2353</fpage>&#x02013;<lpage>2364</lpage>.</citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cokus</surname> <given-names>S. J.</given-names></name> <name><surname>Feng</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>Z.</given-names></name> <name><surname>Merriman</surname> <given-names>B.</given-names></name> <name><surname>Haudenschild</surname> <given-names>C. D.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Shotgun bisulphite sequencing of the <italic>Arabidopsis</italic> genome reveals DNA methylation patterning.</article-title> <source><italic>Nature</italic></source> <volume>452</volume> <fpage>215</fpage>&#x02013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.1038/nature06745</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cook</surname> <given-names>R. D.</given-names></name> <name><surname>Weisberg</surname> <given-names>S.</given-names></name></person-group> (<year>1982</year>). <article-title><italic>Residuals and Influence in Regression</italic>.</article-title> <publisher-loc>New York</publisher-loc>: <publisher-name>Chapman and Hall.</publisher-name></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Creasey</surname> <given-names>K. M.</given-names></name> <name><surname>Zhai</surname> <given-names>J.</given-names></name> <name><surname>Borges</surname> <given-names>F.</given-names></name> <name><surname>Van Ex</surname> <given-names>F.</given-names></name> <name><surname>Regulski</surname> <given-names>M.</given-names></name> <name><surname>Meyers</surname> <given-names>B. C.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>miRNAs trigger widespread epigenetically activated siRNAs from transposons in <italic>Arabidopsis</italic>.</article-title> <source><italic>Nature</italic></source> <volume>508</volume> <fpage>411</fpage>&#x02013;<lpage>415</lpage>. <pub-id pub-id-type="doi">10.1038/nature13069</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frommer</surname> <given-names>M.</given-names></name> <name><surname>Mcdonald</surname> <given-names>L. E.</given-names></name> <name><surname>Millar</surname> <given-names>D. S.</given-names></name> <name><surname>Collis</surname> <given-names>C. M.</given-names></name> <name><surname>Watt</surname> <given-names>F.</given-names></name> <name><surname>Grigg</surname> <given-names>G. W.</given-names></name><etal/></person-group> (<year>1992</year>). <article-title>A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>89</volume> <fpage>1827</fpage>&#x02013;<lpage>1831</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.89.5.1827</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gehring</surname> <given-names>M.</given-names></name> <name><surname>Bubb</surname> <given-names>K. L.</given-names></name> <name><surname>Henikoff</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Extensive demethylation of repetitive elements during seed development underlies gene imprinting.</article-title> <source><italic>Science</italic></source> <volume>324</volume> <fpage>1447</fpage>&#x02013;<lpage>1451</lpage>.<pub-id pub-id-type="doi">10.1126/science.1171609</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gent</surname> <given-names>J. I.</given-names></name> <name><surname>Ellis</surname> <given-names>N. A.</given-names></name> <name><surname>Guo</surname> <given-names>L.</given-names></name> <name><surname>Harkess</surname> <given-names>A. E.</given-names></name> <name><surname>Yao</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>CHH islands: de novo DNA methylation in near-gene chromatin regulation in maize.</article-title> <source><italic>Genome Res.</italic></source> <volume>23</volume> <fpage>628</fpage>&#x02013;<lpage>637</lpage>. <pub-id pub-id-type="doi">10.1101/gr.146985.112</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>H.</given-names></name> <name><surname>Smith</surname> <given-names>Z. D.</given-names></name> <name><surname>Bock</surname> <given-names>C.</given-names></name> <name><surname>Boyle</surname> <given-names>P.</given-names></name> <name><surname>Gnirke</surname> <given-names>A.</given-names></name> <name><surname>Meissner</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Preparation of reduced representation bisulfite sequencing libraries for genome-scale DNA methylation profiling.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>6</volume> <fpage>468</fpage>&#x02013;<lpage>481</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2010.190</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hayatsu</surname> <given-names>H.</given-names></name></person-group> (<year>1976</year>). <article-title>Bisulfite modification of nucleic acids and their constituents.</article-title> <source><italic>Prog. Nucleic Acid Res. Mol. Biol.</italic></source> <volume>16</volume> <fpage>75</fpage>&#x02013;<lpage>124</lpage>.</citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hayatsu</surname> <given-names>H.</given-names></name> <name><surname>Wataya</surname> <given-names>Y.</given-names></name> <name><surname>Kai</surname> <given-names>K.</given-names></name> <name><surname>Iida</surname> <given-names>S.</given-names></name></person-group> (<year>1970</year>). <article-title>Reaction of sodium bisulfite with uracil, cytosine, and their derivatives.</article-title> <source><italic>Biochemistry</italic></source> <volume>9</volume> <fpage>2858</fpage>&#x02013;<lpage>2865</lpage>. <pub-id pub-id-type="doi">10.1021/bi00816a016</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horvath</surname> <given-names>A.</given-names></name> <name><surname>Vertessy</surname> <given-names>B. G.</given-names></name></person-group> (<year>2010</year>). <article-title>A one-step method for quantitative determination of uracil in DNA by real-time PCR.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>38</volume> <issue>e196</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkq815</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>T. F.</given-names></name> <name><surname>Ibarra</surname> <given-names>C. A.</given-names></name> <name><surname>Silva</surname> <given-names>P.</given-names></name> <name><surname>Zemach</surname> <given-names>A.</given-names></name> <name><surname>Eshed-Williams</surname> <given-names>L.</given-names></name> <name><surname>Fischer</surname> <given-names>R. L.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Genome-wide demethylation of <italic>Arabidopsis</italic> endosperm.</article-title> <source><italic>Science</italic></source> <volume>324</volume> <fpage>1451</fpage>&#x02013;<lpage>1454</lpage>. <pub-id pub-id-type="doi">10.1126/science.1172417</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ibarra</surname> <given-names>C. A.</given-names></name> <name><surname>Feng</surname> <given-names>X.</given-names></name> <name><surname>Schoft</surname> <given-names>V. K.</given-names></name> <name><surname>Hsieh</surname> <given-names>T. F.</given-names></name> <name><surname>Uzawa</surname> <given-names>R.</given-names></name> <name><surname>Rodrigues</surname> <given-names>J. A.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Active DNA demethylation in plant companion cells reinforces transposon methylation in gametes.</article-title> <source><italic>Science</italic></source> <volume>337</volume> <fpage>1360</fpage>&#x02013;<lpage>1364</lpage>. <pub-id pub-id-type="doi">10.1126/science.1224839</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>J. J.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Sperm, but not oocyte, DNA methylome is inherited by zebrafish early embryos.</article-title> <source><italic>Cell</italic></source> <volume>153</volume> <fpage>773</fpage>&#x02013;<lpage>784</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2013.04.041</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kobayashi</surname> <given-names>H.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name> <name><surname>Miura</surname> <given-names>F.</given-names></name> <name><surname>Imai</surname> <given-names>M.</given-names></name> <name><surname>Mochiduki</surname> <given-names>K.</given-names></name> <name><surname>Yanagisawa</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>High-resolution DNA methylome analysis of primordial germ cells identifies gender-specific reprogramming in mice.</article-title> <source><italic>Genome Res.</italic></source> <volume>23</volume> <fpage>616</fpage>&#x02013;<lpage>627</lpage>. <pub-id pub-id-type="doi">10.1101/gr.148023.112</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Komori</surname> <given-names>H. K.</given-names></name> <name><surname>Lamere</surname> <given-names>S. A.</given-names></name> <name><surname>Torkamani</surname> <given-names>A.</given-names></name> <name><surname>Hart</surname> <given-names>G. T.</given-names></name> <name><surname>Kotsopoulos</surname> <given-names>S.</given-names></name> <name><surname>Warner</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Application of microdroplet PCR for large-scale targeted bisulfite sequencing.</article-title> <source><italic>Genome Res.</italic></source> <volume>21</volume> <fpage>1738</fpage>&#x02013;<lpage>1745</lpage>. <pub-id pub-id-type="doi">10.1101/gr.116863.110 </pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kozarewa</surname> <given-names>I.</given-names></name> <name><surname>Ning</surname> <given-names>Z.</given-names></name> <name><surname>Quail</surname> <given-names>M. A.</given-names></name> <name><surname>Sanders</surname> <given-names>M. J.</given-names></name> <name><surname>Berriman</surname> <given-names>M.</given-names></name> <name><surname>Turner</surname> <given-names>D. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes.</article-title> <source><italic>Nat. Methods</italic></source> <volume>6</volume> <fpage>291</fpage>&#x02013;<lpage>295</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1311</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>O&#x02019;malley</surname> <given-names>R. C.</given-names></name> <name><surname>Tonti-Filippini</surname> <given-names>J.</given-names></name> <name><surname>Gregory</surname> <given-names>B. D.</given-names></name> <name><surname>Berry</surname> <given-names>C. C.</given-names></name> <name><surname>Millar</surname> <given-names>A. H.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Highly integrated single-base resolution maps of the epigenome in <italic>Arabidopsis</italic>.</article-title> <source><italic>Cell</italic></source> <volume>133</volume> <fpage>523</fpage>&#x02013;<lpage>536</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2008.03.029.</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>Pelizzola</surname> <given-names>M.</given-names></name> <name><surname>Dowen</surname> <given-names>R. H.</given-names></name> <name><surname>Hawkins</surname> <given-names>R. D.</given-names></name> <name><surname>Hon</surname> <given-names>G.</given-names></name> <name><surname>Tonti-Filippini</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Human DNA methylomes at base resolution show widespread epigenomic differences.</article-title> <source><italic>Nature</italic></source> <volume>462</volume> <fpage>315</fpage>&#x02013;<lpage>322</lpage>. <pub-id pub-id-type="doi">10.1038/nature08514</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>Pelizzola</surname> <given-names>M.</given-names></name> <name><surname>Kida</surname> <given-names>Y. S.</given-names></name> <name><surname>Hawkins</surname> <given-names>R. D.</given-names></name> <name><surname>Nery</surname> <given-names>J. R.</given-names></name> <name><surname>Hon</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells.</article-title> <source><italic>Nature</italic></source> <volume>471</volume> <fpage>68</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1038/nature09798</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCluskey</surname> <given-names>K.</given-names></name> <name><surname>Wiest</surname> <given-names>A.</given-names></name> <name><surname>Plamann</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>The Fungal Genetics Stock Center: a repository for 50 years of fungal genetics research.</article-title> <source><italic>J. Biosci.</italic></source> <volume>35</volume> <fpage>119</fpage>&#x02013;<lpage>126</lpage>.</citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meissner</surname> <given-names>A.</given-names></name> <name><surname>Gnirke</surname> <given-names>A.</given-names></name> <name><surname>Bell</surname> <given-names>G. W.</given-names></name> <name><surname>Ramsahoye</surname> <given-names>B.</given-names></name> <name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Jaenisch</surname> <given-names>R.</given-names></name></person-group> (<year>2005</year>). <article-title>Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>33</volume> <fpage>5868</fpage>&#x02013;<lpage>5877</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki901</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moskalev</surname> <given-names>E. A.</given-names></name> <name><surname>Zavgorodnij</surname> <given-names>M. G.</given-names></name> <name><surname>Majorova</surname> <given-names>S. P.</given-names></name> <name><surname>Vorobjev</surname> <given-names>I. A.</given-names></name> <name><surname>Jandaghi</surname> <given-names>P.</given-names></name> <name><surname>Bure</surname> <given-names>I. V.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Correction of PCR-bias in quantitative DNA methylation studies by means of cubic polynomial regression.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>39</volume> <issue>e77</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkr213</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Orozco</surname> <given-names>L. D.</given-names></name> <name><surname>Rubbi</surname> <given-names>L.</given-names></name> <name><surname>Martin</surname> <given-names>L. J.</given-names></name> <name><surname>Fang</surname> <given-names>F.</given-names></name> <name><surname>Hormozdiari</surname> <given-names>F.</given-names></name> <name><surname>Che</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Intergenerational genomic DNA methylation patterns in mouse hybrid strains.</article-title> <source><italic>Genome Biol.</italic></source> <volume>15</volume> R68. <pub-id pub-id-type="doi">10.1186/gb-2014-15-5-r68</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ossowski</surname> <given-names>S.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Lucas-Lledo</surname> <given-names>J. I.</given-names></name> <name><surname>Warthmann</surname> <given-names>N.</given-names></name> <name><surname>Clark</surname> <given-names>R. M.</given-names></name> <name><surname>Shaw</surname> <given-names>R. G.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The rate and molecular spectrum of spontaneous mutations in <italic>Arabidopsis thaliana</italic>.</article-title> <source><italic>Science</italic></source> <volume>327</volume> <fpage>92</fpage>&#x02013;<lpage>94</lpage>. <pub-id pub-id-type="doi">10.1126/science.1180677</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oyola</surname> <given-names>S. O.</given-names></name> <name><surname>Otto</surname> <given-names>T. D.</given-names></name> <name><surname>Gu</surname> <given-names>Y.</given-names></name> <name><surname>Maslen</surname> <given-names>G.</given-names></name> <name><surname>Manske</surname> <given-names>M.</given-names></name> <name><surname>Campino</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.</article-title> <source><italic>BMC Genomics</italic></source> <volume>13</volume>:<issue>1</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-13&#x02013;11</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sasaki</surname> <given-names>T.</given-names></name> <name><surname>Lynch</surname> <given-names>K. L.</given-names></name> <name><surname>Mueller</surname> <given-names>C. V.</given-names></name> <name><surname>Friedman</surname> <given-names>S.</given-names></name> <name><surname>Freitag</surname> <given-names>M.</given-names></name> <name><surname>Lewis</surname> <given-names>Z. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Heterochromatin controls gammaH2A localization in <italic>Neurospora crassa</italic>.</article-title> <source><italic>Eukaryot. Cell 1</italic></source> <volume>3</volume> <fpage>990</fpage>&#x02013;<lpage>1000</lpage>. <pub-id pub-id-type="doi">10.1128/EC.00117&#x02013;114</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>He</surname> <given-names>Y.</given-names></name> <name><surname>Valdes-Lopez</surname> <given-names>O.</given-names></name> <name><surname>Khan</surname> <given-names>S. M.</given-names></name> <name><surname>Joshi</surname> <given-names>T.</given-names></name> <name><surname>Urich</surname> <given-names>M. A.</given-names></name><etal/></person-group> (<year>2013a</year>). <article-title>Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population.</article-title> <source><italic>Genome Res.</italic></source> <volume>23</volume> <fpage>1663</fpage>&#x02013;<lpage>1674</lpage>. <pub-id pub-id-type="doi">10.1101/gr.152538.112</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>Schultz</surname> <given-names>M. D.</given-names></name> <name><surname>Urich</surname> <given-names>M. A.</given-names></name> <name><surname>Nery</surname> <given-names>J. R.</given-names></name> <name><surname>Pelizzola</surname> <given-names>M.</given-names></name> <name><surname>Libiger</surname> <given-names>O.</given-names></name><etal/></person-group> (<year>2013b</year>). <article-title>Patterns of population epigenomic diversity.</article-title> <source><italic>Nature</italic></source> <volume>495</volume> <fpage>193</fpage>&#x02013;<lpage>198</lpage>. <pub-id pub-id-type="doi">10.1038/nature11968</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>Schultz</surname> <given-names>M. D.</given-names></name> <name><surname>Lewsey</surname> <given-names>M. G.</given-names></name> <name><surname>O&#x02019;malley</surname> <given-names>R. C.</given-names></name> <name><surname>Urich</surname> <given-names>M. A.</given-names></name> <name><surname>Libiger</surname> <given-names>O.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Transgenerational epigenetic instability is a source of novel methylation variants.</article-title> <source><italic>Science</italic></source> <volume>334</volume> <fpage>369</fpage>&#x02013;<lpage>373</lpage>. <pub-id pub-id-type="doi">10.1126/science.1212959</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname> <given-names>M. D.</given-names></name> <name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>Ecker</surname> <given-names>J. R.</given-names></name></person-group> (<year>2012</year>). <article-title>&#x0201C;Leveling&#x0201D; the playing field for analyses of single-base resolution DNA methylomes.</article-title> <source><italic>Trends Genet.</italic></source> <volume>28</volume> <fpage>583</fpage>&#x02013;<lpage>585</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2012.10.012</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seisenberger</surname> <given-names>S.</given-names></name> <name><surname>Andrews</surname> <given-names>S.</given-names></name> <name><surname>Krueger</surname> <given-names>F.</given-names></name> <name><surname>Arand</surname> <given-names>J.</given-names></name> <name><surname>Walter</surname> <given-names>J.</given-names></name> <name><surname>Santos</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells.</article-title> <source><italic>Mol. Cell</italic></source> <volume>48</volume> <fpage>849</fpage>&#x02013;<lpage>862</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2012.11.001</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname> <given-names>R.</given-names></name> <name><surname>Braverman</surname> <given-names>B.</given-names></name> <name><surname>Louis</surname> <given-names>J. B.</given-names></name> <name><surname>Servis</surname> <given-names>R. E.</given-names></name></person-group> (<year>1973</year>). <article-title>Nucleic acid reactivity and conformation. II. Reaction of cytosine and uracil with sodium bisulfite.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>248</volume> <fpage>4060</fpage>&#x02013;<lpage>4064</lpage>.</citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname> <given-names>R.</given-names></name> <name><surname>Difate</surname> <given-names>V.</given-names></name> <name><surname>Welcher</surname> <given-names>M.</given-names></name></person-group> (<year>1974</year>). <article-title>Deamination of cytosine derivatives by bisulfite. Mechanism of the reaction.</article-title> <source><italic>J. Am. Chem. Soc.</italic></source>. <volume>96</volume> <fpage>906</fpage>&#x02013;<lpage>912</lpage>.</citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaw</surname> <given-names>R. G.</given-names></name> <name><surname>Byers</surname> <given-names>D. L.</given-names></name> <name><surname>Darmo</surname> <given-names>E.</given-names></name></person-group> (<year>2000</year>). <article-title>Spontaneous mutational effects on reproductive traits of <italic>Arabidopsis thaliana</italic>.</article-title> <source><italic>Genetics</italic></source> <volume>155</volume> <fpage>369</fpage>&#x02013;<lpage>378</lpage>.</citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shirane</surname> <given-names>K.</given-names></name> <name><surname>Toh</surname> <given-names>H.</given-names></name> <name><surname>Kobayashi</surname> <given-names>H.</given-names></name> <name><surname>Miura</surname> <given-names>F.</given-names></name> <name><surname>Chiba</surname> <given-names>H.</given-names></name> <name><surname>Ito</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Mouse oocyte methylomes at base resolution reveal genome-wide accumulation of non-CpG methylation and role of DNA methyltransferases.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>9</volume>:<issue>e1003439</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003439</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slotkin</surname> <given-names>R. K.</given-names></name> <name><surname>Vaughn</surname> <given-names>M.</given-names></name> <name><surname>Borges</surname> <given-names>F.</given-names></name> <name><surname>Tanurdzic</surname> <given-names>M.</given-names></name> <name><surname>Becker</surname> <given-names>J. D.</given-names></name> <name><surname>Feijo</surname> <given-names>J. A.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Epigenetic reprogramming and small RNA silencing of transposable elements in pollen.</article-title> <source><italic>Cell</italic></source> <volume>136</volume> <fpage>461</fpage>&#x02013;<lpage>472</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2008.12.038.</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stadler</surname> <given-names>M. B.</given-names></name> <name><surname>Murr</surname> <given-names>R.</given-names></name> <name><surname>Burger</surname> <given-names>L.</given-names></name> <name><surname>Ivanek</surname> <given-names>R.</given-names></name> <name><surname>Lienert</surname> <given-names>F.</given-names></name> <name><surname>Scholer</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>DNA-binding factors shape the mouse methylome at distal regulatory regions.</article-title> <source><italic>Nature</italic></source> <volume>480</volume> <fpage>490</fpage>&#x02013;<lpage>495</lpage>. <pub-id pub-id-type="doi">10.1038/nature10716</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trapnell</surname> <given-names>C.</given-names></name> <name><surname>Hendrickson</surname> <given-names>D. G.</given-names></name> <name><surname>Sauvageau</surname> <given-names>M.</given-names></name> <name><surname>Goff</surname> <given-names>L.</given-names></name> <name><surname>Rinn</surname> <given-names>J. L.</given-names></name> <name><surname>Pachter</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>Differential analysis of gene regulation at transcript resolution with RNA-seq.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>31</volume> <fpage>46</fpage>&#x02013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2450</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Urich</surname> <given-names>M. A.</given-names></name> <name><surname>Nery</surname> <given-names>J. R.</given-names></name> <name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>Schmitz</surname> <given-names>R. J.</given-names></name> <name><surname>Ecker</surname> <given-names>J. R.</given-names></name></person-group> (<year>2014</year>). <article-title>MethylC-seq: base resolution whole genome bisulfite sequencing library preparation.</article-title> <source><italic>Nat. Protoc.</italic></source> <comment>(in press)</comment>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Warnecke</surname> <given-names>P. M.</given-names></name> <name><surname>Stirzaker</surname> <given-names>C.</given-names></name> <name><surname>Melki</surname> <given-names>J. R.</given-names></name> <name><surname>Millar</surname> <given-names>D. S.</given-names></name> <name><surname>Paul</surname> <given-names>C. L.</given-names></name> <name><surname>Clark</surname> <given-names>S. J.</given-names></name></person-group> (<year>1997</year>). <article-title>Detection and measurement of PCR bias in quantitative methylation analysis of bisulphite-treated DNA.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>25</volume> <fpage>4422</fpage>&#x02013;<lpage>4426</lpage>.</citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname> <given-names>W.</given-names></name> <name><surname>Barr</surname> <given-names>C. L.</given-names></name> <name><surname>Kim</surname> <given-names>A.</given-names></name> <name><surname>Yue</surname> <given-names>F.</given-names></name> <name><surname>Lee</surname> <given-names>A. Y.</given-names></name> <name><surname>Eubanks</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Base-resolution analyses of sequence and parent-of-origin dependent DNA methylation in the mouse genome.</article-title> <source><italic>Cell</italic></source> <volume>148</volume> <fpage>816</fpage>&#x02013;<lpage>831</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2011.12.035</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>M.</given-names></name> <name><surname>Hon</surname> <given-names>G. C.</given-names></name> <name><surname>Szulwach</surname> <given-names>K. E.</given-names></name> <name><surname>Song</surname> <given-names>C. X.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Kim</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Base-resolution analysis of 5-hydroxymethylcytosine in the mammalian genome.</article-title> <source><italic>Cell</italic></source> <volume>149</volume> <fpage>1368</fpage>&#x02013;<lpage>1380</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2012.04.027</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zemach</surname> <given-names>A.</given-names></name> <name><surname>Kim</surname> <given-names>M. Y.</given-names></name> <name><surname>Silva</surname> <given-names>P.</given-names></name> <name><surname>Rodrigues</surname> <given-names>J. A.</given-names></name> <name><surname>Dotson</surname> <given-names>B.</given-names></name> <name><surname>Brooks</surname> <given-names>M. D.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Local DNA hypomethylation activates genes in rice endosperm.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>107</volume> <fpage>18729</fpage>&#x02013;<lpage>18734</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1009695107</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/geo/">http://www.ncbi.nlm.nih.gov/geo/</ext-link></p></fn>
<fn id="fn02">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/sra/SRX096372">http://www.ncbi.nlm.nih.gov/sra/SRX096372</ext-link></p></fn>
</fn-group>
</back>
</article>