Abstract
DNA methylation is an epigenetic modification that has the ability to alter gene expression without any change in the DNA sequence. DNA methylation occurs when a methyl chemical group attaches to cytosine bases on the DNA sequence. In mammals, DNA methylation primarily occurs at CG sites, when a cytosine is followed by a guanine in the DNA sequence. In plants, DNA methylation can also occur in other cytosine sequences, such as when a cytosine is not followed directly by a guanine. Many of the statistical methods that have been developed to estimate methylation levels and test differential methylation in whole-genome bisulfite sequencing studies incorporate the observed correlation between methylation levels of neighboring cytosine sites. However, most of these methods have been applied to human studies, where only CG sites are investigated. In this study, we focus on plant studies and show that the correlation between methylation levels at neighboring sites depends on the DNA sequence immediately following the cytosine. We investigate the importance of accounting for these differences in the correlation structure by comparing the performance of three existing methods (MethylSig, MAGI, and M3D) in plants.
Keywords
Methylation, Next-Generation Sequencing, Cytosine Context
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.
Recommended Citation
Baumann, Douglas; Su, Yuqing; Mendis, Iranga; and Olbricht, Gayla R.
(2015).
"DIFFERENTIAL METHYLATION METHODS IN MULTI-CONTEXT ORGANISMS,"
Conference on Applied Statistics in Agriculture.
https://doi.org/10.4148/2475-7772.1089
DIFFERENTIAL METHYLATION METHODS IN MULTI-CONTEXT ORGANISMS
DNA methylation is an epigenetic modification that has the ability to alter gene expression without any change in the DNA sequence. DNA methylation occurs when a methyl chemical group attaches to cytosine bases on the DNA sequence. In mammals, DNA methylation primarily occurs at CG sites, when a cytosine is followed by a guanine in the DNA sequence. In plants, DNA methylation can also occur in other cytosine sequences, such as when a cytosine is not followed directly by a guanine. Many of the statistical methods that have been developed to estimate methylation levels and test differential methylation in whole-genome bisulfite sequencing studies incorporate the observed correlation between methylation levels of neighboring cytosine sites. However, most of these methods have been applied to human studies, where only CG sites are investigated. In this study, we focus on plant studies and show that the correlation between methylation levels at neighboring sites depends on the DNA sequence immediately following the cytosine. We investigate the importance of accounting for these differences in the correlation structure by comparing the performance of three existing methods (MethylSig, MAGI, and M3D) in plants.