A wavelet-based method to exploit epigenomic language in the regulatory region

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

  • Nha Nguyen
  • An Vo
  • Kyoung-Jae Won

MOTIVATION: Epigenetic landscapes in the regulatory regions reflect binding condition of transcription factors and their co-factors. Identifying epigenetic condition and its variation is important in understanding condition-specific gene regulation. Computational approaches to explore complex multi-dimensional landscapes are needed.

RESULTS: To study epigenomic condition for gene regulation, we developed a method, AWNFR, to classify epigenomic landscapes based on the detected epigenomic landscapes. Assuming mixture of Gaussians for a nucleosome, the proposed method captures the shape of histone modification and identifies potential regulatory regions in the wavelet domain. For accuracy estimation as well as enhanced computational speed, we developed a novel algorithm based on down-sampling operation and footprint in wavelet. We showed the algorithmic advantages of AWNFR using the simulated data. AWNFR identified regulatory regions more effectively and accurately than the previous approaches with the epigenome data in mouse embryonic stem cells and human lung fibroblast cells (IMR90). Based on the detected epigenomic landscapes, AWNFR classified epigenomic status and studied epigenomic codes. We studied co-occurring histone marks and showed that AWNFR captures the epigenomic variation across time.

AVAILABILITY AND IMPLEMENTATION: The source code and supplemental document of AWNFR are available at http://wonk.med.upenn.edu/AWNFR.

OriginalsprogEngelsk
TidsskriftBioinformatics (Online)
Vol/bind30
Udgave nummer7
Sider (fra-til)908-14
Antal sider7
ISSN1367-4811
DOI
StatusUdgivet - 1 apr. 2014
Eksternt udgivetJa

ID: 199332127