Learning Protein-DNA Interaction Landscapes by Integrating Experimental Data through Computational Models
نویسندگان
چکیده
MOTIVATION Transcriptional regulation is directly enacted by the interactions between DNA and many proteins, including transcription factors (TFs), nucleosomes and polymerases. A critical step in deciphering transcriptional regulation is to infer, and eventually predict, the precise locations of these interactions, along with their strength and frequency. While recent datasets yield great insight into these interactions, individual data sources often provide only partial information regarding one aspect of the complete interaction landscape. For example, chromatin immunoprecipitation (ChIP) reveals the binding positions of a protein, but only for one protein at a time. In contrast, nucleases like MNase and DNase can be used to reveal binding positions for many different proteins at once, but cannot easily determine the identities of those proteins. Currently, few statistical frameworks jointly model these different data sources to reveal an accurate, holistic view of the in vivo protein-DNA interaction landscape. RESULTS Here, we develop a novel statistical framework that integrates different sources of experimental information within a thermodynamic model of competitive binding to jointly learn a holistic view of the in vivo protein-DNA interaction landscape. We show that our framework learns an interaction landscape with increased accuracy, explaining multiple sets of data in accordance with thermodynamic principles of competitive DNA binding. The resulting model of genomic occupancy provides a precise mechanistic vantage point from which to explore the role of protein-DNA interactions in transcriptional regulation. AVAILABILITY AND IMPLEMENTATION The C source code for compete and Python source code for MCMC-based inference are available at http://www.cs.duke.edu/∼amink. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Integrating Interactive Whiteboards in EFL Learners' Learning and Retention of Non-congruent Collocations
Drawing on the assumptions of socio-cognitive linguistics, focusing on the effective role of interaction in terms of reducing the cognitive burden in the process of learning, this quasi-experimental study aimed at investigating the effect of the Interactive Whiteboard (IWB) usage on the learning and retention of non-congruent collocations among 60 homogenized Iranian EFL learners, aged 18 to 24...
متن کاملIntegrating Interactive Whiteboards in EFL Learners' Learning and Retention of Non-congruent Collocations
Drawing on the assumptions of socio-cognitive linguistics, focusing on the effective role of interaction in terms of reducing the cognitive burden in the process of learning, this quasi-experimental study aimed at investigating the effect of the Interactive Whiteboard (IWB) usage on the learning and retention of non-congruent collocations among 60 homogenized Iranian EFL learners, aged 18 to 24...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملComputational reconstruction of protein-protein interaction networks: algorithms and issues.
Accurate mapping of protein-protein interaction networks in model organisms is a crucial first step toward subsequent quantitative study of the organization and evolution of biological systems. Data quality of experimental interactome maps can be assessed and improved by integrating multiple sources of evidence using machine learning methods. Here we describe the commonly used algorithms for pr...
متن کاملFrom Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions
DNA-protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA-protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA-protein interaction modes without knowing its specific DNA ta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 30 20 شماره
صفحات -
تاریخ انتشار 2014