Seeder: discriminative seeding DNA motif discovery
نویسندگان
چکیده
MOTIVATION The computational identification of transcription factor binding sites is a major challenge in bioinformatics and an important complement to experimental approaches. RESULTS We describe a novel, exact discriminative seeding DNA motif discovery algorithm designed for fast and reliable prediction of cis-regulatory elements in eukaryotic promoters. The algorithm is tested on biological benchmark data and shown to perform equally or better than other motif discovery tools. The algorithm is applied to the analysis of plant tissue-specific promoter sequences and successfully identifies key regulatory elements.
منابع مشابه
Development of an Efficient Hybrid Method for Motif Discovery in DNA Sequences
This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...
متن کاملAn Approach to Fault Modeling and Fault Seeding Using the Program Dependence Graph1
We present a fault-classification scheme and a fault-seeding method that are based on the manifestation of faults in the program dependence graph (PDG). We enhance the domain/computation faultclassification scheme developed by Howden to further characterize faults as structural and statement-level, depending on the differences between the PDG for the original program and the PDG for the faulty ...
متن کاملSeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps
Genome-wide maps of transcription factor (TF) occupancy and regions of open chromatin implicitly contain DNA sequence signals for multiple factors. We present SeqGL, a novel de novo motif discovery algorithm to identify multiple TF sequence signals from ChIP-, DNase-, and ATAC-seq profiles. SeqGL trains a discriminative model using a k-mer feature representation together with group lasso regula...
متن کاملDECOD: fast and accurate discriminative DNA motif finding
MOTIVATION Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The first is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and ...
متن کامل3-D Turbulence Numerical Simulation for the Flow Field of Suction Cylinder-Seeder with Socket-Slots
The flow field has significantly impact on seeding performance in the suction seeding device. A three-dimensional, incompressible, viscous, RNG turbulence model and the SIMPLE method were used by computational fluid dynamics(CFD), and the flow fields of suction cylinder-seeder with different socket’s radiuses were simulated by Fluent. When vacuum is 4kPa and productivity is 350 trays/h, the sim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 شماره
صفحات -
تاریخ انتشار 2008