Building approximate overlap graphs for DNA assembly using random-permutations-based search

نویسنده

  • Roy R. Lederman
چکیده

between two reads indicates an approximate overlap between the reads. Since the algorithm finds approximate overlaps directly, it can process reads without errorcorrection preprocessing steps. Extensions of the algorithm, such as construction graphs of overlapping pairs of reads, are discussed. The algorithm can be used to construct graphs for assembly and for other related applications such as error correction. Preliminary experimental results indicate that the algorithm constructs graphs which lead to relatively long contigs even in the presence of sequencing errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diamondoids and DNA Nanotechnologies

Diamondoids are cage-like saturated hydrocarbons consisting of fused cyclohexane rings. The Diamondoids family of compounds is one of the best candidates for molecular building blocks (MBBs) in nanotechnology to construct organic nanostructures compared to other MBBs known so far. The challenge is to find a route for self-assembly of these cage hydrocarbons and their applications in the bottom-...

متن کامل

Toward a Distance Oracle for Billion-Node Graphs

The emergence of real life graphs with billions of nodes poses significant challenges for managing and querying these graphs. One of the fundamental queries submitted to graphs is the shortest distance query. Online BFS (breadth-first search) and offline pre-computing pairwise shortest distances are prohibitive in time or space complexity for billion-node graphs. In this paper, we study the fea...

متن کامل

An Effective Path-aware Approach for Keyword Search over Data Graphs

Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...

متن کامل

A memory-efficient data structure representing exact-match overlap graphs with application for next-generation DNA assembly

MOTIVATION Exact-match overlap graphs have been broadly used in the context of DNA assembly and the shortest super string problem where the number of strings n ranges from thousands to billions. The length ℓ of the strings is from 25 to 1000, depending on the DNA sequencing technologies. However, many DNA assemblers using overlap graphs suffer from the need for too much time and space in constr...

متن کامل

F-Permutations induce Some Graphs and Matrices

In this paper, by using the notion of fuzzy subsets, the concept of F-permutation is introduced. Then by applying this notion the concepts of presentation of an F-polygroup, graph of an F-permutation and F-permutation matrices are investigated.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012