spaced seeds

نتایج جستجو برای: spaced seeds

تعداد نتایج: 55222 فیلتر نتایج به سال:

Spaced Seed Data Structures for De Novo Assembly

2015

Inanç Birol Justin Chu Hamid Mohamadi Shaun D. Jackman Karthika Raghavan Benjamin P. Vandervalk Anthony Raymond René L. Warren

De novo assembly of the genome of a species is essential in the absence of a reference genome sequence. Many scalable assembly algorithms use the de Bruijn graph (DBG) paradigm to reconstruct genomes, where a table of subsequences of a certain length is derived from the reads, and their overlaps are analyzed to assemble sequences. Despite longer subsequences unlocking longer genomic features fo...

متن کامل

Sensitivity analysis and efficient method for identifying optimal spaced seeds

Journal: :Journal of Computer and System Sciences 2004

متن کامل

Spaced Seeds for Cross-species Cdna-to-genome Sequence Alignment

2009

LEMING ZHOU LILIANA FLOREA INGRID MIHAI

We review recent developments in spaced seed design for cross-species sequence alignment. We start with a brief overview of original ideas and early techniques, and then focus on more recent work on finding accurate (sensitive and specific) seeds for cross-species cDNA-to-genome alignment. These recent developments include methods and models for estimating seed specificity and determining sensi...

متن کامل

Optimal Spaced Seeds for Faster Approximate String Matching

Journal: :J. Comput. Syst. Sci. 2005

Martin Farach-Colton Gad M. Landau Süleyman Cenk Sahinalp Dekel Tsur

Filtering is a standard technique for fast approximate string matching in practice. In filtering, a quick first step is used to rule out almost all positions of a text as possible starting positions for a pattern. Typically this step consists of finding the exact matches of small parts of the pattern. In the followup step, a slow method is used to verify or eliminate each remaining position. Th...

متن کامل

Spaced seeds improve k-mer-based metagenomic classification

Journal: :Bioinformatics 2015

Karel Brinda Maciej Sykulski Gregory Kucherov

MOTIVATION Metagenomics is a powerful approach to study genetic content of environmental samples, which has been strongly promoted by next-generation sequencing technologies. To cope with massive data involved in modern metagenomic projects, recent tools rely on the analysis of k-mers shared between the read to be classified and sampled reference genomes. RESULTS Within this general framework...

متن کامل

Designing Efficient Spaced Seeds for SOLiD Read Mapping

2010

Laurent Noé Marta Gîrdea Gregory Kucherov

The advent of high-throughput sequencing technologies constituted a major advance in genomic studies, offering new prospects in a wide range of applications.We propose a rigorous and flexible algorithmic solution to mapping SOLiD color-space reads to a reference genome. The solution relies on an advanced method of seed design that uses a faithful probabilistic model of read matches and, on the ...

متن کامل

YASS: Similarity search in DNA sequences

2003

Laurent Noé Gregory Kucherov

We describe YASS – a new tool for finding local similarities in DNA sequences. The YASS algorithm first scans the sequence(s) and creates on the fly groups of seeds (small exact repeats obtained by hashing) according to statistically-founded criteria. Then it tries to extend those groups into similarity regions on the basis of a new extension criterion. The method can be seen as a compromise be...

متن کامل

Fast and accurate correction of optical mapping data via spaced seeds

Journal: :Bioinformatics 2020

متن کامل

A bioinformatician’s guide to the forefront of suffix array construction algorithms

2014

Anish Man Singh Shrestha Martin C. Frith Paul Horton

The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction. We also describe DisLex, a technique that allows standard suffix array construction algorithms to create modified su...

متن کامل

Effective cluster-based seed design for cross-species sequence comparisons

Journal: :Bioinformatics 2008

Leming Zhou Ingrid Mihai Liliana Florea

SUMMARY To annotate newly sequenced organisms, cross-species sequence comparison algorithms can be applied to align gene sequences to the genome of a related species. To improve the accuracy of alignment, spaced seeds must be optimized for each comparison. As the number and diversity of genomes increase, an efficient alternative is to cluster pairwise comparisons into groups and identify seeds ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید