Cookiecutter: a tool for kmer-based read filtering and extraction

نویسندگان

  • Ekaterina Starostina
  • Gaik Tamazian
  • Pavel Dobrynin
  • Stephen O’Brien
  • Aleksey Komissarov
چکیده

Bioinformatics Institute, St. Petersburg, Russia Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, 41A Sredniy Avenue, 198207, St. Petersburg, Russia Oceanographic Center, Nova Southeastern University Ft Lauderdale, 8000 N. Ocean Drive, Ft Lauderdale, Florida 33004, USA

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using a Data Mining Tool and FP-Growth Algorithm Application for Extraction of the Rules in two Different Dataset (TECHNICAL NOTE)

In this paper, we want to improve association rules in order to be used in recommenders. Recommender systems present a method to create the personalized offers. One of the most important types of recommender systems is the collaborative filtering that deals with data mining in user information and offering them the appropriate item. Among the data mining methods, finding frequent item sets and ...

متن کامل

Comprehensive Analysis of Dense Point Cloud Filtering Algorithm for Eliminating Non-Ground Features

Point cloud and LiDAR Filtering is removing non-ground features from digital surface model (DSM) and reaching the bare earth and DTM extraction. Various methods have been proposed by different researchers to distinguish between ground and non- ground in points cloud and LiDAR data. Most fully automated methods have a common disadvantage, and they are only effective for a particular type of surf...

متن کامل

Informed kmer selection for de novo transcriptome assembly

MOTIVATION De novo transcriptome assembly is an integral part for many RNA-seq workflows. Common applications include sequencing of non-model organisms, cancer or meta transcriptomes. Most de novo transcriptome assemblers use the de Bruijn graph (DBG) as the underlying data structure. The quality of the assemblies produced by such assemblers is highly influenced by the exact word length k As su...

متن کامل

Pruning Rule for kMER-Based Acquisition of the Global Topographic Feature Map

For a kernel-based topographic map formation, kMER (kernel-based maximum entropy learning rule) was proposed by Van Hulle, and some effective learning rules related to kMER have been proposed so far with many applications. However, no discusions have been made concerning the determination of the number of units in kMER. This letter describes a unit-pruning rule, which permits automatic contruct...

متن کامل

A hybrid cloud read aligner based on MinHash and kmer voting that preserves privacy

Low-cost clouds can alleviate the compute and storage burden of the genome sequencing data explosion. However, moving personal genome data analysis to the cloud can raise serious privacy concerns. Here, we devise a method named Balaur, a privacy preserving read mapper for hybrid clouds based on locality sensitive hashing and kmer voting. Balaur can securely outsource a substantial fraction of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015