Implementation of Parallel Local Alignment Method for DNA Sequence using Apache Spark
نویسندگان
چکیده
منابع مشابه
An Apache Spark Implementation for Sentiment Analysis on Twitter Data
Sentiment Analysis on Twitter Data is a challenging problem due to the nature, diversity and volume of the data. In this work, we implement a system on Apache Spark, an open-source framework for programming with Big Data. The sentiment analysis tool is based on Machine Learning methodologies alongside with Natural Language Processing techniques and utilizes Apache Spark’s Machine learning libra...
متن کاملImage Correlation Method for DNA Sequence Alignment
The complexity of searches and the volume of genomic data make sequence alignment one of bioinformatics most active research areas. New alignment approaches have incorporated digital signal processing techniques. Among these, correlation methods are highly sensitive. This paper proposes a novel sequence alignment method based on 2-dimensional images, where each nucleic acid base is represented ...
متن کاملReal-time News Recommendations using Apache Spark
Recommending news articles is a challenging task due to the continuous changes in the set of available news articles and the contextdependent preferences of users. Traditional recommender approaches are optimized for analyzing static data sets. In news recommendation scenarios, characterized by continuous changes, high volume of messages, and tight time constraints, alternative approaches are n...
متن کاملDNA Sequence Alignment by Parallel Dynamic Programming
Of late Molecular biology is becoming increasingly dependent on computer science algorithms as research tools. The process of aligning DNA sequences is widely used in modern biological sciences. Genetics databases hold extremely large amount of raw data. The human genome alone has approximately 3 billion DNA base pairs. To search through all this data and to find meaningful relationships in it,...
متن کاملParallel Maritime Traffic Clustering Based on Apache Spark
Maritime traffic patterns extraction is an essential part for maritime security and surveillance and DBSCANSD is a density based clustering algorithm extracting the arbitrary shapes of the normal lanes from AIS data. This paper presents a parallel DBSCANSD algorithm on top of Apache Spark. The project is an experimental research work and the results shown in this paper is preliminary. The exper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Korea Contents Association
سال: 2016
ISSN: 1598-4877
DOI: 10.5392/jkca.2016.16.10.608