Pse-Analysis: a python package for DNA/RNA and protein/peptide sequence analysis based on pseudo components and kernel methods

نویسندگان

  • Bin Liu
  • Hao Wu
  • Deyuan Zhang
  • Xiaolong Wang
  • Kuo-Chen Chou
چکیده

To expedite the pace in conducting genome/proteome analysis, we have developed a Python package called Pse-Analysis. The powerful package can automatically complete the following five procedures: (1) sample feature extraction, (2) optimal parameter selection, (3) model training, (4) cross validation, and (5) evaluating prediction quality. All the work a user needs to do is to input a benchmark dataset along with the query biological sequences concerned. Based on the benchmark dataset, Pse-Analysis will automatically construct an ideal predictor, followed by yielding the predicted results for the submitted query samples. All the aforementioned tedious jobs can be automatically done by the computer. Moreover, the multiprocessing technique was adopted to enhance computational speed by about 6 folds. The Pse-Analysis Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/Pse-Analysis/, and can be directly run on Windows, Linux, and Unix.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phylogenetic and sequence analysis of the growth hormone gene of two sturgeons, Huso huso and Acipenser Gueldenstaedtii

In this study, the cDNA Growth Hormone (cGH) of the Belugasturgeon (Husohuso) and Russian sturgeon (Acipensergueldenstaedtii) were cloned and sequenced, and phylogenetic relationships were examined using nucleic acid and amino acid sequences. The nucleotide sequence of the Beluga GH has an open reading frame of 645 nucleotides encoding a protein 214 amino acid residues. The signal peptide cleav...

متن کامل

Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences

With the avalanche of biological sequences generated in the post-genomic age, one of the most challenging problems in computational biology is how to effectively formulate the sequence of a biological sample (such as DNA, RNA or protein) with a discrete model or a vector that can effectively reflect its sequence pattern information or capture its key features concerned. Although several web ser...

متن کامل

Assessment of humoral immune response of a Cytomegalovirus DNA-vaccine candidate in BALB/c mice

Introduction: Glycoprotein B (gB) is the major antigen for induction of humoral responses against human cytomegalovirus (HCMV) making it an attractive candidate for immune prophylaxis. In the present study, the humoral immune response of BALB/c mice to a truncated HCMV gB protein fused with GFP was evaluated. Methods: The truncated gB coding sequence was synthesized and cloned in pEGFPN1 eukary...

متن کامل

Phylogenetic analysis of two Iranian grapevine virus A isolates using coat protein gene sequence

Symptomatic grapevine samples were collected from vineyards in Zanjan province to detect Grapevine virus A. Total RNA was extracted from symptomatic leaf samples and subjected to cDNA synthesis using random hexamer primers. Then, a DNA fragment around 800 bp including the complete coat protein (CP) gene was amplified from nine out of 57 samples by polymerase chain reaction (PCR) using specific ...

متن کامل

Evaluation of Cell Penetrating Peptide Delivery System on HPV16E7 Expression in Three Types of Cell Line

Background: The poor permeability of the plasma and nuclear membranes to DNA plasmids are two major barriers for the development of these therapeutic molecules. Therefore, success in gene therapy approaches depends on the development of efficient and safe non-viral delivery systems. Objectives: The aim of this study was to investigate the in vitro delivery of plasmid DNA encoding HPV16 E7 gene...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2017