The InterPro database, an integrated documentation resource for protein families, domains and functional sites

نویسندگان

  • Rolf Apweiler
  • Terri K. Attwood
  • Amos Bairoch
  • Alex Bateman
  • Ewan Birney
  • Margaret Biswas
  • Philipp Bucher
  • Lorenzo Cerutti
  • Florence Corpet
  • Michael D. R. Croning
  • Richard Durbin
  • Laurent Falquet
  • Wolfgang Fleischmann
  • Jérôme Gouzy
  • Henning Hermjakob
  • Nicolas Hulo
  • Inge Jonassen
  • Daniel Kahn
  • Alexander Kanapin
  • Youla Karavidopoulou
  • Rodrigo Lopez
  • Beate Marx
  • Nicola J. Mulder
  • Thomas M. Oinn
  • Marco Pagni
  • Florence Servant
  • Christian J. A. Sigrist
  • Evgeny M. Zdobnov
چکیده

Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1,000,000 hits from 462,500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to [email protected].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

InterPro: An Integrated Documentation Resource for Protein Families, Domains and Functional Sites

The exponential increase in the submission of nucleotide sequences to the nucleotide sequence database by genome sequencing centres has resulted in a need for rapid, automatic methods for classification of the resulting protein sequences. There are several signature and sequence cluster-based methods for protein classification, each resource having distinct areas of optimum application owing to...

متن کامل

Reference InterPro , progress and status in 2005 MULDER , Nicola

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is pro...

متن کامل

InterPro, progress and status in 2005

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is pro...

متن کامل

The InterPro Database, 2003 brings increased coverage and new features

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a means of amalgamating the major protein signature databases into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are pro...

متن کامل

Genomic Functional Investigation through Statistical Analysis of Protein Families and Domains

Protein families and domains represent a very relevant resource useful to understand protein functions and interactions among their codifying genes. To perform evaluations of gene annotations sparsely available in numerous different databanks accessible via Internet, we previously developed GFINDer, a Web server that performs statistical analysis of functional and phenotypic annotations of gene...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Nucleic acids research

دوره 29 1  شماره 

صفحات  -

تاریخ انتشار 2001