protein families

The structure of BVU2987 from Bacteroides vulgatus reveals a superfamily of bacterial periplasmic proteins with possible inhibitory function

2010

Debanu Das Robert D. Finn Dennis Carlton Mitchell D. Miller Polat Abdubek Tamara Astakhova Herbert L. Axelrod Constantina Bakolitsa Connie Chen Hsiu-Ju Chiu Michelle Chiu Thomas Clayton Marc C. Deller Lian Duan Kyle Ellrott Dustin Ernst Carol L. Farr Julie Feuerhelm Joanna C. Grant Anna Grzechnik Gye Won Han Lukasz Jaroszewski Kevin K. Jin Heath E. Klock Mark W. Knuth Piotr Kozbial S. Sri Krishna Abhinav Kumar David Marciano Daniel McMullan Andrew T. Morse Edward Nigoghossian Amanda Nopakun Linda Okach Christina Puckett Ron Reyes Christopher L. Rife Natasha Sefcovic Henry J. Tien Christine B. Trame Henry van den Bedem Dana Weekes Tiffany Wooten Qingping Xu Keith O. Hodgson John Wooley Marc-André Elsliger Ashley M. Deacon Adam Godzik Scott A. Lesley Ian A. Wilson

Proteins that contain the DUF2874 domain constitute a new Pfam family PF11396. Members of this family have predominantly been identified in microbes found in the human gut and oral cavity. The crystal structure of one member of this family, BVU2987 from Bacteroides vulgatus, has been determined, revealing a β-lactamase inhibitor protein-like structure with a tandem repeat of domains. Sequence a...

متن کامل

ProtoMap: automatic classification of protein sequences and hierarchy of protein families

Journal: :Nucleic Acids Research 2000

متن کامل

Oxygen tolerant, photoinduced controlled radical polymerization approach for the synthesis of <i>giant amphiphiles</i>

Journal: :Polymer Chemistry 2021

New families of amphiphilic protein–polymer bioconjugates readily synthesized via an oxygen tolerant, photoinduced RDRP approach.

متن کامل

Redefining the PF06864 Pfam Family Based on Burkholderia pseudomallei PilO2Bp S-SAD Crystal Structure

2014

Patricia Lassaux Oscar Conchillo-Solé Babu A. Manjasetty Daniel Yero Lucia Perletti Hassan Belrhali Xavier Daura Louise J. Gourlay Martino Bolognesi

Type IV pili are surface-exposed filaments and bacterial virulence factors, represented by the Tfpa and Tfpb types, which assemble via specific machineries. The Tfpb group is further divided into seven variants, linked to heterogeneity in the assembly machineries. Here we focus on PilO2(Bp), a protein component of the Tfpb R64 thin pilus variant assembly machinery from the pathogen Burkholderia...

متن کامل

Computational prediction of SEG (single exon gene) function in humans.

Journal: :Frontiers in bioscience : a journal and virtual library 2005

Meena K Sakharkar Vincent T K Chow Kingshuk Ghosh Iti Chaturvedi Pern Chern Lee Sundara Perumal Bagavathi Paul Shapshak Subramanian Subbiah Pandjassarame Kangueane

Human genes are often interrupted by non-coding, intragenic sequences called introns. Hence, the gene sequence is divided into exons (coding segments) and introns (non-coding segments). Consequently, a majority of them are multi exon genes (MEG). However, a considerable amount of single exon genes (SEG) are present in the human genome (approximately 12%). This amount is sizeable and it is impor...

متن کامل

Protein Family Databases for Automated Protein Domain Identification

1998

ERIK L.L. SONNHAMMER

Automatic identification and annotation of protein domains is a major challenge for genome sequencing projects. Simple transfer of the annotation from the overall most similar protein with a known function is relatively reliable for prokaryotic proteins, but often produces misleading and incomplete results for multi-domain proteins, which are common in higher organisms. An alternative approach ...

متن کامل

PROCOGNATE: a cognate ligand domain mapping for enzymes

2008

Matthew Bashton Irene Nobeli Janet M. Thornton

PROCOGNATE is a database of protein cognate ligands for the domains in enzyme structures as described by CATH, SCOP and Pfam, and is available as an interactive website or a flat file. This article gives an overview of the database and its generation and presents a new website front end, as well as recent increased coverage in our dataset via inclusion of Pfam domains. We also describe navigati...

متن کامل

Learning generative models for protein fold families.

Journal: :Proteins 2011

Sivaraman Balakrishnan Hetunandan Kamisetty Jaime G Carbonell Su-In Lee Christopher James Langmead

We introduce a new approach to learning statistical models from multiple sequence alignments (MSA) of proteins. Our method, called GREMLIN (Generative REgularized ModeLs of proteINs), learns an undirected probabilistic graphical model of the amino acid composition within the MSA. The resulting model encodes both the position-specific conservation statistics and the correlated mutation statistic...

متن کامل

Generalized Baum-Welch Algorithm Based on the Similarity between Sequences

2013

Vahid Rezaei Hamid Pezeshk Horacio Pérez-Sa'nchez

The profile hidden Markov model (PHMM) is widely used to assign the protein sequences to their respective families. A major limitation of a PHMM is the assumption that given states the observations (amino acids) are independent. To overcome this limitation, the dependency between amino acids in a multiple sequence alignment (MSA) which is the representative of a PHMM can be appended to the PHMM...

متن کامل

Genome3D: exploiting structure to help users understand their sequences

2015

Tony E. Lewis Ian Sillitoe Antonina Andreeva Tom L. Blundell Daniel W. A. Buchan Cyrus Chothia Domenico Cozzetto Jose M. Dana Ioannis Filippis Julian Gough David T. Jones Lawrence A. Kelley Gerard J. Kleywegt Federico Minneci Jaina Mistry Alexey G. Murzin Bernardo Ochoa-Montaño Matt E. Oates Marco Punta Owen J. L. Rackham Jonathan Stahlhacke Michael J. E. Sternberg Sameer Velankar Christine A. Orengo

Genome3D (http://www.genome3d.eu) is a collaborative resource that provides predicted domain annotations and structural models for key sequences. Since introducing Genome3D in a previous NAR paper, we have substantially extended and improved the resource. We have annotated representatives from Pfam families to improve coverage of diverse sequences and added a fast sequence search to the website...

متن کامل