Physiological genomics of Escherichia coli protein families.
نویسندگان
چکیده
The well-researched Escherichia coli genome offers the opportunity to explore the value of using protein families within a single organism to enrich functional annotation procedures and to study mechanisms of protein evolution. Having identified multimodular proteins resulting from gene fusion, and treated each module as a separate protein, nonoverlapping sequence-similar families in E. coli could be assembled. Of 3,902 proteins of length 100 residues or more, 2,415 clustered into 609 protein families. The relatedness of function among members of each family was dissected in detail. Data on paralogous protein families provides valuable information in attributing putative function to unknown genes, supplementing existing function annotation. Enzymes, transporters, and regulators represent the three major types of proteins in E. coli. They are shown to have distinctive patterns in gene duplication and divergence and gene fusion, suggesting that details of protein evolution have been different for genes in these categories. Data for the complete list of paralogous protein families and updated functional annotation for E. coli K-12 are accessible in GenProtEC (http://genprotec.mbl.edu).
منابع مشابه
Protein families reflect the metabolic diversity of organisms and provide support for functional prediction.
Comparative genomics has shown that protein families vary significantly within and across organisms in both number and functional composition. In the present work, we tested how the diversity at the family level reflects biological differences among organisms and contributes to their unique characteristics. For this purpose, we collected sequence-similar proteins of three selected families from...
متن کاملRecent Advances in High Cell Density Cultivation for Production of Recombinant Protein
This paper reviews recent strategies used for increasing specific yield and productivity in high cell density cultures. High cell density cultures offer an efficient means for the economical production of recombinant proteins. However, there are still some challenges associated with high cell density cultivation (HCDC) techniques. A variety of strategies in several aspects including host design...
متن کاملThe Escherichia coli proteome: past, present, and future prospects.
Proteomics has emerged as an indispensable methodology for large-scale protein analysis in functional genomics. The Escherichia coli proteome has been extensively studied and is well defined in terms of biochemical, biological, and biotechnological data. Even before the entire E. coli proteome was fully elucidated, the largest available data set had been integrated to decipher regulatory circui...
متن کاملHigh-level expression of tetanus toxin fragment C in Escherichia coli
Fragment C is the C-terminal domain of the heavy chain of tetanus toxin that can promote the immune response against the lethal dose of this toxin. Therefore, this portion can be considered as a candidate vaccine against tetanus infection, which occurs by Clostridium tetani. The present study aimed to compare the expression of tetanus toxin fragment C in Escherichia coli BL21 (DE3) pLysS cells...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Physiological genomics
دوره 9 1 شماره
صفحات -
تاریخ انتشار 2002