PubChem3D: Diversity of shape
نویسندگان
چکیده
BACKGROUND The shape diversity of 16.4 million biologically relevant molecules from the PubChem Compound database and their 1.46 billion diverse conformers was explored as a function of molecular volume. RESULTS The diversity of shape space was investigated by determining the shape similarity threshold to achieve a maximum on the count of reference shapes per unit of conformer volume. The rate of growth in shape space, as represented by a decreasing shape similarity threshold, was found to be remarkably smooth as a function of volume. There was no apparent correlation between the count of conformers per unit volume and their diversity, meaning that a single reference shape can describe the shape space of many chemical structures. The ability of a volume to describe the shape space of lesser volumes was also examined. It was shown that a given volume was able to describe 40-70% of the shape diversity of lesser volumes, for the majority of the volume range considered in this study. CONCLUSION The relative growth of shape diversity as a function of volume and shape similarity is surprisingly uniform. Given the distribution of chemicals in PubChem versus what is theoretically synthetically possible, the results from this analysis should be considered a conservative estimate to the true diversity of shape space.
منابع مشابه
PubChem3D: conformer ensemble accuracy
UNLABELLED BACKGROUND PubChem is a free and publicly available resource containing substance descriptions and their associated biological activity information. PubChem3D is an extension to PubChem containing computationally-derived three-dimensional (3-D) structures of small molecules. All the tools and services that are a part of PubChem3D rely upon the quality of the 3-D conformer models. ...
متن کاملPubChem3D: a new resource for scientists
BACKGROUND PubChem is an open repository for small molecules and their experimental biological activity. PubChem integrates and provides search, retrieval, visualization, analysis, and programmatic access tools in an effort to maximize the utility of contributed information. There are many diverse chemical structures with similar biological efficacies against targets available in PubChem that a...
متن کاملEvaluation of Genetic Variation of Common Fig (Ficus carica L.) in West of Iran
This study describes morphological diversity and relationship of 14 cultivars and 133 wild fig accessions from central Zagros Mountains located in the west of Iran, based on 58 morphological characters. Among all characters, secondary drooping branches, number of bark tubers, shape of central lobe, length of central lobe/length of lamina, little lateral lobes, shape of leaf without lobed, fruit...
متن کاملتأثیر اندازه و شکل لکههای درختزار بر غنا و تنوع گونهای پرندگان در منطقه حفاظت شده کرکس
Determining landscape parameters influencing species richness of habitat patches is one of the most important issues in conservation biology. Many previous studies have investigated the influence of habitat parameters on bird assemblages in forest patches, but studies seeking effects of oasis parameters on bird assemblages are very scarce. Karkas Protected Area is located in semi-arid zone in...
متن کاملStudy of genetic diversity in pomegranate germplasm of Yazd province of Iran
A total of 117 pomegranate genotypes collected from different areas of Yazd province of Iran were studied for genetic variation by evaluating 23 morphological traits according to the international descriptor. Similar diversity pattern of the measured characteristics was observed in three types of sweet, sweet-sour and sour varieties. The traits shape of fruit base, suckering tendency, vigor of ...
متن کامل