Remarkably little variation in proteins encoded by the Y chromosome's single-copy genes, implying effective purifying selection.

نویسندگان

  • Steve Rozen
  • Janet D Marszalek
  • Raaji K Alagappan
  • Helen Skaletsky
  • David C Page
چکیده

Y-linked single-nucleotide polymorphisms (SNPs) have served as powerful tools for reconstructing the worldwide genealogy of human Y chromosomes and for illuminating patrilineal relationships among modern human populations. However, there has been no systematic, worldwide survey of sequence variation within the protein-coding genes of the Y chromosome. Here we report and analyze coding sequence variation among the 16 single-copy "X-degenerate" genes of the Y chromosome. We examined variation in these genes in 105 men representing worldwide diversity, resequencing in each man an average of 27 kb of coding DNA, 40 kb of intronic DNA, and, for comparison, 15 kb of DNA in single-copy Y-chromosomal pseudogenes. There is remarkably little variation in X-degenerate protein sequences: two chromosomes drawn at random differ on average by a single amino acid, with half of these differences arising from a single, conservative Asp-->Glu mutation that occurred approximately 50,000 years ago. Further analysis showed that nucleotide diversity and the proportion of variant sites are significantly lower for nonsynonymous sites than for synonymous sites, introns, or pseudogenes. These differences imply that natural selection has operated effectively in preserving the amino acid sequences of the Y chromosome's X-degenerate proteins during the last approximately 100,000 years of human history. Thus our findings are at odds with prominent accounts of the human Y chromosome's imminent demise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Global human frequencies of predicted nuclear pathogenic variants and the role played by protein hydrophobicity in pathogenicity potential

Mitochondrial proteins are coded by nuclear (nDNA) and mitochondrial (mtDNA) genes, implying a complex cross-talk between the two genomes. Here we investigated the diversity displayed in 104 nuclear-coded mitochondrial proteins from 1,092 individuals from the 1000 Genomes dataset, in order to evaluate if these genes are under the effects of purifying selection and how that selection compares wi...

متن کامل

T he evolution of functionally novel proteins after gene dup lica tion

A widely cited model of the evolution of functionally novel proteins (here called the model of mutation during non-functionality (mdn model)) holds that, after gene duplication, one gene copy is redundant and free to accumulate substitutions at random. By chance, some of these substitutions may suit the protein encoded by such a non-functional gene to a new function, which it can subsequently a...

متن کامل

T he evolution of functionally novel proteins after gene dup lica tion

A widely cited model of the evolution of functionally novel proteins (here called the model of mutation during non-functionality (mdn model)) holds that, after gene duplication, one gene copy is redundant and free to accumulate substitutions at random. By chance, some of these substitutions may suit the protein encoded by such a non-functional gene to a new function, which it can subsequently a...

متن کامل

I-49: Human Y Chromosome ProteomeProject

The success of the Human Genome Project (HGP) has provided a blueprint for the approximately 20,000 gene-encoded proteins potentially active in all of the hundreds of cell types that make up the human body. Yet we still have limited knowledge about a majority of the gene-encoded proteins which are the “building blocks of life” and “cellular machinery”. It is estimated that for nearly half of th...

متن کامل

Escape from Preferential Retention Following Repeated Whole Genome Duplications in Plants

The well supported gene dosage hypothesis predicts that genes encoding proteins engaged in dose-sensitive interactions cannot be reduced back to single copies once all interacting partners are simultaneously duplicated in a whole genome duplication. The genomes of extant flowering plants are the result of many sequential rounds of whole genome duplication, yet the fraction of genomes devoted to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 85 6  شماره 

صفحات  -

تاریخ انتشار 2009