Genome-wide analysis reveals strong correlation between CpG islands with nearby transcription start sites of genes and their tissue specificity

Riu Yamashita, Yutaka Suzuki, Sumio Sugano, Kenta Nakai

Research output: Contribution to journalArticlepeer-review

80 Citations (Scopus)

Abstract

It has been envisaged that CpG islands are often observed near the transcriptional start sites (TSS) of housekeeping genes. However, neither the precise positions of CpG islands relative to TSS of genes nor the correlation between the presence of the CpG islands and the expression specificity of these genes is well-understood. Using thousands of sequences with known TSS in human and mouse, we found that there is a clear peak in the distribution of CpG islands around TSS in the genes of these two species. Thus, we classified human (mouse) genes into 6600 (2948) CpG+ genes and 2619 (1830) CpG- ones, based on the presence of a CpG island within the -100: +100 region. We estimated the degree of each gene being a housekeeper by the number of cDNA libraries where its ESTs were detected. Then, the tendency that a gene lacking CpG islands around its TSS is expressed with a higher degree of tissue specificity turned out to be evolutionarily conserved. We also confirmed this tendency by analyzing the gene ontology annotation of classified genes. Since no such clear correlation was found in the control data (mRNAs, pre-mRNAs, and chromosome banding pattern), we concluded that the effect of a CpG island near the TSS should be more important than the global GC content of the region where the gene resides.

Original languageEnglish
Pages (from-to)129-136
Number of pages8
JournalGene
Volume350
Issue number2
DOIs
Publication statusPublished - 2005 May 9
Externally publishedYes

Keywords

  • CpG islands
  • Housekeeping genes
  • Isochores
  • Tissue specificity

ASJC Scopus subject areas

  • Genetics

Fingerprint

Dive into the research topics of 'Genome-wide analysis reveals strong correlation between CpG islands with nearby transcription start sites of genes and their tissue specificity'. Together they form a unique fingerprint.

Cite this