TY - JOUR
T1 - Identification of reference genes for quantitative expression analysis using large-scale RNA-seq data of arabidopsis thaliana and model crop plants
AU - Kudo, Toru
AU - Sasaki, Yohei
AU - Terashima, Shin
AU - Matsuda-Imai, Noriko
AU - Takano, Tomoyuki
AU - Saito, Misa
AU - Kanno, Maasa
AU - Ozaki, Soichi
AU - Suwabe, Keita
AU - Suzuki, Go
AU - Watanabe, Masao
AU - Matsuoka, Makoto
AU - Takayama, Seiji
AU - Yano, Kentaro
N1 - Publisher Copyright:
© 2016, Genetics Society of Japan. All rights reserved.
PY - 2016
Y1 - 2016
N2 - In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
AB - In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
KW - Gene expression analysis
KW - Reference gene
KW - RNA-seq
UR - http://www.scopus.com/inward/record.url?scp=84991501212&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84991501212&partnerID=8YFLogxK
U2 - 10.1266/ggs.15-00065
DO - 10.1266/ggs.15-00065
M3 - Article
C2 - 27040147
AN - SCOPUS:84991501212
SN - 1341-7568
VL - 91
SP - 111
EP - 125
JO - Genes and Genetic Systems
JF - Genes and Genetic Systems
IS - 2
ER -