TY - JOUR
T1 - Estimating genome-wide gene networks using nonparametric bayesian network models on massively parallel computers
AU - Tamada, Yoshinori
AU - Imoto, Seiya
AU - Araki, Hiromitsu
AU - Nagasaki, Masao
AU - Print, Cristin
AU - Charnock-Jones, D. Stephen
AU - Miyano, Satoru
N1 - Funding Information:
Computation time was provided by the Super Computer System, Human Genome Center, Institute of Medical Science, University of Tokyo. This research was supported in part by Grant-in-Aid for Research and Development Project of the Next Generation Integrated Simulation of Living Matter at RIKEN and MEXT Japan. The authors would like to acknowledge Yuki Tomiyasu and Kaori Yasuda at Cell Innovator Inc., Muna Affara, Ben Dunmore, Deborah Sanders, and Sally Humphreys at University of Cambridge, and Kousuke Tashiro and Satoru Kuhara at Kyushu University for their contributions to generating the HUVEC microarray data. The authors also would like to thank the anonymous referees for their valuable comments. Hiromitsu Araki was with Cell Innovator Inc. when he was doing this research.
PY - 2011
Y1 - 2011
N2 - We present a novel algorithm to estimate genome-wide gene networks consisting of more than 20,000 genes from gene expression data using nonparametric Bayesian networks. Due to the difficulty of learning Bayesian network structures, existing algorithms cannot be applied to more than a few thousand genes. Our algorithm overcomes this limitation by repeatedly estimating subnetworks in parallel for genes selected by neighbor node sampling. Through numerical simulation, we confirmed that our algorithm outperformed a heuristic algorithm in a shorter time. We applied our algorithm to microarray data from human umbilical vein endothelial cells (HUVECs) treated with siRNAs, to construct a human genome-wide gene network, which we compared to a small gene network estimated for the genes extracted using a traditional bioinformatics method. The results showed that our genome-wide gene network contains many features of the small network, as well as others that could not be captured during the small network estimation. The results also revealed master-regulator genes that are not in the small network but that control many of the genes in the small network. These analyses were impossible to realize without our proposed algorithm.
AB - We present a novel algorithm to estimate genome-wide gene networks consisting of more than 20,000 genes from gene expression data using nonparametric Bayesian networks. Due to the difficulty of learning Bayesian network structures, existing algorithms cannot be applied to more than a few thousand genes. Our algorithm overcomes this limitation by repeatedly estimating subnetworks in parallel for genes selected by neighbor node sampling. Through numerical simulation, we confirmed that our algorithm outperformed a heuristic algorithm in a shorter time. We applied our algorithm to microarray data from human umbilical vein endothelial cells (HUVECs) treated with siRNAs, to construct a human genome-wide gene network, which we compared to a small gene network estimated for the genes extracted using a traditional bioinformatics method. The results showed that our genome-wide gene network contains many features of the small network, as well as others that could not be captured during the small network estimation. The results also revealed master-regulator genes that are not in the small network but that control many of the genes in the small network. These analyses were impossible to realize without our proposed algorithm.
KW - Bayesian network structure learning
KW - Biology and genetics
KW - gene expression data analysis
KW - gene networks
UR - http://www.scopus.com/inward/record.url?scp=79952856971&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952856971&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2010.68
DO - 10.1109/TCBB.2010.68
M3 - Article
C2 - 20714027
AN - SCOPUS:79952856971
SN - 1545-5963
VL - 8
SP - 683
EP - 697
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 3
M1 - 5551118
ER -