TY - GEN
T1 - Identifying hidden confounders in gene networks by Bayesian networks
AU - Higashigaki, Tomoya
AU - Kojima, Kaname
AU - Yamaguchi, Rui
AU - Inoue, Masato
AU - Imoto, Seiya
AU - Miyano, Satoru
PY - 2010/9/6
Y1 - 2010/9/6
N2 - In the estimation of gene networks from microarray gene expression data, we propose a statistical method for quantification of the hidden confounders in gene networks, which were possibly removed from the set of genes on the gene networks or are novel biological elements that are not measured by microarrays. Due to high computational cost of the structural learning of Bayesian networks and the limited source of the microarray data, it is usual to perform gene selection prior to the estimation of gene networks. Therefore, there exist missing genes that decrease accuracy and interpretability of the estimated gene networks. The proposed method can identify hidden confounders based on the conflicts of the estimated local Bayesian network structures and estimate their ideal profiles based on the proposed Bayesian networks with hidden variables with an EM algorithm. From the estimated ideal profiles, we can identify genes which are missing in the network or suggest the existence of the novel biological elements if the ideal profiles are not significantly correlated with any expression profiles of genes. To the best of our knowledge, this research is the first study to theoretically characterize missing genes in gene networks and practically utilize this information to refine network estimation.
AB - In the estimation of gene networks from microarray gene expression data, we propose a statistical method for quantification of the hidden confounders in gene networks, which were possibly removed from the set of genes on the gene networks or are novel biological elements that are not measured by microarrays. Due to high computational cost of the structural learning of Bayesian networks and the limited source of the microarray data, it is usual to perform gene selection prior to the estimation of gene networks. Therefore, there exist missing genes that decrease accuracy and interpretability of the estimated gene networks. The proposed method can identify hidden confounders based on the conflicts of the estimated local Bayesian network structures and estimate their ideal profiles based on the proposed Bayesian networks with hidden variables with an EM algorithm. From the estimated ideal profiles, we can identify genes which are missing in the network or suggest the existence of the novel biological elements if the ideal profiles are not significantly correlated with any expression profiles of genes. To the best of our knowledge, this research is the first study to theoretically characterize missing genes in gene networks and practically utilize this information to refine network estimation.
UR - http://www.scopus.com/inward/record.url?scp=77956135180&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956135180&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2010.35
DO - 10.1109/BIBE.2010.35
M3 - Conference contribution
AN - SCOPUS:77956135180
SN - 9780769540832
T3 - 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010
SP - 168
EP - 173
BT - 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010
T2 - 10th IEEE International Conference on Bioinformatics and Bioengineering, BIBE-2010
Y2 - 31 May 2010 through 3 June 2010
ER -