Abstract
Mining patterns in graphs has become an important issue in real applications, such as bioinformatics and web mining. We address a graph clustering problem where a cluster is a set of densely connected nodes, under a practical setting that 1) the input is multiple graphs which share a set of nodes but have different edges and 2) a true cluster cannot be found in all given graphs. For this problem, we propose a probabilistic generative model and a robust learning scheme based on variational Bayesian estimation. A key feature of our probabilistic framework is that not only nodes but also given graphs can be clustered at the same time, allowing our model to capture clusters found in only part of all given graphs. We empirically evaluated the effectiveness of the proposed framework on not only a variety of synthetic graphs but also real gene networks, demonstrating that our proposed approach can improve the clustering performance of competing methods in both synthetic and real data.
Original language | English |
---|---|
Article number | 5677533 |
Pages (from-to) | 577-589 |
Number of pages | 13 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 24 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2012 |
Externally published | Yes |
Keywords
- Clustering
- graphs
- localized clusters
- statistical machine learning
- variational Bayesian learning
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics