TY - GEN
T1 - Learning co-substructures by kernel dependence maximization
AU - Yokoi, Sho
AU - Mochihashi, Daichi
AU - Takahashi, Ryo
AU - Okazaki, Naoaki
AU - Inui, Kentaro
N1 - Funding Information:
This work was supported by JST CREST Grant Number JP-MJCR1513, Japan. We are grateful to Prof. K. Fukumizu and Prof. H. Kashima for giving us valuable advice. We would also like to thank Dr. R. Tian and S. Kobayashi for meaningful discussions.
PY - 2017
Y1 - 2017
N2 - Modeling associations between items in a dataset is a problem that is frequently encountered in data and knowledge mining research. Most previous studies have simply applied a predefined fixed pattern for extracting the substructure of each item pair and then analyzed the associations between these substructures. Using such fixed patterns may not, however, capture the significant association. We, therefore, propose the novel machine learning task of extracting a strongly associated substructure pair (co-substructure) from each input item pair. We call this task dependent co-substructure extraction (DCSE), and formalize it as a dependence maximization problem. Then, we discuss critical issues with this task: the data sparsity problem and a huge search space. To address the data sparsity problem, we adopt the Hilbert-Schmidt independence criterion as an objective function. To improve search efficiency, we adopt the Metropolis-Hastings algorithm. We report the results of empirical evaluations, in which the proposed method is applied for acquiring and predicting narrative event pairs, an active task in the field of natural language processing.
AB - Modeling associations between items in a dataset is a problem that is frequently encountered in data and knowledge mining research. Most previous studies have simply applied a predefined fixed pattern for extracting the substructure of each item pair and then analyzed the associations between these substructures. Using such fixed patterns may not, however, capture the significant association. We, therefore, propose the novel machine learning task of extracting a strongly associated substructure pair (co-substructure) from each input item pair. We call this task dependent co-substructure extraction (DCSE), and formalize it as a dependence maximization problem. Then, we discuss critical issues with this task: the data sparsity problem and a huge search space. To address the data sparsity problem, we adopt the Hilbert-Schmidt independence criterion as an objective function. To improve search efficiency, we adopt the Metropolis-Hastings algorithm. We report the results of empirical evaluations, in which the proposed method is applied for acquiring and predicting narrative event pairs, an active task in the field of natural language processing.
UR - http://www.scopus.com/inward/record.url?scp=85031924508&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85031924508&partnerID=8YFLogxK
U2 - 10.24963/ijcai.2017/465
DO - 10.24963/ijcai.2017/465
M3 - Conference contribution
AN - SCOPUS:85031924508
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 3329
EP - 3335
BT - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
A2 - Sierra, Carles
PB - International Joint Conferences on Artificial Intelligence
T2 - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
Y2 - 19 August 2017 through 25 August 2017
ER -