TY - GEN
T1 - Neural joint learning for classifying wikipedia articles into fine-grained named entity types
AU - Suzuki, Masatoshi
AU - Matsuda, Koji
AU - Sekine, Satoshi
AU - Okazaki, Naoaki
AU - Inui, Kentaro
PY - 2016
Y1 - 2016
N2 - This paper addresses the task of assigning finegrained NE type labels to Wikipedia articles. To address the data sparseness problem, which is salient particularly in fine-grained type classification, we introduce a multi-task learning framework where type classifiers are all jointly learned by a neural network with a hidden layer. In addition, we also propose to learn article vectors (i.e. entity embeddings) from Wikipedia's hypertext structure using a Skipgram model and incorporate them into the input feature set. To conduct large-scale practical experiments, we created a new dataset containing over 22,000 manually labeled instances. The dataset is available. The results of our experiments show that both ideas gained their own statistically significant improvement separately in classification accuracy.
AB - This paper addresses the task of assigning finegrained NE type labels to Wikipedia articles. To address the data sparseness problem, which is salient particularly in fine-grained type classification, we introduce a multi-task learning framework where type classifiers are all jointly learned by a neural network with a hidden layer. In addition, we also propose to learn article vectors (i.e. entity embeddings) from Wikipedia's hypertext structure using a Skipgram model and incorporate them into the input feature set. To conduct large-scale practical experiments, we created a new dataset containing over 22,000 manually labeled instances. The dataset is available. The results of our experiments show that both ideas gained their own statistically significant improvement separately in classification accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85015954846&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85015954846&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85015954846
T3 - Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
SP - 535
EP - 543
BT - Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
A2 - Park, Jong C.
A2 - Chung, Jin-Woo
PB - Institute for the Study of Language and Information
T2 - 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
Y2 - 28 October 2016 through 30 October 2016
ER -