Building a corpus for Japanese wikification with fine-grained entity classes

Davaajav Jargalsaikhan, Naoaki Okazaki, Koji Matsuda, Kentaro Inui

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

In this research, we build a Wikification corpus for advancing Japanese Entity Linking. This corpus consists of 340 Japanese newspaper articles with 25,675 entity mentions. All entity mentions are labeled by a fine-grained semantic classes (200 classes), and 19,121 mentions were successfully linked to Japanese Wikipedia articles. Even with the fine-grained semantic classes, we found it hard to define the target of entity linking annotations and to utilize the fine-grained semantic classes to improve the accuracy of entity linking.

Original languageEnglish
Title of host publication54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Student Research Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages138-144
Number of pages7
ISBN (Electronic)9781510827608
DOIs
Publication statusPublished - 2016
Event54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Berlin, Germany
Duration: 2016 Aug 72016 Aug 12

Publication series

Name54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Student Research Workshop

Conference

Conference54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
Country/TerritoryGermany
CityBerlin
Period16/8/716/8/12

Fingerprint

Dive into the research topics of 'Building a corpus for Japanese wikification with fine-grained entity classes'. Together they form a unique fingerprint.

Cite this