Annotating a Japanese text corpus with predicate-argument and coreference relations

Ryu Iida, Mamoru Komachi, Kentaro Inui, Yuji Matsumoto

Research output: Contribution to conferencePaperpeer-review

73 Citations (Scopus)

Abstract

In this paper, we discuss how to annotate coreference and predicate-argument relations in Japanese written text. There have been research activities for building Japanese text corpora annotated with coreference and predicate-argument relations as are done in the Kyoto Text Corpus version 4.0 (Kawahara et al., 2002) and the GDATagged Corpus (Hasida, 2005). However, there is still much room for refining their specifications. For this reason, we discuss issues in annotating these two types of relations, and propose a new specification for each. In accordance with the specification, we built a large-scaled annotated corpus, and examined its reliability. As a result of our current work, we have released an annotated corpus named the NAIST Text Corpus1, which is used as the evaluation data set in the coreference and zero-anaphora resolution tasks in Iida et al. (2005) and Iida et al. (2006).

Original languageEnglish
Pages132-139
Number of pages8
DOIs
Publication statusPublished - 2007
EventLinguistic Annotation Workshop, LAW 2007 - Prague, Czech Republic
Duration: 2007 Jun 282007 Jun 29

Conference

ConferenceLinguistic Annotation Workshop, LAW 2007
Country/TerritoryCzech Republic
CityPrague
Period07/6/2807/6/29

Fingerprint

Dive into the research topics of 'Annotating a Japanese text corpus with predicate-argument and coreference relations'. Together they form a unique fingerprint.

Cite this