TY - GEN
T1 - Annotating geographical entities on microblog text
AU - Matsuda, Koji
AU - Sasaki, Akira
AU - Okazaki, Naoaki
AU - Inui, Kentaro
N1 - Funding Information:
This research was supported by the program Research and Development on Real World Big Data Integration and Analysis of the Ministry of Education, Culture, Sports, Science and Technology, Japan and by the Precursory Research for Embryonic Science and Technology (PRESTO), Japan Science and Technology Agency (JST).
Publisher Copyright:
© 2015 Association for Computational Linguistics
PY - 2020
Y1 - 2020
N2 - This paper presents a discussion of the problems surrounding the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we not only annotate geographical location entities but also facility entities, such as stations, restaurants, shopping stores, hospitals and schools. We discuss ways in which to build a gazetteer, the types of ambiguities that need to be considered, reasons why the annotator tends to disagree, and the problems that need to be solved to automate the task of annotating the geographical entities. All the annotation data and the annotation guidelines are publicly available for research purposes from our web site.
AB - This paper presents a discussion of the problems surrounding the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we not only annotate geographical location entities but also facility entities, such as stations, restaurants, shopping stores, hospitals and schools. We discuss ways in which to build a gazetteer, the types of ambiguities that need to be considered, reasons why the annotator tends to disagree, and the problems that need to be solved to automate the task of annotating the geographical entities. All the annotation data and the annotation guidelines are publicly available for research purposes from our web site.
UR - http://www.scopus.com/inward/record.url?scp=85084356534&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084356534&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85084356534
T3 - LAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop
SP - 85
EP - 94
BT - LAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop
A2 - Meyers, Adam
A2 - Rehbein, Ines
A2 - Zinsmeister, Heike
PB - Association for Computational Linguistics (ACL)
T2 - 9th Linguistic Annotation Workshop, LAW 2015, held in conjuncion with NAACL 2015
Y2 - 5 June 2015
ER -