TY - GEN
T1 - A corpus study for identifying evidence on microblogs
AU - Reisert, Paul
AU - Mizuno, Junta
AU - Kanno, Miwa
AU - Okazaki, Naoaki
AU - Inui, Kentaro
N1 - Funding Information:
We would like to acknowledge MEXT (Ministry of Education, Culture, Sports, Science and Technology) for their generous financial support via the Research Student Scholarship. This study was partly supported by Japan Society for the Promotion of Science (JSPS) KAKENHI Grant No. 23240018 and Japan Science and Technology Agency (JST). Furthermore, we would like to also thank Eric Nichols (Honda Research Institute Japan Co., Ltd.) for his discussions on the topic of evidence relations.
Publisher Copyright:
© LAW 2014 - 8th Linguistic Annotation Workshop, in conjunction with COLING 2014 - Proceedings of the Workshop. All rights reserved.
PY - 2020
Y1 - 2020
N2 - Microblogs are a popular way for users to communicate and have recently caught the attention of researchers in the natural language processing (NLP) field. However, regardless of their rising popularity, little attention has been given towards determining the properties of discourse relations for the rapid, large-scale microblog data. Therefore, given their importance for various NLP tasks, we begin a study of discourse relations on microblogs by focusing on evidence relations. As no annotated corpora for evidence relations on microblogs exist, we conduct a corpus study to identify such relations on Twitter, a popular microblogging service. We create annotation guidelines, conduct a large-scale annotation phase, and develop a corpus of annotated evidence relations. Finally, we report our observations, annotation difficulties, and data statistics.
AB - Microblogs are a popular way for users to communicate and have recently caught the attention of researchers in the natural language processing (NLP) field. However, regardless of their rising popularity, little attention has been given towards determining the properties of discourse relations for the rapid, large-scale microblog data. Therefore, given their importance for various NLP tasks, we begin a study of discourse relations on microblogs by focusing on evidence relations. As no annotated corpora for evidence relations on microblogs exist, we conduct a corpus study to identify such relations on Twitter, a popular microblogging service. We create annotation guidelines, conduct a large-scale annotation phase, and develop a corpus of annotated evidence relations. Finally, we report our observations, annotation difficulties, and data statistics.
UR - http://www.scopus.com/inward/record.url?scp=85084342462&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084342462&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85084342462
T3 - LAW 2014 - 8th Linguistic Annotation Workshop, in conjunction with COLING 2014 - Proceedings of the Workshop
SP - 70
EP - 74
BT - LAW 2014 - 8th Linguistic Annotation Workshop, in conjunction with COLING 2014 - Proceedings of the Workshop
A2 - Levin, Lori
A2 - Stede, Manfred
PB - Association for Computational Linguistics (ACL)
T2 - 8th Linguistic Annotation Workshop, LAW 2014, in conjunction with COLING 2014
Y2 - 23 August 2014 through 24 August 2014
ER -