TY - GEN
T1 - A discriminative alignment model for abbreviation recognition
AU - Okazaki, Naoaki
AU - Ananiadou, Sophia
AU - Tsujii, Jun'ichi
PY - 2008
Y1 - 2008
N2 - This paper presents a discriminative alignment model for extracting abbreviations and their full forms appearing in actual text. The task of abbreviation recognition is formalized as a sequential alignment problem, which finds the optimal alignment (origins of abbreviation letters) between two strings (abbreviation and full form). We design a large amount of finegrained features that directly express the events where letters produce or do not produce abbreviations. We obtain the optimal combination of features on an aligned abbreviation corpus by using the maximum entropy framework. The experimental results show the usefulness of the alignment model and corpus for improving abbreviation recognition.
AB - This paper presents a discriminative alignment model for extracting abbreviations and their full forms appearing in actual text. The task of abbreviation recognition is formalized as a sequential alignment problem, which finds the optimal alignment (origins of abbreviation letters) between two strings (abbreviation and full form). We design a large amount of finegrained features that directly express the events where letters produce or do not produce abbreviations. We obtain the optimal combination of features on an aligned abbreviation corpus by using the maximum entropy framework. The experimental results show the usefulness of the alignment model and corpus for improving abbreviation recognition.
UR - http://www.scopus.com/inward/record.url?scp=80053399837&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053399837&partnerID=8YFLogxK
U2 - 10.3115/1599081.1599164
DO - 10.3115/1599081.1599164
M3 - Conference contribution
AN - SCOPUS:80053399837
SN - 9781905593446
T3 - Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
SP - 657
EP - 664
BT - Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 22nd International Conference on Computational Linguistics, Coling 2008
Y2 - 18 August 2008 through 22 August 2008
ER -