Abstract
In this paper, we describe a method of automatic sentence alignment for building extracts from abstracts in automatic summarization research. Our method is based on two steps. First, we introduce the “dependency tree path” (DTP). Next, we calculate the similarity between DTPs based on the ESK (Extended String Subsequence Kernel), which considers sequential patterns. By using these procedures, we can derive one-to-many or many-to-one correspondences among sentences. Experiments using different similarity measures show that DTP consistently improves the alignment accuracy and that ESK gives the best performance.
Original language | English |
---|---|
Publication status | Published - 2004 |
Externally published | Yes |
Event | 20th International Conference on Computational Linguistics, COLING 2004 - Geneva, Switzerland Duration: 2004 Aug 23 → 2004 Aug 27 |
Conference
Conference | 20th International Conference on Computational Linguistics, COLING 2004 |
---|---|
Country/Territory | Switzerland |
City | Geneva |
Period | 04/8/23 → 04/8/27 |
ASJC Scopus subject areas
- Computational Theory and Mathematics
- Language and Linguistics
- Linguistics and Language