Abstract
We present a multi-dialect neural machinetranslation (NMT) model tailored to Japanese.While the surface forms of Japanese dialectsdiffer from those of standard Japanese, mostof the dialects share fundamental properties such as word order, and some also usemany of the same phonetic correspondencerules. To take advantage of these properties,we integrate multilingual, syllable-level, andfixed-order translation techniques into a general NMT model. Our experimental resultsdemonstrate that this model can outperform abaseline dialect translation model. In addition,we show that visualizing the dialect embeddings learned by the model can facilitate geographical and typological analyses of dialects.
Original language | English |
---|---|
Pages | 1-10 |
Number of pages | 10 |
Publication status | Published - 2018 |
Event | 32nd Pacific Asia Conference on Language, Information and Computation, PACLIC 2018 - Hong Kong, Hong Kong Duration: 2018 Dec 1 → 2018 Dec 3 |
Conference
Conference | 32nd Pacific Asia Conference on Language, Information and Computation, PACLIC 2018 |
---|---|
Country/Territory | Hong Kong |
City | Hong Kong |
Period | 18/12/1 → 18/12/3 |
ASJC Scopus subject areas
- Language and Linguistics
- Computer Science (miscellaneous)