TY - GEN
T1 - Multi-label text categorization with model combination based on F1-score maximization
AU - Fujino, Akinori
AU - Isozaki, Hideki
AU - Suzuki, Jun
N1 - Publisher Copyright:
© 2008 IJCNLP 2008 - 3rd International Joint Conference on Natural Language Processing, Proceedings of the Conference. All rights reserved.
PY - 2008
Y1 - 2008
N2 - Text categorization is a fundamental task in natural language processing, and is generally defined as a multi-label categorization problem, where each text document is assigned to one or more categories. We focus on providing good statistical classifiers with a generalization ability for multi-label categorization and present a classifier design method based on model combination and F1-score maximization. In our formulation, we first design multiple models for binary classification per category. Then, we combine these models to maximize the F1-score of a training dataset. Our experimental results confirmed that our proposed method was useful especially for datasets where there were many combinations of category labels.
AB - Text categorization is a fundamental task in natural language processing, and is generally defined as a multi-label categorization problem, where each text document is assigned to one or more categories. We focus on providing good statistical classifiers with a generalization ability for multi-label categorization and present a classifier design method based on model combination and F1-score maximization. In our formulation, we first design multiple models for binary classification per category. Then, we combine these models to maximize the F1-score of a training dataset. Our experimental results confirmed that our proposed method was useful especially for datasets where there were many combinations of category labels.
UR - http://www.scopus.com/inward/record.url?scp=84878385982&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878385982&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84878385982
T3 - IJCNLP 2008 - 3rd International Joint Conference on Natural Language Processing, Proceedings of the Conference
SP - 823
EP - 828
BT - IJCNLP 2008 - 3rd International Joint Conference on Natural Language Processing, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 3rd International Joint Conference on Natural Language Processing, IJCNLP 2008
Y2 - 7 January 2008 through 12 January 2008
ER -