TY - JOUR
T1 - Lagrangian relaxation for scalable text summarization while maximizing multipleobjectives
AU - Nishino, Masaaki
AU - Yasuda, Norihito
AU - Tsutomu, Hirao
AU - Suzuki, Jun
AU - Nagata, Masaaki
PY - 2013/7/10
Y1 - 2013/7/10
N2 - Multi-document summarization is the task of generating a summary from multiple documents, and the generated summary is expected to contain much of the information contained in the original documents. Previous work tries to realize this by (i) formulating the task as the combinatorial optimization problem of simultaneously maximizing relevance and minimizing redundancy, or (ii) formulating the task as a graph-cut problem. This paper improves summary quality by combining these two approaches into a synthesized optimization problem that is formulated in Integer Linear Programming (ILP). Though an ILP problem can be solved with an ILP solver, the problem is NP-hard and it is difficult to obtain the exact solution in situations where immediate responses are needed. Our solution is to propose optimization heuristics that exploit Lagrangian relaxation to obtain good appro ximate solutions within feasible computation times. Experiments on the document understanding conference 2004 (DUC 04) dataset show that our Lagrangian relaxation based heuristics completes in feasible computation time but achieves higher ROUGE scores than state-of-the-art approximate methods.
AB - Multi-document summarization is the task of generating a summary from multiple documents, and the generated summary is expected to contain much of the information contained in the original documents. Previous work tries to realize this by (i) formulating the task as the combinatorial optimization problem of simultaneously maximizing relevance and minimizing redundancy, or (ii) formulating the task as a graph-cut problem. This paper improves summary quality by combining these two approaches into a synthesized optimization problem that is formulated in Integer Linear Programming (ILP). Though an ILP problem can be solved with an ILP solver, the problem is NP-hard and it is difficult to obtain the exact solution in situations where immediate responses are needed. Our solution is to propose optimization heuristics that exploit Lagrangian relaxation to obtain good appro ximate solutions within feasible computation times. Experiments on the document understanding conference 2004 (DUC 04) dataset show that our Lagrangian relaxation based heuristics completes in feasible computation time but achieves higher ROUGE scores than state-of-the-art approximate methods.
KW - Combinatorial optimization
KW - Lagrangian relaxation
KW - Multi-document summarization
UR - http://www.scopus.com/inward/record.url?scp=84940363175&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84940363175&partnerID=8YFLogxK
U2 - 10.1527/tjsai.28.433
DO - 10.1527/tjsai.28.433
M3 - Article
AN - SCOPUS:84940363175
SN - 1346-0714
VL - 28
SP - 433
EP - 441
JO - Transactions of the Japanese Society for Artificial Intelligence
JF - Transactions of the Japanese Society for Artificial Intelligence
IS - 5
ER -