TY - GEN
T1 - Notes on sequence binary decision diagrams
T2 - Prague Stringology Conference 2011, PSC 2011
AU - Denzumi, Shuhei
AU - Yoshinaka, Ryo
AU - Arimura, Hiroki
AU - Minato, Shin ichi
PY - 2011/12/1
Y1 - 2011/12/1
N2 - Manipulation of large sequence data is one of the most important problems in string processing. Recently, Loekito et al. (Knowl. Inf. Syst., 24(2), 235-268, 2009) have introduced a new data structure, called Sequence Binary Decision Diagrams (SeqBDDs, or SDDs), which are descendants of both acyclic DFAs (ADFAs) and binary decision diagrams (BDDs). SDDs can compactly represent sets of sequences as well as minimal ADFAs, while SDDs allow efficient set operations inherited from BDDs. A novel feature of the SDDs is that different SDDs can share equivalent subgraphs and duplicated computation in common to save the time and space in various operations. In this paper, we study fundamental properties of SDDs. In particular, we first present non-trivial relationships between sizes of minimum SDDs and minimal ADFAs. We then analyze the complexities of algorithms for Boolean set operations, called the binary synthesis. Finally, we show experimental results to confirm the results of the theoretical analysis on real data sets.
AB - Manipulation of large sequence data is one of the most important problems in string processing. Recently, Loekito et al. (Knowl. Inf. Syst., 24(2), 235-268, 2009) have introduced a new data structure, called Sequence Binary Decision Diagrams (SeqBDDs, or SDDs), which are descendants of both acyclic DFAs (ADFAs) and binary decision diagrams (BDDs). SDDs can compactly represent sets of sequences as well as minimal ADFAs, while SDDs allow efficient set operations inherited from BDDs. A novel feature of the SDDs is that different SDDs can share equivalent subgraphs and duplicated computation in common to save the time and space in various operations. In this paper, we study fundamental properties of SDDs. In particular, we first present non-trivial relationships between sizes of minimum SDDs and minimal ADFAs. We then analyze the complexities of algorithms for Boolean set operations, called the binary synthesis. Finally, we show experimental results to confirm the results of the theoretical analysis on real data sets.
UR - http://www.scopus.com/inward/record.url?scp=84861213080&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84861213080&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84861213080
SN - 9788001048702
T3 - Proceedings of the Prague Stringology Conference 2011
SP - 147
EP - 161
BT - Proceedings of the Prague Stringology Conference 2011
Y2 - 29 August 2011 through 31 August 2011
ER -