TY - GEN
T1 - Category mining by heterogeneous data fusion using PdLSI model in a retail service
AU - Ishigaki, Tsukasa
AU - Takenaka, Takeshi
AU - Motomura, Yoichi
PY - 2010
Y1 - 2010
N2 - This paper describes an appropriate category discovery method that simultaneously involves a customer's lifestyle category and item category for the sustainable management of retail services, designated as "category mining". Category mining is realized using a large-scale ID-POS data and customer's questionnaire responses with respect to their lifestyle. For the heterogeneous data fusion, we propose a probabilistic double-latent semantic indexing (PdLSI) model that is an extension of PLSI model. In the PdLSI model, customers and items are classified probabilistically into some latent lifestyle categories and latent item category. Then, understanding of relation between the latent categories and various purchased situations is realized using Bayesian network modeling. This method provides useful knowledge based on a large-scale data for efficient customer relationship management and category management, and can be applicable for other service industries.
AB - This paper describes an appropriate category discovery method that simultaneously involves a customer's lifestyle category and item category for the sustainable management of retail services, designated as "category mining". Category mining is realized using a large-scale ID-POS data and customer's questionnaire responses with respect to their lifestyle. For the heterogeneous data fusion, we propose a probabilistic double-latent semantic indexing (PdLSI) model that is an extension of PLSI model. In the PdLSI model, customers and items are classified probabilistically into some latent lifestyle categories and latent item category. Then, understanding of relation between the latent categories and various purchased situations is realized using Bayesian network modeling. This method provides useful knowledge based on a large-scale data for efficient customer relationship management and category management, and can be applicable for other service industries.
KW - Bayesian network
KW - Heterogeneous data fusion
KW - Large-scale ID-POS data
KW - Service engineering
KW - Topic model
UR - http://www.scopus.com/inward/record.url?scp=79951732561&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951732561&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2010.83
DO - 10.1109/ICDM.2010.83
M3 - Conference contribution
AN - SCOPUS:79951732561
SN - 9780769542560
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 857
EP - 862
BT - Proceedings - 10th IEEE International Conference on Data Mining, ICDM 2010
T2 - 10th IEEE International Conference on Data Mining, ICDM 2010
Y2 - 14 December 2010 through 17 December 2010
ER -