TY - GEN
T1 - Improving the web text content by extracting significant pages into a Web Site
AU - Ríos, Sebastián A.
AU - Velásquez, Juan D.
AU - Vera, Eduardo S.
AU - Yasuda, Hiroshi
AU - Aoki, Terumasa
PY - 2005/12/1
Y1 - 2005/12/1
N2 - Web Systems have reached a very important role in today's business world. Every day organizations fight to keep their present clients and to gain new ones. In order to accomplish this goal it is very important to make precise changes in the web site content. However, the development of these improvements is a complex and specialized task because of the nature of the web data itself. We propose a novel approach to successfully make changes to improve the web site content using text mining. We use a Self Organizing Feature Map (SOFM)to find the most relevant text content, and then we propose a reverse clustering analysis in order to extract the most significant pages of the whole web site. The effectiveness of this method was experimentally tested in a real web site.
AB - Web Systems have reached a very important role in today's business world. Every day organizations fight to keep their present clients and to gain new ones. In order to accomplish this goal it is very important to make precise changes in the web site content. However, the development of these improvements is a complex and specialized task because of the nature of the web data itself. We propose a novel approach to successfully make changes to improve the web site content using text mining. We use a Self Organizing Feature Map (SOFM)to find the most relevant text content, and then we propose a reverse clustering analysis in order to extract the most significant pages of the whole web site. The effectiveness of this method was experimentally tested in a real web site.
UR - http://www.scopus.com/inward/record.url?scp=33846993083&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33846993083&partnerID=8YFLogxK
U2 - 10.1109/ISDA.2005.55
DO - 10.1109/ISDA.2005.55
M3 - Conference contribution
AN - SCOPUS:33846993083
SN - 0769522866
SN - 9780769522869
T3 - Proceedings - 5th International Conference on Intelligent Systems Design and Applications 2005, ISDA '05
SP - 32
EP - 36
BT - Proceedings - 5th International Conference on Intelligent Systems Design and Applications, ISDA '05
T2 - 5th International Conference on Intelligent Systems Design and Applications, ISDA '05
Y2 - 8 September 2005 through 10 September 2005
ER -