TY - GEN
T1 - Web spam detection by exploring densely connected subgraphs
AU - Leon-Suematsu, Yutaka I.
AU - Inui, Kentaro
AU - Kurohashi, Sadao
AU - Kidawara, Yutaka
PY - 2011
Y1 - 2011
N2 - In this paper, we present a Web spam detection algorithm that relies on link analysis. The method consists of three steps: (1) decomposition of webgraphs in densely connected subgraphs and calculation of the features for each subgraph; (2) use of SVM classifiers to identify subgraphs composed of Web spam; and (3) propagation of predictions over webgraphs by a biased PageRank algorithm to expand the scope of identification. We performed experiments on a public benchmark. An empirical study of the core structure of webgraphs suggests that highly ranked non-spam hosts can be identified by viewing the coreness of the webgraph elements.
AB - In this paper, we present a Web spam detection algorithm that relies on link analysis. The method consists of three steps: (1) decomposition of webgraphs in densely connected subgraphs and calculation of the features for each subgraph; (2) use of SVM classifiers to identify subgraphs composed of Web spam; and (3) propagation of predictions over webgraphs by a biased PageRank algorithm to expand the scope of identification. We performed experiments on a public benchmark. An empirical study of the core structure of webgraphs suggests that highly ranked non-spam hosts can be identified by viewing the coreness of the webgraph elements.
KW - Biased pagerank
KW - Dense subgraphs
KW - Web spam
UR - http://www.scopus.com/inward/record.url?scp=80155182266&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80155182266&partnerID=8YFLogxK
U2 - 10.1109/WI-IAT.2011.152
DO - 10.1109/WI-IAT.2011.152
M3 - Conference contribution
AN - SCOPUS:80155182266
SN - 9780769545134
T3 - Proceedings - 2011 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2011
SP - 124
EP - 129
BT - Proceedings - 2011 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2011
T2 - 2011 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2011
Y2 - 22 August 2011 through 27 August 2011
ER -