Extracting the author of web pages

Yoshikiyo Kato, Daisuke Kawahara, Kentaro Inui, Sadao Kurohashi, Tomohide Shibata

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

In this paper, we define the problem of identifying the author of a Web page as a sub-problem of identifying the information sender configuration of a Web page. We propose a method that extracts the author name candidates from a Web page based on linguistic features, and rank the candidates based on local features such as distance from the main content. The evaluation shows that we can achieve more than 75% precision when evaluated with candidates ranked within top five.

Original languageEnglish
Title of host publicationProceedings of the 2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
Pages35-41
Number of pages7
DOIs
Publication statusPublished - 2008
Event2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08 - Napa Valley, CA, United States
Duration: 2008 Oct 262008 Oct 30

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Conference

Conference2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
Country/TerritoryUnited States
CityNapa Valley, CA
Period08/10/2608/10/30

Keywords

  • Algorithms
  • Experimentation

Fingerprint

Dive into the research topics of 'Extracting the author of web pages'. Together they form a unique fingerprint.

Cite this