TY - GEN
T1 - A case study on freshness based scoring for fresh information retrieval
AU - Sato, Nobuyoshi
AU - Uehara, Minoru
AU - Sakai, Yoshifumi
PY - 2004/12/1
Y1 - 2004/12/1
N2 - For most businesses, fresh information retrieval is very important However, it is difficult for conventional search engines based on centralized architecture to retrieve really fresh information, because they take a long time to collect documents via Web robots. In contrast to a centralized architecture, a search engine based on a distributed architecture does not need to collect documents, because each site independently makes an index. As this result, distributed search engines can retrieve really fresh information. However, fast indexing is not enough to easily retrieve fresh information. The value of information is determined by both freshness and relevance. Traditional ranking methods consider either freshness or relevance; so, we proposed FTF-IDF (Fresh Term Frequency multiplied by Inverse Document Frequency) as a scoring method that considers both freshness and relevance. In this paper, we will describe a verification of FTF-IDF on an actual web diary.
AB - For most businesses, fresh information retrieval is very important However, it is difficult for conventional search engines based on centralized architecture to retrieve really fresh information, because they take a long time to collect documents via Web robots. In contrast to a centralized architecture, a search engine based on a distributed architecture does not need to collect documents, because each site independently makes an index. As this result, distributed search engines can retrieve really fresh information. However, fast indexing is not enough to easily retrieve fresh information. The value of information is determined by both freshness and relevance. Traditional ranking methods consider either freshness or relevance; so, we proposed FTF-IDF (Fresh Term Frequency multiplied by Inverse Document Frequency) as a scoring method that considers both freshness and relevance. In this paper, we will describe a verification of FTF-IDF on an actual web diary.
UR - http://www.scopus.com/inward/record.url?scp=21844478312&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=21844478312&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:21844478312
SN - 0780385934
T3 - IEEE International Symposium on Communications and Information Technologies: ISCIT 2004
SP - 210
EP - 215
BT - Proceedings - IEEE International Symposium on Communications and Information Technologies, ISCIT 2004
T2 - IEEE International Symposium on Communications and Information Technologies: Smart Info-Media Systems, ISCIT 2004
Y2 - 26 October 2004 through 29 October 2004
ER -