RSS
热门关键字:  数据挖掘  人工智能  数据仓库  搜索引擎  数据挖掘导论

A taxonomy of web search

来源: 作者:unkonwn 时间:2004-12-05 点击:

A central tenet of classical information retrieval is that the user is driven by an information need. Schneiderman, Byrd, and Croft [SBC97] define information need as "the perceived need for information that leads to someone using an information retrieval system in the first place." But the intent behind a web search is often not informational -- it might be navigational (show me the url of the site I want to reach) or transactional (show me sites where I can perform a certain transaction, e.g., shop, download a file, or find a map). In fact as we show later, informational queries constitute less than 50% of web searches.

数据挖掘实验室

The main aim of this paper is to point out this difference and introduce and analyze a taxonomy of web searches. Secondly, we show how search engines evolved to deal with these web-specific needs. The remainder of the paper is organized as follows: in section 2 we discuss the classic model for information retrieval; section 3 introduces a taxonomy of web searches; section 4 presents some statistics extracted from AltaVista surveys and logs regarding the prevalence of various types of searches; section 5 analyzes the evolution of search engines in light of this taxonomy; section 6 discusses some related work; finally, section 7 draws certain conclusions and points to some further directions for research.

2. The classic model for information retrieval We start from the basic model used in many standard information retrieval references textbooks, for instance, van Rijsbergen [R79]. See also [BK94] and references therein for a detailed discussion. Essentially, a user, driven by an information need, constructs a query in some query language. The query is submitted to a system that selects from a collection of documents (corpus), those documents that match the query as indicated by certain matching rules. A query refinement process might be used to create new queries and/or to refine the results. (Figure 1) 数据挖掘研究院

资料全文下载

数据挖掘研究院

最新评论共有 0 位网友发表了评论
发表评论
评论内容:不能超过250字,需审核,请自觉遵守互联网相关政策法规。
匿名?