”Findability”: The Key to Enterprise Search

 

To assist with the task of retrieving information, search has been primarily a keywordbased endeavor, where searchers have attempted as best they could to retrieve relevant documents accurately by matching up the keywords in a given query with the same words in the documents. Without some means of deriving the correct senses for words in the query and at data indexing time, lexical polysemy and homonymy result in a significant number of misinterpretations and thus blur the quality of retrieval results. We are now seeing more promotion of improvements to better achieve accurate and relevant search results through “smart query modification,” which allows for variations in spelling (through sophisticated pattern search) as well as allowing for use of synonyms of query terms. Most of this development is tailored at achieving best results while optimizing index size, providing a scalable indexing and querying paradigm as well as promoting flexibility through SDKs that allow for customization of search and retrieval.

数据挖掘论坛

But information management paradigms are changing. Unit costs of disk space, RAM and processing units are decreasing. The world where recall was of the utmost importance (because it paid to retrieve more than was necessary to ensure finding all relevant information) worked on the smaller document collections of decades past, but cannot possibly apply to the vast quantities of information that we must now be able to handle. Precision is becoming ever more important in the context where the simple increase in document collection size translates into increases in the number of relevant documents for a given query. The technological approach must be able to account for this and include metrics to quantify achievements in precision enhancement, all the while attempting to minimize sacrificing completeness of search results sets.

资料全文下载

[数据挖掘专家] [数据挖掘研究院] [数据挖掘论坛] [数据挖掘实验室]
上一篇:ETL中的数据清洗设计
下一篇:Sapphire: Large Scale Data Mining and Pattern Recognition
最新评论共有 0 位网友发表了评论 , 查看所有评论
发表评论( 不能超过250字,需审核,请自觉遵守互联网相关政策法规。 )
匿名?
数据挖掘网站导航 数据挖掘论坛导航
  • 数据挖掘工具
  • 数据挖掘论坛
  • DataCruncher - Cognos
  • MineSet - MathSoft
  • Intelligent Miner - GainSmarts
  • Sqlserver - SAS - Clementine
  • CART - Weka - WizSoft
  • NeuroShell - ModelQuest
  • data mining tools - Darwin
  • 数据挖掘交友
  • 数据挖掘博客
  • 数据挖掘工具
  • 数据挖掘资源
  • 数据挖掘技术算法
  • 数据挖掘相关期刊、会议
  • 研究院联盟合作专区
  • 数据挖掘基础与相关技术
  • 数据挖掘厂商与就业
  • 数据挖掘研究者乐园
  • 知名厂商数据挖掘工具资料
  • 国内数据挖掘实验室
  • Foreign Data Mining Lab
  • 热点关注
  • :::数据挖掘未来研究方向:::
  • :::数据挖掘常用技术:::
  • :::数据挖掘研究内容和本质:::
  • :::数据挖掘的功能:::
  • 数据挖掘测试数据集大全
  • :::数据挖掘的研究历史和现状:::
  • Making the Most of Operational Analytics
  • 近期与数据挖掘相关的一些重要会议的截止日
  • :::数据挖掘热点:::
  • 韩家炜的论文下载
  • 论坛最新话题
  • Foundations of Statistical Natural Langu
  • Game Theory meet Data Mining: A Recent P
  • System Building: How does it help or hin
  • 数据挖掘与Clementine培训
  • 新手报到
  • 求 SASEM 客户流失预测分析
  • 数据挖掘工程师/搜索研究院—北京——无线
  • 数据挖掘入门介绍(如何着手数据挖掘)
  • Information Overload Survey Results
  • The INEX 2005 Workshop on Element Retrie
  • 相关资讯
  • 从影响圈到关注圈,从数据挖掘到价值挖掘
  • SAS Updates BI Products
  • Call for Papers & Invited Session Propos
  • IEEE Intelligent Systems Special Issue
  • 近期与数据挖掘相关的一些重要会议的截止日
  • Data mining program near rock bottom
  • IDC Names Oracle as Leader in Data Wareh
  • Characterizing the Function Space for Ba
  • German scientists develop software to re
  • deviantART.com Web Application Software
  • 数据挖掘实验室资料
  • 数据挖掘博客地址
  • 数据挖掘实验室网站地址
  • Prepare for Medicare audits by using dat
  • 注册成为SAS用户与爱好者俱乐部会员
  • 水南梅
  • 明日烟
  • 新人报道
  • 下载
  • 厦门服务器托管,450元/月—0592-5177319 高
  • 买空间送域名--0592-5177319 高静