The Data Warehousing Insititute (TDWI) conducted a survey in late 2006 as well as interviews with data management practitioners, consultants, and software vendors via an Internet-based survey. 370 respondents are the data sample for this report entitled, "BI Search and Text Analytics: New Additions to the BI Technology Stack."
数据挖掘交友
The technology stack for business intelligence (BI) and data warehousing (DW) is currently expanding to accommodate two relatively new additions, namely BI search and text analytics. Although each stands ably on its own, the two are related in that they tend to operate on unstructured data. In fact, the growing use of BI search and text analytics is part of a larger trend toward leveraging unstructured data in BI and DW, fields that previously have relied almost exclusively on structured data. Another way to put it is that unstructured data is playing a larger role in BI and DW over time, and that role is today supported largely by tools and techniques for BI search and text analytics. 数据挖掘论坛
New Data Warehouse Sources from the Data Continuum
The data continuum divides into three broad segments for structured, semistructured and unstructured data. In turn, each of these segments is populated by various types of systems, files and documents that can serve as data sources for a data warehouse or other BI solution. These range from flat files, to databases, to XML documents, to email and so on.
数据挖掘实验室
To understand which of these are feeding data into data warehouses today - and in the near future - TDWI asked, "Which types of data and source systems feed your data warehouse?" Survey respondents selected those in use today as well as those they anticipate using in three years in Figure 1; it also calculates the expected rate of change (or delta). Judging by users' responses to this question, the kinds of data sources for the average data warehouse will change dramatically in the next few years: 数据挖掘工具
- Unstructured data sources will soon be more common for data warehouse feeds.