RSS
热门关键字:  数据挖掘  人工智能  数据仓库  搜索引擎  数据挖掘导论
当前位置 :| 首页>编程技术>xml技术>

What's Still Wrong with SQL Native XML Integration Solution

来源: 作者:unkonwn 时间:2006-12-21 点击:

The SQL native XML integration market has not yet taken off. On the surface, it is easy to see why. Every vendor has a different proprietary solution, all of these solutions are incompatible with each other, and none of these solutions has satisfactorily solved the relational and XML data integration problem. This data integration problem is at the heart of this industry acceptance problem. Let's examine the separate data integration areas that comprise this basic data integration industry problem and then look at a possible solution that is already in the ANSI SQL processing box. 数据挖掘研究院

XML Database Processing Industry Basic Problems

A SQL processor with native XML integration is an XML database processor. One of an XML database processors' primary requirements is to output fully structured XML. XML is hierarchical, so the XML input and output needs to be hierarchically valid to be useful. This also means it needs to be hierarchically processed, preserved and structured correctly. XML database processor designers seem to have ignored this requirement. Their products do not impose or require hierarchical processing. Their XML product's data may not be hierarchically correct, which also means their result data is incorrect. So why do current XML database processor products allow nonhierarchical operations that will invalidate the XML hierarchical result? 数据挖掘研究院

This nonhierarchical problem stems from the fact that XML was designed for markup and not for database data. These are very different uses. XML's use for database data was realized after the XML specification had already been finalized and put into practice. For this reason, XML for database use was not thought out very well and evolved in a haphazard fashion. This occurred because the XML spec does not say one word on how the XML database data is to be hierarchically processed because it was not intended for that purpose. This is another significant problem. Even when XML data is processed hierarchically, there is no XML specification for this. So every vendor has come up with their own XML processing procedures that may not be hierarchically valid. 数据挖掘研究院

A third problem is the collective relational mindset in the XML industry that continues using a relational processing model for XML processing. As evidence of this relational mindset, even XQuery designed from scratch to handle XML processing naturally supports the relational inner join operation by default. The inner join is not a hierarchical operation and will invalidate hierarchical data it is applied to. This does not make good sense for an XML product that is designed for XML processing and XML hierarchically structured output.

数据挖掘研究院

SQL Native XML Integration Externally Perceived Problems

The above-mentioned XML database processing industry has basic problems:

数据挖掘研究院

  1. XML database processing is not fully hierarchical,
  2. It has no hierarchical processing specification that must be followed, and
  3. It is overly influenced by the relational mindset of the product designers.

These come together to produce many SQL native XML integration externally perceived processing problems. These are:

  1. Multiple incompatible solutions are on the market;
  2. Solutions that are proprietary and/or nonstandard;
  3. Having to use SQL native XML-centric and procedural syntax;
  4. Solutions that do not solve XML and relational data integration;
  5. Nonprincipled hierarchical processing, producing invalid hierarchical results; and
  6. Hierarchical processing basically limited to single leg (linear) processing.

An ANSI SQL Native XML Integration Solution

Interestingly, the wrong solution to use, relational SQL for XML processing, turns into the perfect solution when it is realized that ANSI SQL can perform full hierarchical processing automatically when properly instructed in SQL. The ANSI SQL-92 standard introduced the Left Outer Join operation that precisely models hierarchical structures. Using only this Left Outer Join operation to model full hierarchical structures produces the correct hierarchical results based on natural hierarchical principles followed precisely by the Left Outer Join hierarchical semantics. Since the Left Outer Join can model multi-leg structures, advanced nonlinear hierarchical processing is also inherently supported. This powerful hierarchical processing can be used to automatically solve all the SQL native XML integration problems identified above.

This is not an immediate SQL native XML integration solution. While every ANSI SQL processor can perform full hierarchical processing when instructed, the internal relational engine does not know that it is processing the data hierarchically. This means it cannot automatically take the final step to fully utilize this powerful hierarchical processing for XML integration. But the capability is available and is currently being seamlessly extended to XML. For additional information and proof of this inherent ANSI SQL hierarchical processing capability and its applicability to solve SQL native XML integration, interested readers can access the Web site found in my bio below and check out the ANSI SQL Hierarchical Processing tutorial located on the "Nonlinear Process" page. This material remains ANSI SQL standard while showing how valid hierarchical query results can be produced automatically and transparently for XML output.

Producing correct XML hierarchical results automatically from generic ANSI SQL query processing leaves very little room for different proprietary and incompatible solutions. There can be only one correct answer for each ANSI SQL query.

最新评论共有 0 位网友发表了评论
发表评论
评论内容:不能超过250字,需审核,请自觉遵守互联网相关政策法规。
匿名?