Lecture Notes in Computer Science, 2001, Volume 2097/2001, 43-56, DOI: 10.1007/3-540-45754-2_4

An Effective Data Placement Strategy for XML Documents

Yuanling Zhu and Kevin Lü

View Related Documents

Abstract

As XML is increasingly being used in Web applications, new technologies need to be investigated for processing XML documents with high performance. Parallelism is a promising solution for structured document processing and data placement is a major factor for system performance improvement in parallel processing. This paper describes an effective XML document data placement strategy. The new strategy is based on a multilevel graph partitioning algorithm with the consideration of the unique features of XML documents and query distributions. A new algorithm, which is based on XML query schemas to derive the weighted graph from the labelled directed graph presentation of XML documents, is also proposed. Performance analysis on the algorithm presented in the paper shows that the new data placement strategy exhibits low workload skew and a high degree of parallelism.

Keywords  Data Placement - XML Documents - Graph Partitioning - Parallel Data Processing

Fulltext Preview

Image of the first page of the fulltext document