Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Partitioning methods for multi-version XML data warehouses

Laura Irina RusuContact Information, Wenny RahayuContact Information and David TaniarContact Information

(1)  Department of Computer Science and Computer Engineering, La Trobe University, Bundoora, VIC, 3086, Australia
(2)  Clayton School of Information Technology, Monash University, Clayton, VIC, 3800, Australia

Published online: 20 February 2009

Communicated by Ladjel Bellatreche.
Abstract  Due to an explosive increase of XML documents, it is imperative to manage XML data in an XML data warehouse. XML warehousing imposes challenges, which are not found in the relational data warehouses. In this paper, we firstly present a framework to build an XML data warehouse schema. For the purpose of scalability due to the increase of data volume, we propose a number of partitioning techniques for multi-version XML data warehouses, including document based partitioning, schema based partitioning, and cascaded (mixed) partitioning model. Finally, we formulate cost models to evaluate various types of queries for an XML data warehouse.

Keywords  Multi-version XML warehouse - Data partitioning - Data warehousing physical design - Storage for multi-version XML documents - XML data warehousing


Contact Information Laura Irina Rusu
Email: l.rusu@latrobe.edu.au

Contact Information Wenny Rahayu
Email: w.rahayu@latrobe.edu.au

Contact Information David Taniar (Corresponding author)
Email: david.taniar@infotech.monash.edu.au

References

1. Abadi, D., Marcus, A., Madden, S.R., Hollenbach, K.: Scalable semantic web data management using vertical partitioning. In: Proceedings of the International Conference on Very Large Data Bases (VLDB’2007), pp. 411–422 (2007)
 
2. Bellatreche, L., Karlapalem, K., Mohania, M.: OLAP query processing for partitioned data warehouses. In: Proceedings of the International Symposium on Database Applications in Non-Traditional Environments, pp. 35–42 (1999)
 
3. Bellatreche, L., Karlapalem, K., Mohania, M., Schneider, M.: What can partitioning do for your data warehouses and data marts? In: Proceedings of International Symposium on Database Engineering and Applications, pp. 437–445 (2000)
 
4. Bellatreche, L., Boukhalfa, L.: An evolutionary approach to schema partitioning selection in a data warehouse. In: Proceedings of the International Conference on Data Warehousing and Knowledge Discovery (DaWaK’2005). Lecture Notes in Computer Science, vol. 3589, pp. 115–125. Springer, Berlin (2005)
SpringerLink
 
5. Chien, S.Y., Tzotras, V.J., Zaniolo, C., Zhang, D.: Storing and querying multiversion XML documents using durable node numbers. In: Proceedings of the International Conference on Web Information Systems Engineering (WISE’2001), pp. 232–241 (2001)
 
6. Cobena, G., Abiteboul, S., Marian, A.: Detecting changes in XML documents. In: Proceedings of the 18th International Conference on Data Engineering (ICDE 2002), pp. 41–52 (2002)
 
7. Dehne, F., Eavis, T., Rau-Chaplin, A.: RCUBE: parallel multi-dimensional ROLAP indexing. Int. J. Data Warehous. Min. IGI Glob. 4(3), 1–14 (2008)
 
8. Furtado, P.: Workload-based placement and join processing in node-partitioned data warehouses. In: Proceedings of the International Conference on Data Warehousing and Knowledge Discovery (DaWaK’2004). Lecture Notes in Computer Science, vol. 3181, pp. 38–47. Springer, Berlin (2004)
 
9. Gorla, N., Pang, B.: Vertical fragmentation in databases using data-mining technique. Int. J. Data Warehous. Min. IGI Glob. 4(3), 33–53 (2008)
 
10. Marian, A., Abiteboul, S., Cobena, G., Mignet, L.: 2001, Change-centric management of versions in an XML warehouse. In: Proceedings of the International Conference on Very Large Data Bases (VLDB’2001), pp. 581–590 (2001)
 
11. Pardede, E., Rahayu, J.W., Taniar, D.: Object-relational complex structures for XML storage. Inf. Softw. Technol. 48(6), 370–384 (2006)
CrossRef
 
12. Rusu, L.I., Rahayu, W., Taniar, D.: On data cleaning in building XML data warehouses. In: Proceedings of the 6th International Conference on Information Integration and Web-based Applications & Services (iiWAS’2004), pp. 797–807 (2004)
 
13. Rusu, L.I., Rahayu, W., Taniar, D.: A methodology for building XML data warehouses. Int. J. Data Warehous. Min. 1(2), 67–92 (2005)
 
14. Rusu, L.I., Rahayu, W., Taniar, D.: Maintaining versions of dynamic XML documents. In: Proceedings of the 6th International Conference on Web Information Systems Engineering (WISE’2005). Lecture Notes in Computer Science Lecture Notes in Computer Science, vol. 3806, pp. 536–543. Springer, Berlin (2005)
 
15. Rusu, L.I., Rahayu, W., Taniar, D.: Warehousing dynamic XML documents. In: Proceedings of the International Conference on Data Warehousing and Knowledge Discovery (DaWaK’2006). Lecture Notes in Computer Science, vol. 4081, pp. 175–184. Springer, Berlin (2006)
SpringerLink
 
16. Rusu, L.I., Rahayu, W., Taniar, D.: Storage techniques for multi-versioned XML documents. In: Proceedings of the 13th International Conference on Database Systems for Advanced Applications (DASFAA’2008). Lecture Notes in Computer Science, vol. 4947, pp. 538–545. Springer, Berlin (2008)
 
17. Taniar, D., Rahayu, J.W.: Parallel sort-merge object-oriented collection join algorithms. Int. J. Comput. Syst. Sci. Eng. 17(3), 145–158 (2002)
 
18. Taniar, D., Rahayu, J.W.: Parallel group-by query processing in a cluster architecture. Int. J. Comput. Syst. Sci. Eng. 17(1), 23–39 (2002)
AMS
 
19. Taniar, D., Leung, C.H.C.: Query execution scheduling in parallel object-oriented databases. Inf. Softw. Technol. 41(3), 163–178 (1999)
CrossRef
 
20. Taniar, D., Leung, C.H.C.: The impact of load balancing to object-oriented query execution scheduling in parallel machine environment. Inf. Sci. 157, 33–71 (2003)
CrossRef
 
21. Wang, F., Zaniolo, C.: Temporal queries in XML document archives and web warehouses. In: Proceedings of the 10th International Symposium on Temporal Representation and Reasoning/4th International Conference on Temporal Logic (TIME-ICTL 2003), pp. 47–55 (2003)
 
22. Xyleme, L.: A dynamic warehouse for XML data of the web. IEEE Data Eng. Bull. 24(2), 40–47 (2001)
 
23. Mahboubi, H., Darmont, J.: Data mining-based fragmentation of XML data warehouses. In: Proceedings of DOLAP 2008, pp. 9–16. ACM, New York (2008)
CrossRef
 


Export this article
Export this article as RIS | Text
 
Remote Address: 38.107.191.114 • Server: mpweb06
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)