Informatics Report Series


Report   

EDI-INF-RR-0595


Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home
Title:Vectorizing and Querying Large XML Repositories
Authors: Peter Buneman ; Byron Choi ; Wenfei Fan ; Robert Hutchison ; Robert Mann ; Stratis Viglas
Date:Apr 2005
Publication Title:Proceedings of ICDE 2005 (International Conference on Data Engineering)
Publisher:IEEE Computer Society
Publication Type:Conference Paper Publication Status:Published
Page Nos:261-272
Abstract:
Vertical partitioning is a well-known technique for optimizing query performance in relational databases. An extreme form of this technique, which we call vectorization, is to store each column separately. We use a generalization of vectorization as the basis for a native XML store. The idea is to decompose an XML document into a set of vectors that contain the data values and a compressed skeleton that describes the structure. In order to query this representation and produce results in the same vectorized format, we consider a practical fragment of XQuery and introduce the notion of query graphs and a novel graph reduction algorithm that allows us to leverage relational optimization techniques as well as to reduce the unnecessary loading of data vectors and decompression of skeletons. A preliminary experimental study based on some scientific and synthetic XML data repositories in the order of gigabytes supports the claim that these techniques are scalable and have the potential to provide performance comparable with established relational database technology.
Links To Paper
No links available
Bibtex format
@InProceedings{EDI-INF-RR-0595,
author = { Peter Buneman and Byron Choi and Wenfei Fan and Robert Hutchison and Robert Mann and Stratis Viglas },
title = {Vectorizing and Querying Large XML Repositories},
book title = {Proceedings of ICDE 2005 (International Conference on Data Engineering)},
publisher = {IEEE Computer Society},
year = 2005,
month = {Apr},
pages = {261-272},
}


Home : Publications : Report 

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh