- Abstract:
-
Central to any XML query language is a path language such as XPath which operates on the tree structure of the XML document. We demonstrate in this paper that the tree structure can be effectively compressed and manipulated using techniques derived from symbolic model checking. Specifically, we show first that succinct representations of document tree structures based on sharing subtrees are highly effective. Second, we show that compressed structures can be queried directly and efficiently through a process of manipulating selections of nodes and partial decompression. We study both the theoretical and experimental properties of this technique and provide algorithms for querying our compressed instances using node-selecting path query languages such as XPath. We believe the ability to store and manipulate large portions of the structure of very large XML documents in main memory is crucial to the development of efficient, scalable native XML databases and query engines.
- Links To Paper
- 1st Link
- 2nd Link
- Bibtex format
- @InProceedings{EDI-INF-RR-0636,
- author = {
Peter Buneman
and Martin Grohe
and Christoph Koch
},
- title = {Path Queries on Compressed XML},
- book title = {Very Large Databases (VLDB 2003), Berlin Germany},
- publisher = {Morgan Kaufmann},
- year = 2003,
- pages = {12},
- url = {http://www.lfcs.inf.ed.ac.uk/research/database/publications/vldb03_path.pdf},
- }
|