Unranked trees can be represented using their minimal dag (directed acyclic graph). For XML this achieves high compression ratios due to their repetitive mark up. Unranked trees are often represented through first child/next sibling (fcns) encoded binary trees. We study the difference in size ( = number of edges) of minimal dag versus minimal dag of the fcns encoded binary tree. One main finding is that the size of the dag of the binary tree can never be smaller than the square root of the size of the minimal dag, and that there are examples that match this bound. We introduce a new com-bined structure, the hybrid dag, which is guaranteed to be smaller than (or equal in size to) both dags. Interestingly, we find through experiments that las...
Consider an ordered, static tree T on t nodes where each node has a label from alphabet set ~. Tree ...
An ancestry labeling scheme labels the nodes of any tree in such a way that ancestry queries between...
Tree data such as XML trees have recently been getting larger and larger. Parallel and distributed p...
Trees are a ubiquitous data structure in computer science. LISP, for instance, was designed to manip...
Many XML documents are data-centric and do not make use of the inherent document order. Can we provi...
We introduce forest straight-line programs (FSLPs) as a compressed representation of unranked ordere...
We consider the minimal directed acyclyc graph (DAG) lossless compression strategy introduced in Kie...
International audienceBy nature, tree structures frequently present similarities between their sub-p...
Trees are a fundamental structure in computing. They are used in almost every aspect of modeling and...
We analyze lossless tree compression algorithms under information-theoretic and combinatorial aspect...
International audienceA classical compression method for trees is to exploit subtree repeats in the ...
Abstract. Sharing of common subtrees has been reported useful not only for XML compression but also ...
We study the average size of the minimal directed acyclic graph (DAG) with respect to so-called leaf...
There is a common problem of operating on hash values of elements of some database. In this paper th...
this paper, we discuss the requirements for an XDS bulkload component and examine existing algorithm...
Consider an ordered, static tree T on t nodes where each node has a label from alphabet set ~. Tree ...
An ancestry labeling scheme labels the nodes of any tree in such a way that ancestry queries between...
Tree data such as XML trees have recently been getting larger and larger. Parallel and distributed p...
Trees are a ubiquitous data structure in computer science. LISP, for instance, was designed to manip...
Many XML documents are data-centric and do not make use of the inherent document order. Can we provi...
We introduce forest straight-line programs (FSLPs) as a compressed representation of unranked ordere...
We consider the minimal directed acyclyc graph (DAG) lossless compression strategy introduced in Kie...
International audienceBy nature, tree structures frequently present similarities between their sub-p...
Trees are a fundamental structure in computing. They are used in almost every aspect of modeling and...
We analyze lossless tree compression algorithms under information-theoretic and combinatorial aspect...
International audienceA classical compression method for trees is to exploit subtree repeats in the ...
Abstract. Sharing of common subtrees has been reported useful not only for XML compression but also ...
We study the average size of the minimal directed acyclic graph (DAG) with respect to so-called leaf...
There is a common problem of operating on hash values of elements of some database. In this paper th...
this paper, we discuss the requirements for an XDS bulkload component and examine existing algorithm...
Consider an ordered, static tree T on t nodes where each node has a label from alphabet set ~. Tree ...
An ancestry labeling scheme labels the nodes of any tree in such a way that ancestry queries between...
Tree data such as XML trees have recently been getting larger and larger. Parallel and distributed p...