Paper 145 (Research track)

HDTQ: Managing RDF Datasets in Compressed Space

Author(s): Javier D. Fernández, Miguel A. Martinez-Prieto, Axel Polleres, Julian Reindorf

Abstract: HDT (Header-Dictionary-Triples) is a well-known compressed representation of RDF data that supports retrieval features without prior decompression. Yet, RDF datasets often contain additional graph information, such as the origin, version or validity time of a triple. Traditional HDT is not capable of handling this additional parameter(s). This work introduces HDTQ (HDT Quads), an extension of HDT, which is able to represent quadruples (or quads) while still being highly compact and \queryable{}. Two approaches of this extension, Annotated Triples and Annotated Graphs, are introduced and their performance is compared to the leading open-source RDF stores on the market, Results show that HDTQ achieves the best compression rates and is a competitive alternative to well-established systems.

Keywords: RDF compression; RDF indexing; Linked Data management

Leave a Reply

Your email address will not be published. Required fields are marked *