Artículo de revista
Characterising RDF data sets
Fecha
2018Registro en:
Journal of Information Science, 44 (2): 203-229
10.1177/0165551516677945
Autor
Fernandez, Javier D.
Martínez Prieto, Miguel A.
Fuente Redondo, Pablo de la
Gutiérrez Gallardo, Claudio
Institución
Resumen
The publication of semantic web data, commonly represented in Resource Description Framework (RDF), has experienced outstanding growth over the last few years. Data from all fields of knowledge are shared publicly and interconnected in active initiatives such as Linked Open Data. However, despite the increasing availability of applications managing large-scale RDF information such as RDF stores and reasoning tools, little attention has been given to the structural features emerging in real-world RDF data. Our work addresses this issue by proposing specific metrics to characterise RDF data. We specifically focus on revealing the redundancy of each data set, as well as common structural patterns. We evaluate the proposed metrics on several data sets, which cover a wide range of designs and models. Our findings provide a basis for more efficient RDF data structures, indexes and compressors.