Author(s): Wouter Beek, Joe Raad, Jan Wielemaker, Frank van Harmelen
Abstract: The owl:sameAs predicate is an essential ingredient of the Semantic Web architecture. It allows parties to independently mint names, while at the same time ensuring that these parties are able to understand each other’s data. An online resource that collects all owl:sameAs statements on the Linked Open Data Cloud has therefore both practical impact (it helps data users and providers to find different names for the same entity) as well as analytical value (it reveals important aspects of the connectivity of the LOD Cloud).
This paper presents the largest dataset of identity statements that has been gathered from the LOD Cloud to date. We describe an efficient representation and algorithm to calculate and store the full equivalence closure over this dataset.
Finally, we present analytics over these datasets, gaining insights in the use of owl:sameAs in the LOD cloud.
All datasets are published online, as well as a web service from which the data and its equivalence closure can be queried.
Keywords: linked open data; identity; owl:sameAs; reasoning