Data&Musée

Explorer les données de l'héritage culturel français

Creative Commons By-SA licenses and URIs

21/07/2022 by Moissinac

My attention was recently drawn to the DBPedia license, which would imply that any dataset that relies on DBPedia should be made public with the same license. A stakeholder in a project told me that they had given up using DBPedia for this reason. According to the article https://fr.wikipedia.org/wiki/DBpedia, two licenses are applicable to DBPedia data; Creative Commons BY-SA 3.0 and GFDL.

I have found that other semantic datasets use the Creative Commons BY-SA 3.0 license. For example, the vocabularies proposed by the Ministry of Culture (http://data.culture.fr/thesaurus/static/en-savoir-plus) are released with this license.

This license stipulates in particular:

Share Under the Same Conditions - In the case that you remix, transform, or create from the material composing the original Work, you must distribute the modified Work under the same conditions, that is, with the same license with which the original Work was distributed.

It seems to me that creating many links to DBPedia uris in my own datasets implies that I am creating "from" DBPedia component material. The license seems to indicate that my data is a "mix" of DBPedia data, so it will have to be made public and accessible under the same license.

As an example, I had to deal with confidential data from an organization. This data had a geographic dimension regarding the origin of transactions. For this geographic data, we used URIs from DBpedia. The DBpedia license could imply that our data must be made public and reusable. Of course, in this simple case, a solution exists: create our own URIs for the geographic entities that are useful to us and isolate in a separate graph the sameAs links with the corresponding DBpedia URIs; only this last graph would be concerned by the license.

In conclusion, it seems necessary to be very careful when reusing sets of URIs proposed by the Linked Open Data (LOD).

Translated with www.DeepL.com/Translator (free version)

Author: Moissinac

Maitre de conférence à Télécom Paris, Département Image, Données, Signal - Groupe Multimédia Jean-Claude Moissinac a mené des recherches sur les techniques avancées pour la production, le transport, la représentation et l’utilisation des documents multimédia. Ces travaux d'abord ont évolué vers la représentation sémantique de données liées au multimédia (process de traitement de médias, description d'adaptations de média, description formelle d'interactions utilisateurs). Aujourd'hui, les travaux portent sur la constitution de graphes de connaissances. Principaux axes de recherche actuel : représentations sémantiques de connaissances, constitution de graphes de connaissances, techniques d'apprentissage automatique sur ces graphes

Comments are closed.