Maximising (Re)Usability
of Library metadata using
Linked Data
Asunción Gómez-Pérez
Ontology Engineering Group
Universidad Politécnica de Madrid
asun@fi.upm.es
@asungomezperez
Acknowledgments:
Daniel Vila Suero
(Library LD, and main developer of datos.bne.es)
Victor Rodríguez Doncel (licensed linked data)
Jorge Gracia (linguistic linked data)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
License
• This work is licensed under the Creative Commons
Attribution – Non Commercial – Share Alike License
• You are free:
- to Share — to copy, distribute and transmit the work
- to Remix — to adapt the work
• Under the following conditions
- Attribution — You must attribute the work by inserting
• “[source http://www.oeg-upm.net/]” at the footer of each
reused slide
• a credits slide stating: “Maximising (Re)Usability of
Library metadata using Linked Data” by A. Gómez-Pérez
- Non-commercial
- Share-Alike
3
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Table of Content
• Motivation
• How to deal with
- Multilingualism and Language
- Provenance
- License
• Linked Data Process
• Uses of library linked metadata
4
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
A world of digital data
Providers
Languages
Licenses
Domains Heterogeneous
Formats
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Lack of interoperability: Language, Syntax, Semantic &Technical
• Ecosystem of
- Open Resources in silos
- Complementary domains
- Heterogeneous formats
- Different languages
- Repositories with different
metadata
- Many APIs and services
for querying
Complementary
but
not connected
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data allows uniform access
1. Agree on vocabularies for
describing
• metadata
• domain data
2. Unified and standardized language
for describing resources ( RDF(S))
3. Unified and standardized query
language (SPARQL)
4. Standardized non-proprietary APIs
5. Links to other resources
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Questions of the end-user …
- Who generated the data set
- When the dataset was created?
- How the dataset was built?
- Is that dataset the last version?
- Is license information clearly stated?
- Is there any condition that prevents my organization to use the data set?
- In which formats the data are delivered?
- Are data monolingual or multilingual?
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Metadata matters
9
Provenance
Licenses
Language
Privacy
GeoLocation
Time
Spatial
Provides vocabularies for representing these dimensions
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Multilingualism and
Language in Linked Data
www.lider-project.eu
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Libraries store multilingual data
11
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
The Web of Data links
12
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linguistic Linked Data Cloud
Linguistic Linked Data
helps to translate the same term
(book title, author name, place, …)
in different languages and
to deal with acronyms
Linguistic Linked
Data Cloud
Linguistic Linked Data Cloud
 Subset of LOD
 Linguistic domain
 Many type of resources
 Interconnected with other Language Resources
 Enables the lexicalization of data on the web, not
necessarily data in the LD format
 Helps with Multilingual Search
 Enables a new generation of LD-aware NLP and MT
Services
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
The core of the Linguistic Linked Open Data cloud!
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
What is BabelNet?
• A merger of resources of different kinds:
From Roberto Navigli (Sapienza University)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November 26/11/201
Why do we need BabelNet?
• Multilinguality: the same concept is expressed in tens of
languages
• Coverage: 272 languages and 14 million entries!
- 6M concepts and 7.7M named entities
- 119M word senses
- 378M semantic relations (27 relations per concept on avg.)
- 11M images associated with concepts
- 41M textual definitions
- 2M concepts with domains associated
From Roberto Navigli (Sapienza University)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November 18
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linguistic Linked Licensed Data
3LD
Linguistic Linked Licensed Data
Language resources
such as:
- Lexica
- Corpora
- Dictionaries ..
NIF
NLP Interchange Format
Using RDF and
standard data
models
(vocabularies):
- Lexica
- Corpora
ODRL
Open Digital Rights Language
Published along with
a machine-readable
license.
www.lider-project.eu
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
How to represent in Linked Data ...
 Traditional annotation properties to represent
language
 Richer models to represent linguistic information
for more demanding applications
bne:XX1718747
rdfs:label ”Θερβάντες, Μιγκέλ ντε"@gr.
”Miguel de Cervantes"@es.
“Cervantes di Saavedra, Michele"@it.
# LEMON
Bne:OP5001 lemon:isReferenceOf [lemon:isSenseOf :author_of].
:author_of a lemon:LexicalEntry;
lemon:form [lemon:writtenRep “es autor de”@es;
isocat:grammaticalGender isocat:masculine];
lemon:form [lemon:writtenRep “es autora de”@es;
isocat:grammaticalGender isocat:feminine].
isocat:grammaticalGender rdfs:subPropertyOf lemon:property.
Association of the vocabulary to an external lexicon model: Is author of (Femenine and
Masculine)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
lemon:LexicalEntry
lemon:LexicalEntry
lemon:LexicalSense
lemon:LexicalSense
lemon:Lexicon
lexiconRU
lemon:Lexicon
lexiconES
tr:Translation
“Миге́ль де
Серва́нтес
Сааве́дра”@ru
“Miguel de
Cervantes”@es
lemon:entry
lemon:entry
lemon:isSenseOf
lemon:isSenseOf tr:translationTarget
tr:translationSource
tr:trans
lemon:lexicalForm
lemon:lexicalForm
lemon:Form
lemon:Form
lemon:writtenRep
tr:TranslationSet
translationSetRU-ES
lemon:writtenRep
How to represent translations
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
How to add the language to the dataset description
# VoiD description
:bne a void:Dataset;
dcterms:language <http://id.loc.gov/vocabulary/iso639-1/es> .
# DCAT description
:bne a dcat:Dataset;
dcterms:language <http://id.loc.gov/vocabulary/iso639-1/es>
VOID: W3C Vocabulary of Interlinked Datasets
DCAT: W3C Data Catalog vocabulary
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data and Linguistic Linked Data
1. Agree on vocabularies for
describing
• Domain vocabularies
• LR metadata and content (Lemon-
Ontolex, NIF, …)
2. Unified and standardized language
for describing resources ( RDF(S))
3. Unified and standardized query
language (SPARQL)
4. Standardized non-proprietary APIs
5. Links to other resources
Linguistic LD
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Multilingual search: retrieve a book in german
• Servantes Saavedra, Migel
• Therbantes, Minkel nte
• Cervantes di Saavedra, Michele
• ‫دى‬ ‫ميجيل‬ ،‫ساڤدرا‬ ‫سرفنتس‬
• Zerbantes eta Saabedra, Mikel
• Θερβάντες, Μιγκέλ ντε
• Cervantes
• Sirfantis Saafedrā, Mīgīl dī
• Сервантес Сааведра, Мигель де
• Sewantisi Saweidela, Migai'er de
• 塞万提斯·萨维德拉, 米盖尔 德
• Cervantes, Miguel de
24
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Expand the information based on language
Asuncion
Multilingual titles
Links
(not available in
BNF and BNE)
Dbpedia
Information about
the author in
English
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Expand the information based on language
“La cité antique”@fr
Digital resource
available
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Multilingual and
complementary
information
Multilingual Content Aggregation
“La cité antique”@es
Work by F. de Coulanges
Subject: “Ciudades antiguas”@es
Only available non-digitized
manifestations in Spanish (title “La
ciudad antigua”).
“La cité antique”
Date: 1864
Digitization available:
http://gallica.bnf.fr/ark:/12148/bpt6k6105986m
“La cité antique”
Translations: “The Ancient City”@en, ”
古代城邦”@zh
Links: Dbpedia, Wikidata
Full text
“La cité antique”
More links
Link to full text:
http://remacle.org/bloodwolf/livre
s/Fustel/intro.htm
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linguistic Linked Data helps with alternative names and titles
29
:un a ontolex:LexicalEntry, lexinfo:Acronym ;
ontolex:canonicalForm :form_un ;
rdfs:label ”UN"@en .
rdfs:label ”ONU"@es .
:form_un a ontolex:Form ;
ontolex:writtenRep ”UN"@en .
ontolex:writtenRep ”ONU"@es .
:united_nations a ontolex:MultiwordExpression ;
ontolex:canonicalForm :form_united_nations ;
lexinfo:abbreviationFor :un;
rdfs:label ”United Nations"@en .
:form_united_nations a ontolex:Form ;
ontolex:writtenRep ”United Nations"@en .
ontolex:writtenRep ”Naciones Unidas"@es .
Organización de Naciones Unidas
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data have
Provenance
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Modeling provenance
RDF
Store
PROVENANCE
Model (RDF(S))
Process centric provenance
• PROV-O @W3C
Filev1.
txt
Revision
Process
wasGeneratedBy
File.txt
used
Metadata provenance
• DC, PROV @ W3C
Resource provenance
• DC, PROV-O, Premis, SWANL
• EDM (including agregation)
creator
rights
creationDate
John
12-2-1900
GPL
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Example of provenance with datos.bne.es
Macr21
Dataset
(prov:Entity)
Conversion Process
(prov:Avtivity)
TTL file
(prov:Entity)
Process centric provenance
prov:used
prov:wasGeneratedBy
dc:license
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November33M3iLD Hearing - Luxembourg, 19 June 2012
Macr21
Dataset
(prov:Entity)
Conversion Process
(prov:Avtivity)
TTL file
(prov:Entity)
BNE (prov:Agent)
“2010-07-14T01:01:01Z”^^xsd:dateTime
CC0
CC0
Resource provenanceProcess centric provenance
“2011-07-14T01:01:01Z”^^xsd:dateTime
prov:used
prov:wasGeneratedBy
“2011-07-14T02:02:02Z”^^xsd:dateTime
prov:startedAtTime
prov:endedAtTime
BNE
(prov:Agent)
prov:wasAssociatedWith
prov:actedOnBehalfOf
prov:wasAttributedTo,
dc:creator
prov:generatedAtTime, dc:created
dc:license
prov:wasAttributedTo,
dc:creator
Marimba
(prov:Agent)
dc:license
prov:generatedAtTime,
dc:created
Example of provenance with datos.bne.es
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Example of provenance with datos.bne.es
Macr21
Dataset
(prov:Entity)
Conversion Process
(prov:Avtivity)
TTL file
(prov:Entity)
BNE (prov:Agent)
“2010-07-14T01:01:01Z”^^xsd:dateTime
CC0
CC0
Resource provenance Metadata provenanceProcess centric provenance
BNE Digital
library
department
(prov:Agent)
GPL
“2011-07-14T01:01:01Z”^^xsd:dateTime
prov:used
prov:wasGeneratedBy
“2011-07-14T02:02:02Z”^^xsd:dateTime
prov:startedAtTime
prov:endedAtTime
BNE
(prov:Agent)
prov:wasAssociatedWith
prov:actedOnBehalfOf
prov:wasAttributedTo,
dc:creator
prov:generatedAtTime, dc:created
dc:license
prov:wasAttributedTo,
dc:creator
Marimba
(prov:Agent)
dc:license
Metadata
provenance file
(prov:Bundle)
prov:generatedAtTime,
dc:created
prov:wasAttributedTo,
dc:creator
dc:license
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data have
Licenses
Research funded by the project
4V: Volumen, Velocidad, Variedad y Validez en la gestión innovadora de datos (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Create, consume, aggregate,
derive and publish Linked Data
in a lawful environment
0
Always license your data
…
Data shops Governments Individuals
36
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
How Open is the Linked Open Data Cloud?
37
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linguistic Linked Licensed Data
How do we represent license information?
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Licensed Data
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
(Linked) Licensed Data in practice
40
Published
Open License
(Linked) Open Data (Linked) Closed Data
Published
No Open License
(Linked) Private Data
Not Published
Available Data without explicit license
Published
Without License
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
RDF Licensing support
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
«An action is (permitted /prohibited / obliged)
to be acted by the party over the asset,
provided that the constraints hold»
Asset:
Statement (rdf:Statement)
Dataset (void:Dataset)
Ontology (owl:Ontology)
Mapping (void:Linkset)
LDP Container
(ldp:Resource)
Action:
Derive (cc:DerivativeWorks)
Translate (odrl:translate)
Distribute (cc:Distribution)
Reproduce (cc:Reproduce)
Print (odrl:print)
Anonymize (odrl:anonymize)
Index (odrl:index)
… plus ~30 others in
ODRL/CC…
Party:
One individual (ej: mailto:timbl@w3.org)
One organization: (http://www.oeg-upm.net)
One key owner using Web of Trust: (using
http://xmlns.com/wot/0.1/hasKey)
Constraint:
Acknowledgement (cc:Attribution)
A country, city… (odrl:spatial)
A time frame (odrl:timeInterval,
odrl:dateTime…)
Data Model: ODRL (Open Digital Rights Language)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
RDFLicense – Dataset of licenses in RDF
― Content negotiation: machine readable version of common licenses
― Based on ODRL (W3C spec)
― 148 licenses (as of June 2015): Creative Commons, ODC, GNU…
― Permanent URIs. Example: http://purl.org/NET/rdflicense/gpl2.0.ttl
― Browse them here: http://rdflicense.appspot.com/
― Contribute here: https://github.com/oeg-upm/rdflicense
― Catalogued here: http://datahub.io/dataset/rdflicense
― Read more here: A Dataset of RDF Licenses, V. Rodriguez-Doncel, S. Villata, A. Gomez-
Perez, in Proc. of the 27th Int. Conf. on Legal Knowledge and Information System (JURIX), R.
Hoekstra (Ed.), ISBN 978-1-61499-467-1, pp. 187-189, IOS Press, 2014
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
<http://purl.org/NET/rdflicense/cc-by-sa3.0.ttl>
a odrl:Policy ;
rdfs:label "Creative Commons CC-BY-SA" ;
rdfs:seeAlso <http://creativecommons.org/licenses/by-sa/3.0/rdf> ;
cc:legalcode <http://creativecommons.org/licenses/by-sa/3.0/legalcode> ;
dct:hasVersion "3.0" ;
dct:language <http://www.lexvo.org/page/iso639-3/eng> ;
dct:publisher "Creative Commons" ;
odrl:permission
[
odrl:action cc:Distribution , cc:DerivativeWorks , cc:Reproduction ;
odrl:duty
[
odrl:action cc:Attribution , cc:Notice , cc:ShareAlike
]
] .
Sample license in ODRL: Creative Commons CC-BY-SA
No Constraints (spatial, temporal,
are not found in Creative Commons
licenses), they are universal
A generic license (like Creative Commons’)
has no party, as the recipient is anybody
accessing the licensed work
URI
:myDataset dct:license <http://purl.org/NET/rdflicense/cc-by-sa3.0>
How do I use in my data set?
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Example of a license offering Paid Linked Data
@prefix gr: <http://purl.org/goodrelations/> .
@prefix dcat: <http://www.w3.org/ns/dcat#> .
<http://samplepolicy/1234>
a odrl:Offer ;
rdfs:label "License Offering Paid Linked Data" ;
odrl:permission [
odrl:target <http://example.org/dataset/ds01> ;
odrl:action odrl:reproduce ;
odrl:duty [
rdfs:label "Pay" ;
gr:UnitOfMeasurement dcat:Dataset ;
gr:amountOfThisGood "1" ;
odrl:action odrl:pay ;
odrl:target "15,00 EUR“
] ;
odrl:constraint
[
odrl:operator odrl:lt ;
odrl:dateTime "2015-12-31"^^xsd:date ]
] ;
]
The reproduction of a dataset is limited until the end of this year after paying 15€.
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Example of a license for classes/concepts
• Old book offered for free, new book to be sold for 5 EUR
46
@prefix cc: <http://creativecommons.org/ns#> .
@prefix odrl: <http://www.w3.org/ns/odrl/2/> .
:example a odrl:Policy ;
odrl:permission [
odrl:action odrl:reproduce ;
odrl:target :oldBook
] ;
odrl:permission [
odrl:action odrl:reproduce ;
odrl:target :newBook ;
odrl:duty [
odrl:action odrl:pay ;
odrl:target “5,00 EUR"
]
] .
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Using Policies to govern conditional access to Linked Data
Example of access to Linked Data for a price (15EUR for the dataset or 0.01EUR for a triple
thereof)
@prefix gr: <http://purl.org/goodrelations/> .
@prefix dcat: <http://www.w3.org/ns/dcat#> .
<http://salonica.dia.fi.upm.es/ldr/policy/cdaddba4-fc2e-4ee0-a784-e62f1db259bf>
a odrl:Set ;
rdfs:label "License Offering Paid Linked Data" ;
odrl:permission [ a odrl:Permission ;
odrl:target <http://example.org/dataset/ds01> ;
odrl:action odrl:reproduce ;
odrl:duty [ a odrl:Duty ;
rdfs:label "Pay" ;
gr:UnitOfMeasurement dcat:Dataset ;
gr:amountOfThisGood "1" ;
odrl:action odrl:pay ;
odrl:target "15,00 EUR"
]
] , [ a odrl:Permission ;
odrl:action odrl:reproduce ;
odrl:target <http://example.org/dataset/ds01> ;
odrl:duty [ a odrl:Duty ;
rdfs:label "Pay" ;
gr:UnitOfMeasurement rdf:Statement ;
gr:amountOfThisGood "1" ;
odrl:action odrl:pay ;
odrl:target "0,01 EUR”
]
] ..
4V (TIN2013-46238-C4-2-R)
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data, Linguistic Linked Data and License
1. Agree on vocabularies for
describing
• Domain data
• Language related information (Lemon)
• LR license metadata (ODRL)
2. Unified and standardized language
for describing resources ( RDF(S))
3. Unified and standardized query
language (SPARQL)
4. Standardized non-proprietary APIs
5. Links to other resources
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linked Data Process
Examples from datos.bne.es
datos.bne.es and MARiMba: an insight into library linked data D. Vila-Suero, A. Gómez-Pérez
Library Hi-tech (ISSN: 0737-8831); Emerald Group Publishing Limited . Vol.: 31(4).Pages: 575-601 Noviembre 2013
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
datos.bne.es: the team
Cataloguing services
Digital Library and
automatization
Focus group
R&D in Linked Data
Ontology engineering
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Methodology
Specification
Modelling
(Ontologies)
RDF
Generation
Publication
Exploitation
Data Linking
Data
Curation
Many technologies
involved
Villazón-Terrazas, B.; Vilches. L.; Corcho, O.; Gómez-Pérez, A.
Methodological Guidelines for Publishing Government Linked Data. In
D. Wood, ed. Linking Government Data. Springer. (pp, 27-49). 2011
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Specification
Goal
Linked Data generation of the Spanish National
Library Metadata
• Records in the MARC 21 format
• 4.5 million bibliographical records
• 4.5 million authority records
• Version: September, 2015
52
Specification
Modelling
RDF Generation
Publication
Links Generation
Exploitation
Persons
1.5 M
Expressions
1 M
Subjects
0.5 M
Works
1.5 M
Editions
4.5 M
Digital items
150 m
Maps
Modern
Ancient
Drawings
Musical scores
Written music
Recorded music
Different types of records in MARC21
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Specification
• Domain experts from BNE (catalogers) part of the
mapping process.
• Multilinguality, collaboration with IFLA
Sources
BNE
VIAF
LoC
BNF
…
BNE
Mappings
RDF
Subjects
Authorities
Bibliographic
Specification
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Modeling in datos.bne.es
Two phases:
1. Re-using URIs: from standard vocabs, mainly IFLA FRBR,
ISBD and FRAD
2. Integrating ontology: BNE ontology that re-uses and links to
external vocabs.
Motivation for the creation of BNE ontology:
- To provide stability under the datos.bne.es domain
- Documented in a central document
- Provide new properties and relationships (e.g., spanish legal
deposit)
- Control over descriptions and labels (e.g., labels for
visualization (bne:label))
- Multilingualism: BNE ontology is described in English and
Spanish
- Content-negotiation enables using programming APIs (Jena,
Sesame, etc.) and Ontology editors (Protégé, TopBraid,
etc.)
54
Specification
Modelling
RDF Generation
Publication
Links Generation
Exploitation
Available and documented at http://datos.bne.es/def/
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
IFLA Vocabulary-based Ontology - Reusing URIs
55
Specification
Modelling
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Datos.bne.es Ontology : - Second phase
56
bne:PERSON
bne:CORPORATE BODY
bne:EXPRESSIONbne:WORK
bne:CONCEPT
bne:MANIFESTATION
Specification
Modelling
• Available and documented at http://datos.bne.es/def/
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Exploiting alternative names and titles
Person
Work
Org
Authority Name
Cervantes Saavedra
Variant names
Servantes
Сервантес Сааведра
Authority title
Rome and Juliet
Variant titles
The tragedy of Romeo and Juliet
Romeo y Julieta
Romeu i Julieta
Authority Name
Naciones Unidas
Variant names
United Nations
ONU (Spanish acronym)
UN (English acronym)
• Indexing alternatives (acronyms,
translations,etc.) increases recall of search
resultsSpecification
Modelling
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Who will be the mapping generator?
Specification
Modelling
RDF
Generation
Publication
Exploitation
Links
Generation
• Librarians built mappings
bne:PERSON
bne:CORPORATE BODY
bne:EXPRESSIONbne:WORK
bne:CONCEPT
bne:MANIFESTATION
001 XX1721208
005 200012181124
008 901120nn aijnnaabn n aaa
016 $a BNE19900178994
040 $a SpMaBN $b spa $c SpMaBN $e rdc $f embne
100 10 $a Camus, Albert
$d 1913-1960
670 $a El mite de Sísif, 1987 $b port. (Albert Camus)
670 $a Dic. de filosofía, de J. Ferrater Mora, 1980$b(Camus., Albert
(1913-1960); n. Mondovi, Argel)
670 $a Aut. BN-OPALE, 1995 $b (Camus, Albert)
100at Work
property
subfield
maps
100t title of work
maps
is creator of
Person100a maps
Content
(100a)
Content
(100at)contained in
maps
MARC 21 records Datos.bne.es ontology
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
MARiMbA generates RDF using RDFS/OWL ontologies
BNE
59
bne:PERSON
bne:CONCEPT
bne:EXPRESSION
bne:WORK
bne:MANIFESTATION
Specification
Modelling
RDF
Generation
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Marimba links with other resources:
VIAF, DNB, SUDOC, LIBRIS, DBpedia
BNE
http://datos.bne.es/resource/XX1718747
Same As
Same As
Same As
Same As
Same As
LIBRIS
http://libris.kb.se/resource/auth/45369
SUDOC
http://www.idref.fr/026774771/id
DNB
http://d-nb.info/gnd/11851993X
DBpedia
http://dbpedia.org/resource/Miguel_de_Cervantes
VIAF
http://viaf.org/viaf/17220427
Specification
Modelling
RDF
Generation
Links
Generation
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Publication
• Combines DCAT and VoID, to include:
- Licenses: Pointing to rdflicense dataset
- Provenance: Using PROV-O
- Language information: Using lexvo.org
- Access mechanisms: SPARQL endpoint, data dumps
• datos.bne.es is a Catalog
- composed of 7 (interlinked) Datasets
- with different Distributions
• Data dumps
• SPARQL endpoint
• Search API
• Available at http://datos.bne.es/inicio.ttl
Specification
Modelling
RDF
Generation
Publication
Exploitation
Links
Generation
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Full DCAT Graph
62
Specification
Modelling
RDF
Generation
Publication
Exploitation
Links
Generation
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Datos.bne.es: DCAT Catalog
:catalog
a dcat:Catalog ;
dct:title "The RDF catalog of the National Library of Spain"@en ,
"El catálogo RDF de la Biblioteca Nacional de España"@es ;
rdfs:label "datos.bne.es RDF catalog"@en ,
"Catálogo RDF datos.bne.es RDF"@es ;
foaf:homepage <http://datos.bne.es/inicio> ;
dct:publisher :bne ;
dct:license rdflicense:cc-zero1.0 ;
dct:language <http://lexvo.org/id/iso639-3/spa>
,<http://lexvo.org/id/iso639-3/eus>
,<http://lexvo.org/id/iso639-3/por>
,<http://lexvo.org/id/iso639-3/fra>
,<http://lexvo.org/id/iso639-3/lat>
,<http://lexvo.org/id/iso639-3/ita>
,<http://lexvo.org/id/iso639-3/cat>
# Up to 195 languages
63
RDFLicense dataset
Publication
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Datos.bne.es: The publisher as agent
:catalog
dct:publisher :bne ;
:bne
a foaf:Agent ;
owl:sameAs datosbne:XX4891886 .
64
BNE as publisher of the catalog
BNE as agent
Linked to the RDF resource in datosbne
Publication
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Datos.bne.es: The catalog contains 7 datasets
:catalog
dcat:dataset :entities,
:works,
:expressions,
:manifestations,
:items,
:subjects,
:persons;
65
# One dataset per type of entity
Publication
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Datos.bne.es: Persons dataset
:persons
a dcat:Dataset ;
dct:title "Persons in datos.bne.es"@en,
"Personas en datos.bne.es"@es ;
dcat:keyword "persons"@en, "personas"@es,
"authors"@en, "autores"@es ;
dct:issued "2011-12-05"^^xsd:date ;
dct:modified "2011-12-05"^^xsd:date ;
dcat:contactPoint
<mailto:info.datosenlazados@bne.es?subject='Persons%20dataset'> ;
dcat:distribution :persons-nt, :sparql ;
.
:persons-nt
a dcat:Distribution ;
dcat:downloadURL <http://datos.bne.es/datadumps/persons101115.nt.bz2>;
dct:title
"Distribution of persons description in RDF (N-triples)"@en ,
"Distribución de descripciones de personas en RDF (N-triples)"@es ;
dcat:mediaType "application/x-bzip2" ;
dct:license rdflicense:cc-zero1.0 ;
66
# Persons dataset
is distributed in two ways:
SPARQL and N-triples dump
# Each distribution has its own license
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Browsing and querying the data
select distinct COUNT(?Obras) where {
http://datos.bne.es/resource/XX1718747
<http://datos.bne.es/def/OP501>
?Obras
}
URI Cervantes
Is creator
SPARQL queries
Web portal and service
Specification
Modelling
RDF
Generation
Publication
Exploitation
Links
Generation
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Exploitation: Ranking and recommedantion in datos.bne.es
Weighting function based on: the ontology, incoming and
outgoing links, string similarity
Query text
Weight =
613
Weight = 37
Ranked
recommendations:
Works and similar
persons
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Results: datos.bne.es
• Total number of authority records: 4.784.303
• Total number of bibliographic records: 4.083.671
• Total number of RDF triples: 143.153.218
• Total number of owl:sameAs links: 1.395.108
• Linked sources:
- VIAF
- SUDOC (French collective university catalogue) FR
- GND (German National Library of authorities)
- LIBRIS Sweden
- DBPedia
- BNF, France
- geo.linkeddata.es
69
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Uses of Library Linked
Metadata
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Validación y enriquecimiento de registros MARC21
BIBLIOTECA 2
BIBLIOTECA 3
BIBLIOTECA 1
…
Catálogos mejorados
Reducción de costes
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Long Tale: Specialize forums for rare editions
73
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Street detective: citizen science
74
• To improve data quality and access to
cultural data in a particular city
• Gaming exercise: Ask questions that link
que asocien personajes históricos a las
calles de su ciudad.
• Link them with datos.bne.es
• Performed by childrens (between 10 and
15 years old) in Zaragoza and Madrid in
collaborartion with the city hall of
Zaragoza and BNE
“Fujitsu Laboratories of
Europe Innovation Award
2015”
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Linking: Integration of cultural data and geographical data
75
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
“El Quijote” route
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Locations related with “El Quijote”
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
From metadata in images to full text
A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November
Messages to take home
1. Library Linked Data can be easily integrated in other data (e.g.
geographical, education, etc.)
2. Data providers should include language, provenance and license metadata
in their datasets
• in the original data sources (e.g., MARC21 records)
• tags into RDF (e.g., @es, @ fr at least)
• language URIs in the VOID or DCAT descriptions
• License and provenance information in RDF
3. Benefits of adding language, provenance and license metadata in
datasets
• Reduce the time and cost of identifying language in resources and
terminology
• Foster the aggregation and enrichment of data across complementary
resources
• Enhances data curation
• Improves precision and recall in information retrieval and search

More Related Content

PPTX
Big Linked Data - Creating Training Curricula
PDF
Documents, services, and data on the web
PDF
The Nature.com ontologies portal - Linked Science 2015
PPTX
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
PDF
ESWC 2017 Tutorial Knowledge Graphs
PDF
Discovering Related Data Sources in Data Portals
PDF
Getting Started with Knowledge Graphs
PPTX
Scaling up Linked Data
Big Linked Data - Creating Training Curricula
Documents, services, and data on the web
The Nature.com ontologies portal - Linked Science 2015
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
ESWC 2017 Tutorial Knowledge Graphs
Discovering Related Data Sources in Data Portals
Getting Started with Knowledge Graphs
Scaling up Linked Data

What's hot (20)

PPTX
Usage of Linked Data: Introduction and Application Scenarios
PDF
Linked Data
PDF
Linked Data Experiences at Springer Nature
PPTX
Building Linked Data Applications
PDF
ODI Summit 2016 - Linked Open Data at Springer Nature
PDF
Linked data experience at Macmillan: Building discovery services for scientif...
PPTX
Interaction with Linked Data
PDF
Wikidata
PPTX
Providing Linked Data
PDF
Ephedra: efficiently combining RDF data and services using SPARQL federation
PDF
Smart Data Applications powered by the Wikidata Knowledge Graph
PDF
nanopub-java: A Java Library for Nanopublications
PPT
euclid_linkedup WWW tutorial (Besnik Fetahu)
PPTX
Microtask Crowdsourcing Applications for Linked Data
PDF
Mind the gap! Reflections on the state of repository data harvesting
PDF
The Web of Data is Our Opportunity
PPT
Scripting User Contributed Interlinking
PPTX
Querying Linked Data
PDF
The web of interlinked data and knowledge stripped
PDF
Linked Data Best Practices and BibFrame
Usage of Linked Data: Introduction and Application Scenarios
Linked Data
Linked Data Experiences at Springer Nature
Building Linked Data Applications
ODI Summit 2016 - Linked Open Data at Springer Nature
Linked data experience at Macmillan: Building discovery services for scientif...
Interaction with Linked Data
Wikidata
Providing Linked Data
Ephedra: efficiently combining RDF data and services using SPARQL federation
Smart Data Applications powered by the Wikidata Knowledge Graph
nanopub-java: A Java Library for Nanopublications
euclid_linkedup WWW tutorial (Besnik Fetahu)
Microtask Crowdsourcing Applications for Linked Data
Mind the gap! Reflections on the state of repository data harvesting
The Web of Data is Our Opportunity
Scripting User Contributed Interlinking
Querying Linked Data
The web of interlinked data and knowledge stripped
Linked Data Best Practices and BibFrame

Viewers also liked (20)

ODP
DataIncubator at Linked Data Meetup February 2010
PDF
Deletreando Android
DOCX
AroraManisha_FinalProject
PPTX
Trabajadores más productivos, motivados y felices gracias a los wearables
PPT
The Community is Where the Rapport is - On Sense and Structure in the YouTube...
PDF
abbie sewing
PDF
Unicorn Course - The Science of Building a Billion-Dollar Startup
PPTX
La base Ancien Possesseur, projet CRIICO
PPTX
Najczęstsze błędy, które ograniczają potencjał SEO sklepu internetowego
PPTX
Desarrollo con stack MEAN
PDF
Zaldibiko euskal barnetegia
DOCX
Archivists toolkit SQL - a tutorial
PPTX
Charla estrategia desarrollo aplicaciones móviles Universidad Girona
PPTX
Mushroom and poision ccuring A Lecture By Mr Allah Dad Khan Former DG Agric...
PPTX
匿名バイナリ配布集団rwinlib
PDF
Présentation du portail Biblissima
PPTX
Antiquarians in the 21st Century: Opening up our data
PDF
Programación Orientada a Objetos en Java - Parte I 2015
DataIncubator at Linked Data Meetup February 2010
Deletreando Android
AroraManisha_FinalProject
Trabajadores más productivos, motivados y felices gracias a los wearables
The Community is Where the Rapport is - On Sense and Structure in the YouTube...
abbie sewing
Unicorn Course - The Science of Building a Billion-Dollar Startup
La base Ancien Possesseur, projet CRIICO
Najczęstsze błędy, które ograniczają potencjał SEO sklepu internetowego
Desarrollo con stack MEAN
Zaldibiko euskal barnetegia
Archivists toolkit SQL - a tutorial
Charla estrategia desarrollo aplicaciones móviles Universidad Girona
Mushroom and poision ccuring A Lecture By Mr Allah Dad Khan Former DG Agric...
匿名バイナリ配布集団rwinlib
Présentation du portail Biblissima
Antiquarians in the 21st Century: Opening up our data
Programación Orientada a Objetos en Java - Parte I 2015

Similar to Maximising (Re)Usability of Library metadata using Linked Data (20)

PPT
Linked library data
PPTX
Linked Data: Why Bother?
PPT
EIFL 2014 - Linked Open Data
PDF
The state of the art in Linked Data
PPT
W3C Library Linked Data Incubator Group: Review of the Final Report
DOCX
LODLAM Landscape NOTES
PDF
Linked Open Data: Identifying Opportunities
PDF
Lider Reference Model ld4lt session March, 3rd, 2015
PDF
Linked Data Visualization 1st Edition Laura Po
PDF
Linked Data Visualization 1st Edition Laura Po
PPTX
Moving to an open world
PDF
What flavor of linked data is best for your collection?
PPTX
The Impact of Linked Data in Digital Curation and Application to the Catalogu...
PPTX
Relationship status: Libraries and linked data in Europe
PDF
Breaking the catalog
PPT
Charper.lawdi.20120601
PPT
W3C Library Linked Data Incubator Group - 2011
ODP
Charper.lawdi.20130531
PPTX
Semantic Web Technologies: Changing Bibliographic Descriptions?
PPTX
Intro to the semantic web (for libraries)
Linked library data
Linked Data: Why Bother?
EIFL 2014 - Linked Open Data
The state of the art in Linked Data
W3C Library Linked Data Incubator Group: Review of the Final Report
LODLAM Landscape NOTES
Linked Open Data: Identifying Opportunities
Lider Reference Model ld4lt session March, 3rd, 2015
Linked Data Visualization 1st Edition Laura Po
Linked Data Visualization 1st Edition Laura Po
Moving to an open world
What flavor of linked data is best for your collection?
The Impact of Linked Data in Digital Curation and Application to the Catalogu...
Relationship status: Libraries and linked data in Europe
Breaking the catalog
Charper.lawdi.20120601
W3C Library Linked Data Incubator Group - 2011
Charper.lawdi.20130531
Semantic Web Technologies: Changing Bibliographic Descriptions?
Intro to the semantic web (for libraries)

More from Asuncion Gomez-Perez (9)

PDF
Presentación del nodo ODI Madrid / Introduction to ODI Madrid
PDF
Maximising (Re)Usability of Resources using Linked Data
PPTX
Lecciones aprendidas al publicar datos enlazados
PDF
Uso de datos.bne.es: imaginando el futuro
PDF
Linked data and language technologies
PDF
Linked DAta Applications: There is no One-Size-Fits All Formula (Long present...
PDF
Linked DAta Applications: There is no One-Size-Fits All Formula (Short presen...
PDF
W3c app ld-asun(v5)-final
PDF
Datos enlazados en la Biblioteca Nacional de España
Presentación del nodo ODI Madrid / Introduction to ODI Madrid
Maximising (Re)Usability of Resources using Linked Data
Lecciones aprendidas al publicar datos enlazados
Uso de datos.bne.es: imaginando el futuro
Linked data and language technologies
Linked DAta Applications: There is no One-Size-Fits All Formula (Long present...
Linked DAta Applications: There is no One-Size-Fits All Formula (Short presen...
W3c app ld-asun(v5)-final
Datos enlazados en la Biblioteca Nacional de España

Recently uploaded (20)

PDF
Teal Blue Futuristic Metaverse Presentation.pdf
PPT
BME 301 Lecture Note 1_2.ppt mata kuliah Instrumentasi
PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PPTX
Basic Statistical Analysis for experimental data.pptx
PPT
Classification methods in data analytics.ppt
PDF
Q1-wK1-Human-and-Cultural-Variation-sy-2024-2025-Copy-1.pdf
PDF
Mcdonald's : a half century growth . pdf
PPTX
Overview_of_Computing_Presentation.pptxxx
PDF
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
PDF
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
PDF
Nucleic-Acids_-Structure-Typ...-1.pdf 011
PPTX
1.Introduction to orthodonti hhhgghhcs.pptx
PPTX
lung disease detection using transfer learning approach.pptx
PPTX
Fkrjrkrkekekekeekkekswkjdjdjddwkejje.pptx
PPT
2011 HCRP presentation-final.pptjrirrififfi
PDF
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
PPT
Technicalities in writing workshops indigenous language
PDF
American Journal of Multidisciplinary Research and Review
PDF
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
PPTX
AI-Augmented Business Process Management Systems
Teal Blue Futuristic Metaverse Presentation.pdf
BME 301 Lecture Note 1_2.ppt mata kuliah Instrumentasi
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
Basic Statistical Analysis for experimental data.pptx
Classification methods in data analytics.ppt
Q1-wK1-Human-and-Cultural-Variation-sy-2024-2025-Copy-1.pdf
Mcdonald's : a half century growth . pdf
Overview_of_Computing_Presentation.pptxxx
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
Nucleic-Acids_-Structure-Typ...-1.pdf 011
1.Introduction to orthodonti hhhgghhcs.pptx
lung disease detection using transfer learning approach.pptx
Fkrjrkrkekekekeekkekswkjdjdjddwkejje.pptx
2011 HCRP presentation-final.pptjrirrififfi
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
Technicalities in writing workshops indigenous language
American Journal of Multidisciplinary Research and Review
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
AI-Augmented Business Process Management Systems

Maximising (Re)Usability of Library metadata using Linked Data

  • 1. Maximising (Re)Usability of Library metadata using Linked Data Asunción Gómez-Pérez Ontology Engineering Group Universidad Politécnica de Madrid [email protected] @asungomezperez Acknowledgments: Daniel Vila Suero (Library LD, and main developer of datos.bne.es) Victor Rodríguez Doncel (licensed linked data) Jorge Gracia (linguistic linked data)
  • 2. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November License • This work is licensed under the Creative Commons Attribution – Non Commercial – Share Alike License • You are free: - to Share — to copy, distribute and transmit the work - to Remix — to adapt the work • Under the following conditions - Attribution — You must attribute the work by inserting • “[source http://www.oeg-upm.net/]” at the footer of each reused slide • a credits slide stating: “Maximising (Re)Usability of Library metadata using Linked Data” by A. Gómez-Pérez - Non-commercial - Share-Alike 3
  • 3. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Table of Content • Motivation • How to deal with - Multilingualism and Language - Provenance - License • Linked Data Process • Uses of library linked metadata 4
  • 4. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November A world of digital data Providers Languages Licenses Domains Heterogeneous Formats
  • 5. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Lack of interoperability: Language, Syntax, Semantic &Technical • Ecosystem of - Open Resources in silos - Complementary domains - Heterogeneous formats - Different languages - Repositories with different metadata - Many APIs and services for querying Complementary but not connected
  • 6. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data allows uniform access 1. Agree on vocabularies for describing • metadata • domain data 2. Unified and standardized language for describing resources ( RDF(S)) 3. Unified and standardized query language (SPARQL) 4. Standardized non-proprietary APIs 5. Links to other resources
  • 7. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Questions of the end-user … - Who generated the data set - When the dataset was created? - How the dataset was built? - Is that dataset the last version? - Is license information clearly stated? - Is there any condition that prevents my organization to use the data set? - In which formats the data are delivered? - Are data monolingual or multilingual?
  • 8. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Metadata matters 9 Provenance Licenses Language Privacy GeoLocation Time Spatial Provides vocabularies for representing these dimensions
  • 9. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Multilingualism and Language in Linked Data www.lider-project.eu
  • 10. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Libraries store multilingual data 11
  • 11. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November The Web of Data links 12
  • 12. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linguistic Linked Data Cloud Linguistic Linked Data helps to translate the same term (book title, author name, place, …) in different languages and to deal with acronyms Linguistic Linked Data Cloud Linguistic Linked Data Cloud  Subset of LOD  Linguistic domain  Many type of resources  Interconnected with other Language Resources  Enables the lexicalization of data on the web, not necessarily data in the LD format  Helps with Multilingual Search  Enables a new generation of LD-aware NLP and MT Services
  • 13. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November The core of the Linguistic Linked Open Data cloud!
  • 14. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November What is BabelNet? • A merger of resources of different kinds: From Roberto Navigli (Sapienza University)
  • 15. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November 26/11/201 Why do we need BabelNet? • Multilinguality: the same concept is expressed in tens of languages • Coverage: 272 languages and 14 million entries! - 6M concepts and 7.7M named entities - 119M word senses - 378M semantic relations (27 relations per concept on avg.) - 11M images associated with concepts - 41M textual definitions - 2M concepts with domains associated From Roberto Navigli (Sapienza University)
  • 16. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November 18
  • 17. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linguistic Linked Licensed Data 3LD Linguistic Linked Licensed Data Language resources such as: - Lexica - Corpora - Dictionaries .. NIF NLP Interchange Format Using RDF and standard data models (vocabularies): - Lexica - Corpora ODRL Open Digital Rights Language Published along with a machine-readable license. www.lider-project.eu
  • 18. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November How to represent in Linked Data ...  Traditional annotation properties to represent language  Richer models to represent linguistic information for more demanding applications bne:XX1718747 rdfs:label ”Θερβάντες, Μιγκέλ ντε"@gr. ”Miguel de Cervantes"@es. “Cervantes di Saavedra, Michele"@it. # LEMON Bne:OP5001 lemon:isReferenceOf [lemon:isSenseOf :author_of]. :author_of a lemon:LexicalEntry; lemon:form [lemon:writtenRep “es autor de”@es; isocat:grammaticalGender isocat:masculine]; lemon:form [lemon:writtenRep “es autora de”@es; isocat:grammaticalGender isocat:feminine]. isocat:grammaticalGender rdfs:subPropertyOf lemon:property. Association of the vocabulary to an external lexicon model: Is author of (Femenine and Masculine)
  • 19. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November lemon:LexicalEntry lemon:LexicalEntry lemon:LexicalSense lemon:LexicalSense lemon:Lexicon lexiconRU lemon:Lexicon lexiconES tr:Translation “Миге́ль де Серва́нтес Сааве́дра”@ru “Miguel de Cervantes”@es lemon:entry lemon:entry lemon:isSenseOf lemon:isSenseOf tr:translationTarget tr:translationSource tr:trans lemon:lexicalForm lemon:lexicalForm lemon:Form lemon:Form lemon:writtenRep tr:TranslationSet translationSetRU-ES lemon:writtenRep How to represent translations
  • 20. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November How to add the language to the dataset description # VoiD description :bne a void:Dataset; dcterms:language <http://id.loc.gov/vocabulary/iso639-1/es> . # DCAT description :bne a dcat:Dataset; dcterms:language <http://id.loc.gov/vocabulary/iso639-1/es> VOID: W3C Vocabulary of Interlinked Datasets DCAT: W3C Data Catalog vocabulary
  • 21. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data and Linguistic Linked Data 1. Agree on vocabularies for describing • Domain vocabularies • LR metadata and content (Lemon- Ontolex, NIF, …) 2. Unified and standardized language for describing resources ( RDF(S)) 3. Unified and standardized query language (SPARQL) 4. Standardized non-proprietary APIs 5. Links to other resources Linguistic LD
  • 22. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Multilingual search: retrieve a book in german • Servantes Saavedra, Migel • Therbantes, Minkel nte • Cervantes di Saavedra, Michele • ‫دى‬ ‫ميجيل‬ ،‫ساڤدرا‬ ‫سرفنتس‬ • Zerbantes eta Saabedra, Mikel • Θερβάντες, Μιγκέλ ντε • Cervantes • Sirfantis Saafedrā, Mīgīl dī • Сервантес Сааведра, Мигель де • Sewantisi Saweidela, Migai'er de • 塞万提斯·萨维德拉, 米盖尔 德 • Cervantes, Miguel de 24
  • 23. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Expand the information based on language Asuncion Multilingual titles Links (not available in BNF and BNE) Dbpedia Information about the author in English
  • 24. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Expand the information based on language “La cité antique”@fr Digital resource available
  • 25. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Multilingual and complementary information Multilingual Content Aggregation “La cité antique”@es Work by F. de Coulanges Subject: “Ciudades antiguas”@es Only available non-digitized manifestations in Spanish (title “La ciudad antigua”). “La cité antique” Date: 1864 Digitization available: http://gallica.bnf.fr/ark:/12148/bpt6k6105986m “La cité antique” Translations: “The Ancient City”@en, ” 古代城邦”@zh Links: Dbpedia, Wikidata Full text “La cité antique” More links Link to full text: http://remacle.org/bloodwolf/livre s/Fustel/intro.htm
  • 26. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linguistic Linked Data helps with alternative names and titles 29 :un a ontolex:LexicalEntry, lexinfo:Acronym ; ontolex:canonicalForm :form_un ; rdfs:label ”UN"@en . rdfs:label ”ONU"@es . :form_un a ontolex:Form ; ontolex:writtenRep ”UN"@en . ontolex:writtenRep ”ONU"@es . :united_nations a ontolex:MultiwordExpression ; ontolex:canonicalForm :form_united_nations ; lexinfo:abbreviationFor :un; rdfs:label ”United Nations"@en . :form_united_nations a ontolex:Form ; ontolex:writtenRep ”United Nations"@en . ontolex:writtenRep ”Naciones Unidas"@es . Organización de Naciones Unidas
  • 27. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data have Provenance
  • 28. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Modeling provenance RDF Store PROVENANCE Model (RDF(S)) Process centric provenance • PROV-O @W3C Filev1. txt Revision Process wasGeneratedBy File.txt used Metadata provenance • DC, PROV @ W3C Resource provenance • DC, PROV-O, Premis, SWANL • EDM (including agregation) creator rights creationDate John 12-2-1900 GPL
  • 29. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Example of provenance with datos.bne.es Macr21 Dataset (prov:Entity) Conversion Process (prov:Avtivity) TTL file (prov:Entity) Process centric provenance prov:used prov:wasGeneratedBy dc:license
  • 30. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November33M3iLD Hearing - Luxembourg, 19 June 2012 Macr21 Dataset (prov:Entity) Conversion Process (prov:Avtivity) TTL file (prov:Entity) BNE (prov:Agent) “2010-07-14T01:01:01Z”^^xsd:dateTime CC0 CC0 Resource provenanceProcess centric provenance “2011-07-14T01:01:01Z”^^xsd:dateTime prov:used prov:wasGeneratedBy “2011-07-14T02:02:02Z”^^xsd:dateTime prov:startedAtTime prov:endedAtTime BNE (prov:Agent) prov:wasAssociatedWith prov:actedOnBehalfOf prov:wasAttributedTo, dc:creator prov:generatedAtTime, dc:created dc:license prov:wasAttributedTo, dc:creator Marimba (prov:Agent) dc:license prov:generatedAtTime, dc:created Example of provenance with datos.bne.es
  • 31. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Example of provenance with datos.bne.es Macr21 Dataset (prov:Entity) Conversion Process (prov:Avtivity) TTL file (prov:Entity) BNE (prov:Agent) “2010-07-14T01:01:01Z”^^xsd:dateTime CC0 CC0 Resource provenance Metadata provenanceProcess centric provenance BNE Digital library department (prov:Agent) GPL “2011-07-14T01:01:01Z”^^xsd:dateTime prov:used prov:wasGeneratedBy “2011-07-14T02:02:02Z”^^xsd:dateTime prov:startedAtTime prov:endedAtTime BNE (prov:Agent) prov:wasAssociatedWith prov:actedOnBehalfOf prov:wasAttributedTo, dc:creator prov:generatedAtTime, dc:created dc:license prov:wasAttributedTo, dc:creator Marimba (prov:Agent) dc:license Metadata provenance file (prov:Bundle) prov:generatedAtTime, dc:created prov:wasAttributedTo, dc:creator dc:license
  • 32. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data have Licenses Research funded by the project 4V: Volumen, Velocidad, Variedad y Validez en la gestión innovadora de datos (TIN2013-46238-C4-2-R)
  • 33. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Create, consume, aggregate, derive and publish Linked Data in a lawful environment 0 Always license your data … Data shops Governments Individuals 36 4V (TIN2013-46238-C4-2-R)
  • 34. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November How Open is the Linked Open Data Cloud? 37 4V (TIN2013-46238-C4-2-R)
  • 35. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linguistic Linked Licensed Data How do we represent license information? 4V (TIN2013-46238-C4-2-R)
  • 36. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Licensed Data 4V (TIN2013-46238-C4-2-R)
  • 37. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November (Linked) Licensed Data in practice 40 Published Open License (Linked) Open Data (Linked) Closed Data Published No Open License (Linked) Private Data Not Published Available Data without explicit license Published Without License 4V (TIN2013-46238-C4-2-R)
  • 38. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November RDF Licensing support 4V (TIN2013-46238-C4-2-R)
  • 39. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November «An action is (permitted /prohibited / obliged) to be acted by the party over the asset, provided that the constraints hold» Asset: Statement (rdf:Statement) Dataset (void:Dataset) Ontology (owl:Ontology) Mapping (void:Linkset) LDP Container (ldp:Resource) Action: Derive (cc:DerivativeWorks) Translate (odrl:translate) Distribute (cc:Distribution) Reproduce (cc:Reproduce) Print (odrl:print) Anonymize (odrl:anonymize) Index (odrl:index) … plus ~30 others in ODRL/CC… Party: One individual (ej: mailto:[email protected]) One organization: (http://www.oeg-upm.net) One key owner using Web of Trust: (using http://xmlns.com/wot/0.1/hasKey) Constraint: Acknowledgement (cc:Attribution) A country, city… (odrl:spatial) A time frame (odrl:timeInterval, odrl:dateTime…) Data Model: ODRL (Open Digital Rights Language)
  • 40. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November RDFLicense – Dataset of licenses in RDF ― Content negotiation: machine readable version of common licenses ― Based on ODRL (W3C spec) ― 148 licenses (as of June 2015): Creative Commons, ODC, GNU… ― Permanent URIs. Example: http://purl.org/NET/rdflicense/gpl2.0.ttl ― Browse them here: http://rdflicense.appspot.com/ ― Contribute here: https://github.com/oeg-upm/rdflicense ― Catalogued here: http://datahub.io/dataset/rdflicense ― Read more here: A Dataset of RDF Licenses, V. Rodriguez-Doncel, S. Villata, A. Gomez- Perez, in Proc. of the 27th Int. Conf. on Legal Knowledge and Information System (JURIX), R. Hoekstra (Ed.), ISBN 978-1-61499-467-1, pp. 187-189, IOS Press, 2014 4V (TIN2013-46238-C4-2-R)
  • 41. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November <http://purl.org/NET/rdflicense/cc-by-sa3.0.ttl> a odrl:Policy ; rdfs:label "Creative Commons CC-BY-SA" ; rdfs:seeAlso <http://creativecommons.org/licenses/by-sa/3.0/rdf> ; cc:legalcode <http://creativecommons.org/licenses/by-sa/3.0/legalcode> ; dct:hasVersion "3.0" ; dct:language <http://www.lexvo.org/page/iso639-3/eng> ; dct:publisher "Creative Commons" ; odrl:permission [ odrl:action cc:Distribution , cc:DerivativeWorks , cc:Reproduction ; odrl:duty [ odrl:action cc:Attribution , cc:Notice , cc:ShareAlike ] ] . Sample license in ODRL: Creative Commons CC-BY-SA No Constraints (spatial, temporal, are not found in Creative Commons licenses), they are universal A generic license (like Creative Commons’) has no party, as the recipient is anybody accessing the licensed work URI :myDataset dct:license <http://purl.org/NET/rdflicense/cc-by-sa3.0> How do I use in my data set?
  • 42. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Example of a license offering Paid Linked Data @prefix gr: <http://purl.org/goodrelations/> . @prefix dcat: <http://www.w3.org/ns/dcat#> . <http://samplepolicy/1234> a odrl:Offer ; rdfs:label "License Offering Paid Linked Data" ; odrl:permission [ odrl:target <http://example.org/dataset/ds01> ; odrl:action odrl:reproduce ; odrl:duty [ rdfs:label "Pay" ; gr:UnitOfMeasurement dcat:Dataset ; gr:amountOfThisGood "1" ; odrl:action odrl:pay ; odrl:target "15,00 EUR“ ] ; odrl:constraint [ odrl:operator odrl:lt ; odrl:dateTime "2015-12-31"^^xsd:date ] ] ; ] The reproduction of a dataset is limited until the end of this year after paying 15€.
  • 43. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Example of a license for classes/concepts • Old book offered for free, new book to be sold for 5 EUR 46 @prefix cc: <http://creativecommons.org/ns#> . @prefix odrl: <http://www.w3.org/ns/odrl/2/> . :example a odrl:Policy ; odrl:permission [ odrl:action odrl:reproduce ; odrl:target :oldBook ] ; odrl:permission [ odrl:action odrl:reproduce ; odrl:target :newBook ; odrl:duty [ odrl:action odrl:pay ; odrl:target “5,00 EUR" ] ] .
  • 44. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Using Policies to govern conditional access to Linked Data Example of access to Linked Data for a price (15EUR for the dataset or 0.01EUR for a triple thereof) @prefix gr: <http://purl.org/goodrelations/> . @prefix dcat: <http://www.w3.org/ns/dcat#> . <http://salonica.dia.fi.upm.es/ldr/policy/cdaddba4-fc2e-4ee0-a784-e62f1db259bf> a odrl:Set ; rdfs:label "License Offering Paid Linked Data" ; odrl:permission [ a odrl:Permission ; odrl:target <http://example.org/dataset/ds01> ; odrl:action odrl:reproduce ; odrl:duty [ a odrl:Duty ; rdfs:label "Pay" ; gr:UnitOfMeasurement dcat:Dataset ; gr:amountOfThisGood "1" ; odrl:action odrl:pay ; odrl:target "15,00 EUR" ] ] , [ a odrl:Permission ; odrl:action odrl:reproduce ; odrl:target <http://example.org/dataset/ds01> ; odrl:duty [ a odrl:Duty ; rdfs:label "Pay" ; gr:UnitOfMeasurement rdf:Statement ; gr:amountOfThisGood "1" ; odrl:action odrl:pay ; odrl:target "0,01 EUR” ] ] .. 4V (TIN2013-46238-C4-2-R)
  • 45. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data, Linguistic Linked Data and License 1. Agree on vocabularies for describing • Domain data • Language related information (Lemon) • LR license metadata (ODRL) 2. Unified and standardized language for describing resources ( RDF(S)) 3. Unified and standardized query language (SPARQL) 4. Standardized non-proprietary APIs 5. Links to other resources
  • 46. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linked Data Process Examples from datos.bne.es datos.bne.es and MARiMba: an insight into library linked data D. Vila-Suero, A. Gómez-Pérez Library Hi-tech (ISSN: 0737-8831); Emerald Group Publishing Limited . Vol.: 31(4).Pages: 575-601 Noviembre 2013
  • 47. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November datos.bne.es: the team Cataloguing services Digital Library and automatization Focus group R&D in Linked Data Ontology engineering
  • 48. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Methodology Specification Modelling (Ontologies) RDF Generation Publication Exploitation Data Linking Data Curation Many technologies involved Villazón-Terrazas, B.; Vilches. L.; Corcho, O.; Gómez-Pérez, A. Methodological Guidelines for Publishing Government Linked Data. In D. Wood, ed. Linking Government Data. Springer. (pp, 27-49). 2011
  • 49. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Specification Goal Linked Data generation of the Spanish National Library Metadata • Records in the MARC 21 format • 4.5 million bibliographical records • 4.5 million authority records • Version: September, 2015 52 Specification Modelling RDF Generation Publication Links Generation Exploitation Persons 1.5 M Expressions 1 M Subjects 0.5 M Works 1.5 M Editions 4.5 M Digital items 150 m Maps Modern Ancient Drawings Musical scores Written music Recorded music Different types of records in MARC21
  • 50. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Specification • Domain experts from BNE (catalogers) part of the mapping process. • Multilinguality, collaboration with IFLA Sources BNE VIAF LoC BNF … BNE Mappings RDF Subjects Authorities Bibliographic Specification
  • 51. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Modeling in datos.bne.es Two phases: 1. Re-using URIs: from standard vocabs, mainly IFLA FRBR, ISBD and FRAD 2. Integrating ontology: BNE ontology that re-uses and links to external vocabs. Motivation for the creation of BNE ontology: - To provide stability under the datos.bne.es domain - Documented in a central document - Provide new properties and relationships (e.g., spanish legal deposit) - Control over descriptions and labels (e.g., labels for visualization (bne:label)) - Multilingualism: BNE ontology is described in English and Spanish - Content-negotiation enables using programming APIs (Jena, Sesame, etc.) and Ontology editors (Protégé, TopBraid, etc.) 54 Specification Modelling RDF Generation Publication Links Generation Exploitation Available and documented at http://datos.bne.es/def/
  • 52. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November IFLA Vocabulary-based Ontology - Reusing URIs 55 Specification Modelling
  • 53. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Datos.bne.es Ontology : - Second phase 56 bne:PERSON bne:CORPORATE BODY bne:EXPRESSIONbne:WORK bne:CONCEPT bne:MANIFESTATION Specification Modelling • Available and documented at http://datos.bne.es/def/
  • 54. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Exploiting alternative names and titles Person Work Org Authority Name Cervantes Saavedra Variant names Servantes Сервантес Сааведра Authority title Rome and Juliet Variant titles The tragedy of Romeo and Juliet Romeo y Julieta Romeu i Julieta Authority Name Naciones Unidas Variant names United Nations ONU (Spanish acronym) UN (English acronym) • Indexing alternatives (acronyms, translations,etc.) increases recall of search resultsSpecification Modelling
  • 55. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Who will be the mapping generator? Specification Modelling RDF Generation Publication Exploitation Links Generation • Librarians built mappings bne:PERSON bne:CORPORATE BODY bne:EXPRESSIONbne:WORK bne:CONCEPT bne:MANIFESTATION 001 XX1721208 005 200012181124 008 901120nn aijnnaabn n aaa 016 $a BNE19900178994 040 $a SpMaBN $b spa $c SpMaBN $e rdc $f embne 100 10 $a Camus, Albert $d 1913-1960 670 $a El mite de Sísif, 1987 $b port. (Albert Camus) 670 $a Dic. de filosofía, de J. Ferrater Mora, 1980$b(Camus., Albert (1913-1960); n. Mondovi, Argel) 670 $a Aut. BN-OPALE, 1995 $b (Camus, Albert) 100at Work property subfield maps 100t title of work maps is creator of Person100a maps Content (100a) Content (100at)contained in maps MARC 21 records Datos.bne.es ontology
  • 56. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November MARiMbA generates RDF using RDFS/OWL ontologies BNE 59 bne:PERSON bne:CONCEPT bne:EXPRESSION bne:WORK bne:MANIFESTATION Specification Modelling RDF Generation
  • 57. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Marimba links with other resources: VIAF, DNB, SUDOC, LIBRIS, DBpedia BNE http://datos.bne.es/resource/XX1718747 Same As Same As Same As Same As Same As LIBRIS http://libris.kb.se/resource/auth/45369 SUDOC http://www.idref.fr/026774771/id DNB http://d-nb.info/gnd/11851993X DBpedia http://dbpedia.org/resource/Miguel_de_Cervantes VIAF http://viaf.org/viaf/17220427 Specification Modelling RDF Generation Links Generation
  • 58. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Publication • Combines DCAT and VoID, to include: - Licenses: Pointing to rdflicense dataset - Provenance: Using PROV-O - Language information: Using lexvo.org - Access mechanisms: SPARQL endpoint, data dumps • datos.bne.es is a Catalog - composed of 7 (interlinked) Datasets - with different Distributions • Data dumps • SPARQL endpoint • Search API • Available at http://datos.bne.es/inicio.ttl Specification Modelling RDF Generation Publication Exploitation Links Generation
  • 59. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Full DCAT Graph 62 Specification Modelling RDF Generation Publication Exploitation Links Generation
  • 60. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Datos.bne.es: DCAT Catalog :catalog a dcat:Catalog ; dct:title "The RDF catalog of the National Library of Spain"@en , "El catálogo RDF de la Biblioteca Nacional de España"@es ; rdfs:label "datos.bne.es RDF catalog"@en , "Catálogo RDF datos.bne.es RDF"@es ; foaf:homepage <http://datos.bne.es/inicio> ; dct:publisher :bne ; dct:license rdflicense:cc-zero1.0 ; dct:language <http://lexvo.org/id/iso639-3/spa> ,<http://lexvo.org/id/iso639-3/eus> ,<http://lexvo.org/id/iso639-3/por> ,<http://lexvo.org/id/iso639-3/fra> ,<http://lexvo.org/id/iso639-3/lat> ,<http://lexvo.org/id/iso639-3/ita> ,<http://lexvo.org/id/iso639-3/cat> # Up to 195 languages 63 RDFLicense dataset Publication
  • 61. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Datos.bne.es: The publisher as agent :catalog dct:publisher :bne ; :bne a foaf:Agent ; owl:sameAs datosbne:XX4891886 . 64 BNE as publisher of the catalog BNE as agent Linked to the RDF resource in datosbne Publication
  • 62. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Datos.bne.es: The catalog contains 7 datasets :catalog dcat:dataset :entities, :works, :expressions, :manifestations, :items, :subjects, :persons; 65 # One dataset per type of entity Publication
  • 63. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Datos.bne.es: Persons dataset :persons a dcat:Dataset ; dct:title "Persons in datos.bne.es"@en, "Personas en datos.bne.es"@es ; dcat:keyword "persons"@en, "personas"@es, "authors"@en, "autores"@es ; dct:issued "2011-12-05"^^xsd:date ; dct:modified "2011-12-05"^^xsd:date ; dcat:contactPoint <mailto:[email protected]?subject='Persons%20dataset'> ; dcat:distribution :persons-nt, :sparql ; . :persons-nt a dcat:Distribution ; dcat:downloadURL <http://datos.bne.es/datadumps/persons101115.nt.bz2>; dct:title "Distribution of persons description in RDF (N-triples)"@en , "Distribución de descripciones de personas en RDF (N-triples)"@es ; dcat:mediaType "application/x-bzip2" ; dct:license rdflicense:cc-zero1.0 ; 66 # Persons dataset is distributed in two ways: SPARQL and N-triples dump # Each distribution has its own license
  • 64. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Browsing and querying the data select distinct COUNT(?Obras) where { http://datos.bne.es/resource/XX1718747 <http://datos.bne.es/def/OP501> ?Obras } URI Cervantes Is creator SPARQL queries Web portal and service Specification Modelling RDF Generation Publication Exploitation Links Generation
  • 65. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Exploitation: Ranking and recommedantion in datos.bne.es Weighting function based on: the ontology, incoming and outgoing links, string similarity Query text Weight = 613 Weight = 37 Ranked recommendations: Works and similar persons
  • 66. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Results: datos.bne.es • Total number of authority records: 4.784.303 • Total number of bibliographic records: 4.083.671 • Total number of RDF triples: 143.153.218 • Total number of owl:sameAs links: 1.395.108 • Linked sources: - VIAF - SUDOC (French collective university catalogue) FR - GND (German National Library of authorities) - LIBRIS Sweden - DBPedia - BNF, France - geo.linkeddata.es 69
  • 67. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Uses of Library Linked Metadata
  • 68. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Validación y enriquecimiento de registros MARC21 BIBLIOTECA 2 BIBLIOTECA 3 BIBLIOTECA 1 … Catálogos mejorados Reducción de costes
  • 69. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Long Tale: Specialize forums for rare editions 73
  • 70. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Street detective: citizen science 74 • To improve data quality and access to cultural data in a particular city • Gaming exercise: Ask questions that link que asocien personajes históricos a las calles de su ciudad. • Link them with datos.bne.es • Performed by childrens (between 10 and 15 years old) in Zaragoza and Madrid in collaborartion with the city hall of Zaragoza and BNE “Fujitsu Laboratories of Europe Innovation Award 2015”
  • 71. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Linking: Integration of cultural data and geographical data 75
  • 72. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November “El Quijote” route
  • 73. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Locations related with “El Quijote”
  • 74. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November From metadata in images to full text
  • 75. A. Gómez-Pérez Maximising (Re)Usability of Library metadata using Linked Data SWIB’2015 Semantic Web in Libraries 2015 Hamburg 2015, 23-25 November Messages to take home 1. Library Linked Data can be easily integrated in other data (e.g. geographical, education, etc.) 2. Data providers should include language, provenance and license metadata in their datasets • in the original data sources (e.g., MARC21 records) • tags into RDF (e.g., @es, @ fr at least) • language URIs in the VOID or DCAT descriptions • License and provenance information in RDF 3. Benefits of adding language, provenance and license metadata in datasets • Reduce the time and cost of identifying language in resources and terminology • Foster the aggregation and enrichment of data across complementary resources • Enhances data curation • Improves precision and recall in information retrieval and search