This document presents the ODRL Profile for accessing Language Resources and Technologies (LRTs). This aims to support entities (repositories, infrastructures, archives, libraries, etc.) that enable the sharing of LRTs and act as intermediaries between LRT providers and consumers; from here on, we refer to them as "distributing entities". To this aim, it extends the ODRL model to define policies that express actions, rules, permissions and / or prohibitions related to sharing and processing of LRTs, and re-uses the META-SHARE ontology to describe these policies and other access or rights related features.

Introduction

Objectives of the profile

The current profile introduces elements that serve one of these purposes:

To achieve these goals, the following requirements motivate our approach to extend the ODRL Vocabulary and Expression specification to define permissions and/or prohibitions associated with access granting operations of LRTs stored and distributed by the above entities, and to re-use and extend the META-SHARE vocabulary to express legal and technical concepts associated to these operations.

R1. Support specifying user preferences as policies.

R2. Incorporate vocabulary specifying or aligned to legal concepts.

R3. Support specifying permissions and prohibitions typical of policies for LRTs.

R4. Describe policies used to authorise access to data resources and data processing services.

R5. Specify procedures and algorithms that can generate templates for policies typical of access to LRTs.

Overview of the LRT distribution landscape

Sharing of Language Resources and Technologies (LRTs) is included in the mission of infrastructures and distribution bodies, such as the European Language Grid, CLARIN and its national nodes (e.g. the Greek CLARIN), META-SHARE, ELRA-ELDA. To this end, they create and maintain repositories, that enable the upload and storage of resources, their description in a structured form (metadata records), the search and browse of these resources through dedicated catalogue pages, as well as access to them, which can be performed, for instance, by the download of content files or execution of web services. These hosting/distributing entities (in short "distributing entities") act as intermediaries between providers and consumers of LRTs, both human and machines. Therefore, the requirements addressed in the current profile are set by the operational needs of the distributing entities, and indirectly by the needs of providers and consumers.

To make clearer the requirements, we briefly outline here the landscape of LRTs with a focus on issues related to legal rights and access procedures.

Under the term "Language Resource and Technology" we include any resource composed of linguistic material used in the construction, improvement and/or evaluation of language processing applications, but also, in a broader sense, in language and language-mediated research studies and applications. The term is used with a broader meaning, encompassing (a) data sets (textual, multimodal/multimedia and lexical data, grammars, language models, etc.) in machine readable form, and (b) tools/technologies/services used for their processing and management.

LRTs are distributed and consumed with various forms: e.g., corpora may be distributed as content files to be downloaded, or made available through dedicated interfaces (e.g., a corpus workbench, or a SPARQL endpoint); tools/services may be distributed as content files containing the source code, as executable files to be downloaded and run locally, or as web services.

Distributable forms of an LRT may be accessed with the same access conditions or with different ones, which must be described in the form of a policy statement; this can range from a "standard" licence (e.g., one from the Creative Commons suite of licences) to a commercial licence with terms for the use of an individual resource.

Entities that provide access to LRTs typically support one or more of the following operations:

The policy statements (licences and terms of use) that accompany a resource regulate its use by end-users but do not explicitly describe the rules (duties, restrictions, permissions) to be imposed by the entities when providing access to them. This set of rules may be inferred from the licensing conditions and needs to be calculated (semi-)automatically. One of the main objectives of this profile is to support the expression of these conditions, their automatic computation when possible and their representation in the form of authorisation statements and actions.

Detailed requirements

Entities responsible for sharing LRTs must satisfy the user requirements of both LRT providers and consumers, as described in Appendix A. To achieve this goal, the following requirements are set:

R1. For granting access to an LRT, the LRT must be released with a clear and accessible usage licence, which must be uniquely referred to in the formal description (metadata record) of the LRT (see also FAIR principle R1.1)

a. The licence must be described at the level of the distributable form of the LRT; different distributable forms may be released with different licences.

R2. The licence text must be available in a digital form and accessible to the end users so that they are aware of the terms under which they can use the LRT.

R3. The formal description of the licence should include metadata elements (names, broad usage conditions) that support findability of the LRT.

R4. The formal description of the licence should be machine actionable, i.e. include terms and indications that facilitate computational agents to perform actions that enable access to LRTs according to its licensing conditions. More precisely, it must include at least those terms that the entities granting access can use to restrict access to LRTs; such restrictions are typically based on user type, billing model, soliciting consent by users, tracking of users, monitoring of usage, etc.

a. For data resources, access may be limited to download and/or processing based on user type (e.g., for academic users only, for members of a group), following payment, or deposition of a research plan, etc.

b. For services, in addition, access may be limited and/or billed based on a count of specific units (e.g., count of API requests, characters to be processed, audio/video seconds to be processed) for a limited time period (e.g., on a weekly basis)

c. Entities granting end users access to an LRT must ensure they grant such access in accordance with its licensing conditions to the extent that these can be controlled by them; licensing conditions that cannot be controlled by them remain at the end-user's responsibility/liability.

R5. To allow combination of LRTs or processing of LRTs, their licensing conditions must be compatible and allow combination or processing actions; these permissions should be included in the formal description of the licences.

R6. The formal description of the LRT should include elements that may help resolve issues related to rights and access.

Document Conventions

Within this document, the following namespace prefix bindings are used:

Prefix Namespace Description
odrl http://www.w3.org/ns/odrl/2/ ODRL Vocabulary
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# [[rdf11-concepts]]
rdfs http://www.w3.org/2000/01/rdf-schema# [[rdf-schema]]
owl http://www.w3.org/2002/07/owl# [[owl2-overview]]
xsd http://www.w3.org/2001/XMLSchema# [[xmlschema11-2]]
skos http://www.w3.org/2004/02/skos/core# [[skos-reference]]
dcterms http://purl.org/dc/terms/ [[dcterms]]
vcard http://www.w3.org/2006/vcard/ns# [[vcard-rdf]]
foaf http://xmlns.com/foaf/0.1/ [[foaf]]
schema http://schema.org/ schema.org
cc https://creativecommons.org/ns# creativecommons.org
ms http://w3id.org/meta-share/meta-share
ex http://example.com/ns#

Profile specification

Entities

The current specification focuses on three classes taken from the META-SHARE vocabulary: Resource, Distribution and LicenceTerms. For the first two classes, we reuse the META-SHARE vocabulary and include here only those properties that are of interest to this specification. In addition, the LicenceTerms is further represented as a policy in accordance with the ODRL model.

Resources

A distinction is made between the language resource / technology, as the object created and described at an abstract level, and its distributable form, i.e., the form as packaged, distributed and consumed by end users. Following the DCAT terminology, the former is represented with the class LanguageResource and the latter as DatasetDistribution and SoftwareDistribution.

LanguageResource

Definition:A resource composed of linguistic material used in the construction, improvement and/or evaluation of language processing applications, but also, in a broader sense, in language and language-mediated research studies and applications; the term is used with a broader meaning, encompassing (a) data sets (textual, multimodal/multimedia and lexical data, grammars, language models, etc.) in machine readable form, and (b) tools/technologies/services used for their processing and management.
Label:Language Resource / Technology
Identifier:http://w3id.org/meta-share/meta-share/LanguageResource
Sub-class Of:rdfs:Resource, dcat:Resource

Corpus

Definition:A structured collection of pieces of data (textual, audio, video, multimodal/multimedia, etc.) typically of considerable size and selected according to criteria external to the data (e.g., size, type of language, type of text producers or expected audience, etc.) to represent as comprehensively as possible the object of study.
Label:Corpus
Identifier:http://w3id.org/meta-share/meta-share/Corpus
Sub-class Of:ms:LanguageResource

Lexical/Conceptual Resource

Definition:A resource organised on the basis of lexical or conceptual entries (lexical items, terms, concepts, etc.) with their supplementary information (e.g., grammatical, semantic, statistical information, etc.).
Label:Lexical/Conceptual Resource
Identifier:http://w3id.org/meta-share/meta-share/LexicalConceptualResource
Sub-class Of:ms:LanguageResource

Language description

Definition:A resource that describes a language or some aspect(s) of a language via a systematic documentation of linguistic structures.
Label:Language description
Identifier:http://w3id.org/meta-share/meta-share/LanguageDescription
Sub-class Of:ms:LanguageResource

Tool/Service

Definition:A tool/service/any piece of software that performs language processing and/or any Language Technology related operation.
Label:Tool/Service
Identifier:http://w3id.org/meta-share/meta-share/ToolService
Sub-class Of:ms:LanguageResource

Resource properties

The following properties address R6, i.e. can help determine and/or resolve rights and access issues.

resourceProvider

Definition:The person/organization responsible for providing, curating, maintaining and making available (publishing) the resource
Label:resource provider
Identifier:http://w3id.org/meta-share/meta-share/resourceProvider
Domain:LanguageResource
Range:foaf:Agent

IPR holder

Definition:A person or an organization who holds the full Intellectual Property Rights (Copyright, trademark, etc.) that subsist in the resource. The IPR holder could be different from the creator that may have assigned the rights to the IPR holder (e.g., an author as a creator assigns her rights to the publisher who is the IPR holder) and the distributor that holds a specific licence (i.e. a permission) to distribute the work via a specific distributor.
Label:IPR holder
Identifier:http://w3id.org/meta-share/meta-share/iprHolder
Domain:LanguageResource
Range:foaf:Agent

pesonalDataIncluded

Definition:Specifies whether the language resource contains personal data (mainly in the sense falling under the GDPR)
Label:Pesonal data included
Identifier:http://w3id.org/meta-share/meta-share/personalDataIncluded
Domain:LanguageResource
Range:xs:boolean

pesonalDataDetails

Definition:If the resource includes personal data, this field can be used for entering more information, e.g., whether special handling of the resource is required (e.g., anonymization, further request for use, etc.)
Label:Pesonal data details
Identifier:http://w3id.org/meta-share/meta-share/personalDataDetails
Domain:LanguageResource
Range:rdf:langString

sensitiveDataIncluded

Definition:Specifies whether the language resource contains sensitive data (mainly in the sense falling under the GDPR)
Label:Sensitive data included
Identifier:http://w3id.org/meta-share/meta-share/sensitiveDataIncluded
Domain:LanguageResource
Range:xs:boolean

sensitiveDataDetails

Definition:If the resource includes sensitive data, this field can be used for entering more information, e.g., whether special handling of the resource is required (e.g., anonymization, further request for use, etc.)
Label:Sensitive data details
Identifier:http://w3id.org/meta-share/meta-share/sensitiveDataDetails
Domain:LanguageResource
Range:rdf:langString

anonymized

Definition:Indicates whether the language resource has been anonymized
Label:anonymized
Identifier:http://w3id.org/meta-share/meta-share/anonymized
Domain:LanguageResource
Range:xs:boolean

anonymizationDetails

Definition:If the resource has been anonymized, this field can be used for entering more information, e.g., tool or method used for the anonymization, by whom it has been performed, whether there was any check of the results, etc.)
Label:Anonymization details
Identifier:http://w3id.org/meta-share/meta-share/anonymizationDetails
Domain:LanguageResource
Range:rdf:langString

DatasetDistribution and SoftwareDistribution

These are subclasses of dcat:Distribution catering for the distict features of data and software resources, such as different distribution mediums (service or source code for software, content files or forms accessible via interfaces for data resources), descriptive features (size for text, audio, video, image files, technical requirements on the input / output of software).

DatasetDistribution

Definition:Any form with which a dataset is distributed, such as a downloadable form in a specific format (e.g., spreadsheet, plain text, etc.) or an API with which it can be accessed
Label:Dataset Distribution
Identifier:http://w3id.org/meta-share/meta-share/DatasetDistribution
Sub-class Of:dcat:Distribution

SoftwareDistribution

Definition:Any form with which software is distributed (e.g., web services, executable or code files, etc.)
Label:Software Distribution
Identifier:http://w3id.org/meta-share/meta-share/SoftwareDistribution
Sub-class Of:dcat:Distribution

Distribution properties

Properties of the Distribution subclasses include:

The property licenceTerms addresses R1 and R1a. It is the property used for linking LRTs to the policy statements (class LicenceTerms) and it is mandatory in order to ensure legitimate access by end-users (see examples 1 - 5). Normally, each >Distribution must have one licence attached. Multiple licences are also possible for the same Distribution in the following cases:

In both cases, the access procedure must take into account all licences attached to this Distribution.

LRTs whose licence is not identified in the formal description may be included in a catalogue for information purposes only, clearly marked as such and described with at least an access statement which is added with the accessRights property.

licenceTerms

Definition:Links the distribution (distributable form) of a language resource to the licence or terms of use/service (a specific legal document) with which it is distributed
Label:licence
Identifier:http://w3id.org/meta-share/meta-share/licenceTerms
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:ms:LicenceTerms

accessRights

Definition:Specifies the rights for accessing the distributable form(s) of a language resource (preferrably in accordance to a formalised vocabulary)
Label:access rights
Identifier:http://w3id.org/meta-share/meta-share/accessRights
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:ms:AccessRightsStatement
Note:The property is mandatory when no licence is available for a resource and its intended use is for resources catalogued for information only purposes. For the AccessRightsStatement, it is recommended to use a term from standardized vocabularies, such as the Open COAR Access Rights vocabulary or the statements provided by RightsStatements.org

cost

Definition:Introduces the cost for accessing a resource or the overall budget of a project, formally described as a set of amount and currency unit
Label:cost
Identifier:http://w3id.org/meta-share/meta-share/cost
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:ms:Cost (ms:Cost is formally represented using two properties, amount and currency.)

membershipInstitution

Definition:Introduces an institution with members that can benefit from specific conditions on the use of a resource (e.g. discount, unlimited access, etc.)
Label:membership institution
Identifier:http://w3id.org/meta-share/meta-share/membershipInstitution
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:foaf:organization

availabilityStartDate

Definition:Specifies the start date of availability of a resource - only for cases where a resource is available for a restricted time period.
Label:availability start date
Identifier:http://w3id.org/meta-share/meta-share/availabilityStartDate
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:xs:date

availabilityEndDate

Definition:Specifies the end date of availability of a resource - only for cases where a resource is available for a restricted time period.
Label:availability end date
Identifier:http://w3id.org/meta-share/meta-share/availabilityEndDate
Domain:Union of ms:DatasetDistribution and ms:SoftwareDistribution
Range:xs:date

Policies (Licences and Terms of use/service)

LicenceTerms

This class covers legal documents, i.e., licences and terms of use or service for data resources and software.

LicenceTerms (capitalized)

Definition:A legal document (licence or terms of use/service) with which the language resource is distributed
Label:Licence
Identifier:http://w3id.org/meta-share/meta-share/LicenceTerms
Sub-class Of:cc:License

Licence properties

The following set of properties addresses identification of the licence and access to the licence text (R2 and R3) are:

name, short name, alternative title, URL (where the licence text can be found), logo, identifier.

The property conditionOfUse aims to encode the main conditions associated with the use of a LRT (e.g. non commercial use, use by academic users only, etc.) as described in R4a and R4b. These can also be represented as odrl properties (permission, prohibition, duty), but are also added as an object property for two reasons: (a) facilitating LRT providers with no or little prior experience of writing policies in ODRL; (b) supporting the implementation of a filter enhancing the discoverability of LRTs in a catalogue (R3). The class ConditionOfUse contains a set of instances that are typical of licences in the LRT domain.

The object property licenceCategory addresses R4; i.e., it supports the encoding of terms that can be mapped to actions granting access to LRTs and the authorization procedures required for them. As with conditionOfUse, these can be mapped to ODRL rules and added to the RDF representations of a licence.

A set of such mappings has already been made for the typical cases in the LRT domain and is available below.

licenceTermsName

Definition:The name by which a legal document (e.g., licence, terms of use, terms of service) is known
Label:licence name
Identifier:http://w3id.org/meta-share/meta-share/licenceTermsName
Domain:ms:LicenceTerms
Range:rdf:langString
Subproperty of:dcterms:title

licenceTermsAlternativeName

Definition:Introduces an alternative name (other than the short name) used for a licence or terms of use
Label:licence alternative name
Identifier:http://w3id.org/meta-share/meta-share/licenceTermsAlternativeName
Domain:ms:LicenceTerms
Range:rdf:langString
Subproperty of:dcterms:alternative

licenceTermsShortName

Definition:Introduces the short name (abbreviation, acronym, etc.) used for a licence or terms of use document
Label:licence short name
Identifier:http://w3id.org/meta-share/meta-share/licenceTermsShortName
Domain:ms:LicenceTerms
Range:rdf:langString
Subproperty of:dcterms:alternative

licenceTermsURL

Definition:Links to the URL where the text of a licence/terms of use/service is found
Label:licence URL
Identifier:http://w3id.org/meta-share/meta-share/licenceTermsURL
Domain:ms:LicenceTerms
Range:xsd:anyURI

logo

Definition:Links to a symbol or graphic object used to identify an entity; please, add a URL with an image file
Label:logo
Identifier:http://w3id.org/meta-share/meta-share/logo
Domain:ms:LicenceTerms
Range:xsd:anyURI

dct:identifier

Definition:Links to a symbol or graphic object used to identify an entity; please, add a URL with an image file
Label:identifier
Identifier:http://w3id.org/meta-share/meta-share/logo
Domain:ms:LicenceTerms
Range:ms:LicenceIdentifier ("A string used to uniquely identify a licence")
Usage note:The LicenceIdentifier must be defined within a LicenceIdentifierScheme. The use of a standardized vocabulary, such as the SPDX identifiers is higly recommended.

conditionOfUse

Definition:Links a licence with a specific condition/term of use imposed for accessing a language resource. It is an optional element and only to be taken as providing brief human readable information on the fact that the language resource is provided under a specific set of conditions. These correspond to the most frequently used conditions imposed by the licensor (via the specified licence). The proper exposition of all conditions and possible exceptions is to be found inside the licence text. Depositors should, hence, carefully choose the values of this field to match the licence chosen and users should carefully read that licence before using the language resource.
Label:condition of use
Identifier:http://w3id.org/meta-share/meta-share/conditionOfUse
Domain:ms:LicenceTerms
Range:ms:ConditionOfUse ("A condition imposed via a specified licence on the use of a language resource (e.g., non-commercial use, no derivatives, etc.")
Usage note:The ConditionOfUse class includes a list of individuals with the most common conditions imposed in the domain of LRTs.

licenceCategory

Definition:Categorises a licence according to a classification scheme
Label:licence category
Identifier:http://w3id.org/meta-share/meta-share/licenceCategory
Domain:ms:LicenceTerms
Range:ms:LicenceCategory ("A classification scheme for licences")
Usage note:The LicenceCategory class includes a list of individuals that formalise the restrictions to be taken into account by hosting entities when giving access to LRTs and the type of actions that must or can be performed; for instance, whether users need to be authenticated, belong to a specific user type, whether the resource can be downloaded or processed, etc. For more information, see Section Actions and examples 7 - 12.

Parties

Party roles and types

With respect to the transactions of LRT distribution, the following roles can be identified:

For the scope of the current profile, these roles do not need to be formally represented, as the ODRL roles odrl:assigner and odrl:assignee suffice.

In terms of legal status, distributing entities can only be organizations, resource providers can be both organizations and individuals, while resource consumers can be only individuals, affiliated or not to organizations. These can be described with any relevant vocabulary such as W3C vCard Ontology, FOAF Vocabulary enriched with META-SHARE and ODRL properties, as required.

Party properties

The only property specific to this profile is the userType, which distinguishes LRT consumers into commercial and academic ones. This is required because this distinction is related to restrictions on using an LRT or on the price or billing model of an LRT.

userType

Definition:Used to identify the type of user of the resource (affiliated with a commercial vs. academic institution); it is usually associated with licensing or pricing conditions on the use of a resource
Label:user type
Identifier:http://w3id.org/meta-share/meta-share/userType
Domain:foaf:Agent
Range:ms:UserType ("Classification type for users of a language resource; usually required for assessing the licensing or pricing policies for the use of the resource")
Usage note:The UserType class lists two individuals, namely academic and commercial.

Actions

Operations

To cover the operations performed in the context of this profile, the following concepts are used from the ODRL and META-SHARE vocabularies: odrl:execute, odrl:Aggregate, ms:Download, ms:Process and ms:Combine.

Download

Definition:Download of a content file (with data or executable file) on one's local system
Label:Download
Identifier:http://w3id.org/meta-share/meta-share/Download
Sub-class Of:odrl:Action

Process

Definition:To process a data resource with a language processing service (e.g. to create an annotated version, or to extract elements from it)
Label:Process
Identifier:http://w3id.org/meta-share/meta-share/Process
Sub-class Of:odrl:Action

Combine

Definition:Combine two or more resources of different resource types (e.g. a model and a tool/service) in order to deploy them together
Label:Combine
Identifier:http://w3id.org/meta-share/meta-share/Combine
Sub-class Of:odrl:Action

Actions for Constraints

The following subclasses of odrl:Action are required in use with constraints on rules to describe common practices in the domain of LRT distribution: SignIn, Authorize, SignLicence, Report, Deposit.

SignIn

Definition:To sign in a system or application
Label:SignIn
Identifier:http://w3id.org/meta-share/meta-share/SignIn
Sub-class Of:odrl:Action

Authorize

Definition:To be formally allowed to access and make use of a resource
Label:Authorize
Identifier:http://w3id.org/meta-share/meta-share/Authorize
Sub-class Of:odrl:Action

SignLicence

Definition:The obligation of the assignee to sign a licence before being allowed to use a resource
Label:SignLicence
Identifier:http://w3id.org/meta-share/meta-share/SignLicence
Sub-class Of:odrl:Action

Report

Definition:Give an account of an event related to the resource (e.g. a research plan, the use of the resource in a project or publication, etc.)
Label:Report
Identifier:http://w3id.org/meta-share/meta-share/Report
Sub-class Of:odrl:Action

Deposit

Definition:The action of uploading a resource to a hosting entity (repository), e.g. for preservation purposes)
Label:Deposit
Identifier:http://w3id.org/meta-share/meta-share/Deposit
Sub-class Of:odrl:Action

Examples

Examples of LRTs

ex:Resource1 a ms:Corpus ;
  ms:distribution 
    [ ms:datasetDistributionForm ms:downloadable 
      dct:license <http://purl.org/NET/rdflicense/cc-by4.0>
    ]
					
ex:Resource1 a ms:Corpus ;
  ms:distribution 
    [ ms:datasetDistributionForm ms:downloadable ;
      dct:license <http://purl.org/NET/rdflicense/cc-by4.0>
    ]
  ms:distribution 
    [ ms:datasetDistributionForm ms:accessibleWithInterface ; 
      dct:license <http://purl.org/NET/rdflicense/cc-by4.0>
    ]
	
ex:Resource1 a ms:ToolService ;
  ms:distribution 
    [ ms:softwareDistributionForm ms:webservice ;
      dct:license <http://rdflicense.appspot.com/rdflicense/cc-by4.0.ttl> ;
      dct:license <http://purl.org/NET/rdflicense/APACHE2.0.ttl>
    ]
ex:Resource1 a ms:Corpus ;
  ms:distribution 
    [ ms:datasetDistributionForm ms:downloadable ;
      dct:license <http://rdflicense.appspot.com/rdflicense/cc-by4.0.ttl> ;
      dct:license <http://rdflicense.appspot.com/rdflicense/cc-by-nc-4.0.ttl>
    ]
    
ex:Resource1 a ms:Corpus ;
  ms:distribution 
    [ ms:datasetDistributionForm ms:downloadable ;
      dct:license < http://rdflicense.appspot.com/rdflicense/elra-end-user.ttl> ; 
      ms:userType ms:academic
    ]
    [ ms:datasetDistributionForm ms:downloadable ;
      dct:license < http://rdflicense.appspot.com/rdflicense/elra-end-user.ttl> ; 
      ms:cost [ 
      ms:amount "10000" ; 
      ms:currency ms:euro
      ]
      ms:userType ms:commercial
    ] 
ex:Resource1 a ms:Corpus ;
  ms:distribution 
    [ ms:datasetDistributionForm ms:downloadable ;
      dct:license <http://purl.org/NET/rdflicense/cc-by4.0> ;
      ms:availabilityStartDate "2021-06-30"
    ] .

Examples of Licences

<http://purl.org/NET/rdflicense/ilsp/CC0-1.0>
  a odrl:Policy ;
  rdfs:label "Creative Commons Zero v1.0 Universal" ;
  dct:source <https://creativecommons.org/publicdomain/zero/1.0/legalcode> ;
  dct:language <http://www.lexvo.org/page/iso639-3/eng> ;
  dct:identifier "CC0-1.0" ;
  rdfs:seeAlso <http://spdx.org/licenses/CC0-1.0> ;
  ms:licenceCategory ms:allowsDirectAccess ;
  ms:licenceCategory ms:allowsProcessing ;
  ms:conditionOfUse ms:noConditions ;
  odrl:permission [
          a odrl:Permission ;
          odrl:action ms:download , odrl:execute , ms:process, odrl:aggregate 
  ] .
 <http://purl.org/NET/rdflicense/elra-var>
  a odrl:Policy ;
  rdfs:label " LANGUAGE RESOURCES VALUE-ADDED-RESELLER AGREEMENT" ;
  dct:source <http://www.elda.org/media/filer_public/2014/08/01/var_140312.pdf> ;
  dct:language <http://www.lexvo.org/page/iso639-3/eng> ;
  ms:conditionOfUse ms:noRedistribution , ms:languageEngineeringResearch , ms:spatialConstraint , cc:Attribution ;
  ms:licenceCategory ms:allowsAccessWithSignature ;
  ms:licenceCategory ms:requiresUserAuthentication ;
  ms:licenceCategory ms:allowsProcessing ;
  odrl:permission [
          a odrl:Permission ;
          odrl:action ms:download , ms:process, ms:execute, odrl:aggregate, ms:combine ;
                odrl:constraint
                        [ a odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:spatial "only at assignee's site"
                        ] ;
                        [ a odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:purpose ms:languageEngineeringResearch
                        ] ;
                        [ a odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:assignee ms:sign
              ] ;
                        [ a odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:assignee ms:signIn
                        ] 
                odrl:duty
                        [ odrl:action cc:Attribution
                        ] 
    ] ;
  odrl:prohibition
              [ odrl:action cc:Distribution
              ] . 
<http://purl.org/NET/rdflicense/ilsp/ms-c-nored>
  a odrl:Policy ;
  rdfs:label "META-SHARE Commercial NoRedistribution" ;
  dct:source <http://www.meta-net.eu/meta-share/meta-share-licenses/META-SHARE%20Commercial%20%20NoRedistribution_v0%207.pdf> ;
  ms:conditionOfUse ms:noRedistribution , ms:languageEngineeringResearch , ms:attribution ;
  ms:licenceCategory ms:requiresUserAuthorization ;
  ms:licenceCategory ms:allowsProcessing ;
  odrl:permission [
          a odrl:Permission ;
          odrl:action ms:download , ms:process, ms:execute, odrl:aggregate, ms:combine ;
                odrl:constraint
                        [ a       odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:spatial "only at assignee's site"
                        ] ;
                odrl:constraint
                        [ a       odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:purpose ms:languageEngineeringResearch
                        ] ;
                odrl:constraint
                        [ a odrl:Constraint ;
                          odrl:operator odrl:eq ;
                          odrl:assignee ms:authorize
                        ] ;
                odrl:duty
                        [ odrl:action cc:Attribution
                        ] 
              ] ;
  ] ;
  odrl:prohibition
              [ odrl:action cc:Distribution
              ] .

Processing

  • of 0 - 100 characters per day: the user has to pay 10 USD;
  • from 101 - 500 chars per day: 50 USD
[] a odrl:Duty; odrl:action [ a md:Compensate ; odrl:payAmount 10.00 ; odrl:unit <https://www.wikidata.org/wiki/Q4917> ; # USD odrl:unitOfCount [ a ms:Character ] ; odrl:refinement [ odrl:operator odrl:lt ; odrl:count 100 ; odrl:meteredTime odrl:lteq ";P1D&" ] ] , [ a md:Compensate ; odrl:payAmount 50.00 ; odrl:unit <https://www.wikidata.org/wiki/Q4917> ; # USD odrl:unitOfCount [ a ms:Character ] ; odrl:refinement [ odrl:operator odrl:gt ; odrl:count 100 ; odrl:meteredTime odrl:lteq "P1D" ], [ odrl:operator odrl:lt ; odrl:count 500 ; odrl:meteredTime odrl:lteq "P1D" ] ] .

Execution of service for up to 100 characters per day

[] a odrl:Permission ; odrl:action [ a odrl:execute ; odrl:unitOfCount [ a ms:Character ] ; odrl:refinement [ odrl:operator odrl:lt ; odrl:count 100 ; odrl:meteredTime odrl:lteq "P1D" ] ] .

Execution of service for up to 100 characters per day

Execution of service for up to 100 API calls and up to 10 GB of processing volume per day [] a odrl:Permission ; odrl:action [ a odrl:execute ; odrl:unitOfCount [ a ms:apiCall ] ; odrl:refinement [ odrl:operator odrl:lt ; odrl:count 100 ; odrl:meteredTime odrl:lteq "P1D" ] ; odrl:unitOfCount [ a ms:processedGB ] ; odrl:refinement [ odrl:operator odrl:lt ; odrl:count 10 ; odrl:meteredTime odrl:lteq "P1D" ] ] .

Acknowledgements

This work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182: Prêt-à-LLOD and the COST Action "Nexus Linguarum" (CA18209)

A service supporting the transformation of XML files of licences into ODRL policies was developed in Athens in July 2021 by the Institute for Language and Speech Processing (Athena RC) and the Ontology Engineering Group (Universidad Politécnica de Madrid) available at https://rdflicense.linkeddata.es/service.html and as an API at https://rdflicense.linkeddata.es/swagger-ui.html

A. User requirements

User requirements on LRT hosting/distributing entities from LRT providers

UR1. As an LRT provider, I want to make available my LRT with a licence

UR2. As an LRT provider, I want to make available different forms of my LRT with different licences.

UR3. As an LRT provider, I want to be able to release my LRT with a custom licence that includes access restrictions.

UR4. As a provider of a commercial LT, I want to limit access to my service based on a count of specific units (e.g., count of API requests, characters to be processed, audio/video seconds to be processed) for a limited time period (e.g., on a weekly basis)

UR5. As a provider of a commercial LT, I want to give access to my service with a billing model that takes into account counts (cf. R4).

User requirements on LRT hosting/distributing entities from LRT consumers

UR6. As an LRT consumer, I want to search for LRTs based on their licences.

UR7. As an LRT consumer, I want to search for LRTs based on broad usage conditions.

UR8. As an LRT consumer, I want to obtain access to (use, deploy, process, download, …) an LRT

UR9. As an LRT consumer, I want to view the licence of an LRT.

UR10. As an LRT consumer, I want to be able to combine LRTs.

User requirements of LRT hosting/distributing entities

UR11. As a distributing entity, I want to give access to LRTs and impose restrictions based on their licensing conditions

a. allow users to download/access/consume data resources/services/tools with permissive licences without any interaction

b. give access to authorized users only for LRTs requiring authorization

c. enforce signature to users before getting access to a resource/service for licences that require signature

d. enforce conditions stemming from multiple licences on the same distribution (e.g. combination of software & data)

e. give access to authenticated users only for LRTs requiring authentication

f. I want to support a billing module that takes into account reductions / restrictions for members of a group

UR12. As a distributing entity, I want to monitor the use of LT processing services in order to limit or bill users when accessing them

UR13. As a distributing entity, I want to support LRT providers in attaching a single or multiple licences on distributions of LRTs

UR14. As a distributing entity, I want to ask the providers to describe the conditions of use for the new licences

UR15. As a distributing entity, I want to have information that may affect its licence (e.g. inclusion of non-anonymized sensitive data)

UR16. As a distributing entity, I want to know the usage conditions under which a LRT is released, preferably in a machine readable form.

UR17. As a distributing entity, I want to aggregate the set of licences of multiple data resources into one set of licences for a collection or attach a single licence on a collection of resources

B. Dataset of licences

To facilitate providers of LRTs, a set of licences in ODRL is available at ? in the form of three interpretations:

The META-SHARE interpretation of licences includes the SPDX identifier, thus linking the licences to the official SPDX License List. The SPDX License List includes a standardized short identifier, full name, vetted license text including matching guidelines markup as appropriate, and a canonical permanent URL for each license and exception; it is available at the spdx github in various serialization formats.

The META-SHARE set of licences includes representations for popular licences in the domain of LRTs, both standard ones, such as Creative Commons, but also standard licences created specifically for the distribution of LRTs, namely ELRA, META-SHARE and CLARIN licences.

C. Mappings for transformation of META-SHARE elements to ODRL policies

Mappings for condition of use

Rule type Condition of use ODRL action/rule representation

Prohibition
   
ms:noDerivatives   
   
odrl:prohibition
   
[ odrl:action cc:DerivativeWorks
   
]   
   
Prohibition   
   
ms:nonCommercialUse   
   
odrl:prohibition
   
[ odrl:action cc:CommercialUse
   
]    
   
Prohibition   
   
ms:noRedistribution   
   
odrl:prohibition
   
[ odrl:action cc:Distrbution
   
]   
   
Duty   
   
ms:attribution   
   
odrl:duty
   
[ odrl:action cc:Attribution
   
]    
   
Duty   
   
ms:shareAlike   
   
odrl:duty
   
[ odrl:action cc:ShareAlike
   
]    
   
Duty   
   
ms:redeposit   
   
odrl:duty
   
[ odrl:action ms:deposit
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:depositingParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   odrl:assigner
   
]
   
]    
   
Duty   
   
ms:informLicensor   
   
odrl:duty
   
[ odrl:action ms:report
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand odrl:informedParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand odrl:assigner
   
] ;
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:objectOfReport   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand odrl:Use
   
]
   

   
]   
   
Constraint   
   
ms:spatial   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:spatial ;
   
odrl:rightOperand "only at   assignee's site"
   
]    
   
Constraint   
   
ms:academicUseOnly   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:academicUse
   
]   
   
Constraint   
   
ms:evaluationUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:evaluation1
   
]   
   
Constraint   
   
ms:languageEngineeringResearchUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand   ms:languageEngineeringResearch
   
]   
   
Constraint   
   
ms:researchUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:research
   
]   
   
Constraint   
   
ms:trainingUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:training
   
]   
   
Constraint   
   
ms:academicUser   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:userType ;
   
odrl:rightOperand ms:academic
   
]   
   
Constraint   
   
ms:commercialUser   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:userType ;
   
odrl:rightOperand ms:commercial
   
]   
   
Constraint   
   
ms:memberOfAssociation1   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:isPartOf ;
   
odrl:leftOperand odrl:assignee ;
   
odrl:rightOperand "association   X"
   
]   
   
Constraint   
   
ms:requestPlan   
   
odrl:duty
   
[ odrl:action ms:report
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand odrl:informedParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   odrl:assigner
   
] ;
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:objectOfReport   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   ms:ResearchPlan
   
]
   

   
]   
   
Constraint   
   
ms:userIdentified   
odrl:duty [
odrl:Action ms:signIn
]

Mappings for licence category

The first two values of the licence category (allowsDirectAccess & allowsProcessing) specify the types of actions that the distributing entity may perform, while the other values refine the constraints on which these actions are performed.

Rule type Condition of use ODRL action/rule representation

Prohibition
   
ms:noDerivatives   
   
odrl:prohibition
   
[ odrl:action cc:DerivativeWorks
   
]   
   
Prohibition   
   
ms:nonCommercialUse   
   
odrl:prohibition
   
[ odrl:action cc:CommercialUse
   
]    
   
Prohibition   
   
ms:noRedistribution   
   
odrl:prohibition
   
[ odrl:action cc:Distrbution
   
]   
   
Duty   
   
ms:attribution   
   
odrl:duty
   
[ odrl:action cc:Attribution
   
]    
   
Duty   
   
ms:shareAlike   
   
odrl:duty
   
[ odrl:action cc:ShareAlike
   
]    
   
Duty   
   
ms:redeposit   
   
odrl:duty
   
[ odrl:action ms:deposit
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:depositingParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   odrl:assigner
   
]
   
]    
   
Duty   
   
ms:informLicensor   
   
odrl:duty
   
[ odrl:action ms:report
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand odrl:informedParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand odrl:assigner
   
] ;
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:objectOfReport   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand odrl:Use
   
]
   

   
]   
   
Constraint   
   
ms:spatial   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:spatial ;
   
odrl:rightOperand "only at   assignee's site"
   
]    
   
Constraint   
   
ms:academicUseOnly   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:academicUse
   
]   
   
Constraint   
   
ms:evaluationUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:evaluation1
   
]   
   
Constraint   
   
ms:languageEngineeringResearchUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand   ms:languageEngineeringResearch
   
]   
   
Constraint   
   
ms:researchUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:research
   
]   
   
Constraint   
   
ms:trainingUse   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:purpose ;
   
odrl:rightOperand ms:training
   
]   
   
Constraint   
   
ms:academicUser   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:userType ;
   
odrl:rightOperand ms:academic
   
]   
   
Constraint   
   
ms:commercialUser   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:eq ;
   
odrl:leftOperand odrl:userType ;
   
odrl:rightOperand ms:commercial
   
]   
   
Constraint   
   
ms:memberOfAssociation1   
   
odrl:constraint
   
[ a odrl:Constraint ;
   
odrl:operator   odrl:isPartOf ;
   
odrl:leftOperand odrl:assignee ;
   
odrl:rightOperand "association   X"
   
]   
   
Constraint   
   
ms:requestPlan   
   
odrl:duty
   
[ odrl:action ms:report
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand odrl:informedParty   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   odrl:assigner
   
] ;
   
[
   
odrl:refinement
   
[ a odrl:Constraint ;
   
odrl:leftOperand ms:objectOfReport   ;
   
odrl:operator   odrl:eq ;
   
odrl:rightOperand   ms:ResearchPlan
   
]
   

   
]   
   
Constraint   
   
ms:userIdentified   
odrl:duty [
odrl:Action ms:signIn
]