Introduction
From an inventory among the various subdomain work packages in ENVRI-FAIR, WP6 concluded that Provenance Tracing was high on the priority list of training requirements for RI’s.
Using the very relevant outcomes of the ENVRIplus project as written down in Deliverable D8.5 (Data provenance and tracing for environmental sciences: system design), we have put together a series of events all focussing on this topic:
- An introductory webinar addressing the basic theoretical background:
- What is provenance
- Why is it important
- What does provenance relate to
- How is provenance recorded
- Recording methods
- Provenance and FAIRness;
- Three technical demonstrators of provenance tracing:
- The PROV Template approach – Doron Goldfarb
- The ENVRI-FAIR demonstrator focusing on provenance, and modeling a specific data production/processing workflow used at NILU – Markus Stocker
- The EPOS Approach using the main catalogue – Keith Jeffery, Daniele Bailo;
- A workshop-type session in collaboration with WP7 to discuss cases, issues, problems and questions collected from the community, working towards practical support and -possibly- solutions.
During the technical demonstrator sessions, we aim to collect input for the final workshop, like concrete cases, questions, requests for help on specific issues from the ENV RI’s.
Presenters
- Introductory course: hosted by Keith Jeffery (EPOS) and Doron Goldfarb (LTER)
- Technical Demonstrators: Markus Stocker, Doron Goldfarb, Keith Jeffery, Daniele Bailo
- ‘Bring your own provenance cases’ workshop: All trainers from this training event, WP7 staff (Zhiming Zhao et al.) plus relevant experts from the RI’s
Target groups
The primary target group for the training is staff at the ENVRI data centres, especially those concerned with data management and service architecture. Other target groups are staff of data centres of key local, regional and national institutions dealing with environmental data, and early-career scientists (MS and PhD students and post-docs) associated with the ENVRIs and their respective end-user communities.
Dates
- Introductory course: Thursday 19th November, 14.00-16.30h CET
- What is provenance
- Why is it important
- What does provenance relate to
- How is provenance recorded
- Recording methods
- Provenance and FAIRness;
- Technical Demonstrators:
- Wednesday 2nd December, 15.00-16.00h CET: PROV Template is a proposed extension to the existing set of W3C PROV specifications, dedicated to providing a formal language to describe templates for PROV conforming documents, a formal specification to describe bindings between source documents and template variables and a mechanism to instantiate templates using such bindings. This demonstrator will provide an introduction how to use this specification to turn existing log files available e.g. as csv into conforming PROV documents and point to existing tools in this regard – Doron Goldfarb;
- Wednesday 9th December, 15.00-16.00h CET: The ENVRI-FAIR demonstrator, focusing on provenance, and modeling a specific data production/processing workflow used at NILU. This technology demonstrator shows how to use PROV-O to represent provenance information for an example data processing workflow. – Markus Stocker
- Wednesday 16th December, 15.00-16.00h CET: The EPOS Approach using the main catalogue – Daniele Bailo, Keith Jeffery
- ‘Bring your own provenance cases’ workshop: Thursday 18th February, 14.00-17.30h CET
Location
All events will be organized as online events, using GoToMeeting as tool (participants will receive the link to the online meetings well in time before the respective events) and the ENVRI training platform for materials (https://training.envri.eu)
Organizers
WP6 team, together with the trainers and WP7
Registration
Registration to these events is required. You can register here: https://zfrmz.eu/F5iMdqJK3gLUXVynEmSM
Materials
For preparatory reading:
- D. Goldfarb et al.: ENVRIplus D8.5 Data provenance and tracing for environmental sciences: system design
- Z. Zhao, M. Hellström (Eds.): Towards Interoperable Research Infrastructures for Environmental and Earth Sciences, Data Provenance: https://envri.eu/wp-content/uploads/2020/07/Data-Provenance.pdf
- L. Moreau: PROV-Template: A Quick Start
- L. Moreau et al: A Templating System to Generate Provenance ProvToolbox (Software)
- D. Goldfarb et al: EnvriPlus D8.6 Data provenance and tracing for environmental sciences: Prototype and deployment
Contact
For more information about the training: Jacco Konijn