Provenance Description of Metadata using PROV with PREMIS for Long-term Use of Metadata

Chunqiu Li, Shigeo Sugimoto

Abstract


Provenance description is necessary for long-term preservation of digital resources. PREMIS and OAIS, which are well-known standards designed for digital preservation, define descriptive elements for digital preservation. Metadata, which is a data about a primary digital resource, has to be preserved as well as the primary resource. However, due to the changing technology and information context, metadata is at risk in damage or even loss. Thus, metadata preservation is important as well as digital object preservation. Metadata provenance is a rather new research topic but critical for keeping metadata about preserved resources consistently over time. This paper discusses provenance description in two aspects - provenance of digital object and provenance of metadata including metadata schemas. These are called digital provenance and metadata provenance, respectively. The goal of this paper is to clarify the concepts of digital provenance and metadata provenance based on some well-known standards - PREMIS, OAIS, PROV, and so forth, and to propose a novel model of provenance description for digital preservation based on the ontologies of PREMIS and PROV. The paper firstly explains digital provenance and metadata provenance. Next, we outline some major models and standards for provenance description. Then, this paper proposes to integrate PROV-O with PREMIS OWL Ontology in order to merge the provenance description model in PROV-O and the digital preservation model in PREMIS OWL Ontology. This paper also presents the merged model using some maintenance scenarios of digital objects and metadata. Lastly, we discuss metadata schema provenance, metadata object provenance and some other open issues.

Full Text:

PDF (Paper)