Full Paper

A Model to Support Interpretation of Embedded Metadata without Formal Schema by Linking a Metadata Instance to DCMI Description Set Profiles

Download PDF Read Online
Abstract

There are a number of HTML documents which include metadata on the Web and a number of information services which provide metadata embedded using metadata standards across domains. Those metadata are, however, encoded in various different schemas and in different serialization formats, which makes it hard to automatically extract and interpret the metadata. The primary reason of the difficulty is the lack of interpretation rules of the metadata, e.g., lack of definition of metadata vocabularies, lack of definition of encoding syntax and so forth. This paper proposes a model to support interpretation of embedded metadata without formal schema by linking a metadata instance to DCMI Description Set Profiles (DSP). An XPath expression addresses a metadata instance encoded in HTML, and DSP define metadata schema. We propose extending DSP to include XPath for linking a metadata instance to a metadata schema. This paper also shows an experimental system which extracts metadata using extended DSP.

Author information

Tsunagu Honma
Graduate School of Library, Information and Media Studies. University of Tsukuba., JP
Mitsuharu Nagamori
University of Tsukuba, JP
Shigeo Sugimoto
University of Tsukuba, JP

Cite this article

Honma, T., Nagamori, M., & Sugimoto, S. (2012). A Model to Support Interpretation of Embedded Metadata without Formal  Schema by Linking a Metadata Instance to DCMI Description Set Profiles. International Conference on Dublin Core and Metadata Applications, 2012. https://doi.org/10.23106/dcmi.952135962

DOI : 10.23106/dcmi.952135962

CC-0 Logo Metadata and citations of this article is published under the Creative Commons Zero Universal Public Domain Dedication (CC0), allowing unrestricted reuse. Anyone can freely use the metadata from DCPapers articles for any purpose without limitations.
CC-BY Logo This article full-text is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license allows use, sharing, adaptation, distribution, and reproduction in any medium or format, provided that appropriate credit is given to the original author(s) and the source is cited.