Automatic Literature Metadata Extraction from DataCite Services

Author(s): Kun Ma*.

Journal Name: Recent Patents on Computer Science

Volume 11 , Issue 1 , 2018

Graphical Abstract:


Abstract:

Background: Generally, it is difficult to obtain the literature metadata in a unified way because the source data of literature is heterogeneous. Researchers developed a series of systems marked by digital object to manage them and obtained a good effect. Though there are several DOI systems, we face with some problems in promoting the use of them.

Objective: To address this issue of promoting literature identifier extraction, this paper has proposed automatic literature metadata extraction from DataCite services.

Method: This paper describes Patent Publication Number CN103279361A, titled "Method and System for Bookmark-triggered Literature Sharing", issued by State Intellectual Property Office of the P.R.C. on January 27, 2016. A literature metadata extraction system supporting both personal computer and mobile terminal is developed using the integration of DataCite content negotiation, DataCite metadata search, and HTML template extraction. The architecture of this system is divided into model, view, service and controller. An important contribution of this article is to design a cross-platform and universal way to extract digital literature with/without DOI.

Results: The analysis of application's effect and piratical test case show the ability to verify the authenticity of automatic literature metadata extraction from DataCite services. The contributions of our method are literature identifier extraction from DOI proxy, template extraction using Roadrunner, and bookmarklet-based literature sharing.

Conclusion: The idea and a disclosed embodiment of a patent (Patent CN103279361A, issued by State Intellectual Property Office of the P.R.C.) are presented, which is based on the distribution of literature metadata extraction. In one disclosed embodiment, this method contains literature identifier extraction from DOI proxy, template extraction using Roadrunner, and bookmarklet-based literature sharing.

Keywords: Digital object identifier, literature extraction, metadata, literature sharing, bookmarklet, proxy, roadrunner, template.

Rights & PermissionsPrintExport Cite as

Article Details

VOLUME: 11
ISSUE: 1
Year: 2018
Page: [25 - 31]
Pages: 7
DOI: 10.2174/2213275911666180627093515

Article Metrics

PDF: 6