Project Details
Automatic Harvesting, Indexing and Provision of Open Access Figures from the fields of Engineering and Technology Using the Infrastructure of Wikimedia Commons and Wikidata
Applicants
Professorin Dr. Ina Blümel, since 7/2018; Professor Dr. Christian Wartena
Subject Area
Security and Dependability, Operating-, Communication- and Distributed Systems
Term
from 2016 to 2019
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 315976924
Multimedial objects, especially images and figures, are essential for the visualization and interpretation of research findings. The distribution and reuse of these scientific objects is significantly improved under open access conditions, for instance in Wikipedia articles, in research literature, as well as in education and knowledge dissemination, where licensing of images often represents a serious barrier.Whereas scientific publications are retrievable through library portals or other online search services due to standardized indices there is no targeted retrieval and access to the accompanying images and figures yet. Consequently there is a great demand to develop standardized indexing methods for these multimedial open access objects in order to improve the accessibilty to this material. To address the objective it is necessary to develop a process for automatic harvesting and indexing of multimedial open access objects. Wikimedia Commons, operated by the Wikimedia Foundation, is one of the most important open access media collections and a platform and service for the harvesting and distribution thereof. Wikimedia Commons collects freely licensed objects, mainly images and figures, including their metadata and provides them to Wikipedia and all other web users.It is the aim of this project to develop a process for the automatic harvesting and indexing of articles including their images and figures derived from quality controlled open access journals in the fields of engineering and technology, using the infrastructure of Wikimedia Commons and Wikidata. To reach this goal it is planned to harvest scientific publications and subsequently extract and enrich the metadata of the accompanying images and figures in order to generate a standardized index.The main outcome of the project will be the provision of an information service which improves the retrieval of and the access to freely licenced images and figures from the fields of engineering and technology.
DFG Programme
Science Communication, Research Data, eResearch (Scientific Library Services and Information Systems)
Ehemaliger Antragsteller
Dr. Peter Löwe, until 7/2018