Sigal Arie Erez, Tobias Blanke, Mike Bryant, Kepa Rodriguez, Reto Speck and Veerle Vanden Daelen
This paper aims to describe the European Holocaust Research Infrastructure (EHRI) project's ongoing efforts to virtually integrate trans-national archival sources via the…
Abstract
Purpose
This paper aims to describe the European Holocaust Research Infrastructure (EHRI) project's ongoing efforts to virtually integrate trans-national archival sources via the reconstruction of collection provenance as it relates to copy collections (material copied from one archive to another) and the co-referencing of subject and authority terms across material held by distinct institutions.
Design/methodology/approach
This paper is a case study of approximately 6,000 words length. The authors describe the scope of the problem of archival fragmentation from both cultural and technical perspectives, with particular focus on Holocaust-related material, and describe, with graph-based visualisations, two ways in which EHRI seeks to better integrate information about fragmented material.
Findings
As a case study, the principal contributions of this paper include reports on our experience with extracting provenance-based connections between archival descriptions from encoded finding aids and the challenges of co-referencing access points in the absence of domain-specific controlled vocabularies.
Originality/value
Record linking in general is an important technique in computational approaches to humanities research and one that has rightly received significant attention from scholars. In the context of historical archives, however, the material itself is in most cases not digitised, meaning that computational attempts at linking must rely on finding aids which constitute much fewer rich data sources. The EHRI project’s work in this area is therefore quite pioneering and has implications for archival integration on a larger scale, where the disruptive potential of Linked Open Data is most obvious.
Details
Keywords
Sheila Anderson and Tobias Blanke
The purpose of this paper is to analyse the steps taken to produce new kinds of integrated documentation on the Holocaust in the European Holocaust Research Infrastructure…
Abstract
Purpose
The purpose of this paper is to analyse the steps taken to produce new kinds of integrated documentation on the Holocaust in the European Holocaust Research Infrastructure project. The authors present the user investigation methodology as well as the novel data design to support this complex field.
Design/methodology/approach
The paper is based on the scholarly primitives framework. From here, it proceeds with two empirical studies of Holocaust archival research and the implementation steps taken. The paper employs key insights from large technology studies in how to organise such work. In particular, it uses the concepts of social-technical assemblages and intermediation.
Findings
The paper offers a number of findings. First from the empirical studies, it presents how Holocaust researchers and archivist perceive the way they currently do research in archives. It then presents how the intermediation and digital transformation of such research can be enabled without violating its foundations. The second major insight is the technical research into how to use graph databases to integrate heterogeneous research collections and the analysis opportunities behind.
Originality/value
The paper is based on existing work by the authors but takes this work forward into the world of real-life existing historical research on archives. It demonstrates how the theoretical foundations of primitives are fit for purpose. The paper presents a completely new approach on how to (re)organise archives as research infrastructures and offers a flexible way of implementing this. Next to these major insights, a range of new solutions are presented how to arrange the socio-technical assemblages of research infrastructures.
Details
Keywords
Tobias Blanke, Michael Bryant and Reto Speck
In 2010 the European Holocaust Research Infrastructure (EHRI) was funded to support research into the Holocaust. The project follows on from significant efforts in the past to…
Abstract
Purpose
In 2010 the European Holocaust Research Infrastructure (EHRI) was funded to support research into the Holocaust. The project follows on from significant efforts in the past to develop and record the collections of the Holocaust in several national initiatives. The purpose of this paper is to introduce the efforts by EHRI to create a flexible research environment using graph databases. The authors concentrate on the added features and design decisions to enable efficient processing of collection information as a graph.
Design/methodology/approach
The paper concentrates on the specific customisations EHRI had to develop, as the graph database approach is new, and the authors could not rely on existing solutions. The authors describe the serialisations of collections in the graph to provide for efficient processing. Because the EHRI infrastructure is highly distributed, the authors also had to invest a lot of effort into reliable distributed access control mechanisms. Finally, the authors analyse the user-facing work on a portal and a virtual research environment (VRE) in order to discover, share and analyse Holocaust material.
Findings
Using the novel graph database approach, the authors first present how we can model collection information as graphs and why this is effective. Second, we show how we make collection information persistent and describe the complex access management system we have developed. Third, we outline how we integrate user interaction with the data through a VRE.
Originality/value
Scholars require specialised access to information. The authors present the results of the work to develop integrated research with collections on the Holocaust researchers and the proposals for a socio-technical ecosystem based on graph database technologies. The use of graph databases is new and the authors needed to work on several innovative customisations to make them work in the domain.
Details
Keywords
Tobias Blanke, Michael Bryant and Mark Hedges
This paper aims to present an evaluation of open source OCR for supporting research on material in small‐ to medium‐scale historical archives.
Abstract
Purpose
This paper aims to present an evaluation of open source OCR for supporting research on material in small‐ to medium‐scale historical archives.
Design/methodology/approach
The approach was to develop a workflow engine to support the easy customisation of the OCR process towards the historical materials using open source technologies. Commercial OCR often fails to deliver sufficient results here, as their processing is optimised towards large‐scale commercially relevant collections. The approach presented here allows users to combine the most effective parts of different OCR tools.
Findings
The authors demonstrate their application and its flexibility and present two case studies, which demonstrate how OCR can be embedded into wider digitally enabled historical research. The first case study produces high‐quality research‐oriented digitisation outputs, utilizing services that the authors developed to allow for the direct linkage of digitisation image and OCR output. The second case study demonstrates what becomes possible if OCR can be customised directly within a larger research infrastructure for history. In such a scenario, further semantics can be added easily to the workflow, enhancing the research browse experience significantly.
Originality/value
There has been little work on the use of open source OCR technologies for historical research. This paper demonstrates that the authors' workflow approach allows users to combine commercial engines' ability to read a wider range of character sets with the flexibility of open source tools in terms of customisable pre‐processing and layout analysis. All this can be done without the need to develop dedicated code.
Details
Keywords
Abstract
Details
Keywords
The purpose of this paper is to conduct a retrospective bibliometric analysis of documents about digital humanities, an emerging but interdisciplinary movement. It examines the…
Abstract
Purpose
The purpose of this paper is to conduct a retrospective bibliometric analysis of documents about digital humanities, an emerging but interdisciplinary movement. It examines the distribution of research outputs and languages, identifies the active journals and institutions, dissects the network of categories and cited references, and interprets the hot research topics.
Design/methodology/approach
The source data are derived from the Web of Science (WoS) core collection. To reveal the holistic landscape of this field, VOSviewer and CiteSpace as popular visualization tools are employed to process the bibliographic data including author, category, reference, and keyword. Furthermore, the parameter design of the visualization tools follows the general procedures and methods for bibliometric analysis.
Findings
There is an obviously rapid growth in digital humanities research. English is still the leading academic language in this field. The most influential authors all come from or have scientific relationships with Europe and North America, and two leading countries of which are the UK and USA. Digital humanities is the result of a dynamic dialogue between humanistic exploration and digital means. This research field is closely associated with history, literary and cultural heritage, and information and library science.
Research limitations/implications
This analysis relies on the metadata information extracted from the WoS database; however, some valuable literatures in the field of digital humanities may not be retrieved from the database owing to the inherent challenge of topic search. This study is also restricted by the scope of publications, the limitation regarding the source of data is that WoS database may have underrepresented publications in this domain.
Originality/value
The output of this paper could be a valuable reference for researchers and practitioners interesting in the knowledge domain of digital humanities. Moreover, the conclusions of this retrospective analysis can be deemed as the comparable foundation for future study.
Details
Keywords
Mattie Tops, Jesús Montero-Marín and Markus Quirin
Engagement, motivation, and persistence are usually associated with positive outcomes. However, too much of it can overtax our psychophysiological system and put it at risk. On…
Abstract
Engagement, motivation, and persistence are usually associated with positive outcomes. However, too much of it can overtax our psychophysiological system and put it at risk. On the basis of a neuro-dynamic personality and self-regulation model, we explain the neurobehavioral mechanisms presumably underlying engagement and how engagement, when overtaxing the individual, becomes automatically inhibited for reasons of protection. We explain how different intensities and patterns of engagement may relate to personality traits such as Self-directedness, Conscientiousness, Drive for Reward, and Absorption, which we conceive of as functions or strategies of adaptive neurobehavioral systems. We describe how protective inhibitions and personality traits contribute to phenomena such as disengagement and increased effort-sense in chronic fatigue conditions, which often affect professions involving high socio-emotional interactions. By doing so we adduce evidence on hemispheric asymmetry of motivation, neuromodulation by dopamine, self-determination, task engagement, and physiological disengagement. Not least, we discuss educational implications of our model.