Search results

1 – 3 of 3

View access options

Article

Publication date: 8 May 2017

Working framework of semantic interoperability for CRIS with heterogeneous data sources

Amed Leiva-Mederos, Jose A. Senso, Yusniel Hidalgo-Delgado and Pedro Hipola

Information from Current Research Information Systems (CRIS) is stored in different formats, in platforms that are not compatible, or even in independent networks. It would be…

HTML

PDF (1.6 MB)

Downloads

1172

Abstract

Purpose

Information from Current Research Information Systems (CRIS) is stored in different formats, in platforms that are not compatible, or even in independent networks. It would be helpful to have a well-defined methodology to allow for management data processing from a single site, so as to take advantage of the capacity to link disperse data found in different systems, platforms, sources and/or formats. Based on functionalities and materials of the VLIR project, the purpose of this paper is to present a model that provides for interoperability by means of semantic alignment techniques and metadata crosswalks, and facilitates the fusion of information stored in diverse sources.

Design/methodology/approach

After reviewing the state of the art regarding the diverse mechanisms for achieving semantic interoperability, the paper analyzes the following: the specific coverage of the data sets (type of data, thematic coverage and geographic coverage); the technical specifications needed to retrieve and analyze a distribution of the data set (format, protocol, etc.); the conditions of re-utilization (copyright and licenses); and the “dimensions” included in the data set as well as the semantics of these dimensions (the syntax and the taxonomies of reference). The semantic interoperability framework here presented implements semantic alignment and metadata crosswalk to convert information from three different systems (ABCD, Moodle and DSpace) to integrate all the databases in a single RDF file.

Findings

The paper also includes an evaluation based on the comparison – by means of calculations of recall and precision – of the proposed model and identical consultations made on Open Archives Initiative and SQL, in order to estimate its efficiency. The results have been satisfactory enough, due to the fact that the semantic interoperability facilitates the exact retrieval of information.

Originality/value

The proposed model enhances management of the syntactic and semantic interoperability of the CRIS system designed. In a real setting of use it achieves very positive results.

Details

Journal of Documentation, vol. 73 no. 3

Type: Research Article

DOI:

ISSN: 0022-0418

Keywords

View access options

Article

Publication date: 10 June 2014

Ontology-based text summarization. The case of Texminer

Pedro Hípola, José A. Senso, Amed Leiva-Mederos and Sandor Domínguez-Velasco

The purpose of this paper is to look into the latest advances in ontology-based text summarization systems, with emphasis on the methodologies of a socio-cognitive approach, the…

HTML

PDF (434 KB)

Downloads

666

Abstract

Purpose

The purpose of this paper is to look into the latest advances in ontology-based text summarization systems, with emphasis on the methodologies of a socio-cognitive approach, the structural discourse models and the ontology-based text summarization systems.

Design/methodology/approach

The paper analyzes the main literature in this field and presents the structure and features of Texminer, a software that facilitates summarization of texts on Port and Coastal Engineering. Texminer entails a combination of several techniques, including: socio-cognitive user models, Natural Language Processing, disambiguation and ontologies. After processing a corpus, the system was evaluated using as a reference various clustering evaluation experiments conducted by Arco (2008) and Hennig et al. (2008). The results were checked with a support vector machine, Rouge metrics, the F-measure and calculation of precision and recall.

Findings

The experiment illustrates the superiority of abstracts obtained through the assistance of ontology-based techniques.

Originality/value

The authors were able to corroborate that the summaries obtained using Texminer are more efficient than those derived through other systems whose summarization models do not use ontologies to summarize texts. Thanks to ontologies, main sentences can be selected with a broad rhetorical structure, especially for a specific knowledge domain.

Details

Library Hi Tech, vol. 32 no. 2

Type: Research Article

DOI:

ISSN: 0737-8831

Keywords

View access options

Article

Publication date: 2 September 2013

AUTHORIS: a tool for authority control in the semantic web

Amed Leiva-Mederos, José A. Senso, Sandor Domínguez-Velasco and Pedro Hípola

The purpose of this paper is to propose a tool that generates authority files to be integrated with linked data by means of learning rules. AUTHORIS is software developed to…

HTML

PDF (452 KB)

Downloads

1185

Abstract

Purpose

The purpose of this paper is to propose a tool that generates authority files to be integrated with linked data by means of learning rules. AUTHORIS is software developed to enhance authority control and information exchange among bibliographic and non-bibliographic entities.

Design/methodology/approach

The article analyzes different methods previously developed for authority control as well as IFLA and ALA standards for managing bibliographic records. Semantic Web technologies are also evaluated. AUTHORIS relies on Drupal and incorporates the protocols of Dublin Core, SIOC, SKOS and FOAF. The tool has also taken into account the obsolescence of MARC and its substitution by FRBR and RDA. Its effectiveness was evaluated applying a learning test proposed by RDA. Over 80 percent of the actions were carried out correctly.

Findings

The use of learning rules and the facilities of linked data make it easier for information organizations to reutilize products for authority control and distribute them in a fair and efficient manner.

Research limitations/implications

The ISAD-G records were the ones presenting most errors. EAD was found to be second in the number of errors produced. The rest of the formats – MARC 21, Dublin Core, FRAD, RDF, OWL, XBRL and FOAF – showed fewer than 20 errors in total.

Practical implications

AUTHORIS offers institutions the means of sharing data with a high level of stability, helping to detect records that are duplicated and contributing to lexical disambiguation and data enrichment.

Originality/value

The software combines the facilities of linked data, the potency of the algorithms for converting bibliographic data, and the precision of learning rules.

Details

Library Hi Tech, vol. 31 no. 3

Type: Research Article

DOI:

ISSN: 0737-8831

Keywords

Access

Year

All dates (3)

Content type

Article (3)

1 – 3 of 3

Working framework of semantic interoperability for CRIS with heterogeneous data sources

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Ontology-based text summarization. The case of Texminer

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

AUTHORIS: a tool for authority control in the semantic web

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Practical implications

Originality/value

Details

Keywords

Access

Year

Content type

All feedback is valuable

Report an issue or find answers to frequently asked questions