Search results
1 – 10 of 21Lidija Ivanovic, Bojana Dimic Surla, Dusan Surla, Dragan Ivanovic, Zora Konjovic and Gordana Rudic
Dissertations from the University of Novi Sad (UNS) are integrated with the research information system called the current research information system (CRIS) UNS. This paper aims…
Abstract
Purpose
Dissertations from the University of Novi Sad (UNS) are integrated with the research information system called the current research information system (CRIS) UNS. This paper aims to present a proposal for an extension of this system to enable the storage of student papers as prescribed by PhD study exam obligations. The proposed extension enables preservation and improves discoverability of scientific and technical works produced by students during their PhD studies.
Design/methodology/approach
An analysis of examination modes in accredited PhD study programs has been conducted. It was noted that students in examination modes verify the obtained results in the form of scientific and technical work. The main idea of this paper is to enable the preservation of those student results and to implement electronic services for retrieving those results by current and future PhD students to empower the development of science.
Findings
The proposal includes an extension of the CRIS UNS to store and publish student papers as prescribed as a PhD study exam obligation; an extension of the CERIF data model to enable storing of student papers; cataloguing student papers in the MARC 21 format; and a way to represent student papers in the Dublin Core format.
Practical implications
This paper can be a starting point for initiatives for the creation of institutional, regional, national and international Web portals for searching and browsing papers by PhD students.
Social implications
This system offers the improvement of cooperation between PhD students from different institutions and countries.
Originality/value
The paper presents an extension of institutional, national and international current research information system (CRIS) systems which will enable the preservation and improve discoverability of student papers produced during PhD studies. The proposed extension has been verified by its implementation within the CRIS UNS system, which also supports monitoring of the scientific competencies of students based on an automatic evaluation of published scientific results.
Details
Keywords
Bojana Dimic Surla, Dusan Ilija Surla and Dragan Ivanovic
The purpose of this article is to describe a proposition for the evaluation of citations of scientific papers, which could serve as a supplement to the existing Rule Book of the…
Abstract
Purpose
The purpose of this article is to describe a proposition for the evaluation of citations of scientific papers, which could serve as a supplement to the existing Rule Book of the Ministry of the Republic of Serbia, which is used in the procedure of electing candidates for particular academic and research titles. The evaluation and quantitative presentation of the results and evaluation of citations were carried out on data taken from the database of the Current Research Information System of the University of Novi Sad (CRIS UNS), which is harmonized with the Rule Book of the Ministry with respect to the evaluation of published scientific results of researchers.
Design/methodology/approach
There are different criteria to evaluate the quality of scientific papers based on their citations. The pertinent parameters can be the total number of citations, the number of citations in a defined time period and by assigning the appropriate weighting values to the citations. This work proposes a procedure of assigning the citation weighting values based on the evaluation of the scientific results in which the citation appeared according to the Rule Book in the Republic of Serbia. Based on this, the authors introduced the impact factor of researchers as the ratio of the number of points of the evaluated citations and the number of points of the evaluated papers of the researcher.
Findings
Results showed that the research information system CRIS UNS can be extended to the evaluation of citations for a single researcher, groups of researchers and institutions.
Practical implications
The proposed solution enables the evaluation of citations in the process of election and promotion of academic staff. In this way, there is a means for measuring the scientific influence of a researcher in the relevant scientific area.
Social implications
The evaluation of citations may be included in the national strategies of scientific development, funding and evaluation of research projects; for promotions of academic staff at the universities and other academic institutions; and ranking of researchers and research organizations.
Originality/value
The main idea presented in the paper is the definition of a rule book (or several rule books) for the evaluation of citations. Based on the evaluation of citations, the authors proposed the term “the impact factor of researcher”.
Details
Keywords
Miroslav Zarić, Danijela Boberić Krstićev and Dušan Surla
The aim of the research is modelling and implementation of a client application that enables parallel search and retrieval of bibliographic records from multiple servers. The…
Abstract
Purpose
The aim of the research is modelling and implementation of a client application that enables parallel search and retrieval of bibliographic records from multiple servers. The client application supports simultaneous communication over Z39.50 and SRW/SRU protocols. The application design is flexible and later addition of other communication protocols for search/retrieval is envisioned and supported.
Design/methodology/approach
Object‐oriented approach has been used for modelling and implementation of client application. CASE tool, Sybase PowerDesigner, supporting Unified Modelling Language (UML 2.0), was used for modelling. Java programming language and Eclipse environment were used for implementation.
Findings
The result of the research is a client application that enables parallel search and retrieval of multiple Z39.50 and SRW/SRU servers. Additionally, the application supports conversion from type‐1 query language, defined by Z39.50 standard, to CQL query language required for search/retrieval from SRW/SRU servers. The application was verified by performing parallel search and retrieval from several publicly accessible Z39.50 and SRW/SRU servers.
Research limitations/implications
The application supports only the use of bib‐1 attribute set for type‐1 queries created according to Z39.50 standard. Hence, only such queries can be converted to CQL notation. The use of other attribute sets is not supported.
Practical implications
The client application is integrated into the BISIS software system, version 4. This enables the cataloguing of bibliographic records retrieved over Z39.50 and SRW/SRU protocol.
Originality/value
The contribution of this work is in client application architecture that enables parallel communication with multiple servers, which can use different communication protocols, Z39.50 or SRW/SRU. Search/retrieval from servers using some other protocol is also supported. This can be achieved by adding new classes that implement protocol specification, and classes for query transformation into notation required by that new protocol, if required.
Details
Keywords
Lidija Ivanović, Dragan Ivanović and Dušan Surla
The aim of this research is to define a data model of theses and dissertations that enables data exchange with CERIF‐compatible CRIS systems and data exchange according to OAI‐PMH…
Abstract
Purpose
The aim of this research is to define a data model of theses and dissertations that enables data exchange with CERIF‐compatible CRIS systems and data exchange according to OAI‐PMH protocol in different metadata formats (Dublin Core, EDT‐MS, etc.).
Design/methodology/approach
Various systems that contain metadata about theses and dissertations are analyzed. There are different standards and protocols that enable the interoperability of those systems: CERIF standard, AOI‐PMH protocol, etc. A physical data model that enables interoperability with almost all of those systems is created using the PowerDesigner CASE tool.
Findings
A set of metadata about theses and dissertations that contain all the metadata required by CERIF data model, Dublin Core format, EDT‐MS format and all the metadata prescribed by the University of Novi Sad is defined. Defined metadata can be stored in the CERIF‐compatible data model based on the MARC21 format.
Practical implications
CRIS‐UNS is a CRIS which has been developed at the University of Novi Sad since 2008. The system is based on the proposed data model, which enables the system's interoperability with other CERIF‐compatible CRIS systems. Also, the system based on the proposed model can become a member of NDLTD.
Social implications
A system based on the proposed model increases the availability of theses and dissertations, and thus encourages the development of the knowledge‐based society.
Originality/value
A data model of theses and dissertations that enables interoperability with CERIF‐compatible CRIS systems is proposed. A software system based on the proposed model could become a member of NDLTD and exchange metadata with institutional repositories. The proposed model increases the availability of theses and dissertations.
Details
Keywords
The purpose of this paper is to model and implement an extensible markup language (XML)‐based editor for library cataloguing. The editor model should support data input in the…
Abstract
Purpose
The purpose of this paper is to model and implement an extensible markup language (XML)‐based editor for library cataloguing. The editor model should support data input in the form of free text with interactive control of structure and content validity of records specified in the UNIMARC and MARC 21 formats. The editor is implemented in the Java programming language in the form of a software package.
Design/methodology/approach
The unified modelling language (UML 2.0) is used for the specification of both the information requirements and the model architecture. The object oriented methodology is used for design and implementation of the software packages, as well as the corresponding CASE tools.
Findings
The result is an editor for UNIMARC and MARC 21 cataloguing. The editor is based on the XML technologies by which the two basic characteristics are achieved as follows: a possibility of integrating the editor into different library software systems and, moving to another format requires only the changes of the module for bibliographic record data control.
Research limitations/implications
A basic limitation of the system is related to the subsystem that controls validation of the bibliographic records and its expansion for work with other bibliographic formats. In the proposed solution, a part of the control of data input is included into the implementation itself and it is related to the UNIMARC format. That is, a part of data by which the control is done, such as repeatability of the record elements and the codebooks, is contained in the XML document of the format that is input information in the editor. However, the control that is related to validation of the format of content in record elements cannot be performed for any other format without modification in the implementation. Therefore, the research could be continued by considering the separation of data used for content control as input information for the application. In that way, this segment would also become implementation independent. One of the solutions should be extending the XML document of the format by this data. Some other solution should mean creating a totally separate system for the content validation. Moreover, the proposed editor supports processing of a bibliographic record only in the UNIMARC and MARC 21 formats. Processing of records in other formats requires considerable changes in the model.
Practical implications
The model of a new editor is developed on the basis of the experience and needs of electronic management in city and special libraries. Based on the given model a new editor is implemented and integrated into the BISIS software system used by the mentioned libraries. Testing and verification are performed on the bibliographic records of the public city libraries.
Originality/value
The contribution of this work is in the system architecture that is based on the XML documents and is independent of the bibliographic format. The XML document that contains data about the bibliographic format represents the editor input information. After a bibliographic record is created in this editor, the record is stored into an XML document that represents the editor output information. This XML document can be stored into various software systems for data storage and retrieval.
Details
Keywords
Bojana Dimić, Branko Milosavljević and Dušan Surla
The purpose of this paper is to create a model for an XML document that will carry information about bibliographic formats. The model will be given in the form of an XML schema…
Abstract
Purpose
The purpose of this paper is to create a model for an XML document that will carry information about bibliographic formats. The model will be given in the form of an XML schema describing two bibliographic formats, UNIMARC and MARC 21.
Design/methodology/approach
The description of bibliographic formats using the XML schema language may be discussed in two ways. The first one relates to creating an XML schema in a way that all elements of the bibliographic format are described separately. The second way, used in this paper, is creating an XML schema as a set of elements that presents concepts of bibliographic formats. A schema created in the second way is appropriate for use in implementation of cataloguing software.
Findings
The result is an XML schema that describes MARC 21 and UNIMARC formats. The instance of that schema is an XML document describing a bibliographic format that will be used in software systems for cataloguing. An XML document that is an instance of the proposed XML schema is applied in the development of the editor for cataloguing in the BISIS library information system. This XML document represents input information for that editor. In this way, the implementation of the editor becomes independent of the bibliographic format.
Practical implications
The created XML schema cannot serve as an electronic manual because there is some information about the format that is not included in it. In order to overcome this shortcoming an additional XML schema that will contain remaining format data may be provided.
Originality/value
The originality lies in the idea of creating one XML schema for two bibliographic formats. The schema contains elements that are models for data used in cataloguing tools. On the basis of that XML schema, the object model of bibliographic formats is implemented as well as software component for manipulating format data. This component can be used in development of library software systems.
Details
Keywords
Dragan Ivanović, Dušan Surla and Zora Konjović
The purpose of this research is to observe all data from the Common European Research Information Format (CERIF) data model that can be described using bibliographic standards and…
Abstract
Purpose
The purpose of this research is to observe all data from the Common European Research Information Format (CERIF) data model that can be described using bibliographic standards and move those data to a data model of bibliographic standard.
Design/methodology/approach
Analysis of the CERIF data model and the MARC 21 format has shown that some elements of the CERIF data model could be mapped to the MARC 21 bibliographic record. A CERIF compatible data model based on the MARC 21 format is proposed. The data model was created using PowerDesigner CASE tool. The proposed data model is represented using a physical data model in the conceptual notation that is adopted in the literature for representing the CERIF data model.
Findings
A CERIF compatible data model based on the MARC 21 format is proposed. The proposed model contains all the data from the CERIF2008 data model. The central part of the proposed model is MARC 21 data model that is used as a replacement for 27 entities of the CERIF data model, including all their attributes as well as part of the attributes in entities related to organisational unit. The mappings between attributes of entities of the CERIF data model and the data model of the MARC 21 format are described.
Research limitations/implications
The CERIF compatible data model based on the MARC 21 format does not support all restrictions on data types, which are defined by the CERIF data model. This means that such restrictions have to be controlled by software.
Practical implications
The central part of the proposed CERIF compatible data model is a data model of MARC 21 format. It means that most of the data are modelled according to bibliographic standard, which is very widespread worldwide. This implies that the proposed CERIF model can be easily implemented within the existing library infrastructure. In addition, the proposed model can be used for other purposes, such as the evaluation of scientific research results, generating bibliographies of researchers, and institutions, the citations etc. A research management system based on the proposed model is implemented. Also, this system is verified and tested on data about published results of researchers employed at University of Novi Sad, Serbia.
Originality/value
A new data model compatible with the CERIF data model is proposed. The basic idea is to map part of the CERIF data model related to published results of scientific research to some well‐known bibliographic standard. It was shown that this part of the data model could be mapped to the MARC 21 data model. It can be mapped to data models of any other MARC standards in a similar way.
Branko Milosavljević, Danijela Boberić and Dušan Surla
The aim of the research is modeling and implementing a software component for the retrieval of bibliographic records using the Apache Lucene retrieval engine.
Abstract
Purpose
The aim of the research is modeling and implementing a software component for the retrieval of bibliographic records using the Apache Lucene retrieval engine.
Design/methodology/approach
Object‐oriented methodology is used for modeling and implementation of the bibliographic record retrieval engine. Modeling is carried out in the CASE tool that supports the unified modeling language (UML 2.0), while the implementation is using the Java programming language and open source components.
Findings
The result is a software component for the retrieval of bibliographic records that are independent of the bibliographic format used in cataloging. It features great flexibility in terms of configuring search types without the need to change the software implementation.
Research limitations/implications
One of the constraints of this system relates to the problem of searching linking entry fields. UNIMARC format defines fields used to link the item being cataloged to another bibliographic item, so those fields may contain other fields, which can be termed secondary fields. In this proposed solution, secondary fields are treated as all other fields and there is no information whether the search term belongs to the secondary or a regular field.
Practical implications
The proposed solution is integrated into library information system BISIS, version 4. This version of the BISIS system is in use at university, public and special libraries. By introducing this version, system performance as well as flexibility of the indexing process are improved and at the same time librarians are able to perform sophisticated and effective retrieval of bibliographic records.
Originality/value
The contribution of this work is in the design of a customizable record retrieval component. It is configured by means of an XML document for specifying mapping rules between subfields of the bibliographic record format and search types. By using XML it is possible to add new mapping rules without additional programming. In addition, great attention has been paid to the indexing of subfields that contain punctuation marks having special semantic meanings for librarians and the transliteration between Cyrillic and Latin scripts. Also, originality of this work lies in using the Apache Lucene search engine, which facilitates building highly flexible and efficient retrieval systems.
Details
Keywords
The aim of this research is the conversion of the bibliographic records between the following different formats for bibliographic material processing – the YUMARC (which is a…
Abstract
Purpose
The aim of this research is the conversion of the bibliographic records between the following different formats for bibliographic material processing – the YUMARC (which is a variant of the UNIMARC format in which the Serbian BISIS system operates), UNIMARC and MARC 21 format.
Design/methodology/approach
The CASE tools that support the information system developing methodology based on the XML technologies are used.
Findings
The result is the specification and implementation of information requirements for the conversion of the bibliographic records created in the BISIS system into the UNIMARC or MARC 21 format.
Research limitations/implications
The specification of the rules for bibliographic record conversion is not formalized, so the implementation of these rules cannot be done automatically. If the rules could be formalized, then a generator of the programming code could be developed for the implementation of the rules for the bibliographic record conversion.
Practical implications
The research result is applied for the conversion of the YUMARC bibliographic records in the Library of the Department for Mathematics and Informatics of Novi Sad University. The conversion of the records is made at first into the UNIMARC format and subsequently from the UNIMARC format into the records of the MARC 21 format. The task of conversion of the bibliographic records formed in the BISIS software system in the UNIMARC or the MARC 21 format is solved in that way.
Originality/value
The originality of the work is contained in the application of the XML technologies for the conversion of the bibliographic records between the different bibliographic formats (YUMARC, UNIMARC and MARC 21). For each of the formats an XML schema is formed and record conversion between the different formats is done by the XSLT transformations.
Details
Keywords
Aleksandar Kovačević, Dragan Ivanović, Branko Milosavljević, Zora Konjović and Dušan Surla
The aim of this paper is to develop a system for automatic extraction of metadata from scientific papers in PDF format for the information system for monitoring the scientific…
Abstract
Purpose
The aim of this paper is to develop a system for automatic extraction of metadata from scientific papers in PDF format for the information system for monitoring the scientific research activity of the University of Novi Sad (CRIS UNS).
Design/methodology/approach
The system is based on machine learning and performs automatic extraction and classification of metadata in eight pre‐defined categories. The extraction task is realised as a classification process. For the purpose of classification each row of text is represented with a vector that comprises different features: formatting, position, characteristics related to the words, etc. Experiments were performed with standard classification models. Both a single classifier with all eight categories and eight individual classifiers were tested. Classifiers were evaluated using the five‐fold cross validation, on a manually annotated corpus comprising 100 scientific papers in PDF format, collected from various conferences, journals and authors' personal web pages.
Findings
Based on the performances obtained on classification experiments, eight separate support vector machines (SVM) models (each of which recognises its corresponding category) were chosen. All eight models were established to have a good performance. The F‐measure was over 85 per cent for almost all of the classifiers and over 90 per cent for most of them.
Research limitations/implications
Automatically extracted metadata cannot be directly entered into CRIS UNS but requires control of the curators.
Practical implications
The proposed system for automatic metadata extraction using support vector machines model was integrated into the software system, CRIS UNS. Metadata extraction has been tested on the publications of researchers from the Department of Mathematics and Informatics of the Faculty of Sciences in Novi Sad. Analysis of extracted metadata from these publications showed that the performance of the system for the previously unseen data is in accordance with that obtained by the cross‐validation from eight separate SVM classifiers. This system will help in the process of synchronising metadata from CRIS UNS with other institutional repositories.
Originality/value
The paper documents a fully automated system for metadata extraction from scientific papers that was developed. The system is based on the SVM classifier and open source tools, and is capable of extracting eight types of metadata from scientific articles of any format that can be converted to PDF. Although developed as part of CRIS UNS, the proposed system can be integrated into other CRIS systems, as well as institutional repositories and library management systems.
Details