A.R.D. Prasad and Nabonita Guha
The purpose of this paper is to show that concept naming alone in document annotation is not sufficient to convey the thought content of the information resource. The paper…
Abstract
Purpose
The purpose of this paper is to show that concept naming alone in document annotation is not sufficient to convey the thought content of the information resource. The paper presents an outline of semantic document annotation which combines two major processes: facet analysis and concept categorisation. This is also an effort to show how RDF schema can be designed and implemented so that the properties of the schema are able to express the basic structure of the subject matter of the resource.
Design/methodology/approach
This paper presents a methodology for representing the subject matter of a document in terms of RDF. For the purposes of faceted subject annotation, it has developed an extended RDF schema for simple knowledge organisation system (SKOS). The facets and relationships of the faceted subject indexing language postulate‐based permuted subject indexing system (POPSI) have been transformed into RDFS classes. The elementary categories of POPSI form the property classes in the POPSI/RDF Schema. These property classes have been used to formulate the subject description of a document.
Findings
The subject annotation of a document using this schema expresses all the components of the thought content of an information resource.
Practical implications
The examples given in this paper show the applicability of this schema in describing resources in web directories and annotating scholarly documents in digital libraries. In a broader perspective, this provides a methodology for formulating the subject metadata of web resources. This schema helps in formulating the subject string(s) for a resource outlining the skeleton structure of its thought content.
Originality/value
SKOS has been developed as an RDF schema representation of the traditional knowledge organisation systems. But the schema has limited room to accommodate subject indexing languages. The present schema extends the SKOS schema to accommodate the representation of faceted subject indexing languages. The faceted subject annotation system has been adopted for the very reason that it has precedence over the enumerated classification systems, controlled vocabulary lists, etc. The potential to describe the specific subject of the document with more accuracy and representation of context gives the faceted subject indexing languages strength to make the subject description explicit and machine processible.
Details
Keywords
Devika P. Madalli and Aparajita Suman
The purpose of this paper is to propose the use of unified modelling language (UML) based ontologies to develop a faceted model to facilitate semantic browsing and searching in…
Abstract
Purpose
The purpose of this paper is to propose the use of unified modelling language (UML) based ontologies to develop a faceted model to facilitate semantic browsing and searching in digital libraries.
Design/methodology/approach
The approach is to understand the way information gets logically organised in the human mind in terms of mutually defined concepts. Ontologies will be used to model the semantic structure of individual information sources using the faceted classification schema of Dr S.R. Ranganathan, though with improvisation to accommodate new concepts relating to digital resources.
Findings
UML can be the best option for knowledge representation if the target system is a digital library where the required type of reasoning about ontologies is more in terms of answering specific and specialised questions. Here, ontologies can be used to facilitate a distributed information retrieval system.
Originality/value
The proposed model will not only facilitate semantic searching using UML concept maps but will also enable domain experts, even those with little coding expertise, to build ontologies for their resources while submitting their documents to the digital library.
Details
Keywords
The Third National Meeting of CDROM/Online Users was sponsored by National Information System for Science and Technology (NISSAT), Department of Scientific and Industrial Research…
Abstract
The Third National Meeting of CDROM/Online Users was sponsored by National Information System for Science and Technology (NISSAT), Department of Scientific and Industrial Research (DSIR), New Delhi; organised by the National Information Centre for Leather & Allied Industries (NICLAI); and held at the Central Leather Research Institute, Madras during 9–10 August 1994. The previous two Meetings were held at New Delhi in 1992 and 1993. The Meeting's objective was to assess the current national and international situation, the utility of CDROM services, and their popularisation within the country, and to provide a forum for exchange of experiences on this new technique. The Meeting focused its attention on databases available, demand for search services, target users, problem areas and possible solutions. The Meeting was attended by 205 participants from 91 institutions all over the country, representing a galaxy of R&D, academic, corporate, entrepreneurial and so forth institutions. Eleven industries engaged in information technology products participated in the Meeting. The Meeting included technical sessions where 39 presentations were made.
Giovanni Tummarello, Christian Morbidoni, Paolo Puliti and Francesco Piazza
The purpose of this paper is to investigate and prove the feasibility of a semantic web (SW) based approach to textual encoding. It aims to discuss benefits and novel…
Abstract
Purpose
The purpose of this paper is to investigate and prove the feasibility of a semantic web (SW) based approach to textual encoding. It aims to discuss benefits and novel possibilities with respect to traditional XML‐based approaches.
Design/methodology/approach
The markup process can be seen as a task of knowledge representation where elements such as words, sentences and pages are instances of conceptual classes forming a semantic network. An ontology web language ontology for textual encoding has been developed, capturing structural and grammatical aspects. Different approaches and tools to query the encoded text are investigated.
Findings
resource description framework (RDF) is powerful and expressive enough to fulfil tasks traditionally done in XML as well as to enable new possibilities such as collaborative and distributed textual encoding and the use of ontology‐based reasoning in text processing and querying. While the encoding of overlapping hierarchies through the use of existing approaches is often complex and leads to idiosyncratic solutions, this problem is naturally solved using SW languages.
Research limitations/implications
To make the approach suitable for widespread adoption, further work is required both in ontologies modelling and in applications (e.g. markup editing).
Practical implications
The prototype implementation imports existing encoded texts, transforms them into RDF‐based markups and uses SW query languages to answer cross‐hierarchy queries. Existing tools (reasoners, search and query engines, etc.) can be used immediately.
Originality/value
This methodology enables distributed interoperability and reuse of previous encoded results and opens the way to novel collaborative textual markup scenarios.
Details
Keywords
Abstract
Details
Keywords
The purpose of this paper is to discuss the pros and cons of computing the h‐index using Scopus.
Abstract
Purpose
The purpose of this paper is to discuss the pros and cons of computing the h‐index using Scopus.
Design/methodology/approach
The paper looks at the content features and the software capabilities of Scopus from the perspective of computing a reasonable h‐index for scholars.
Findings
Although there are limitations in the content, and even in the mostly excellent, swift, powerful and innovative software of Scopus, it can produce a much more reliable and reproducible h‐index – at least for relatively junior researchers – than Google Scholar.
Originality/value
The paper adds insight into computing the h‐index using Scopus.
Details
Keywords
Durga Sankar Rath and A.R.D. Prasad
This paper presents a methodology for automatic identification of bibliographic data elements from the title pages of books. Also enumerates the various steps like scanning the…
Abstract
This paper presents a methodology for automatic identification of bibliographic data elements from the title pages of books. Also enumerates the various steps like scanning the title pages, running optical character recognition (OCR) software, generating HTML files out of title pages and applying heuristics to identify the bibliographic data elements. Much of the paper deals with the surveys undertaken to analyze the characteristics of various bibliographic descriptive elements like title, author, publisher and other elements. The first survey deals with the sequence of the bibliographic data in the title pages. The second survey deals with the font size, font type and the proximity of each bibliographic element on the title pages. The survey results are then used to develop heuristics, in order to develop a rule‐based expert system which can identify the bibliographic elements on the title pages. The results of the system are presented, along with problems encountered.
Details
Keywords
A.R.D. Prasad and Devika P. Madalli
The paper aims to argue that digital library retrieval should be based on semantic representations and propose a semantic infrastructure for digital libraries.
Abstract
Purpose
The paper aims to argue that digital library retrieval should be based on semantic representations and propose a semantic infrastructure for digital libraries.
Design/methodology/approach
The approach taken is formal model based on subject representation for digital libraries.
Findings
Search engines and search techniques have fallen short of user expectations as they do not give context based retrieval. Deploying semantic web technologies would lead to efficient and more precise representation of digital library content and hence better retrieval. Though digital libraries often have metadata of information resources which can be accessed through OAI‐PMH, much remains to be accomplished in making digital libraries semantic web compliant. This paper presents a semantic infrastructure for digital libraries, that will go a long way in providing them and web based information services with products highly customised to users needs.
Research limitations/implications
Here only a model for semantic infrastructure is proposed. This model is proposed after studying current user‐centric, top‐down models adopted in digital library service architectures.
Originality/value
This paper gives a generic model for building semantic infrastructure for digital libraries. Faceted ontologies for digital libraries is just one approach. But the same may be adopted by groups working with different approaches in building ontologies to realise efficient retrieval in digital libraries.
Details
Keywords
This paper aims to analyse whether, and how far, library and information services (LIS) are involved in e‐Science and grid computing projects funded by authorities in the European…
Abstract
Purpose
This paper aims to analyse whether, and how far, library and information services (LIS) are involved in e‐Science and grid computing projects funded by authorities in the European Union and Germany. It explains and emphasises the relevance of LIS‐based information services and expertise in e‐Science activities.
Design/methodology/approach
Projects were analysed to determine the extent to which institutions or persons with LIS expertise were involved and information services provided.
Findings
Very few of the e‐Science projects made use of LIS‐based services. This omission could reduce the quality of e‐Science‐related research in the long term and LIS experts could lose their role and expertise in serving scientists. Further research is needed to verify this. Additionally research is needed to determine whether e‐Science projects in other countries are also lacking LIS services.
Research limitations/implications
The research was undertaken during the preliminary phase of the projects.
Practical implications
Funding should emphasise the inclusion and provision of LIS services.
Originality/value
The findings show a lack of LIS services in e‐Science projects, which could otherwise enhance the projects' development and the distribution of their results.