C. Min Han and Hyojin Nam
The purpose of this paper is to examine how consumer ethnocentrism (CET) and cosmopolitanism (COS) may affect Asian consumers’ perceptions of out-group countries and their…
Abstract
Purpose
The purpose of this paper is to examine how consumer ethnocentrism (CET) and cosmopolitanism (COS) may affect Asian consumers’ perceptions of out-group countries and their products, doing so by examining similar vs dissimilar countries across countries of origin. Given the strong inter-country rivalries that exist among Asian countries, the authors propose two alternative hypotheses, drawing from social identity theory and realistic group conflict theory.
Design/methodology/approach
To test the hypotheses, the authors examine consumer perceptions of both Western countries (dissimilar out-groups) and Asian countries (similar out-groups) within China (Study 1). In addition, the authors investigate how CET and COS affect consumer perceptions of Asian countries in Japan and in non-Asian dissimilar countries, and compare the effects between the two regions (Study 2).
Findings
The findings indicate that CET shows greater negative effects on perceptions of a country and its products, when the country is from a similar out-group than when it is from a dissimilar one. On the other hand, COS showed equally strong positive effects among consumers for both similar and dissimilar out-group countries.
Research limitations/implications
The results suggest that Asian consumers feel a sense of intergroup rivalry with other Asian countries, and, as a result, exhibit a greater degree of ethnocentric biases toward these countries and their products than they do toward Western countries and products. Also, the results suggest that COS may transcend national differences and inter-country rivalries in consumer consumption tendencies.
Originality/value
The study examines inter-country similarities as a moderator of CET and COS effects, which has not been extensively researched in the past. In addition, the study discusses the concept of intergroup rivalry among neighboring countries and examines how it affects consumer perceptions of out-group countries and their products in Asia, where strong inter-country rivalries exist.
Details
Keywords
Lunyan Wang, Qing Xia, Huimin Li and Yongchao Cao
The fuzziness and complexity of evaluation information are common phenomenon in practical decision-making problem, interval neutrosophic sets (INSs) is a power tool to deal with…
Abstract
Purpose
The fuzziness and complexity of evaluation information are common phenomenon in practical decision-making problem, interval neutrosophic sets (INSs) is a power tool to deal with ambiguous information. Similarity measure plays an important role in judging the degree between ideal and each alternative in decision-making process, the purpose of this paper is to establish a multi-criteria decision-making method based on similarity measure under INSs.
Design/methodology/approach
Based on an extension of existing cosine similarity, this paper first introduces an improved cosine similarity measure between interval neutosophic numbers, which considers the degrees of the truth membership, the indeterminacy membership and the falsity membership of the evaluation values. And then a multi-criteria decision-making method is established based on the improved cosine similarity measure, in which the ordered weighted averaging (OWA) is adopted to aggregate the neutrosophic information related to each alternative. Finally, an example on supplier selection is given to illustrate the feasibility and practicality of the presented decision-making method.
Findings
In the whole process of research and practice, it was realized that the application field of the proposed similarity measure theory still should be expanded, and the development of interval number theory is one of further research direction.
Originality/value
The main contributions of this paper are as follows: this study presents an improved cosine similarity measure under INSs, in which the weights of the three independent components of an interval number are taken into account; OWA are adopted to aggregate the neutrosophic information related to each alternative; and a multi-criteria decision-making method using the proposed similarity is developed under INSs.
Details
Keywords
Manjula Wijewickrema, Vivien Petras and Naomal Dias
The purpose of this paper is to develop a journal recommender system, which compares the content similarities between a manuscript and the existing journal articles in two subject…
Abstract
Purpose
The purpose of this paper is to develop a journal recommender system, which compares the content similarities between a manuscript and the existing journal articles in two subject corpora (covering the social sciences and medicine). The study examines the appropriateness of three text similarity measures and the impact of numerous aspects of corpus documents on system performance.
Design/methodology/approach
Implemented three similarity measures one at a time on a journal recommender system with two separate journal corpora. Two distinct samples of test abstracts were classified and evaluated based on the normalized discounted cumulative gain.
Findings
The BM25 similarity measure outperforms both the cosine and unigram language similarity measures overall. The unigram language measure shows the lowest performance. The performance results are significantly different between each pair of similarity measures, while the BM25 and cosine similarity measures are moderately correlated. The cosine similarity achieves better performance for subjects with higher density of technical vocabulary and shorter corpus documents. Moreover, increasing the number of corpus journals in the domain of social sciences achieved better performance for cosine similarity and BM25.
Originality/value
This is the first work related to comparing the suitability of a number of string-based similarity measures with distinct corpora for journal recommender systems.
Details
Keywords
Atefeh Momeni, Mitra Pashootanizadeh and Marjan Kaedi
This study aims to determine the most similar set of recommendation books to the user selections in LibraryThing.
Abstract
Purpose
This study aims to determine the most similar set of recommendation books to the user selections in LibraryThing.
Design/methodology/approach
For this purpose, 30,000 tags related to History on the LibraryThing have been selected. Their tags and the tags of the related recommended books were extracted from three different recommendations sections on LibraryThing. Then, four similarity criteria of Jaccard coefficient, Cosine similarity, Dice coefficient and Pearson correlation coefficient were used to calculate the similarity between the tags. To determine the most similar recommended section, the best similarity criterion had to be determined first. So, a researcher-made questionnaire was provided to History experts.
Findings
The results showed that the Jaccard coefficient, with a frequency of 32.81, is the best similarity criterion from the point of view of History experts. Besides, the degree of similarity in LibraryThing recommendations section according to this criterion is equal to 0.256, in the section of books with similar library subjects and classifications is 0.163 and in the Member recommendations section is 0.152. Based on the findings of this study, the LibraryThing recommendations section has succeeded in introducing the most similar books to the selected book compared to the other two sections.
Originality/value
To the best of the authors’ knowledge, itis for the first time, three sections of LibraryThing recommendations are compared by four different similarity criteria to show which sections would be more beneficial for the user browsing. The results showed that machine recommendations work better than humans.
Details
Keywords
Narasimhulu K, Meena Abarna KT and Sivakumar B
The purpose of the paper is to study multiple viewpoints which are required to access the more informative similarity features among the tweets documents, which is useful for…
Abstract
Purpose
The purpose of the paper is to study multiple viewpoints which are required to access the more informative similarity features among the tweets documents, which is useful for achieving the robust tweets data clustering results.
Design/methodology/approach
Let “N” be the number of tweets documents for the topics extraction. Unwanted texts, punctuations and other symbols are removed, tokenization and stemming operations are performed in the initial tweets pre-processing step. Bag-of-features are determined for the tweets; later tweets are modelled with the obtained bag-of-features during the process of topics extraction. Approximation of topics features are extracted for every tweet document. These set of topics features of N documents are treated as multi-viewpoints. The key idea of the proposed work is to use multi-viewpoints in the similarity features computation. The following figure illustrates multi-viewpoints based cosine similarity computation of the five tweets documents (here N = 5) and corresponding documents are defined in projected space with five viewpoints, say, v1,v2, v3, v4, and v5. For example, similarity features between two documents (viewpoints v1, and v2) are computed concerning the other three multi-viewpoints (v3, v4, and v5), unlike a single viewpoint in traditional cosine metric.
Findings
Healthcare problems with tweets data. Topic models play a crucial role in the classification of health-related tweets with finding topics (or health clusters) instead of finding term frequency and inverse document frequency (TF–IDF) for unlabelled tweets.
Originality/value
Topic models play a crucial role in the classification of health-related tweets with finding topics (or health clusters) instead of finding TF-IDF for unlabelled tweets.
Details
Keywords
Naoki Shibata, Yuya Kajikawa and Ichiro Sakata
This paper seeks to propose a method of discovering uncommercialized research fronts by comparing scientific papers and patents. A comparative study was performed to measure the…
Abstract
Purpose
This paper seeks to propose a method of discovering uncommercialized research fronts by comparing scientific papers and patents. A comparative study was performed to measure the semantic similarity between academic papers and patents in order to discover research fronts that do not correspond to any patents.
Design/methodology/approach
The authors compared structures of citation networks of scientific publications with those of patents by citation analysis and measured the similarity between sets of academic papers and sets of patents by natural language processing. After the documents (papers/patents) in each layer were categorized by a citation‐based method, the authors compared three semantic similarity measurements between a set of academic papers and a set of patents: Jaccard coefficient, cosine similarity of term frequency‐inverse document frequency (tfidf) vector, and cosine similarity of log‐tfidf vector. A case study was performed in solar cells.
Findings
As a result, the cosine similarity of tfidf was found to be the best way of discovering corresponding relationships.
Social implications
This proposed approach makes it possible to obtain candidates of unexplored research fronts, where academic researches exist but patents do not. This methodology can be immediately applied to support the decision making of R&D investment by both R&D managers in companies and policy makers in government.
Originality/value
This paper enables comparison of scientific outcomes and patents in more detail by citation analysis and natural language processing than previous studies which just count the direct linkage from patents to papers.
Details
Keywords
Renu Devi, Mohammad Firoz and R. Saravanan
This study aims to investigate redundant information in mandatory non-financial reports (NFRs) demanded by regulators, focusing primarily on overlapping disclosures in a new…
Abstract
Purpose
This study aims to investigate redundant information in mandatory non-financial reports (NFRs) demanded by regulators, focusing primarily on overlapping disclosures in a new Indian sustainability reporting (SR) framework.
Design/methodology/approach
The study sample comprised NIFTY100 listed entities that published SR voluntarily during 2021–2022. The authors used content analysis and cosine similarity techniques to conceptually compare redundancy in SR disclosures with non-financial disclosures.
Findings
The findings reveal an information overlap in SR disclosure with other NFRs disclosures. The disclosures of Directors’ Report have higher cosine similarity scores at the firm level with SR, followed by the Management Discussion and Analysis report, Corporate Governance report and Corporate Social Responsibility report. The additional analysis reveals that qualitative disclosures and disclosures comprising governance factors overlap more in SR.
Practical implications
Policymakers should look to establish relevant disclosure guidelines in the SR system, and thereby, shed light on fundamental issues to enhance future SR framework reforms.
Social implications
The study highlight the need for integration and amendment in the disclosure guidelines of NFRs to improve the overall transparency of the reports.
Originality/value
Previous studies have examined the redundancy in annual reports and SRs from the point of view of overlapping information. To the best author’s knowledge, this is possibly among the first studies to offer insights into the repetition of disclosures required by regulators in statutory NFRs based on environmental, social, and governance factors through the lenses of the institutional theory.
Details
Keywords
Adamu Garba, Shah Khalid, Irfan Ullah, Shah Khusro and Diyawu Mumin
There have been many challenges in crawling deep web by search engines due to their proprietary nature or dynamic content. Distributed Information Retrieval (DIR) tries to solve…
Abstract
Purpose
There have been many challenges in crawling deep web by search engines due to their proprietary nature or dynamic content. Distributed Information Retrieval (DIR) tries to solve these problems by providing a unified searchable interface to these databases. Since a DIR must search across many databases, selecting a specific database to search against the user query is challenging. The challenge can be solved if the past queries of the users are considered in selecting collections to search in combination with word embedding techniques. Combining these would aid the best performing collection selection method to speed up retrieval performance of DIR solutions.
Design/methodology/approach
The authors propose a collection selection model based on word embedding using Word2Vec approach that learns the similarity between the current and past queries. They used the cosine and transformed cosine similarity models in computing the similarities among queries. The experiment is conducted using three standard TREC testbeds created for federated search.
Findings
The results show significant improvements over the baseline models.
Originality/value
Although the lexical matching models for collection selection using similarity based on past queries exist, to the best our knowledge, the proposed work is the first of its kind that uses word embedding for collection selection by learning from past queries.
Details
Keywords
Shuhei Yamamoto, Kei Wakabayashi, Noriko Kando and Tetsuji Satoh
Many Twitter users post tweets that are related to their particular interests. Users can also collect information by following other users. One approach clarifies user interests…
Abstract
Purpose
Many Twitter users post tweets that are related to their particular interests. Users can also collect information by following other users. One approach clarifies user interests by tagging labels based on the users. A user tagging method is important to discover candidate users with similar interests. This paper aims to propose a new user tagging method using the posting time series data of the number of tweets.
Design/methodology/approach
Our hypothesis focuses on the relationship between a user’s interests and the posting times of tweets: as users have interests, they will post more tweets at the time when events occur compared with general times. The authors assume that hashtags are labeled tags to users and observe their occurrence counts in each timestamp. The authors extract burst timestamps using Kleinberg’s burst enumeration algorithm and estimate the burst levels. The authors manage the burst levels as term frequency in documents and calculate the score using typical methods such as cosine similarity, Naïve Bayes and term frequency (TF) in a document and inversed document frequency (IDF; TF-IDF).
Findings
From the sophisticated experimental evaluations, the authors demonstrate the high efficiency of the tagging method. Naïve Bayes and cosine similarity are particular suitable for the user tagging and tag score calculation tasks, respectively. Some users, whose hashtags were appropriately estimated by our methods, experienced higher the maximum value of the number of tweets than other users.
Originality/value
Many approaches estimate user interest based on the terms in tweets and apply such graph theory as following networks. The authors propose a new estimation method that uses the time series data of the number of tweets. The merits to estimating user interest using the time series data do not depend on language and can decrease the calculation costs compared with the above-mentioned approaches because the number of features is fewer.
Details
Keywords
Qingting Wei, Xing Liu, Daming Xian, Jianfeng Xu, Lan Liu and Shiyang Long
The collaborative filtering algorithm is a classical and widely used approach in product recommendation systems. However, the existing algorithms rely mostly on common ratings of…
Abstract
Purpose
The collaborative filtering algorithm is a classical and widely used approach in product recommendation systems. However, the existing algorithms rely mostly on common ratings of items and do not consider temporal information about items or user interests. To solve this problem, this study proposes a new user-item composite filtering (UICF) recommendation framework by leveraging temporal semantics.
Design/methodology/approach
The UICF framework fully utilizes the time information of item ratings for measuring the similarity of items and takes into account the short-term and long-term interest decay for computing users’ latest interest degrees. For an item to be probably recommended to a user, the interest degrees of the user on all the historically rated items are weighted by their similarities with the item to be recommended and then added up to predict the recommendation degree.
Findings
Comprehensive experiments on the MovieLens and KuaiRec datasets for user movie recommendation were conducted to evaluate the performance of the proposed UICF framework. Experimental results show that the UICF outperformed three well-known recommendation algorithms Item-Based Collaborative Filtering (IBCF), User-Based Collaborative Filtering (UBCF) and User-Popularity Composite Filtering (UPCF) in the root mean square error (RMSE), mean absolute error (MAE) and F1 metrics, especially yielding an average decrease of 11.9% in MAE.
Originality/value
A UICF recommendation framework is proposed that combines a time-aware item similarity model and a time-wise user interest degree model. It overcomes the limitations of common rating items and utilizes temporal information in item ratings and user interests effectively, resulting in more accurate and personalized recommendations.