Search results

1 – 8 of 8

View access options

Article

Publication date: 6 February 2017

Developing a novel recommender network-based ranking mechanism for library book acquisition

Most academic libraries provide book recommendation services to enable readers to recommend books to the libraries. To facilitate decision-making in book acquisition, this study…

HTML

PDF (520 KB)

Downloads

657

Abstract

Purpose

Most academic libraries provide book recommendation services to enable readers to recommend books to the libraries. To facilitate decision-making in book acquisition, this study aimed to develop a method to determine the ranking of the recommended books based on the recommender network.

Design/methodology/approach

The recommender network was conducted to establish relationships among book recommenders and their similar readers by using circulation records. Furthermore, social computing techniques were used to evaluate the degree of representativeness of the recommenders and subsequently applied as a criterion to rank the recommended books. Empirical studies were performed to demonstrate the effectiveness of the proposed ranking system. The Spearman’s correlation coefficients between the proposed ranking system and the ranking obtained using reader circulation statistics were used as performance measure.

Findings

The ranking calculated using the proposed ranking mechanism was highly and moderately correlated to the ranking obtained using reader circulation statistics. The ranking of recommended books by the librarians was moderately and poorly correlated to the ranking calculated using reader circulation statistics.

Practical implications

The book recommender can be used to improve the accuracy of book recommendations.

Originality/value

This study is the first that considers the recommender network on library book acquisition. The results also show that the proposed ranking mechanism can facilitate effective book-acquisition decisions in libraries.

Details

The Electronic Library, vol. 35 no. 1

Type: Research Article

DOI:

ISSN: 0264-0473

Keywords

View access options

Article

Publication date: 14 January 2025

Predicting the churn patterns of monetizers and non-monetizers: exploring the influence of behavioral variability in churn prediction

Ruei-Yan Wu, Ya-Han Hu and En-Yi Chou

Although prior research has employed various variables to predict player churn, the dynamic evolution of the behavioral patterns of players has received limited attention. In this…

HTML

PDF (1 MB)

Downloads

Abstract

Purpose

Although prior research has employed various variables to predict player churn, the dynamic evolution of the behavioral patterns of players has received limited attention. In this study, churn prediction models are developed by incorporating the progress level, in-game purchase, social interaction, behavioral pattern and behavioral variability (BV) of players in social casino games (SCGs). The study distinguishes churn prediction between two player groups: monetizers and non-monetizers.

Design/methodology/approach

This study employs three machine learning techniques—logistic regression, decision trees and random forests—using real-world player data from an SCG company to construct churn prediction models. Two experiments were conducted. In Experiment 1, BV was combined with four other variable categories to effectively predict churn behaviors across all players (n = 52,246). In Experiment 2, churn prediction models were developed separately for monetizers (n = 16,628) and non-monetizers (n = 35,618).

Findings

The findings from Experiment 1 indicate that incorporating BV significantly improves the overall performance of churn prediction models. Experiment 2 demonstrates that churn prediction models achieve better performance and predictive accuracy for monetizers and non-monetizers when BV is calculated over the 3-day to 7-day and 7-day to 14-day windows, respectively.

Originality/value

This study introduces BV as a novel variable category for churn prediction, emphasizing within-person variability and demonstrating its effectiveness in enhancing model performance. Churn prediction models were independently constructed for monetizers and non-monetizers, utilizing different time windows for variable extraction. This approach improves predictive performance and highlights key differences in critical variables influencing churn across the two player groups. The findings provide valuable insights into churn management strategies tailored for monetizers and non-monetizers.

Details

Internet Research, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1066-2243

Keywords

View access options

Article

Publication date: 16 October 2018

Considering online consumer reviews to predict movie box-office performance between the years 2009 and 2014 in the US

Ya-Han Hu, Wen-Ming Shiau, Sheng-Pao Shih and Cho-Ju Chen

The purpose of this paper is to combine basic movie information factors, external factors and review factors, to predict box-office performance and identify the most crucial…

HTML

PDF (157 KB)

Downloads

1217

Abstract

Purpose

The purpose of this paper is to combine basic movie information factors, external factors and review factors, to predict box-office performance and identify the most crucial factor of influence for box-office performance.

Design/methodology/approach

Five movie genres and first-week movie reviews found on IMDb were collected. The movie reviews were quantified using sentiment analysis tools SentiStrength and Stanford CoreNLP, in which quantified data were combined with basic movie information and external environment factors to predict movie box-office performance. A movie box-office performance prediction model was then developed using data mining (DM) technologies with M5 model trees (M5P), linear regression (LR) and support vector regression (SVR), after which movie box-office performance predictions were made.

Findings

The results of this paper showed that the inclusion of movie reviews generated more accurate prediction results. Concerning movie review-related factors, the one that exhibited the greatest effect on box-office performance was the number of movie reviews made, whereas movie review content only displayed an effect on box-office performance for specific movie genres.

Research limitations/implications

Because this paper collected movie data from the IMDb, the data were limited and primarily consisted of movies released in the USA; data pertaining to less popular movies or those released outside of the USA were, thus, insufficient.

Practical implications

This paper helps to verify whether the consideration of the features extracted from movie reviews can improve the performance of movie box-office.

Originality/value

Through various DM technologies, this paper shows that movie reviews enhanced the accuracy of box-office performance predictions and the content of movie reviews has an effect on box-office performance.

Details

The Electronic Library, vol. 36 no. 6

Type: Research Article

DOI:

ISSN: 0264-0473

Keywords

View access options

Article

Publication date: 29 July 2014

Two-stage credit rating prediction using machine learning techniques

Hsu-Che Wu, Ya-Han Hu and Yen-Hao Huang

Credit ratings have become one of the primary references for financial institutions to assess credit risk. Conventional credit rating approaches mainly concentrated on two-class…

HTML

PDF (227 KB)

Downloads

1071

Abstract

Purpose

Credit ratings have become one of the primary references for financial institutions to assess credit risk. Conventional credit rating approaches mainly concentrated on two-class classification (i.e. good or bad credit), which lacks adequate precision to perform credit risk evaluations in practice. In addition, most of previous researches directly focussed on employing various data mining techniques, but rare studies discussed the influence of data preprocessing before classifier construction. The paper aims to discuss these issues.

Design/methodology/approach

This study considers nine-class classification (i.e. nine credit risk level) to credit rating prediction. For the development of more accurate classifiers, the paper adopts two-stage analysis, which integrates multiple data preprocessing and supervised learning techniques. Specifically, the first stage applies feature selection, data clustering, and data resampling methods to preprocess the data, and then the second stage utilizes several classification techniques and classifier ensembles to construct prediction models.

Findings

The results show that Bagging-DT with data resampling method achieves excellent accuracy (82.96 percent), indicating that the proposed two-stage prediction model is better than conventional one-stage models.

Originality/value

Practical implication of this study can lower credit rating expenses and also allow corporations to gain credit rating information instantly.

Details

Kybernetes, vol. 43 no. 7

Type: Research Article

DOI:

ISSN: 0368-492X

Keywords

View access options

Article

Publication date: 8 August 2016

Research impact of general and funded papers: A citation analysis of two ACM international conference proceeding series

Cheng-Che Shen, Ya-Han Hu, Wei-Chao Lin, Chih-Fong Tsai and Shih-Wen Ke

The purpose of this paper is to focus on examining the research impact of papers written with and without funding. Specifically, the citation analysis method is used to compare…

HTML

PDF (403 KB)

Downloads

595

Abstract

Purpose

The purpose of this paper is to focus on examining the research impact of papers written with and without funding. Specifically, the citation analysis method is used to compare the general and funded papers published in two leading international conferences, which are ACM SIGIR and ACM SIGKDD.

Design/methodology/approach

The authors investigate the number of general and funded papers to see whether the number of funded papers is larger than the number of general papers. In addition, the total citations and the number of highly cited papers with and without funding are also compared.

Findings

The analysis results of ACM SIGIR papers show that in most cases the number of funded papers is larger than the number of general papers. Moreover, the total captions, the average number of citations per paper, and the number of highly cited papers all reveal the superiority of funded papers over general papers. However, the findings are somewhat different for the ACM SIGKDD papers. This may be because ACM SIGIR began much earlier than ACM SIGKDD, which relates to the maturity of the research problems addressed in these two conferences.

Originality/value

The value of this paper is the first attempt at examining the research impact of general and funded research papers by the citation analysis method. The research impact of other research areas can be further investigated by other analysis methods.

Details

Online Information Review, vol. 40 no. 4

Type: Research Article

DOI:

ISSN: 1468-4527

Keywords

View access options

Article

Publication date: 9 September 2014

Citation impact analysis of research papers that appear in oral and poster sessions: A case study of three computer science conferences

Shih-Wen Ke, Wei-Chao Lin, Chih-Fong Tsai and Ya-Han Hu

Conference publications are an important aspect of research activities. There are generally both oral presentations and poster sessions at large international conferences. One can…

HTML

PDF (177 KB)

Downloads

576

Abstract

Purpose

Conference publications are an important aspect of research activities. There are generally both oral presentations and poster sessions at large international conferences. One can hypothesise that, for the same conferences, the papers presented in oral sessions should have a higher research impact than the papers presented in poster sessions. However, there has been no related study examining the validity of this hypothesis. In other words, the difference of research impact between papers presented orally or during poster sessions has not been discussed in literature. Therefore, the purpose of this paper is to conduct a citation analysis to compare the research impact of papers presented in oral and poster sessions.

Design/methodology/approach

In this paper, data from three leading conferences in the field of computer vision are examined, namely CVPR (2011 and 2012), ICCV (2011) and ECCV (2012). Several types of citation-related statistics are collected, including the number of highly cited papers (i.e. high number of citations) presented in oral and poster sessions, the total citations of both types of papers, the average citations of oral and poster papers, and the average citations of each frequently cited paper of both types.

Findings

There are three main findings. First, a larger proportion of highly cited papers are from oral sessions than poster sessions. Second, the average number of citations per paper is larger for those presented in oral sessions than poster sessions. Third, the average number of citations for highly cited papers presented in oral sessions is not necessarily greater than for the ones presented in poster sessions.

Originality/value

The originality of this paper is that it is the first attempt to examine the differences of citation impacts of conference papers presented in oral and poster sessions. The findings of this study will allow future bibliometrics research to further explore this related issue for longer periods and different fields.

Details

Online Information Review, vol. 38 no. 6

Type: Research Article

DOI:

ISSN: 1468-4527

Keywords

View access options

Article

Publication date: 12 June 2014

A Borda count approach to combine subjective and objective based MIS journal rankings

Chih-Fong Tsai, Ya-Han Hu and Shih-Wen George Ke

Ranking relevant journals is very critical for researchers to choose their publication outlets, which can affect their research performance. In the management information systems…

HTML

PDF (142 KB)

Downloads

495

Abstract

Purpose

Ranking relevant journals is very critical for researchers to choose their publication outlets, which can affect their research performance. In the management information systems (MIS) subject, many related studies conducted surveys as the subjective method for identifying MIS journal rankings. However, very few consider other objective methods, such as journals’ impact factors and h-indexes. The paper aims to discuss these issues.

Design/methodology/approach

In this paper, top 50 ranked journals identified by researchers’ perceptions are examined in terms of the correlation to the rankings by their impact factors and h-indexes. Moreover, a hybrid method to combine these different rankings based on Borda count is used to produce new MIS journal rankings.

Findings

The results show that there are low correlations between the subjective and objective based MIS journal rankings. In addition, the new MIS journal rankings by the Borda count approach can also be considered for future researches.

Originality/value

The contribution of this paper is to apply the Borda count approach to combine different MIS journal rankings produced by subjective and objective methods. The new MIS journal rankings and previous studies can be complementary to allow researchers to determine the top-ranked journals for their publication outlets.

Details

Online Information Review, vol. 38 no. 4

Type: Research Article

DOI:

ISSN: 1468-4527

Keywords

View access options

Article

Publication date: 22 March 2013

A comparative study of hybrid machine learning techniques for customer lifetime value prediction

Chih‐Fong Tsai, Ya‐Han Hu, Chia‐Sheng Hung and Yu‐Feng Hsu

Customer lifetime value (CLV) has received increasing attention in database marketing. Enterprises can retain valuable customers by the correct prediction of valuable customers…

HTML

PDF (87 KB)

Downloads

2530

Abstract

Purpose

Customer lifetime value (CLV) has received increasing attention in database marketing. Enterprises can retain valuable customers by the correct prediction of valuable customers. In the literature, many data mining and machine learning techniques have been applied to develop CLV models. Specifically, hybrid techniques have shown their superiorities over single techniques. However, it is unknown which hybrid model can perform the best in customer value prediction. Therefore, the purpose of this paper is to compares two types of commonly‐used hybrid models by classification+classification and clustering+classification hybrid approaches, respectively, in terms of customer value prediction.

Design/methodology/approach

To construct a hybrid model, multiple techniques are usually combined in a two‐stage manner, in which the first stage is based on either clustering or classification techniques, which can be used to pre‐process the data. Then, the output of the first stage (i.e. the processed data) is used to construct the second stage classifier as the prediction model. Specifically, decision trees, logistic regression, and neural networks are used as the classification techniques and k‐means and self‐organizing maps for the clustering techniques to construct six different hybrid models.

Findings

The experimental results over a real case dataset show that the classification+classification hybrid approach performs the best. In particular, combining two‐stage of decision trees provides the highest rate of accuracy (99.73 percent) and lowest rate of Type I/II errors (0.22 percent/0.43 percent).

Originality/value

The contribution of this paper is to demonstrate that hybrid machine learning techniques perform better than single ones. In addition, this paper allows us to find out which hybrid technique performs best in terms of CLV prediction.

Details

Kybernetes, vol. 42 no. 3

Type: Research Article

DOI:

ISSN: 0368-492X

Keywords

Access

Year

Content type

1 – 8 of 8

Abstract

Purpose

Design/methodology/approach

Findings

Practical implications

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Practical implications

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Access

Year

Content type

All feedback is valuable

Report an issue or find answers to frequently asked questions