Providing consumers with a representative subset from online reviews
Abstract
Purpose
The purpose of this paper is to find a representative subset from large-scale online reviews for consumers. The subset is significantly small in size, but covers the majority amount of information in the original reviews and contains little redundant information.
Design/methodology/approach
A heuristic approach named RewSel is proposed to successively select representatives until the number of representatives meets the requirement. To reveal the advantages of the approach, extensive data experiments and a user study are conducted on real data.
Findings
The proposed approach has the advantage over the benchmarks in terms of coverage and redundancy. People show preference to the representative subsets provided by RewSel. The proposed approach also has good scalability, and is more adaptive to big data applications.
Research limitations/implications
The paper contributes to the literature of review selection, by proposing a heuristic approach which achieves both high coverage and low redundancy. This study can be applied as the basis for conducting further analysis of large-scale online reviews.
Practical implications
The proposed approach offers a novel way to select a representative subset of online reviews to facilitate consumer decision making. It can also enhance the existing information retrieval system to provide representative information to users rather than a large amount of results.
Originality/value
The proposed approach finds the representative subset by adopting the concept of relative entropy and sentiment analysis methods. Compared with state-of-the-art approaches, it offers a more effective and efficient way for users to handle a large amount of online information.
Keywords
Acknowledgements
The work is supported by Fundamental Research Funds for the Central Universities, and the Research Funds of Renmin University of China (14XNI012).
Citation
Zhang, J., Ren, M., Xiao, X. and Zhang, J. (2017), "Providing consumers with a representative subset from online reviews", Online Information Review, Vol. 41 No. 6, pp. 877-899. https://doi.org/10.1108/OIR-05-2016-0125
Publisher
:Emerald Publishing Limited
Copyright © 2017, Emerald Publishing Limited