Rajat Kumar Mudgal, Rajdeep Niyogi, Alfredo Milani and Valentina Franzoni
The purpose of this paper is to propose and experiment a framework for analysing the tweets to find the basis of popularity of a person and extract the reasons supporting the…
Abstract
Purpose
The purpose of this paper is to propose and experiment a framework for analysing the tweets to find the basis of popularity of a person and extract the reasons supporting the popularity. Although the problem of analysing tweets to detect popular events and trends has recently attracted extensive research efforts, not much emphasis has been given to find out the reasons behind the popularity of a person based on tweets.
Design/methodology/approach
In this paper, the authors introduce a framework to find out the reasons behind the popularity of a person based on the analysis of events and the evaluation of a Web-based semantic set similarity measure applied to tweets. The methodology uses the semantic similarity measure to group similar tweets in events. Although the tweets cannot contain identical hashtags, they can refer to a unique topic with equivalent or related terminology. A special data structure maintains event information, related keywords and statistics to extract the reasons supporting popularity.
Findings
An implementation of the algorithms has been experimented on a data set of 218,490 tweets from five different countries for popularity detection and reasons extraction. The experimental results are quite encouraging and consistent in determining the reasons behind popularity. The use of Web-based semantic similarity measure is based on statistics extracted from search engines, it allows to dynamically adapt the similarity values to the variation on the correlation of words depending on current social trends.
Originality/value
To the best of the authors’ knowledge, the proposed method for finding the reason of popularity in short messages is original. The semantic set similarity presented in the paper is an original asymmetric variant of a similarity scheme developed in the context of semantic image recognition.
Details
Keywords
Alfredo Milani, Niyogi Rajdeep, Nimita Mangal, Rajat Kumar Mudgal and Valentina Franzoni
This paper aims to propose an approach for the analysis of user interest based on tweets, which can be used in the design of user recommendation systems. The extract topics are…
Abstract
Purpose
This paper aims to propose an approach for the analysis of user interest based on tweets, which can be used in the design of user recommendation systems. The extract topics are seen positively by the user.
Design/methodology/approach
The proposed approach is based on the combination of sentiment extraction and classification analysis of tweet to extract the topic of interest. The proposed hybrid method is original. The topic extraction phase uses a method based on semantic distance in the WordNet taxonomy. Sentiment extraction uses NLPcore.
Findings
The algorithm has been extensively tested using real tweets generated by 1,000 users. The results are quite encouraging and outperform state-of-the-art results and confirm the suitability of the approach combining sentiment and categorization for the topic of interest extraction.
Research limitations/implications
The hybrid method combining sentiment extraction and classification for user positive topics represents a novel contribution with many potential applications.
Practical implications
The functionality of positive topic extraction is very useful as a component in the design of a recommender system based on user profiling from Twitter user behaviors.
Social implications
The application of the proposed method in short-text social network can be massive and beyond the applications in tweets.
Originality/value
There are few works that have considered both sentiment analysis and classification to find out users’ interest. The algorithm has been extensively tested using real tweets generated by 1,000 users. The results are quite encouraging and outperform state-of-the-art results.
Details
Keywords
Valentina Franzoni and Alfredo Milani
In this work, a new general framework is proposed to guide navigation over a collaborative concept network, in order to discover paths between concepts. Finding semantic chains…
Abstract
Purpose
In this work, a new general framework is proposed to guide navigation over a collaborative concept network, in order to discover paths between concepts. Finding semantic chains between concepts over a semantic network is an issue of great interest for many applications, such as explanation generation and query expansion. Collaborative concept networks over the web tend to have features such as large dimensions, high connectivity degree, dynamically evolution over the time, which represent special challenges for efficient graph search methods, since they result in huge memory requirements, high branching factors, unknown dimensions and high cost for accessing nodes. The paper aims to discuss these issues.
Design/methodology/approach
The proposed framework is based on the novel notion of heuristic semantic walk (HSW). In the HSW framework, a semantic proximity measure among concepts, reflecting the collective knowledge embedded in search engines or other statistical sources, is used as a heuristic in order to guide the search in the collaborative network. Different search strategies, information sources and proximity measures, can be used to adapt HSW to the collaborative semantic network under consideration.
Findings
Experiments held on the Wikipedia network and Bing search engine on a range of different semantic measures show that the proposed HSW approach with weighted randomized walk strategy outperforms state-of-the-art search methods.
Originality/value
To the best of the authors' knowledge, the proposed HSW model is the first approach which uses search engine-based proximity measures as heuristic for semantic search.