Search results
1 – 10 of 33Yuzhuo Wang, Chengzhi Zhang, Min Song, Seongdeok Kim, Youngsoo Ko and Juhee Lee
In the era of artificial intelligence (AI), algorithms have gained unprecedented importance. Scientific studies have shown that algorithms are frequently mentioned in papers…
Abstract
Purpose
In the era of artificial intelligence (AI), algorithms have gained unprecedented importance. Scientific studies have shown that algorithms are frequently mentioned in papers, making mention frequency a classical indicator of their popularity and influence. However, contemporary methods for evaluating influence tend to focus solely on individual algorithms, disregarding the collective impact resulting from the interconnectedness of these algorithms, which can provide a new way to reveal their roles and importance within algorithm clusters. This paper aims to build the co-occurrence network of algorithms in the natural language processing field based on the full-text content of academic papers and analyze the academic influence of algorithms in the group based on the features of the network.
Design/methodology/approach
We use deep learning models to extract algorithm entities from articles and construct the whole, cumulative and annual co-occurrence networks. We first analyze the characteristics of algorithm networks and then use various centrality metrics to obtain the score and ranking of group influence for each algorithm in the whole domain and each year. Finally, we analyze the influence evolution of different representative algorithms.
Findings
The results indicate that algorithm networks also have the characteristics of complex networks, with tight connections between nodes developing over approximately four decades. For different algorithms, algorithms that are classic, high-performing and appear at the junctions of different eras can possess high popularity, control, central position and balanced influence in the network. As an algorithm gradually diminishes its sway within the group, it typically loses its core position first, followed by a dwindling association with other algorithms.
Originality/value
To the best of the authors’ knowledge, this paper is the first large-scale analysis of algorithm networks. The extensive temporal coverage, spanning over four decades of academic publications, ensures the depth and integrity of the network. Our results serve as a cornerstone for constructing multifaceted networks interlinking algorithms, scholars and tasks, facilitating future exploration of their scientific roles and semantic relations.
Details
Keywords
Qingqing Zhou and Chengzhi Zhang
The development of social media has led to large numbers of internet users now producing massive amounts of user-generated content (UGC). UGC, which shows users’ opinions about…
Abstract
Purpose
The development of social media has led to large numbers of internet users now producing massive amounts of user-generated content (UGC). UGC, which shows users’ opinions about events directly, is valuable for monitoring public opinion. Current researches have focused on analysing topic evolutions in UGC. However, few researches pay attention to emotion evolutions of sub-topics about popular events. Important details about users’ opinions might be missed, as users’ emotions are ignored. This paper aims to extract sub-topics about a popular event from UGC and investigate the emotion evolutions of each sub-topic.
Design/methodology/approach
This paper first collects UGC about a popular event as experimental data and conducts subjectivity classification on the data to get subjective corpus. Second, the subjective corpus is classified into different emotion categories using supervised emotion classification. Meanwhile, a topic model is used to extract sub-topics about the event from the subjective corpora. Finally, the authors use the results of emotion classification and sub-topic extraction to analyze emotion evolutions over time.
Findings
Experimental results show that specific primary emotions exist in each sub-topic and undergo evolutions differently. Moreover, the authors find that performance of emotion classifier is optimal with term frequency and relevance frequency as the feature-weighting method.
Originality/value
To the best of the authors’ knowledge, this is the first research to mine emotion evolutions of sub-topics about an event with UGC. It mines users’ opinions about sub-topics of event, which may offer more details that are useful for analysing users’ emotions in preparation for decision-making.
Details
Keywords
Qiangbing Wang, Shutian Ma and Chengzhi Zhang
Based on user-generated content from a Chinese social media platform, this paper aims to investigate multiple methods of constructing user profiles and their effectiveness in…
Abstract
Purpose
Based on user-generated content from a Chinese social media platform, this paper aims to investigate multiple methods of constructing user profiles and their effectiveness in predicting their gender, age and geographic location.
Design/methodology/approach
This investigation collected 331,634 posts from 4,440 users of Sina Weibo. The data were divided into two parts, for training and testing . First, a vector space model and topic models were applied to construct user profiles. A classification model was then learned by a support vector machine according to the training data set. Finally, we used the classification model to predict users’ gender, age and geographic location in the testing data set.
Findings
The results revealed that in constructing user profiles, latent semantic analysis performed better on the task of predicting gender and age. By contrast, the method based on a traditional vector space model worked better in making predictions regarding the geographic location. In the process of applying a topic model to construct user profiles, the authors found that different prediction tasks should use different numbers of topics.
Originality/value
This study explores different user profile construction methods to predict Chinese social media network users’ gender, age and geographic location. The results of this paper will help to improve the quality of personal information gathered from social media platforms, and thereby improve personalized recommendation systems and personalized marketing.
Details
Keywords
Chenglei Qin, Chengzhi Zhang and Yi Bu
To better understand the online reviews and help potential consumers, businessmen and product manufacturers effectively obtain users’ evaluation on product aspects, this paper…
Abstract
Purpose
To better understand the online reviews and help potential consumers, businessmen and product manufacturers effectively obtain users’ evaluation on product aspects, this paper aims to explore the distribution regularities of users’ attention and sentiment on product aspects from the temporal perspective of online reviews.
Design/methodology/approach
Temporal characteristics of online reviews (purchase time, review time and time intervals between purchase time and review time), similar attributes clustering and attribute-level sentiment computing technologies are used based on more than 340k smartphone reviews of three products from JD.COM (a famous online shopping platform in China) to explore the distribution regularities of users’ attention and sentiment on product aspects in this paper.
Findings
The empirical results show that a power-law distribution can fit users’ attention on product aspects, and the reviews posted in short time intervals contain more product aspects. Besides, the results show that the values of users’ sentiment on product aspects are significantly higher/lower in short time intervals which contribute to judging the advantages and weaknesses of a product.
Research limitations/implications
This paper cannot acquire online reviews for more products with temporal characteristics to verify the findings because of the restriction on reviews crawling by the shopping platforms.
Originality/value
This work reveals the distribution regularities of users’ attention and sentiment on product aspects, which is of great significance in assisting decision-making, optimizing review presentation and improving the shopping experience.
Details
Keywords
Lei Zhao, Yingyi Zhang and Chengzhi Zhang
To understand the meaning of a sentence, humans can focus on important words in the sentence, which reflects our eyes staying on each word in different gaze time or times. Thus…
Abstract
Purpose
To understand the meaning of a sentence, humans can focus on important words in the sentence, which reflects our eyes staying on each word in different gaze time or times. Thus, some studies utilize eye-tracking values to optimize the attention mechanism in deep learning models. But these studies lack to explain the rationality of this approach. Whether the attention mechanism possesses this feature of human reading needs to be explored.
Design/methodology/approach
The authors conducted experiments on a sentiment classification task. Firstly, they obtained eye-tracking values from two open-source eye-tracking corpora to describe the feature of human reading. Then, the machine attention values of each sentence were learned from a sentiment classification model. Finally, a comparison was conducted to analyze machine attention values and eye-tracking values.
Findings
Through experiments, the authors found the attention mechanism can focus on important words, such as adjectives, adverbs and sentiment words, which are valuable for judging the sentiment of sentences on the sentiment classification task. It possesses the feature of human reading, focusing on important words in sentences when reading. Due to the insufficient learning of the attention mechanism, some words are wrongly focused. The eye-tracking values can help the attention mechanism correct this error and improve the model performance.
Originality/value
Our research not only provides a reasonable explanation for the study of using eye-tracking values to optimize the attention mechanism but also provides new inspiration for the interpretability of attention mechanism.
Details
Keywords
Ziling Chen, Chengzhi Zhang, Heng Zhang, Yi Zhao, Chen Yang and Yang Yang
The composition of author teams is a significant factor affecting the novelty of academic papers. Existing research lacks studies focusing on institutional types and measures of…
Abstract
Purpose
The composition of author teams is a significant factor affecting the novelty of academic papers. Existing research lacks studies focusing on institutional types and measures of novelty remained at a general level, making it difficult to analyse the types of novelty in papers and to provide a detailed explanation of novelty. This study aims to take the field of natural language processing (NLP) as an example to analyse the relationship between team institutional composition and the fine-grained novelty of academic papers.
Design/methodology/approach
Firstly, author teams are categorized into three types: academic institutions, industrial institutions and mixed academic and industrial institutions. Next, the authors extract four types of entities from the full paper: methods, data sets, tools and metric. The novelty of papers is evaluated using entity combination measurement methods. Additionally, pairwise combinations of different types of fine-grained entities are analysed to assess their contributions to novel papers.
Findings
The results of the study found that in the field of NLP, for industrial institutions, collaboration with academic institutions has a higher probability of producing novel papers. From the contribution rate of different types of fine-grained knowledge entities, the mixed academic and industrial institutions pay more attention to the novelty of the combination of method indicators, and the industrial institutions pay more attention to the novelty of the combination of method tools.
Originality/value
This paper explores the relationship between the team institutional composition and the novelty of academic papers and reveals the importance of cooperation between industry and academia through fine-grained novelty measurement, which provides key guidance for improving the quality of papers and promoting industry–university–research cooperation.
Details
Keywords
Yuzhuo Wang and Chengzhi Zhang
Citation content in academic papers and academic monographs promotes the knowledge flow among different publications. However, existing citation content analysis (CCA) focuses on…
Abstract
Purpose
Citation content in academic papers and academic monographs promotes the knowledge flow among different publications. However, existing citation content analysis (CCA) focuses on academic papers and monographs have not received much research attention. We want to know if monographs are appropriate objects of CCA and whether existing methods of analyzing citation in papers are suitable for citation in monographs. Therefore, this paper aims to learn more about features of cited references and citation content in monographs and compare the characteristic of citation pattern between monographs and papers.
Design/methodology/approach
The authors manually annotate the references and syntactic citation content in academic monographs published by Morgan & Claypool and automatically extracted the references and citation content from academic papers published by Public Library of Science. Five features in two types citation pattern, namely, pattern of cited reference (including year, source and mention frequency of reference) and pattern of citation content (including location, length of citation content) are used to examine similarities and differences between monographs and papers.
Findings
The results indicate that between monographs and papers, differences are shown in location, length of citation content and year, source of reference, whereas frequency of mention of reference is similar.
Originality/value
Previous studies have explored the patter of citation content in academic papers. However, none of the existing literature, as far as the authors know, has considered the citation content in academic monographs and the similarities or differences among academic documents when studying the citation pattern.
Details
Keywords
Zhongyi Wang, Haihua Chen, Chengzhi Zhang, Wei Lu and Jian Wu