Search results
1 – 10 of 14Jui-Long Hung, Kerry Rice, Jennifer Kepka and Juan Yang
For studies in educational data mining or learning Analytics, the prediction of student’s performance or early warning is one of the most popular research topics. However…
Abstract
Purpose
For studies in educational data mining or learning Analytics, the prediction of student’s performance or early warning is one of the most popular research topics. However, research gaps indicate a paucity of research using machine learning and deep learning (DL) models in predictive analytics that include both behaviors and text analysis.
Design/methodology/approach
This study combined behavioral data and discussion board content to construct early warning models with machine learning and DL algorithms. In total, 680 course sections, 12,869 students and 14,951,368 logs were collected from a K-12 virtual school in the USA. Three rounds of experiments were conducted to demonstrate the effectiveness of the proposed approach.
Findings
The DL model performed better than machine learning models and was able to capture 51% of at-risk students in the eighth week with 86.8% overall accuracy. The combination of behavioral and textual data further improved the model’s performance in both recall and accuracy rates. The total word count is a more general indicator than the textual content feature. Successful students showed more words in analytic, and at-risk students showed more words in authentic when text was imported into a linguistic function word analysis tool. The balanced threshold was 0.315, which can capture up to 59% of at-risk students.
Originality/value
The results of this exploratory study indicate that the use of student behaviors and text in a DL approach may improve the predictive power of identifying at-risk learners early enough in the learning process to allow for interventions that can change the course of their trajectory.
Details
Keywords
Mingyan Zhang, Xu Du, Kerry Rice, Jui-Long Hung and Hao Li
This study aims to propose a learning pattern analysis method which can improve a predictive model’s performance, as well as discover hidden insights into micro-level learning…
Abstract
Purpose
This study aims to propose a learning pattern analysis method which can improve a predictive model’s performance, as well as discover hidden insights into micro-level learning pattern. Analyzing student’s learning patterns can help instructors understand how their course design or activities shape learning behaviors; depict students’ beliefs about learning and their motivation; and predict learning performance by analyzing individual students’ learning patterns. Although time-series analysis is one of the most feasible predictive methods for learning pattern analysis, literature-indicated current approaches cannot provide holistic insights about learning patterns for personalized intervention. This study identified at-risk students by micro-level learning pattern analysis and detected pattern types, especially at-risk patterns that existed in the case study. The connections among students’ learning patterns, corresponding self-regulated learning (SRL) strategies and learning performance were finally revealed.
Design/methodology/approach
The method used long short-term memory (LSTM)-encoder to process micro-level behavioral patterns for feature extraction and compression, thus the students’ behavior pattern information were saved into encoded series. The encoded time-series data were then used for pattern analysis and performance prediction. Time series clustering were performed to interpret the unique strength of proposed method.
Findings
Successful students showed consistent participation levels and balanced behavioral frequency distributions. The successful students also adjusted learning behaviors to meet with course requirements accordingly. The three at-risk patten types showed the low-engagement (R1) the low-interaction (R2) and the non-persistent characteristics (R3). Successful students showed more complete SRL strategies than failed students. Political Science had higher at-risk chances in all three at-risk types. Computer Science, Earth Science and Economics showed higher chances of having R3 students.
Research limitations/implications
The study identified multiple learning patterns which can lead to the at-risk situation. However, more studies are needed to validate whether the same at-risk types can be found in other educational settings. In addition, this case study found the distributions of at-risk types were vary in different subjects. The relationship between subjects and at-risk types is worth further investigation.
Originality/value
This study found the proposed method can effectively extract micro-level behavioral information to generate better prediction outcomes and depict student’s SRL learning strategies in online learning. The authors confirm that the research in their work is original, and that all the data given in the paper are real and authentic. The study has not been submitted to peer review and not has been accepted for publishing in another journal.
Details
Keywords
Xu Du, Juan Yang, Brett Shelton and Jui-Long Hung
Online learning is well-known by its flexibility of learning anytime and anywhere. However, how behavioral patterns tied to learning anytime and anywhere influence learning…
Abstract
Purpose
Online learning is well-known by its flexibility of learning anytime and anywhere. However, how behavioral patterns tied to learning anytime and anywhere influence learning outcomes are still unknown.
Design/methodology/approach
This study proposed concepts of time and location entropy to depict students’ spatial-temporal patterns. A total of 5,221 students with 1,797,677 logs, including 485 on-the-job students and 4,736 full-time students, were analyzed to depict their spatial-temporal learning patterns, including the relationships between identified patterns and students’ learning performance.
Findings
Analysis results indicate on-the-job students took more advantage of anytime, anywhere than full-time students. Students with a higher tendency for learning anytime and a lower level of learning anywhere were more likely to have better outcomes. Gender did not show consistent findings on students’ spatial-temporal patterns, but partial findings could be supported by evidence in neural science or by cultural and geographical differences.
Research limitations/implications
A more accurate approach for categorizing position and location might be considered. Some findings need more studies for further validation. Finally, future research can consider connections between other well-known performance predictors (such as financial situation, motivation, personality and major) and the type of learning patterns.
Practical implications
The findings gained from this study can help improve the understandings of students’ learning behavioral patterns and design as well as implement better online education programs.
Originality/value
This study proposed concepts of time and location entropy to identify successful spatial-temporal patterns of on-the-job and full-time students.
Details
Keywords
Xu Du, Juan Yang, Jui-Long Hung and Brett Shelton
Educational data mining (EDM) and learning analytics, which are highly related subjects but have different definitions and focuses, have enabled instructors to obtain a holistic…
Abstract
Purpose
Educational data mining (EDM) and learning analytics, which are highly related subjects but have different definitions and focuses, have enabled instructors to obtain a holistic view of student progress and trigger corresponding decision-making. Furthermore, the automation part of EDM is closer to the concept of artificial intelligence. Due to the wide applications of artificial intelligence in assorted fields, the authors are curious about the state-of-art of related applications in Education.
Design/methodology/approach
This study focused on systematically reviewing 1,219 EDM studies that were searched from five digital databases based on a strict search procedure. Although 33 reviews were attempted to synthesize research literature, several research gaps were identified. A comprehensive and systematic review report is needed to show us: what research trends can be revealed and what major research topics and open issues are existed in EDM research.
Findings
Results show that the EDM research has moved toward the early majority stage; EDM publications are mainly contributed by “actual analysis” category; machine learning or even deep learning algorithms have been widely adopted, but collecting actual larger data sets for EDM research is rare, especially in K-12. Four major research topics, including prediction of performance, decision support for teachers and learners, detection of behaviors and learner modeling and comparison or optimization of algorithms, have been identified. Some open issues and future research directions in EDM field are also put forward.
Research limitations/implications
Limitations for this search method include the likelihood of missing EDM research that was not captured through these portals.
Originality/value
This systematic review has not only reported the research trends of EDM but also discussed open issues to direct future research. Finally, it is concluded that the state-of-art of EDM research is far from the ideal of artificial intelligence and the automatic support part for teaching and learning in EDM may need improvement in the future work.
Details
Keywords
Wu He, Jui-Long Hung and Lixin Liu
The paper aims to help enterprises gain valuable knowledge about big data implementation in practice and improve their information management ability, as they accumulate…
Abstract
Purpose
The paper aims to help enterprises gain valuable knowledge about big data implementation in practice and improve their information management ability, as they accumulate experience, to reuse or adapt the proposed method to achieve a sustainable competitive advantage.
Design/methodology/approach
Guided by the theory of technological frames of reference (TFR) and transaction cost theory (TCT), this paper describes a real-world case study in the banking industry to explain how to help enterprises leverage big data analytics for changes. Through close integration with bank's daily operations and strategic planning, the case study shows how the analytics team frame the challenge and analyze the data with two analytic models – customer segmentation (unsupervised) and product affinity prediction (supervised), to initiate the adoption of big data analytics in precise marketing.
Findings
The study reported relevant findings from a longitudinal data analysis and identified some key success factors. First, non-technical factors, for example intuitive analytics results, appropriate evaluation baseline, multiple-wave implementation and selection of marketing channels critically influence big data implementation progress in organizations. Second, a successful campaign also relies on technical factors. For example, the clustering analytics could promote customers' response rates, and the product affinity prediction model could boost efficient transaction and lower time costs.
Originality/value
For theoretical contribution, this paper verified that the outstanding characteristics of online mutual fund platforms brought up by Nagle, Seamans and Tadelis (2010) could not guarantee organizations' competitive advantages from the aspect of TCT.
Details
Keywords
Lizhao Zhang, Xu Du, Jui-Long Hung and Hao Li
The purpose of this study is to conduct a systematic review to understand state-of-art research related to learning preferences from the aspects of impacts, influential factors…
Abstract
Purpose
The purpose of this study is to conduct a systematic review to understand state-of-art research related to learning preferences from the aspects of impacts, influential factors and evaluation methods.
Design/methodology/approach
This paper uses the systematic synthesis method to provide state-of-the-art knowledge on learning preference research by summarizing published studies in major databases and attempting to aggregate and reconcile the scientific results from the individual studies. The findings summarize aggregated research efforts and improve the quality of future research.
Findings
After analyzing existing literature, this study proposed three possible research directions in the future. First, researchers might focus on how to use the real-time tracking mechanism to further understand other impacts of learning preferences within the learning environments. Second, existing studies mainly focused on the influence of singular factors on learning preferences. The joint effects of multiple factors should be an important topic for future research. Finally, integrated algorithms might become the most popular evaluation method of learning preference in the era of smart learning environments.
Research limitations/implications
This review used the search results generated by Google Scholar and Web of Science databases. There might be published papers available in other databases that have not been taken into account.
Originality/value
The research summarizes the state-of-art research related to learning preferences. This paper is one of the first to discuss the development of learning preference research in smart learning environments.
Details
Keywords
Lizhao Zhang, Jui-Long Hung, Xu Du, Hao Li and Zhuang Hu
Student engagement is a key factor that connects with student achievement and retention. This paper aims to identify individuals' engagement automatically in the classroom with…
Abstract
Purpose
Student engagement is a key factor that connects with student achievement and retention. This paper aims to identify individuals' engagement automatically in the classroom with multimodal data for supporting educational research.
Design/methodology/approach
The video and electroencephalogram data of 36 undergraduates were collected to represent observable and internal information. Since different modal data have different granularity, this study proposed the Fast–Slow Neural Network (FSNN) to detect engagement through both observable and internal information, with an asynchrony structure to preserve the sequence information of data with different granularity.
Findings
Experimental results show that the proposed algorithm can recognize engagement better than the traditional data fusion methods. The results are also analyzed to figure out the reasons for the better performance of the proposed FSNN.
Originality/value
This study combined multimodal data from observable and internal aspects to improve the accuracy of engagement detection in the classroom. The proposed FSNN used the asynchronous process to deal with the problem of remaining sequential information when facing multimodal data with different granularity.
Details
Keywords
Juan Yang, Xu Du, Jui-Long Hung and Chih-hsiung Tu
Critical thinking is considered important in psychological science because it enables students to make effective decisions and optimizes their performance. Aiming at the…
Abstract
Purpose
Critical thinking is considered important in psychological science because it enables students to make effective decisions and optimizes their performance. Aiming at the challenges and issues of understanding the student's critical thinking, the objective of this study is to analyze online discussion data through an advanced multi-feature fusion modeling (MFFM) approach for automatically and accurately understanding the student's critical thinking levels.
Design/methodology/approach
An advanced MFFM approach is proposed in this study. Specifically, with considering the time-series characteristic and the high correlations between adjacent words in discussion contents, the long short-term memory–convolutional neural network (LSTM-CNN) architecture is proposed to extract deep semantic features, and then these semantic features are combined with linguistic and psychological knowledge generated by the LIWC2015 tool as the inputs of full-connected layers to automatically and accurately predict students' critical thinking levels that are hidden in online discussion data.
Findings
A series of experiments with 94 students' 7,691 posts were conducted to verify the effectiveness of the proposed approach. The experimental results show that the proposed MFFM approach that combines two types of textual features outperforms baseline methods, and the semantic-based padding can further improve the prediction performance of MFFM. It can achieve 0.8205 overall accuracy and 0.6172 F1 score for the “high” category on the validation dataset. Furthermore, it is found that the semantic features extracted by LSTM-CNN are more powerful for identifying self-introduction or off-topic discussions, while the linguistic, as well as psychological features, can better distinguish the discussion posts with the highest critical thinking level.
Originality/value
With the support of the proposed MFFM approach, online teachers can conveniently and effectively understand the interaction quality of online discussions, which can support instructional decision-making to better promote the student's knowledge construction process and improve learning performance.
Details
Keywords
Cheng Yang, Jui‐long Hung and Zhangxi Lin
In December 2011, the National Computer Network Emergency Response Technical Team/Coordination Center of China reported the most serious user data leak in history which involved…
Abstract
Purpose
In December 2011, the National Computer Network Emergency Response Technical Team/Coordination Center of China reported the most serious user data leak in history which involved 26 databases with 278 million user accounts and passwords. After acquiring the user data from this massive information leak, this study has two major research purposes: the paper aims to reveal similarities and differences of password construction among four companies; and investigate how culture factors shape user password construction in China.
Design/methodology/approach
This article analyzed real‐life passwords collected from four companies by comparing the following attributes: password length, password constitution, top 20 frequent passwords, character frequency distributions, string similarity, and password reuse.
Findings
Major findings include that: general users in China have a weaker sense of security than those in Western countries, which reflected in the password lengths, the character combinations and the content structures; password constitution preferences are different between users in Western countries and in China, where passwords are more similar to the Pinyin context and Chinese number homonym; and password reuse is very common in China. General users tend to reuse the same passwords and IT professionals tend to engage in Seed Password reuse.
Research limitations/implications
Due to the rapid growth of Internet users and e‐commerce markets in China, many online service providers may not pay enough attention to security issues, but focus instead on market expansion. Employees in these companies may not be well trained in information security, resulting in carelessness when handling security issues.
Originality/value
This is the first study which attempts to consider culture influences in password construction by analyzing real‐life datasets.
Details