Financial distress prediction based on ensemble feature selection and improved stacking algorithm
Abstract
Purpose
While the Chinese securities market is booming, the phenomenon of listed companies falling into financial distress is also emerging, which affects the operation and development of enterprises and also jeopardizes the interests of investors. Therefore, it is important to understand how to accurately and reasonably predict the financial distress of enterprises.
Design/methodology/approach
In the present study, ensemble feature selection (EFS) and improved stacking were used for financial distress prediction (FDP). Mutual information, analysis of variance (ANOVA), random forest (RF), genetic algorithms, and recursive feature elimination (RFE) were chosen for EFS to select features. Since there may be missing information when feeding the results of the base learner directly into the meta-learner, the features with high importance were fed into the meta-learner together. A screening layer was added to select the meta-learner with better performance. Finally, Optima hyperparameters were used for parameter tuning by the learners.
Findings
An empirical study was conducted with a sample of A-share listed companies in China. The F1-score of the model constructed using the features screened by EFS reached 84.55%, representing an improvement of 4.37% compared to the original features. To verify the effectiveness of improved stacking, benchmark model comparison experiments were conducted. Compared to the original stacking model, the accuracy of the improved stacking model was improved by 0.44%, and the F1-score was improved by 0.51%. In addition, the improved stacking model had the highest area under the curve (AUC) value (0.905) among all the compared models.
Originality/value
Compared to previous models, the proposed FDP model has better performance, thus bridging the research gap of feature selection. The present study provides new ideas for stacking improvement research and a reference for subsequent research in this field.
Keywords
Acknowledgements
This work was supported by the Philosophy and Social Science Research Planning Project of Heilongjiang Province (No. 20JYB039) and the National Natural Science Foundation of China (No. 72131005 and 72121001).
Citation
Wu, C., Chen, X. and Jiang, Y. (2024), "Financial distress prediction based on ensemble feature selection and improved stacking algorithm", Kybernetes, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/K-08-2023-1428
Publisher
:Emerald Publishing Limited
Copyright © 2024, Emerald Publishing Limited