Garbage in, Garbage out: A Theory-Driven Approach to Improve Data Handling in Supervised Machine Learning
ISBN: 978-1-80455-365-7, eISBN: 978-1-80455-364-0
Publication date: 18 January 2023
Abstract
Machine learning (ML) has recently gained momentum as a method for measurement in strategy research. Yet, little guidance exists regarding how to appropriately apply the method for this purpose in our discipline. We address this by offering a guide to the application of ML in strategy research, with a particular emphasis on data handling practices that should improve our ability to accurately measure our constructs of interest using ML techniques. We offer a brief overview of ML methodologies that can be used for measurement before describing key challenges that exist when applying those methods for this purpose in strategy research (i.e., sample sizes, data noise, and construct complexity). We then outline a theory-driven approach to help scholars overcome these challenges and improve data handling and the subsequent application of ML techniques in strategy research. We demonstrate the efficacy of our approach by applying it to create a linguistic measure of CEOs' motivational needs in a sample of S&P 500 firms. We conclude by describing steps scholars can take after creating ML-based measures to continue to improve the application of ML in strategy research.
Keywords
Citation
Hyde, S.J., Bachura, E. and Harrison, J.S. (2023), "Garbage in, Garbage out: A Theory-Driven Approach to Improve Data Handling in Supervised Machine Learning", Hill, A.D., McKenny, A.F., O'Kane, P. and Paroutis, S. (Ed.) Methods to Improve Our Field (Research Methodology in Strategy and Management, Vol. 14), Emerald Publishing Limited, Leeds, pp. 101-132. https://doi.org/10.1108/S1479-838720220000014006
Publisher
:Emerald Publishing Limited
Copyright © 2023 by Emerald Publishing Limited