Finding predictors of corruption from European firm level survey data: a random forest approach
DOI: https://doi.org/10.3846/jbem.2026.25748Abstract
Corruption remains a significant constraint for firms in Europe, despite ongoing institutional reforms. The main goal of this paper is to obtain a list of firm-level variables that can serve as predictors of corruption perception using a machine learning approach. Drawing on agency and institutional theory, we analyse firm-level data from European firms from the World Bank Enterprise Survey (WBES). We employ a Random Forest classifier, which is well-suited for high-dimensional, categorical survey data, capturing non-linear relationships and interactions often missed by traditional models. The model achieves strong predictive performance (ROC AUC = 0.755; Accuracy = 79%). Results show that the most important prediction factors of corruption perception include firm age, size, ownership concentration, legal form, external financial audits, bribery experiences, sector, country group (EU vs. WB), innovation activity, and informal sector competition. The findings support the design of risk-based audits and encourage reforms to reduce informality through streamlined registration processes. The study contributes methodologically by applying machine learning to the field of political economy and expands theoretical insights into firm-level institutional barriers. It is one of the first research to apply Random Forest to firm-level corruption perception in both EU and Western Balkans.
Keywords:
corruption, Europe, institutions, firms, machine learning, Random ForestHow to Cite
Share
License
Copyright (c) 2026 The Author(s). Published by Vilnius Gediminas Technical University.

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
Adam, I., & Fazekas, M. (2021). Are emerging technologies helping win the fight against corruption? A review of the state of evidence. Information Economics and Policy, 57, Article 100950. https://doi.org/10.1016/j.infoecopol.2021.100950
Amin, M., & Soh, Y. C. (2019). Corruption and country size: evidence using firm-level survey data (World Bank Policy Research Working Paper, 8864). SSRN. https://doi.org/10.1596/1813-9450-8864
Audretsch, D. B., Belitski, M., Chowdhury, F., & Desai, S. (2022). Necessity or opportunity? Government size, tax policy, corruption, and implications for entrepreneurship. Small Business Economics, 58(4), 2025–2042. https://doi.org/10.1007/s11187-021-00497-2
Bartlett, W. (2023). The performance of politically connected firms in South East Europe: State capture or business capture? Post-Communist Economies, 35(4), 351–367. https://doi.org/10.1080/14631377.2023.2188694
Belitski, M., Grigore, A. M., & Bratu, A. (2021). Political entrepreneurship: Entrepreneurship ecosystem perspective. International Entrepreneurship and Management Journal, 17(4), 1973–2004. https://doi.org/10.1007/s11365-021-00750-w
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1257/jep.31.2.87
Burke, R. J., & Cooper, C. L. (Eds.). (2009). Research companion to corruption in organizations. Edward Elgar Publishing. https://doi.org/10.4337/9781849801928
Campos, N. F., Dimova, R. D., & Saleh, A. (2010). Whither corruption? A quantitative survey of the literature on corruption and growth (CEPR Discussion Paper No. DP8140). SSRN. https://doi.org/10.2139/ssrn.1716129
Cieślik, A., & Goczek, Ł. (2018). Corruption, privatisation and economic growth in post-communist countries. Europe-Asia Studies, 70(8), 1303–1325. https://doi.org/10.1080/09668136.2018.1511771
Cieślik, A., & Goczek, Ł. (2022). Who suffers and how much from corruption? Evidence from firm-level data. Eurasian Business Review, 12(3), 451–473. https://doi.org/10.1007/s40821-021-00185-x
Colonnelli, E., & Prem, M. (2022). Corruption and firms. The Review of Economic Studies, 89(2), 695–732. https://doi.org/10.1093/restud/rdab040
Colonnelli, E., Gallego, J., & Prem, M. (2022). What predicts corruption?. In P. Buonanno, P. Vanin & J. Vargas (Eds.), A modern guide to the economics of crime (pp. 345–373). Edward Elgar Publishing. https://doi.org/10.4337/9781789909333.00020
d’Agostino, G., Dunne, J. P., & Pieroni, L. (2016). Government spending, corruption and economic growth. World Development, 84, 190–205. https://doi.org/10.1016/j.worlddev.2016.03.011
Dávid-Barrett, E., & Fazekas, M. (2020). Grand corruption and government change: An analysis of partisan favoritism in public procurement. European Journal on Criminal Policy and Research, 26(4), 411–430. https://doi.org/10.1007/s10610-019-09416-4
Decarolis, F., & Giorgiantonio, C. (2022). Corruption red flags in public procurement: New evidence from Italian calls for tenders. EPJ Data Science, 11(1), Article 16. https://doi.org/10.1140/epjds/s13688-022-00325-x
Dimant, E., & Tosato, G. (2018). Causes and effects of corruption: What has past decade’s empirical research taught us? A survey. Journal of Economic Surveys, 32(2), 335–356. https://doi.org/10.1111/joes.12198
Djankov, S., La Porta, R., Lopez-de-Silanes, F. & Shleifer, A. (2002). The regulation of entry. The Quarterly Journal of Economics, 117(1), 1–37. https://doi.org/10.1162/003355302753399436
Doria, L. M., Doria, F. F., Figueiredo, P., Sampaio, A., & Sampaio, R. R. (2022). A machine learning approach on the problem of corruption. International Journal of Advanced Engineering Research and Science, 9(3), 277–282. https://dx.doi.org/10.22161/ijaers.93.33
Egger, P., & Winner, H. (2005). Evidence on corruption as an incentive for foreign direct investment. European Journal of Political Economy, 21(4), 932–952. https://doi.org/10.1016/j.ejpoleco.2005.01.002
European Commission. (2023). 2023 Eurobarometer survey: Business’ attitudes towards corruption.
European Commission. (2024). Directorate-General for Migration and Home Affairs. High-risk areas of corruption in the EU – A mapping and in-depth analysis. Publications Office of the European Union. https://data.europa.eu/doi/10.2837/5907939
Enste, D. H., & Heldman, C. (2018). The consequences of corruption. In B. Warf (Ed.) Handbook on the geographies of corruption (pp. 106–119). Edward Elgar Publishing. https://doi.org/10.4337/9781786434753.00011
Farooq, O., & Shehata, N. F. (2018). Does external auditing combat corruption? Evidence from private firms. Managerial Auditing Journal, 33(3), 267–287. https://doi.org/10.1108/MAJ-08-2017-1634
Fazekas, M., & Ferrali, R. (2023). Advances in measuring corruption and agenda for the future (Working Paper GTI-WP/2023:01). Government Transparency Institute. https://www.govtransparency.eu/wp-content/uploads/2023/03/Fazekas-Ferrali_Corr-measurement_article_WPformatted_final.pdf
Fazekas, M., Sberna, S., & Vannucci, A. (2022). The extra‐legal governance of corruption: Tracing the organization of corruption in public procurement. Governance, 35(4), 1139–1161. https://doi.org/10.1111/gove.12648
Fisman, R., Guriev, S., Ioramashvili, C., & Plekhanov, A. (2024). Corruption and firm growth: Evidence from around the world. The Economic Journal, 134(660), 1494–1516. https://doi.org/10.1093/ej/uead100
García‐Gómez, C. D., Bilyay‐Erdogan, S., Demir, E., & Díez‐Esteban, J. M. (2025). A new piece in the puzzle: Corruption and financial constraints – evidence from European firms. Business Ethics, the Environment & Responsibility, 35(2), 693–716. https://doi.org/10.1111/beer.12815
Glaeser, E. L., & Saks, R. E. (2006). Corruption in America. Journal of Public Economics, 90(6–7), 1053–1072. https://doi.org/10.1016/j.jpubeco.2005.08.007
Glaeser, E. L., La Porta, R., Lopez-de-Silanes, F., & Shleifer, A. (2004). Do institutions cause growth?. Journal of Economic Growth, 9, 271–303. https://doi.org/10.1023/B:JOEG.0000038933.16398.ed
Goel, R. K., Mazhar, U., & Ram, R. (2021). Size matters: corruption perceptions versus corruption experiences by firms (CESifo Working Paper, No. 9221). SSRN. https://doi.org/10.2139/ssrn.3898319
Graeff, P., & Svendsen, G. T. (2013). Trust and corruption: The influence of positive and negative social capital on the economic development in the European Union. Quality & Quantity, 47, 2829–2846. https://doi.org/10.1007/s11135-012-9693-4
Gray, C. W., Hellman, J. S., & Ryterman, R. (2004). Anticorruption in transition 2: Corruption in enterprise-state interactions in Europe and Central Asia, 1999–2002 (Vol. 2). The World Bank. https://openknowledge.worldbank.org/server/api/core/bitstreams/cfcbf274-136b-58e6-a587-048e18310f9f/content
Grzymala-Busse, A. (2007). Rebuilding leviathan party competition and state exploitation in post-communist democracies. Cambridge University Press. https://doi.org/10.1017/CBO9780511618819
Gugiu, M. R., & Gugiu, P. C. (2016). Economic crisis and corruption in the European Union. Journal of Methods and Measurement in the Social Sciences, 7(1), 1–22. https://doi.org/10.2458/v7i1.19398
Gupta, S., Davoodi, H., & Alonso-Terme, R. (2002). Does corruption affect income inequality and poverty?. Economics of Governance, 3, 23–45. https://doi.org/10.1007/s101010100039
Huang, C. J. (2016). Is corruption bad for economic growth? Evidence from Asia-Pacific countries. The North American Journal of Economics and Finance, 35, 247–256. https://doi.org/10.1016/j.najef.2015.10.013
Jackson, E. (2023). Informality as a driving force for corruption in economy: A neoclassical simulation. Economic Analysis Letters, 2, 60–65. https://doi.org/10.58567/eal02020008
Jaggi, B., Allini, A., Ginesti, G., & Macchioni, R. (2021). Determinants of corporate corruption disclosures: Evidence based on EU listed firms. Meditari Accountancy Research, 29(1), 21–38. https://doi.org/10.1108/MEDAR-11-2019-0616
Jensen, N. M., Li, Q., & Rahman, A. (2010). Understanding corruption and firm responses in cross-national firm-level surveys. Journal of International Business Studies, 41, 1481–1504. https://doi.org/10.1057/jibs.2010.8
Khan, S. (2022). Investigating the effect of income inequality on corruption: New evidence from 23 emerging countries. Journal of the Knowledge Economy, 13(3), 2100–2126. https://doi.org/10.1007/s13132-021-00761-6
Klitgaard, R. E. (1988). Controlling corruption. University of California Press.
Knack, S. F. (2006). Measuring corruption in Eastern Europe and Central Asia: A critique of the cross-country indicators (World Bank Publications, Vol. 3968). https://doi.org/10.1596/1813-9450-3968
Köbis, N., Starke, C., & Rahwan, I. (2022). The promise and perils of using artificial intelligence to fight corruption. Nature Machine Intelligence, 4(5), 418–424. https://doi.org/10.1038/s42256-022-00489-1
Lambsdorff, J. G. (2006). Causes and consequences of corruption: What do we know from a cross-section of countries. International Handbook on the Economics of Corruption, 1, 3–51. https://doi.org/10.4337/9781847203106.00007
Lambsdorff, J. G. (2007). The institutional economics of corruption and reform: Theory, evidence and policy. Cambridge University Press. https://doi.org/10.1017/CBO9780511492617
Lima, M. S. M., & Delen, D. (2020). Predicting and explaining corruption across countries: A machine learning approach. Government Information Quarterly, 37(1), Article 101407. https://doi.org/10.1016/j.giq.2019.101407
Mullainathan, S., & Spiess, J. (2017). Machine learning: An applied econometric approach. Journal of Economic Perspectives, 31(2), 87–106. https://doi.org/10.1257/jep.31.2.87
Mungiu-Pippidi, A. (2013). The good, the bad and the ugly: Controlling corruption in the European Union. Advanced Policy Paper for Discussion in the European Parliament, 9, 61–63. https://www.againstcorruption.eu/wp-content/uploads/2013/03/ANTICORRP-Policy-Paper-on-Lessons-Learnt-1_protected1.pdf
Murray, G. & Scime, A. (2010). Microtargeting and electorate segmentation: Data mining the American National Election Studies. Journal of Political Marketing, 9, 143–166. https://doi.org/10.1080/15377857.2010.497732
Nguyen, T. D. (2020). Does firm growth increase corruption? Evidence from an instrumental variable approach. Small Business Economics, 55, 237–256. https://doi.org/10.1007/s11187-019-00160-x
Organisation for Economic Co-operation and Development. (2008). Corruption: A glossary of international standards in criminal law. OECD Publishing. www.oecd.org/daf/antiribery/41194428.pdf
Poltoratskaia, V., & Fazekas, M. (2024). Data analytics for anti-corruption in public procurement. In S. Williams, & J. Tillipman (Eds.), Routledge handbook of public procurement corruption (pp. 42–59). Routledge. https://doi.org/10.4324/9781003220374-6
Riaz, M. F., Cherkas, N., & Leitão, J. (2018). Corruption and innovation: Mixed evidences on bidirectional causality. Journal of Applied Economic Sciences, 13(2(56)), 378–384.
Rose-Ackerman, S. (2017). When is corruption harmful?. In A. J. Heidenheimer & M. Johnston (Eds.), Political corruption: Concepts and contexts (pp. 353–372). Routledge. https://doi.org/10.4324/9781315126647-32
Rusch, J. (2021). Is there a role for machine learning in anti-corruption risk and compliance?. Faculty of Law Blogs, University of Oxford. https://blogs.law.ox.ac.uk/business-law-blog/blog/2021/06/there-role-machine-learning-anti-corruption-risk-and-compliance
Saha, S., & Gounder, R. (2013). Corruption and economic development nexus: Variations across income levels in a non-linear framework. Economic Modelling, 31, 70–79. https://doi.org/10.1016/j.econmod.2012.11.012
Shleifer, A., & Vishny, R. W. (1993). Corruption. The Quarterly Journal of Economics, 108(3), 599–617. https://doi.org/10.2307/2118402
Svensson, J. (2003). Who must pay bribes and how much? Evidence from a cross section of firms. Quarterly Journal of Economics, 118(1), 207–230. https://doi.org/10.1162/00335530360535180
Tanzi, V. (1998). Corruption around the world: Causes, consequences, scope, and cures. Staff Papers, 45(4), 559–594. https://doi.org/10.2307/3867585
Transparency International. (2021). Addressing corruption as a driver of democratic decline. Policy position. https://files.transparencycdn.org/images/2021-Addressing-corruption-as-driver-of-democratic-decline-Summit-for-Democracy-PositionPaper-EN.pdf
Uslaner, E. M. (2017). Political trust, corruption, and inequality. In S. Zmerli & T. W. Van der Meer (Eds.), Handbook on political trust (pp. 302–315). Edward Elgar Publishing. https://doi.org/10.4337/9781782545118.00030
Varian, H. R. (2014). Big data: New tricks for econometrics. Journal of Economic Perspectives, 28(2), 3–28. https://doi.org/10.1257/jep.28.2.3
Warner, C. (2011). The best system money can buy: Corruption in the European Union. Cornell University Press. https://doi.org/10.1093/bjc/azn061
World Bank. (1997). Helping countries combat corruption: The role of the World Bank. https://documents1.worldbank.org/curated/en/799831538245192753/pdf/Helping-Countries-Combat-Corruption-The-Role-of-the-World-Bank.pdf
World Bank. (2023). Evidence on public procurement from firm-level surveys: Global statistics from the World Bank Enterprise Surveys and a novel public procurement survey module. Equitable Growth, Finance & Institutions Insight. https://documents1.worldbank.org/curated/en/099122723173512944/pdf/P17503317d58130361a8391945d653eb092.pdf
World Bank. (n.d.). World Bank enterprise surveys [database]. https://www.enterprisesurveys.org/en/enterprisesurveys
View article in other formats
Published
Issue
Section
Copyright
Copyright (c) 2026 The Author(s). Published by Vilnius Gediminas Technical University.
License

This work is licensed under a Creative Commons Attribution 4.0 International License.