Home icon
SME Loan Default Prediction with Machine Learning: An Empirical Study on Bank-Branch-Level Data
by Augoustinos Dimitras | Konstantinos Gkillas | Georgios Peppas | Maria Tantoula

In the context of post-crisis economy, where small and medium-sized enterprises (SMEs) remain central to economic recovery, commercial banks face persistent challenges in accurately and efficiently assessing credit risk. Machine learning models offer strong predictive potential, yet their adoption in real-world lending is hindered by issues such as severe class imbalance, limited interpretability, and difficulties integrating them into existing workflows. This study addresses these challenges by providing empirical evidence that ensemble learning methods, when combined with resampling and cost-sensitive techniques, can improve loan default detection in highly imbalanced settings. Using a unique, proprietary dataset of SME loans collected from the branch of a large systemic Greek bank, we evaluate Random Forests, Extra Trees, AdaBoost, CatBoost, XGBoost and Multi-Layer Perceptron (MLP), within a stratified k-fold cross-validation framework. Our findings show that hybrid configurations, specifically Random Forest with ADASYN and XGBoost with SMOTEENN, deliver superior predictive performance. Complementary SHAP analysis identifies the most influential borrower, loan, and sector characteristics, thus enabling branch managers to make more accurate, transparent, and context
sensitive lending decisions.

Jel Classification: C45, C53, G21

Conference registration is required in order to view papers !
HELLENIC 
OPEN
UNIVERSITY
The International Conference on Business & Economics of the Hellenic Open University (ICBE - HOU) aims to bring together leading scientists and researchers, affiliated with the HOU, to present, discuss and challenge their ideas opinions and research findings about all disciplines of Business Administration and Economics.
menuchevron-downcross-circle