Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis - Archive ouverte HAL Access content directly
Master Thesis Year : 2021

Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis

Abstract

In this report, we will attempt to improve a failure prediction model that is currently used by the French Ministry of Economy and Finance. First, we studied several models and benchmarked them in order to compare our results with those of the articles studied. As a result, we were able to select four models that stood out from the rest, and that we should improve as much as possible. Secondly, we decided to look at the data itself. We realized that our dataset was static. That is, for each row in our table, we had data only for a time T. We therefore decided to add variables, which we will call temporal features, in order to take temporality into account in the model. This addition was more than conclusive, because it allowed us to obtain excellent results that had not been achieved until then. Afterwards, we will proceed with this new dataset. In order to further improve our results, we have started to search by sector of activity. We separated our dataset into several datasets, the separation being done on the sector of activity of the companies. In doing so, we realized that if we applied different models for each business sector, we would get much better results. Depending on the operational needs, we conclude that this is an area to consider seriously. Finally, we decided to look at the importance of features in our models. To do this, we looked at the importance of the variables in the classifiers, and we realized that only a fraction of the input variables were actually useful, and among those, our temporal features that we added. It would therefore be appropriate to reduce the number of input variables, and to go even further in temporalizing the model on the small remaining dataset.
Fichier principal
Vignette du fichier
Article_SF (2).pdf (970.99 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03523010 , version 1 (12-01-2022)

Licence

Public Domain

Identifiers

  • HAL Id : hal-03523010 , version 1

Cite

Thomas Meunier. Automatic or semi-automatic detection of companies in difficulty or weakened by the crisis. Artificial Intelligence [cs.AI]. 2021. ⟨hal-03523010⟩
49 View
33 Download

Share

Gmail Facebook Twitter LinkedIn More