Skip to content

atlasrule/Lending-Risk-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Customer Lending Risk Classification



Introduction

This is a personal study made on credit dataset. The problem is binary classification, to classify bank customers as their lending ratings.


Data Preprocessing

First I've analysed the dataset by reading through the CSV. Checked for any unknown values, the data had no empty cells. Used label encoding


Dataset Observations

Examined the correlation matrix, this helped tweaking the bayesian network nodes.

The target class is imbalanced, while there is 700 'good' credit samples, there is only 300 'bad' credit samples included.

The data didn't have any null/NaN/? values.

All data were categorical, while evaluation, I took that into account.

The dataset was lack of any single females. This particular data must be gathered in case of any single female prediction need.


Tested algorithms:

  • Categorical Naive Bayes
  • DecisionTreeClassifier
  • GaussianNB
  • KNeighborsClassifier
  • LGBMClassifier
  • LinearDiscriminantAnalysis
  • LogisticRegression
  • RandomForestClassifier
  • Support Vector Classifier
  • XGBClassifier
  • Bayesian Network

About

Customer Lending Risk Classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published