Dataset informations
For this project we have been working on a dataset from kaggle called “Home Credit Default Risk” that can be found at :
https://www.kaggle.com/c/home-credit-default-risk
The dataset contains various informations about previous loans and the if the loans has been repayed by the borrower.
Files
We were working on two separates files :
application_train that contains numerous data about previous loans and a Target feature that will be use to train our model and that represents the ability to repay the loan.
application_test that also contains data about previous loans but without the target feature, it will be used by our models to make predictions.
Importing the files
We didn’t implement a way to automatically import the files from kaggle.
To acces the data, we simply downloaded them from kaggle and placed them in the data/raw folder