CSV File Description
The Data folder contains the following data:
- Loan.csv and Borrower.csv: data sets with 100K rows of the simulated data used to build the end-to-end Loan Credit Risk solution.
- Loan_Prod.csv and Borrower_Prod.csv: data sets with about 20 rows of the simulated data used in the Production pipeline.
Loan.csv and Loan_Prod.csv contain the following fields:
Field | Type | Description |
---|---|---|
loanId | Integer | Unique Id of the loan |
memberId | Integer | Unique Id of the borrower |
date | Date | a) Historical data: the loan approval date b) Production data: the loan application date. Format: M/D/YYYY (e.g. 3/9/2016, 3/13/2016) |
purpose | String | Purpose of the loan e.g., debtconsolidation |
isJointApplication | String | Flag about the nature of the application (joint or individual) |
loanAmount | Float | Total amount to be borrowed |
term | String | Number of months of payments on the loan e.g., 36 months |
interestRate | String | Interest Rate on the loan e.g., 7.21 % |
monthlyPayment | Float | Monthly payment owed by the borrower |
grade | String | Loan grade (risk-related) e.g. A2 |
loanStatus | String | Status of the loan (Label) Values taken: Current, Charged Off (This field is not present in the Loan_Prod.csv file) |
Borrower.csv and Borrower_Prod.csv contain the following fields:
Field | Type | Description |
---|---|---|
memberId | Integer | Unique Id of the borrower |
residentialState | String | Residential state of the borrower e.g., MA |
yearsEmployment | String | Number of years of employment of the borrower e.g., 10+ years |
homeOwnership | String | Home ownership status of the borrower Values taken: own, rent, mortgage |
annualIncome | Float | Annual income of the borrower |
incomeVerified | String | Flag indicating if the income was verified or not |
dtiRatio | Float | Debt to income ratio: borrower’s total monthly debt payments (without mortgage and the requested loan) divided by the monthly income. It is expressed in percentage |
lengthCreditHistory | Integer | Length of the credit history in terms of years |
numTotalCreditLines | Integer | Total number of credit lines in the borrower's credit file |
numOpenCreditLines | Integer | Number of open credit lines in the borrower's credit file |
numOpenCreditLines1Year | Integer | Number of credit lines in the borrower's credit file that were opened in the past year |
revolvingBalance | Float | Total credit revolving balance |
revolvingUtilizationRate | Float | Amount of credit the borrower is using relative to all available revolving credit e.g., 7.30% |
numDerogatoryRec | Integer | Number of derogatory public records (includes tax liens, bankruptcies, and other judgements such as civil lawsuits) |
numDelinquency2Years | Integer | Number of 30+ days past-due incidences of delinquency in the borrower's credit file for the past 2 years |
numChargeoff1year | Integer | Number of charge-offs within 1 year |
numInquiries6Mon | Integer | Number of inquiries in past 6 months |