Academia.eduAcademia.edu

Credit Scoring Techniques: A Survey

2014

Credit scoring is a numerical expression of the credit worthiness of an individual. A Value with a specific creditworthiness associated is assigned to an individual. Overall objective is to determine the creditworthiness of an individual. Ability of an individual to repay is determined in the credit scoring process. The credit scoring process looks at specific criteria such as income, credit history and many others. All this is done with the intent to reduce the overall default rate thereby decreasing the overall risk of financial institutions such as banks and micro lending institutions. Several credit scoring methodologies have been proposed and implemented and are varied from statistical based methods to Artificial Intelligence based techniques.

International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 Credit Scoring Techniques: A Survey Thabiso Peter Mpofu1, Macdonald Mukosera2 1 M.Tech. Student, Department of Computer Science, School of Information Technology, Jawaharlal Nehru Technological University Hyderabad, India 2 Jawaharlal Nehru Technological University Hyderabad, School of Information Technology, Kukatpally, Hyderabad 500085, India Abstract: Credit scoring is a numerical expression of the credit worthiness of an individual. A Value with a specific creditworthiness associated is assigned to an individual. Overall objective is to determine the creditworthiness of an individual. Ability of an individual to repay is determined in the credit scoring process. The credit scoring process looks at specific criteria such as income, credit history and many others. All this is done with the intent to reduce the overall default rate thereby decreasing the overall risk of financial institutions such as banks and micro lending institutions. Several credit scoring methodologies have been proposed and implemented and are varied from statistical based methods to Artificial Intelligence based techniques. Keywords: credit scoring, credit worthiness, default rate, risk, credit scoring techniques. 1. Introduction Financial Institutions such as banks and micro lending institutions issue out loans to individuals and corporate institutions. The general public still forms the core of the financial institutions clients and thus the need for credit scoring. For the financial institutions to remain competitive and as go on as a going concern, the loans issued out have to be paid back. It is not everyone who borrows that has the capacity or ability to repay back the loans they have taken. There is therefore a certain risk associated in the issuance of loans by the financial institutions. Due diligence is therefore supposed to be observed by the financial institutions issuing out the loans. Credit scoring is one of the most successful applications and operations research techniques used in banking and finance, and is also one of the earliest financial risk management tools developed [1]. Credit scoring was developed by Fair and Isaac in the early 1960s and in simple terms corresponds to producing a score that can be used to classify customers into two separate groups: the creditworthy or”good” group (likely to repay the loan), and the non credit-worthy or”bad” group (rejected due to its high probability of defaulting) [4]. A risk assessment and analysis is supposed to be carried out in the issuance of loans. The creditworthiness of the individual to whom the loan is being issued out to is supposed to be assessed and analysed by the issuance of a credit score. The credit score issued to an individual will determine on whether to issue out the loan to the person or not. Several credit scoring techniques have been proposed and implemented over the years. The credit scoring techniques vary from statistical based techniques to Artificial Intelligence based techniques. Statistical based techniques range from Linear Discriminant Analysis to Logistic regression. Artificial Intelligence (AI) based techniques have a learning and memory ability. AI techniques are able to learn from specific data and pick up trends. They are also able to remember what has been previously learnt. AI techniques have proven to be more flexible compared to statistical based techniques which operate on a rigid set of conditions. AI based techniques such as Artificial Neural Networks, Genetic Algorithm and Paper ID: 02015243 Artificial Immune Systems will be looked at in this paper. With the advent of the global financial crisis which was largely as a result of defaulting loans or bad loans where people were unable to repay loans that they took, the area of credit scoring is therefore of the essence. 2. Factors considered in Credit Scoring In the credit scoring process, there are certain and specific criteria that are looked at. The main determinants of whether a default will take place or not can be classified into the following four areas [3], [13]. 1. Financial Indicators 2. Demographic Indicators 3. Employment Indicators 4. Behavioral Indicators 2.1 Financial Indicators Financial indicators essentially indicate financial status or position of the loan applicant in repaying the loan. Cash inflows and outflows give a realistic position or potential maximum possible monthly payment the loan applicant can pay. 2.1.1 Total assets of borrower The total assets of borrower reflect the borrower’s repayment ability. If the borrower is hit with financial troubles, assets can be used to cover the loan taken. 2.1.2 Gross income of borrower Cash inflows should be known in order to calculate the effective capacity of repayment 2.1.3 Gross income of household If borrower is unable to repay, family member can assist in the repayment. The higher the household income is the better your chances of loan acceptance Volume 3 Issue 8, August 2014 www.ijsr.net Licensed Under Creative Commons Attribution CC BY 165 International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 2.1.4 Monthly costs of household Cash outflows should be known in order to calculate the effective capacity of repayment 2.2 Demographic Indicators 2.4.2 Average balance on CA What is your average account balance over a period of time? If the average balance per month is higher than the monthly installment, the more likely for the loan to be approved. These variables typically do not have the highest consideration in the loan consideration process. 2.4.3 Loans outstanding Your current load of loans is looked at. If u have other loans that you are servicing it then decreases your chances or ability to pay back other loans 2.2.1 Age of borrower Age to determine the maturity levels 2.2.2 Sex of borrower The gender of the applicant is hardly ever looked at anymore. 2.4.4 Loans defaulted or delinquent Loans defaulted are loans taken but never fully paid back or never paid back at all. Your loan history will give an indication of the likelihood of repayment in the loan currently being applied for. 2.2.3 Marital status of borrower Depending on the purpose of the loan marital status can also be considered. If married, it can be coupled with analysis of house hold income. 2.4.5 Number of payments per year At what intervals do you make payments? 2.2.4 Number of dependants The more kids, the more the load and the more likely to default. 2.4.6 Collateral/ guarantee If loan applicant has collateral, the more likely for the loan to be approved. The loan applicant will work diligently to repay for fear of losing the asset that has been put up as collateral. 2.2.5 Home status Is the house rented or owned, if the house is owned it increases the likelihood of getting a loan relative to when house is rented 3. Credit scoring Techniques 2.2.6 District of address The area you stay also signifies your social status, the higher your social status the more likely you are to repay 3.1 Statistical based methods 2.3 Employment Indicators Employment details are analysed in this segment 2.3.1 Type of employment Are you self employed or employed somewhere. Such factors are such as the nature of work that you do are looked at. 2.3.2 Length of current employment The more the years the more it signifies on your stability. More stable people are more likely to repay. 2.2.3 Number of employments over the last x years Depending with the organizations worked for, if the change in employment moves up the ladder, the more likely you are to repay. 2.4 Behavioral Indicators The financial behavioral patterns are looked at in this segment 2.4.1 Checking account (CA) Does the applicant have an account with the financial institution issuing the loan or not? If a client has some history with a financial institution, then that bank can easily verify the behavioral activities of the loan applicant. Paper ID: 02015243 Credit Scoring techniques are divide into statistical and Artificial intelligence based methods. 3.1.1 Linear regression Linear regression is a credit scoring technique concerned with describing the relationship between a response variable and one or more independent variables.[3] 3.1.2 Discriminant analysis Discriminant analysis is a credit scoring technique developed to discriminate between two groups. It is widely agreed that the discriminant approach is still one of the most widely established techniques to classify customers as good credit or bad credit.[3] 3.1.3 Probit analysis Probit analysis is a credit scoring technique that finds coefficient values, this in turn is the probability of a unit value of a dichotomous coefficient. A linear combination of the independent variables is transformed into its cumulative probability value from a normal distribution [3]. 3.1.4 Decision trees A classification tree is a non-parametric method to analyse dependent and/or categorical variables as a function of continuous explanatory variables (Breiman et al. 1984; Arminger et al, 1997). In a classification tree, a dichotomous tree is built by splitting the records at each node based on a function of a single input. The system considers all possible splits to find the best one, and the winning sub-tree is selected based on its overall error rate or lowest cost of misclassification (Zekic-Susac et al, 2004) [3] Volume 3 Issue 8, August 2014 www.ijsr.net Licensed Under Creative Commons Attribution CC BY 166 International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 3.1.5 Logistic Regression Logistic regression (LR) is a probabilistic statistical classification model.Logistic regression is one of the most widely used techniques in credit scoring like discriminant analysis. The main differences between a logistic regression model from a linear regression model are that the outcome variable in logistic regression is dichotomous (a 0/1outcome) [3]. LR model can fit various kinds of distribution functions such as Gamble, Poisson, and normal distributions, unlike other statistical tools (e.g. discriminant analysis or ordinary linear regression). It is more suitable for the fraud detection problems. In addition, in order to increase its accuracy and flexibility, several methods have been proposed to extend the traditional binary logistic regression model including multinomial logistic regression model and logistic regression model for ordered categories [5]. 3.2 Artificial Intelligence based techniques 3.2.1 Artificial Neural Networks (ANNs) ANNs are computational models inspired by an animal's central nervous systems which are capable of machine learning as well as pattern recognition [12]. ANNs are inspired by the functionality of the nerve cells in the brain. Just like humans, ANNs can learn to recognise patterns by repeated exposure to many different examples. They are non-linear models that can classify based on pattern recognition capabilities. This gives them an advantage over conventional statistical techniques used in industry which are primarily linear. In the field of credit scoring, studies have shown that neural networks perform significantly better than statistical techniques. [1], [5]. ANN have been used in credit rating and credit scoring quite extensively as illustrated in the following papers : “Artificial Neural Networks for Corporation Credit Rating Analysis”[7], “Personal Credit Rating Assessment for the National Student Loans based on Artificial Neural Network”[8], “Personal Credit Rating Using Artificial Intelligence Technology for the National Student Loans” where a Back Propagation neural network was used [9], “Research of electronic commercial credit rating based on Neural Network with Principal Component Analysis” [10] 3.2.2 Genetic Algorithms (GAs) GAs try and replicate the natural selection process where genes are passed from one generation to the next generation.. GAs are inspired by biological evolution and offer efficient problem-solving mechanisms. A problem’s solution is evolved over many processing cycles, each time producing better solutions. Application of GAs is rapidly expanding with successful applications in finance trading, fraud detection and other areas of credit risk. Desai et al. investigated the use of GAs as a credit scoring model in a credit-union environment while Yobas et al. compared the predictive performances of four techniques, one of which is GAs, GA faired quite well coming in second place[1][5]. GAs can perform at least as good or better than traditional techniques [2]. 3.2.3 Artificial Immune Systems (AIS) AIS are an Artificial intelligence technique inspires natural immune system of the body. AIS have a learning and Paper ID: 02015243 memory component, and perform pattern recognition. AIS were implemented in a paper titled “An Artificial Immune System for Extracting Fuzzy Rules in Credit Scoring” [11]. Weka data mining software was used to classify and in turn compared with other well-known classifiers. They used the clonal selection algorithm to implement the AIS. Competitive results with high accuracy were obtained. 4. Future Work The area of credit scoring is a very critical area for the financial sector. For the financial sector to survive formidable credit scoring techniques are required.. The area of credit scoring is a very interesting area with a lot of research still going on. Algorithms and techniques which attain higher detection rates continue to be found. Particularly in the area of Artificial Intelligence (AI). AI techniques are an area to look out for particularly in the area of credit card scoring. 5. Conclusion Various credit scoring techniques were looked at. The first review looked at statistical based methods and the rest were nature inspired. The nature inspired methods were ANN, based on the model of brain, GA, based on natural selection modelled on the evolution of species, and the last one is Artificial Immune System. Some studies found statistical techniques to perform better than AI techniques, while others concluded just the opposite. References [1] Kevin Leung, France Cheong, Christopher Cheong “Consumer Credit Scoring using an Artificial Immune System Algorithm,” IEEE, pp. 3377-3384, 2007 [2] David J. Fogarty, “USING GENETIC ALGORITHMS FOR CREDIT SCORING SYSTEM MAINTENANCE FUNCTIONS,” International Journal of Artificial Intelligence & Applications (IJAIA), Vol.3, No.6, pp 18, November 2012 [3] Abdou, H. & Pointon, J. (2011) 'Credit scoring, statistical techniques and evaluation criteria: a review of the literature ', Intelligent Systems in Accounting, Finance & Management, 18 (2-3), pp. 59-88. [4] Antonio I. S. Nascimento and Germano C. Vasconcelos ,“An Experimental Investigation of Artificial Immune System Algorithms for Credit Risk Assessment Applications” In Proceedings of the WCCI IEEE World Congress on Computational Intelligence, pp 1-8, 2012 [5] Antariksha Bhaduri “Credit Scoring using Artificial Immune System Algorithms: A Comparative Study,” World Congress on Nature & Biologically Inspired Computing, pp. 1540-1540, 2009 [6] Yang Zong-chang, Kuang Hong “Credit Evaluation for Mobile Customers Using Artificial Immune [7] Liu Yijun, Cai Qiuru, Luo Ye, Qian Jin, Ye Feiyue “Artificial Neural Networks for Corporation Credit Rating Analysis,” International Conference on Networking and Digital Society,pp. 81-84, 2009\ Volume 3 Issue 8, August 2014 www.ijsr.net Licensed Under Creative Commons Attribution CC BY 167 International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Impact Factor (2012): 3.358 [8] Xiao_ jie Zhang, Jian Hu “Personal Credit Rating Assessment for the National Student Loans based on Artificial Neural Network,” International Conference on Business Intelligence and Financial Engineering, pp. 5356, 2009 [9] Jian HU “Personal Credit Rating Using Artificial Intelligence Technology for the National Student Loans,” In Proceedings of the 4th International Conference on Computer Science & Education, pp. 103106, 2009 [10] XUE Xiang-hong, XUE Xiao-feng “Research of electronic commercial credit rating based on Neural Network with Principal Component Analysis,” IEEE [11] , pp. 1-4, 2010 [12] Ehsan Kamalloo ,Mohammad Saniee Abadeh “An Artificial Immune System for Extracting Fuzzy Rules in Credit Scoring;” IEEE, pp 1-8, 2010 [13] Wikipidea, wikipedia.org, [Online]. Available: http://en.wikipedia.org/wiki/Artificial_neural_network,[ Accessed: July.15, 2014] [14] Martin VOJTEK, Evžen KO ČENDA www.risknet.de [Online]. Available: https://www.risknet.de/uploads/tx_bxelibrary/CreditSco ring.pdf, [Accessed: July.15, 2014] Author Profile Thabiso Peter Mpofu received B. Tech degree in Computer Science at Harare Institute of Technology (HIT), Zimbabwe in 2010. He is currently pursuing M. Tech Computer Science at JNTUH, India. He is a HIT staff development research fellow. His research interests are in the area of Data Mining, Network Security and Mobile Computing. Macdonald Mukosera received the B.Tech Hons degree in Computer Science from Harare Institute of Technology in 2010. During 2011-2012, he worked as a Teaching assistant at Harare Institute of Technology in the Software Engineering Department. He is now studying M Tech Computer Science at Jawaharlal Technological University Hyderabad in School of Information Technology. Paper ID: 02015243 Volume 3 Issue 8, August 2014 www.ijsr.net Licensed Under Creative Commons Attribution CC BY 168