Metodološki zvezki, Vol. 1, No. 1, 2004, 109-118 Maximum Likelihood Estimation of Lorenz Curves using Alternative Parametric Model Ibrahim M. Abdalla1 and Mohamed Y. Hassan2 Abstract In this paper the Lorenz curve proposed by Abdalla and Hassan is fitted to grouped income data of Abu-Dhabi Emirate family expenditure survey, 1997, using Maximum likelihood estimation method and assuming that income shares follow a Dirichlet distribution. Employing Abdalla and Hassan's together with some known parametric Lorenz models, estimates based on the maximum likelihood are compared with those based on nonlinear least squares techniques. Given the nature of the distribution of income and the distinct characteristics of Abu-Dhabi Emirate, it is evident that the maximum likelihood estimation approach produces comparable parameter estimates to the non-linear least squares techniques, but higher standard errors and less goodness of fit. Under the two estimation techniques, the model proposed by Abdalla and Hassan performed well better than some well known parametric models in the literature. 1 Introduction The Lorenz curve is a graphical representation, usually adopted to depict the distribution of income and wealth in a population. Horizontally, it displays the cumulative proportion of the population, say p, arranged in increasing order of income. Vertically, it measures the proportion of income, ?, accruing to any particular fraction of the population thus arranged. Direct estimation of the Lorenz curve is based on linear interpolation using empirical data presented in income group format. This representation assumes a 1 College of Business and Economics, United Arab Emirates University, P. O. Box 17555; i.abdalla@uaeu.ac.ae 2 College of Business and Economics, United Arab Emirates University, P. O. Box 17555; Myusuf@uaeu.ac.ae 110 Ibrahim M. Abdalla and Mohamed Y. Hassan complete uniformity of income distribution within each group. Consequently the estimated curve converges to the true Lorenz curve as the number of income groups increases. However, various models for parametric Lorenz curves have been suggested in the literature, including Kakwani (1980), Basmann et al. (1990), Ortega et al. (1991) and Chotikapanich (1993). Other models are based on well known income distribution models such as lognormal and gamma. Kakwani and Podder (1976) noted that Lorenz curves, driven from known density functions, hardly give reasonably good fit to actual data. By means of goodness of fit tests, Gini ratios , and estimated income shares, it is evident that results of employing different models on different income data for different countries are in sharp contrast (Chotikapanich, 1993, Sarabia et al., 1999 and Cheong, 1999). Some models give a good approximation to the data over the middle of the distribution, but not well over the tails. The opposite happens when analyzing some other models. Arguably, these differences may be attributable to differences in the nature of income distributions and distinct characteristics of different countries. The objective of this paper is to estimate the Lorenz curve for income data from the most recent Abu-Dhabi Emirate family expenditure survey, Ministry of Planning (1997), using the maximum likelihood approach suggested by Chotikapanich and Griffiths (2002) and compare estimates with those based on the non-linear least square techniques. Along with the model proposed by Abdalla and Hassan (2004), additional models are evaluated and corresponding Gini concentration ratios are assessed in terms of accuracy of fit. The plan of this paper is as follows: In Section 2 some known parametric Lorenz curves (models), including the one proposed by Abdalla and Hassan (2004) are presented. In Sections 3 and 4 estimation methods are outlined. Goodness of fit and empirical results are discussed in Sections 5 and 6. Section 7 is dedicated to some concluding remarks. 2 Parametric Lorenz Curves Suppose that population units, individuals or families, are ordered according to increasing income. Thus income distribution data can take the form ( pi ,hi ) for each of T income groups, where pi is the cumulative proportions of population units associated with group i , with pT = 1, and hi is the corresponding cumulative proportions of income, with hT = 1. Dropping the subscripts and utilizing these observations, a parametric Lorenz curve is given as h= L( p;q), where q is a vector of unknown parameters. A function is a Lorenz curve if it satisfies the following conditions: Maximum Likelihood Estimation of Lorenz Curves… 111 (i) L(0; q) = 0, (ii) L(1; q) = 1, (iii) L(p;q) is twice continuously differentiable monotone increasing function; therefore: L¢(p;q)³0, L¢¢(p;q)³0. Abdalla and Hassan (2004) suggest the model defined by L(p) = pa[1 - (1 -p) de ] a ³0, 0 £ b £d £ 1 (21) The Gini concentration ratio (RG) associated with the different parametric models are calculated based on the definition 1 RG = 1- 2? L( p;q)dp 0 (2.2) In particular, the Gini ratio associated with Abdalla and Hassan model is reduced to the following closed form RG= 1 -2\pa[ 1-(1-p) ebpdp 0 = a-1 ¥ G(a+ j + 1)G(d +1)b j (23) a+1 j=0G(j + 1)G(a + d + j + 2) Using Abu-Dhabi Emirate income data set, 1997, the performance of the parametric model suggested by Abdalla and Hassan, LAH, is compared with different alternatives. The first alternative is the model LC introduced by Chotikapanich (1993). It is a simple one parameter model: eap-1 (2.4) LC( p;a) =------ a>0 e a - 1 Ortega et al. (1991) suggested the model LO which is a special case of Abdalla and Hassan model LAH when b = 0; LO(p;a,d) = pa[1 - (1 - p) ] a³0, 00, 0