Scientific paper Randi} Index, Irregularity and Complex Biomolecular Networks Ernesto Estrada Department of Mathematics & Statistics, Department of Physics, and Institute of Complex Systems at Strathclyde, University of Strathclyde, Glasgow G1 1XH, UK * Corresponding author: E-mail: Ernesto.estrada@strath.ac.uk Received: 21-06-2010 This paper is dedicated to Professor Milan Randi} on the occasion of his 80th birthday Abstract A new formulation of Randic index is carried out as a minimization of a quadratic form which involves the Laplacian matrix of a graph. Using this formulation it is easy to realize that Randic index is useful in defining a new index of irregularity, which is then formulated here. A new context for the study of irregularity of graphs is advanced as a necessity for studying complex (biological) networks. We analyze both Randic and irregularity indices for random networks with Poisson and power-law degree distributions. Then, we analyze the irregularity of 10 protein-protein interaction networks in different organisms ranging from 50 to 3000 nodes. Finally, some 'ruminations' about the elegance and applicability of Randic index are remarked. Keywords: Complex networks; scale-free networks; protein-protein interaction networks; graph irregularity; quadratic forms; Laplacian matrix 1. Introduction Chemistry is defined as the science that studies the "composition, structure, properties, and reactions of matter, especially of atomic and molecular systems".1 A great deal of efforts was put forward in the 20th century for understanding the structure and properties of isolated molecular species. Quantum chemistry, molecular mechanics and molecular graph theory are examples of these fruitful attempts to express molecular structure in an abstract way. Such (mathematical) abstractions allow the description, and more importantly the prediction, of many molecular properties, ranging from physico-chemical to complex biological behaviours. However, 21th century has presented a new and more difficult challenge to chemistry. The availability of new experimental techniques allows mapping the interactions of thousands of molecules in cellular environments. As a consequence systems biology was born.2 Systems biology is an interdisciplinary approach to study complex interactions in biological systems, resting basically on the study of emergent properties in complex biological networks. Among their targets we can mention genetic networks, transcription net- works, protein-protein interaction networks, and metabolic networks. More recently, the term system chemistry3 has been coined, which goes beyond biological networks to cover the whole spectrum of emergent properties in chemical networks. Among the approaches developed in the 20th century to study properties of 'isolated' molecules only a tiny fraction is useful to study the new problems posted by systems biology and chemistry. It goes beyond any imagination to ask how quantum chemistry or even molecular mechanics can deal with more than 6,000 interactions between more than 2,700 proteins in the main connected component of the human protein-protein interaction network. In contrast, methods based on network theory, which rest basically on the study of topological properties of these interaction networks, has become the standard analysis in systems biology and chemistry. In 1975 Milan Randic proposed one of such topolo-gical invariants to study properties of small isolated organic molecules.4 This index was proposed as a branching index and it is nowadays known as the Randi} index of a graph.4 The Randic index 1R-1/2 of a graph having |£] = m undirected links is given by4 (1) Here we designate the degree of a node, i.e., number of links attached to it, by k. This index was then generalized by Bollobas and Erdos by changing the exponent -1/2 to any real value exponent a.5 The Randic index has been intensively studied in mathematics and widely applied in chemistry and biomolecular sciences.6-10 Here we propose a new interpretation for this index and a new context for applications in systems biology and chemistry. irregular ones. A comparison between these indices has been published by Gutman et al.11 Consequently, we are urged to define a new irregularity index from some sort of 'first-principles'. We need an index minimized by regular graphs and maximized for starlike ones. The reason for this necessity will become clear in the next section. Let us restate the problem of finding an irregularity index as that of minimizing the quantity i i i y (5) 2. Network Irregularity and Randic Index The study of network irregularity can be traced back to the seminal paper of Collatz and Sinogowitz in 1951.11 These authors raised the question of finding the most irregular graphs of a given size by using the following measure (2) where is the principal eigenvalue of the adjacency matrix and k is the average degree. Clearly this index is zero for regular graphs and Collatz and Sinogowitz conjectured that stars maximize this index.11 They showed that in fact this is the case for graphs with up to 5 nodes. However, Cvetkovic and Rowlinson12 rejected the conjecture of Collatz and Sinogowitz11 by constructing several families of graphs having larger values of CS(G) than the corresponding stars of the same size. Another irregularity index was proposed in 1992 by Bell as the variance of node degrees k;:13 (k-kj. (3) n This index was proposed for the first time as a measure of centralization by Snijders14 based on the intuition that "centralization is synonymous with the dispersion or heterogeneity" of a node. It takes the minimum value for regular graphs as there is no heterogeneity at all in their degrees. However, the maximum value depends on the number of nodes in the network. Another measure of graph irregularity was proposed by Albertson as:15 A(G)= Y[V iiJkk (4) Recently, Hansen and Melot have shown that the Albertson index is not always maximized for star graphs but it is in a particular class of split graphs consisting of a clique, an independent set, and some links joining a node in the clique to another one in the independent set. Although the three indices are minimized for regular graphs they identify different classes of graphs as the most The intuition behind this definition of irregularity is that for regular networks this quantity is equal to zero as in the case of the other irregularity indices previously defined in the literature. However, as the difference in the degrees of adjacent nodes increases, the index increases. For instance, let us consider a node of degree one connected to a node of degree k. Then, as k increases the term inside the bracket tends to one. If we consider the general case I^r-k/J, (6) we can see these indices as a generalization of the Albert-son index of irregularity15 in which instead of using a modulus function we use the square one. Now, let us define the Laplacian matrix L of a network whose entries are given by18 ' k, for i = j, -1 for i~j, 0 otherwise, LV = where i ~j stands for pairs of adjacent nodes. The matrix L can be obtained as L = K-A, where K is a diagonal matrix of degrees and A is the adjacency matrix of the network, whose A., is one if, and only if, the corresponding nodes are joined by a link or zero otherwise. This discrete Laplacian has been intensively studied in algebraic graph theory and the reader is referred to the excellent reviews for details.19, 20 Let |ka } = (k", kav k") represents a column vector where ki is the degree of the node i. Then, it is easy to realize that expression (6) can be stated as a quadratic form of the Laplacian matrix: (7) The second term in the right hand side of eq. (1) is twontimes the generalized Randic index Ra.8-10 Let 0Ra = X (kj)a be designated as the generalized zeroth-or-der Randic index.s~w It is straightforward to realize that the quantity to be minimized is the difference between the R2a+1 index and twice the Randic index lRa, R2a + x - 21Ra. I i.jzE 1 W v = n-TR (8) Now, let us consider some known facts about the 1R-1/2 index. Among connected networks with n nodes, the star Sn attains the minimum 1R-1/2 index and the regular ones have maximum 1R-1/2. Then, among connected networks of size n, the original Randic index is bounded as: (9) where the lower bound is reached for the star Sn and the upper bound is attained for any regular network with n nodes irrespective of its degree. Then, we can normalize the Randic index as follows (10) which gives a zero value for regular graphs and one for the star. It is straightforward to realize that (11) which clearly identifies p(G) as a normalized irregularity index for a network. We would like to close this section by developing a little bit further the connection between the Randic index and the graph Laplacian. Let fyj be an orthonormal eigenvector of the Laplacian matrix associated with the ^ eigenvalue. We recall that the Laplacian matrix is positive semidefinite. Let COSÖ, = „ 1 I. -Mi (12) be the angle formed between the orthonormal eigenvector fyj and the vector k-1/2 previously defined. The Euclidean norm ||ka|| can be written as ||ka|| = V0R2a. Then, the generalized Randic index can be written as ~2 (13) In particular, the original Randic index is expressed as follows ]R-\i2 ~2 n-%^MjCOS2 ffj (14) The term cos2 0j represents the "contribution" of the normalized degree to the corresponding eigenvector (or vice versa). For instance, cos2 6j = 0 means that the vector ka is perpendicular to the corresponding eigenvector, and no "duplicated" information is contained in both vectors. 3. Why Does Irregularity Matter? The study of complex networks representing systems in disparate real-world contexts ranging from biological to technological systems has become an important area of interdisciplinary research.21-23 These complex networks share many universal topological characteristics such as small-worldness,24 scale-freeness,25 the existence of network motifs26 and self-similarity characteristics.27 In particular, the power-law degree distribution of many real-world networks contrasts with the regularity observed in random models like the one proposed by Erdos and Renyi.28 Let us consider the property of scale-freeness in more detail. Let p(k) = n(k)/n be the probability of randomly selecting a node of degree k in a network, where n(k) is the number of nodes having degree k in a network of size n. Then, a plot of p(k) versus k represents the degree distribution for the network.22,23 A random network displays a Poisson degree distribution of the form P(kh k\ (15) where k is the average degree of the network. Most biological networks behave in a completely different way. Instead of having such kind of 'democratic' distribution of node degrees, they display a very 'egoistic' one, in which very few nodes have very large degrees and most of the nodes has relatively low degrees. In mathematical terms, the degree distribution of these networks follows a power-law of the form:22, 23 p{k)~k- (16) In Fig. 1 we illustrate the plots of Poisson and power-law degree distributions for a hypothetical network. The term 'scale-free' describes the fact that by scaling the degree by a constant factor c, only produces a proportionate scaling of the function: p{k,c)= A(ckY = Ac-r-pik). (17) Power-law relations are usually represented in a logarithmic scale, where we obtain a straight line, lnp(k) = -fink + lnA, where -/is the slope and lnA the intercept of the function. However, strictly speaking there are not very many networks displaying a 'perfect' power-law degree distri- Poisson degree distribution Power-law degree distribution Figure 1. Illustration of Poisson and power-law degree distributions for hypothetical networks. bution. Instead, a complete zoo of 'fat-tail' distributions, such as lognormal, Burr, logGamma, Pareto, etc., can be found in complex (biological) networks. Then, the more generic term of 'fat-tail' degree distribution has been proposed to globalize those distributions. The idea is that networks with 'fat-tail' distributions display a large heterogeneity, or in our terms a large irregularity. The identification of fat-tail degree distribution in networks has become an intense area of research in system biology and other interdisciplinary areas of research. The consequences that a network displays such kind of degree distributions are remarkable.21-23 For instance, let us consider a protein-protein interaction network, in which nodes represent proteins and links represent interactions between pairs of proteins in a cell. If such network displays a fat-tail degree distribution it will be robust to random failures of their nodes. That is, due to the fact that about 80% of proteins have low number of interactions, their failure does not produce the collapse of the whole system. However, a targeted attack to the most connected proteins, the very few with very large degrees, disconnects the network into isolated chunks, which are unable to function as a system. In other words, if a network is very irregular, i.e., it is starlike, it will be robust to random failures but very vulnerable to intentional attacks. The consequences of the previously described relationship between vulnerability and irregularity are tremendous. Irregular biological networks can be very vulnerable to the failure of some specific nodes indicating possible sources of diseases as well as possible targets for drug design. The question that remains is whether we need an index of irregularity for such networks instead of analyzing their degree distributions. Identifying the degree distribution of a network is not a trivial question. There are hundreds of possible distributions to test. Sometimes the differences in the statistical fittings between several distributions are very small. More difficult is the situation for relatively small networks, where the number of data points is not enough for having a good fit of any of the candidate distributions. Consequently, having an index such as p(G), which has its minimum for regular networks and its maximum for starlike ones, is an urgent necessity. 4. Irregularity in Random Networks Now, we are going to investigate random graph models used to model real-world complex networks. The Erdos-Renyi graphs28 Gnp are generated by considering n nodes which are linked by pairs according to a probability p, 0 < p < 1. It is well-known that when p ~ (n)log n/n, where Gin) ^ Gnp is almost surely connected, and the degrees of almost all nodes are asymptotically equal. Then, in this regime R-1/2 (ER) ~ n/2 and p(G) « 0. Figure 2. Change of Randic index as a function of average degree for networks generated with Barabasi-Albert model of preferential attachment. Another popular model for generating random graphs is the Barabasi-Albert (BA) model,25 which produces random graphs with power-law degree distributions. In this model the graphs are generated from an initial set of m0 > 2 nodes. The new nodes are added one at a time by connecting them to m of the existing nodes according to the probability pi = kt/ Xjk;, We have generated BA graphs with 5,000; 10,000 and 20,000 nodes and average degrees ranging from 2 to 16. Then, we have averaged the values of 1R-1/2 for 100 realizations of each graph. In Fig. 2 we illustrate the plots of 1R-1/2 versus (k>. The plots can be approximated by using the following power-law expression, 2.27 + k ' (18) Accordingly, the irregularity index for BA networks is approximated by the following expression: p{bá)» f 0.27 + r 2.27 +k' (19) which for very large networks, n ^ with very large average degree approximates to a constant value, p(BA) ~ 0.1189. The first conclusion that can be extracted from these results is that the scale-free networks are more irregular than random networks with homogeneous degree distributions, which is a trivial conclusion based on the proper definition of both kinds of networks. However, in particular the BA model is a poor generator of such irregularity in graphs, which is able to produce only about 12% of the irregularity of a star-like graph. In addition, it can be clearly seen that the increase of the average degree in scale-free graphs reduces the irregularity. Thus, both degree distribution and average degree play important roles in understanding the irregularity of a network. All networks display significantly larger irregularity than the regular random networks. That is, all networks have p(G)>> 0, which indicates that they have some kind of fat-tail degree distributions characterized by very few hubs that keep the whole network connected. In comparing these networks with those generated at random using BA preferential attachment we see that most of them display larger irregularity than the BA ones. In particular the PPI of yeast, S. cerevisiae, displays a ratio p(G) / p(BA) = 1.74, followed by the human PPI with p(G) / p(BA) = 1.49. Table 1. Irregularity of complex protein-protein interaction networks of several organisms. No. Network n m pG) pB/A) ratio 1 D. melanogaster 3039 3687 0.148 0.246 0.602 2 KSHV 50 118 0.151 0.246 0.614 3 P. falciparum 229 604 0.174 0.196 0.888 4 VZV 53 154 0.278 0.227 1.225 5 Human 2783 6007 0.283 0.190 1.489 6 S. cerevisiae 2224 6829 0.294 0.169 1.740 7 A. fulgidus 32 37 0.308 0.372 0.828 8 H. pylori 710 1396 0.323 0.205 1.576 9 C. elegans 314 363 0.383 0.273 1.403 10 B. subtilis 84 98 0.386 0.309 1.249 Stumpf and Ingram40 have carried out a detailed statistical analysis of the empirical data existing for the yeast PPI showing that the best fit for the degree distributions are the stretched exponential followed by a lognormal distribution. The same distribution is observed for H. pylori.40 The stretched exponential distribution is also known as the Weibull distribution and has the following form: \(H) A>iï. (20) 5. Irregularity in Protein-protein Interaction Networks Here we study 10 protein-protein interaction (PPI) networks. In these networks, nodes represent proteins and undirected links represent the interaction between two proteins determined experimentally. The networks studied correspond to following organisms: D. melanogaster (fruit fly),29 Kaposi sarcoma herpes virus (KSHV),30 P. falciparum (malaria parasite),31 varicella zoster virus (VZV),30 human,32 S. cerevisiae (yeast),33 A. fulgidus3 H. pylori,35 C. elegans36 and B.subtilis?1 We study only the largest (main) connected component of each of these networks, which ranges from 50 to 3039 proteins. In Table 1 we give the values of irregularities of all these PPIs together with the values calculated for random networks with preferential attachment using BA model. The PPI of C. elegans displayed a power-law degree distribution and that of D. melanogaster was fit to a gamma distribution.40 Not all networks analyzed here were previously discussed by Stumpf and Ingram. In addition these authors do not study an exhaustive list of all possible distributions for the networks analyzed. Then, it is difficult to obtain a parallel between both approaches. According to our irregularity measure, the PPI of D. melanoga-ster appears as the least heterogeneous one while that of C. elegans is the most irregular among those considered by Stumpf and Ingram. What is clear here is that the current approach allows a classification of networks according to their irregularity, which is not possible by using degree distributions when a variety of fits are considered. In Fig. 3 we illustrate two of the smallest networks with close to extreme values of irregularity, i.e., that of KSHV and that of B.subtilis. 6. Ruminations Since the seminal paper introducing the Randic index was published in 1975, there have been many advances in the study and applications of networks and graph theory to molecular sciences. Milan Randic himself has pioneered several of these forefronts of research in areas such as characterization of molecular structure, chemin-formatics and bioinformatics. However, the most important contribution that the mentioned 1975 paper has done is as a source of inspiration for several generations of scientists. The current author is one of such examples who found his way in sciences by studying that paper and others that Milan Randic published on molecular graph theory. These contributions, together with those of other pioneers of molecular graph theory, have paved the road of the current development of network theory. Network theory is one of the most important 'new' areas of interdisciplinary research in 21th century, which is called to solve important public health, technological, environmental as well as socio-economical problems that confront humankind today.38 There is a vast literature in mathematics and in chemistry about the Randic index. Eminent mathematicians like Erdos and Bollobas5 already paid attention to this index at the end of the 20th century. Gutman and others810 have found many extremal properties and bounds for this index. Many chemists and biologists have found an immeasurable amount of applications for this index. The simplicity of the Randic index is astonishing: Graham Farmelo has edited a selection of "Elegant Formulae",39 which includes Einstein's E = mc2, Schrodinger's HW = EW, Shannon's I = -p log2pt, among others. No doubt about the elegance, simplicity and importance of all these formulae. Randic equation is elegant, simple and fundamental for understanding many properties of interaction networks. Only time will tell whether 'Randic equation' deserves a place among Farmelo's "elegant formulae". In the meantime we continue exploring it to discover new facts that support its inclusion. 7. Acknowledgement The author thanks Editor Marjana Novic for the kind invitation to write this paper. Partial financial support from the Principal, University of Strathclyde through the New Professors' Fund is also thanked. 8. References 1. Chemistry. Merriam-Webster's Medical Dictionary, http://dictionary.reference.com/browse/Chemistry, (accessed: July 20, 2010). 2. U. Alon. An Introduction to Systems Biology: Design Principles of Biological Circuits. CRC Press, 2006. 3. R. F. Ludlow, S. Otto, Chem. Soc. Rev. 2008, 57, 101-108. 4. M. Randic, J. Amer. Chem. Soc. 1975, 97, 6609-6615. 5. B. Bollobas, P. Erdos, Ars Combin. 1998, 50, 225-233. 6. L. B. Kier, L. H. Hall, Molecular Connectivity In Chemistry and Drug Research, Academic Press, New York, 1976. 7. L. B. Kier, L. H. Hall, Molecular Connectivity in Structure Activity Analysis, John Wiley Publ, London, 1986. 8. X. Li, Y. Shi, MATCH: Comm. Math. Comput. Chem. 2008, 59, 127-156. 9. X. Li, I. Gutman, Mathematical Aspects of Randic-Type Molecular Structure Descriptors, Mathematical Chemistry Monographs No.1, Kragujevac, 2006. 10. I. Gutman, B. Furtula (Eds.) Recent Results in the Theory of Randic Index, Mathematical Chemistry Monographs No.6, Kragujevac, 2008. 11. L. Collatz, U. Sinogowitz, Abh. Math. Sem. Univ. Hamburg 1957, 21, 63-77. 12. D. Cvetkovic, P. Rowlinson, Publ. Inst. Math. (Beograd), 1988, 44, 29-34. 13. F. K. Bell, Linear Algebra Appl. 1992, 161, 45-54. 14. T. A. B. Snijders, Social Networks 1981, 3, 163-174. 15. M. O. Albertson, Ars Combinatoria 1997, 46, 219-225. 16. P. Hansen, H. Mélot, in S. Fajtlowicz (Eds.), Graphs and Discovery, DIMACS Series in Discrete Mathematics and Theoretical Computer Science, Providence, American Mathematical Society, 2005, 69, 253-264. 17. I. Gutman, P. Hansen, H. Mélot, J. Chem. Inf. Model. 2005, 45, 222-230. 18. N. L. Briggs, Algebraic Graph Theory, Cambridge University Press, 1974. 19. B. Mohar, in G. Hahns, G. Sabidussi (Eds.), Graph Symmetry: Algebraic Methods and Applications, NATO ASI Ser. C 497, Kluwer, 1997, 225-275. 20. T. Bitikoölu, J. Leydold, P. F. Stadler, Laplacian Eigenvector of Graphs, Springer-Verlag, Berlin, 2007. 21. S. H. Strogatz, Nature 2001, 410, 268-276. 22. R. Albert, A.-L. Barabási, Rev. Mod. Phys. 2002, 74, 47-97. 23. M. E. J. Newman, SIAMRev. 2003, 45, 167-256. 24. D. J. Watts, H. Strogatz, Nature 1998, 393, 440-442. 25. A.-L. Barabási, R. Albert, Science 1999, 286, 509-512. 26. R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklov-skii, U. Alon, Science 2002, 298, 824-827. 27. C. Song, S. Havlin, H. A. Makse, Nature 2005, 433, 392395. 28. P. Erdös, A. Rényi, Publ. Math. 1959, 6, 290-297. 29. L. Giot, J. S. Bader, C. Brouwer, A. Chaudhuri, B. Kuang, Y. Li, Y. L. Hao, C. E. Ooi, B. Godwin, E. Vitols, G. Vijayada-modar, P. Pochart, H. Machineni, M. Welsh, Y. Kong, B. Zerhusen, R. Malcolm, Z. Varrone, A. Collis, M. Minto, S. Burgess, L. McDaniel, E. Stimpson, F. Spriggs, J. Williams, K. Neurath, N. Ioime, M. Agee, E. Voss, K. Furtak, R. Renzulli, N. Aanensen, S. Carrolla, E. Bickelhaupt, Y. Lazovatsky, A. DaSilva, J. Zhong, C. A. Stanyon, R. L. Finley Jr., K. P. White, M. Braverman, T. Jarvie, S. Gold, M. Leach, J. Knight, R. A. Shimkets, M. P. McKenna, J. Chant, J. M. Rothberg, Science 2003, 302, 1727-1736. 30. P. Uetz, Y.-A. Dong, C. Zeretzke, C. Atzler, A. Baiker, B. Berger, S. Rajagopala, M. Roupelieva, D. Rose, E. Fossum, J. Haas, Science 2006, 311, 239-242. 31. D. J. LaCount, M. Vignali, R. Chettier, A. Phansalkar, R. Bell, J. R. Hesselberth, L. W. Schoenfeld, I. Ota, S. Sahasra-budhe, C. Kürschner, S. Fields, R. E. Hughes, Nature 2005, 438, 103-107. 32. J.-F. Rual, K. Venkatesan, T. Hao, T. Hirozane-Kishikawa, A. Dricot, N. Li, G. F. Berriz, F. D. Gibbons, M. Dreze, N. Ayi-vi-Guedehoussou, N. Klitgord, C. Simon, M. Boxem, S. Milstein, J. Rosenberg, D. S. Goldberg, L. V. Zhang, S. L. Wong, G. Franklin, S. Li, J. S. Albala, J. Lim, C. Fraughton, E. Llamosas, S. Cevik, C. Bex, P. Lamesch, R. S. Sikorski, J. Vandenhaute, H. Y. Zoghbi, A. Smolyar, S. Bosak, R. Se-querra, L. Doucette-Stamm, M. E. Cusick, D. E. Hill, F. P. Roth, M. Vidal, Nature 2005, 437, 1173-1178. 33. C. von Mering, R. Krause, B. Snel, M. Cornell, S. G. Oliver, S. Fields, P. Bork, Nature 2002, 417, 399-403. 34. M. Motz, I. Kober, C. Girardot, E. Loeser, U. Bauer, M. Albers, G. Moeckel, E. Minch, H. Voss, C. Kilger, M. Koegl, J. Biol. Chem. 2002, 277, 16179-16188. 35. C.-Y. Lin, C.-L. Chen, C.-S. Cho, L.-M. Wang, C.-M. Chang, P.-Y. Chen, C.-Z. Lo, C. A. Hsiung, Bioinformatics 2005, 21, 1288-1290. 36. A. David, P. Bello, N. Thierry-Mieg, J. Vaglio, J. Hitti, L. Doucette-Stamm, D. Thierry-Mieg, J. Reboul, S. Boulton, A, J, Walhout, O. Coux, M. Vidal, EMBO Rep. 2001, 2, 821828. 37. P. Noirot, N. F. Noirot-Gross, Curr. Op. Microb. 2004, 7, 505-512. 38. E. Estrada, M. Fox, D. Higham, G.-L. Oppo (Eds.) Network Science. Complexity in Nature and Technology, Springer, 2010 in press. 39. G. Farmelo (Ed.), It Must be Beautiful: Great Equations of Modern Science. Granta, U.K. 2003. 40. M. P. H. Stumpf, P. J. Ingram, Europhys. Lett. 2005, 71, 152158. Povzetek Nova formulacija Randicevega indeksa je izpeljana kot minimizacija kvadratne oblike, ki vključuje Laplaceovo matriko grafa. Iz te formulacije je enostavno razvidno, da Randicev indeks lahko uporabimo za definicijo novega indeksa »nepravilnosti«, ki je tukaj predstavljen. Potreba po novem načinu študija nepravilnosti grafov se je pokazala pri obravnavi kompleksnih bioloških sistemov, predstavljenih v obliki mrež. Analizirali smo Randiceve in »nepravilnostne« indekse za naključne mreže s Poissonovo in potenčno porazdelitvijo. Nadalje smo analizirali nepravilnost desetih mrež, ki ponazarjajo interakcije med proteini v različnih organizmih, v območju med 50 in 3000 vozlišči. V zaključnem razmisleku je poudarjena eleganca in uporabnosti Randicevega indeksa.