Strojniški vestnik - Journal of Mechanical Engineering 64(2018)6, 393-400 © 2018 Journal of Mechanical Engineering. All rights reserved. D0l:10.5545/sv-jme.2017.5189 Original Scientific Paper Received for review: 2017-12-30 Received revised form: 2018-02-19 Accepted for publication: 2018-04-16 New Hybrid System of Machine Learning and Statistical Pattern Recognition for a 3D Visibility Network Matej Babič1* - Karolj Skala2 - Dookhitram Kumar3 - Roman Šturm4 1 Jožef Stefan Institute, Ljubljana, Slovenia 2 Ruder Boškovic Institute, Croatia 3 University of Technology, Mauritius 4 University of Ljubljana, Faculty of Mechanical Engineering, Slovenia Intelligent systems are an excellent tool to use for solving complex problems in the field of industrial applications. We use the mathematical method of fractal geometry and network theory when laser-hardening techniques are applied. The microstructure of the robot-laser-hardened specimens is very complex; however, we can present it by using a 3D visibility network. We convert the scanning electron microscope (SEM) images of the microstructure to a 3D graph and calculate the density of the visibility network of these 3D networks. We have analyzed the topographical properties of the hardened specimens by using the algorithm for the construction of a visibility network in a 3D space. We develop a new hybrid system of machine learning for predicting carbide content of the hardened specimens by using multiple regression, neural networks, and a genetic algorithm. We find the statistical significance of the relationship between attributes of the hardened specimens, the topological properties of visibility graphs, and carbide content of the hardened specimens._ Keywords: fractal geometry, hybrid system, laser hardened specimens, visibility network, statistical pattern recognition Highlights • We have calculated the statistical properties of the data of the parameters of the hardened specimens. • We have described the carbide content of the hardened specimens using the topological properties. • We have presented a new intelligent hybrid system model to predict the carbide content of the hardened specimens. • Our new method has many applications in pattern recognition, computer graphics, computational geometry, and so on. 0 INTRODUCTION Robot laser hardening [RLH] [1] is a heat treatment similar to inductive or conventional flame hardening. We can analyze the microstructures of RLH specimens using 3D visibility networks (graphs). A visibility network [2] is a graph of visible areas, which presents a set of nodes and obstacles in the Euclidean plane or space. Fractal geometry [3] was developed by Mandelbrot, who built the Mandelbrot set y=z2 + c with the help of a computer. He provided a new approach in the scientific discipline as he set out and designed a new way of thinking about structures and shapes. The Hurst parameter H [4] is the correlation between random steps X1 and X2, which is followed by the time-to-time difference At. Hurst parameter occurs in many areas of applied mathematics, including fractals and chaos theory, and is used in many fields ranging from biophysics to network computers. The parameter was originally developed in hydrology. However, modern techniques to estimate the Hurst parameter H come from fractal mathematics. The fractal dimension has been used to measure the roughness of sea coasts. The relationship between the fractal dimension D and the Hurst parameter H is given by the equation D = 2 - H for 2D objects and D=3 - H for 3D objects. We developed a new method to estimate the Hurst parameter H of a 3D object. In this paper, we introduce a new hybrid intelligent system to predict the carbide content of RLH specimens from the topological property of the density of the visibility network and fractal dimension. 1 PREPARATION OF MATERIAL SPECIMENS The study was done on the standard tool steel labeled as DIN 1.7225 [5]. The tool steel was surface hardened by laser at different speeds and different powers. We use a robot laser cell RV60-40 (Reis Robotics Company). The maximum power of the robot-laser cell is 3000 W. We hardened specimens with an output power of 1500 W. Therefore, we modified the speed parameter v £ [2, 5] mm/s and the temperature parameter T £ [800, 2000] °C. Each sample was prepared by etching and polishing (IMT, Institute of Metals and Technology Ljubljana, Slovenia) for a microscope evaluation (IJS, Jozef Stefan Institute). Fig. 1 was made by field emission scanning electron microscopy, JMS-7600F, JEOL. We wanted to know whether the microstructure of the RLH fractal patterns found a structure from which the Hurst parameter H could be estimated. Fig. 2 presents a 3D graph of the *Corr. Author's Address: Jožef Stefan Institute, Ljubljana, Slovenia, , babicster@gmail.com 393 microstructure of the RLH specimen. It is converted from Fig. 1 by using Fig. 1 color depth. Fig. 1. SEM picture of microstructure of the RLH specimen Fig. 2. 3D graph of microstructure of the RLH specimen 2 DESCRIPTION OF METHOD We used fractal geometry and a visibility network to determine the complex microstructure of the RLH specimens. Fig. 3 presents the random vertices of the 3D graph. For better visual presentation, we use 5x5 vertices of the 3D graph. 3D graph presents microstructure of the RLH specimens. Babic et al. [6] present a solution for constructing a visibility network in a 3D space. Fig. 4 presents the results of the problem of constructing a visibility network in a 3D space. We use the statistical topological property of the density of the visibility Fig. 3. Vertices in a 3D graph Fig. 4. Visibility network of vertices in a 3D graph network for pattern recognition from SEM images of the RLH specimens. The density q was calculated for each visibility network by Eq. (1): q = - 2m n x (n -1) (1) where m is the number of edges, n is the number of vertices in the visibility network. We present a method of estimating the Hurst exponent H for 3D objects [7]. First, we use the program Image J to find all the coordinates (x, y, z) of the SEM picture. Secondly, we estimate the Hurst exponent H by using the z-coordinates, which present a long continuous graph (Fig. 5). Also, all points (xz, y0, zz) present a space component on a 2D graph for all points (xz, zz). All points (xi,j1, zz) present a second space component on a 2D graph for all points (xz, zz). We made a space component for all y, Vz. Then we combined all these space components into one space component. For this long space component, we can estimate the Hurst exponent H. We use the fractal dimension for pattern recognition from SEM images. Fig. 5. Long continuous graph We use the method of a visibility network and fractal geometry for statistical pattern recognition. To model the results, we use intelligent system methods, that is, multiple regression, neural network (NN), and genetic programming (GP). NNs [8] have the ability to solve a variety of problems. The sophistication of NNs is primarily due to their ability to imitate the principle of the functioning of the biological brain, which means that they solve problems similarly to humans. We used the Neuralyst program to create a model with NNs. Neuralyst is a software tool used within Excel. It has the ability to model NNs. We used a multi-tasking neural system with backpropagation and no back links. We had the option of setting different attributes. Table 1 presents the attributes of the NNS. Table 1. Attributes of the NN Learning speed [-] 0.6 Inertial coefficient [-] 0.5 Test mass tolerance [-] 0.02 Tolerance of the learning set [-] 0.03 Number of layers [-] 4 Fig. 6 presents a general multi-layer NN system. Table 2. Attributes of the GP Size of the population of organisms 500 Maximum number of generations 100 Reproduction probability 0.4 Crossover probability 0.6 Maximum permissible depth in the creation of the population 10 Maximum permissible depth after the operation of crossover of two organisms Smallest permissible depth of organisms in generating new organisms Tournament size used for selection of organisms Genetic programming [9] is similar to genetic algorithms and differs only in terms of the presentation method. Individual component in genetic algorithms is presented by a sequence of numbers, and the individual component in genetic programming is presented by a computer program. GP automatic writing of programs according to the nature of natural selection (evolution). At the beginning, we have some randomly written programs, which represent the initial Fig. 6. General multi-layer NN system 6 2 7 population. Then by crossing and selection, we get the next generation. Table 2 presents the attributes of the genetic programming. We used the genetic operations of reproduction and crossover. Fig. 7 presents an example of an organism in genetic programming. (22 -(-^-))*(l7+eos(Y)) Fig. 7. Organism in GP Multiple regression (MR) [10] is designed to investigate linear causes the relationship between a single dependent variable and one or more independent variables. With it, we determine the statistical feature in the power of connection and we predict the values of the dependent variable. The impact of each of the independent variables is estimated to be independent of the interactions between independent variables. Hybrid evolutionary computation [11] is a generic, flexible, robust, and versatile method for solving complex global optimization problems and could be used in practical applications. We present a new intelligent hybrid systems model in Fig. 8. Data information is presented in Table 3. Table. 3. Attributes of the hardened specimens S x1 x2 x3 x4 x5 x6 Y S1 1000 2 1.91 2.30 0.191 85 39 S2 1000 3 1.96 2.26 0.224 85 45 S3 1000 4 1.95 2.26 0.210 85 43 S4 1000 5 1.94 2.34 0.235 85 41 S5 1400 2 1.92 2.22 0.246 85 36 S6 1400 3 1.98 2.39 0.228 85 49 S7 1400 4 1.95 2.25 0.201 85 45 S8 1400 5 1.98 2.29 0.215 85 48 S9 1000 2 1.97 2.18 0.247 39 46 S10 1000 3 1.86 2.18 0.232 45 32 S11 1000 4 1.98 2.41 0.219 43 45 S12 1000 5 1.94 2.21 0.241 41 42 S13 1400 2 1.98 2.26 0.225 36 28 S14 1400 3 1.58 2.27 0.238 49 19 S15 1400 4 1.97 2.43 0.208 45 41 S16 1400 5 1.81 2.29 0.197 48 38 S17 800 0 1.97 2.23 0.289 85 47 S18 1400 0 1.98 2.24 0.277 85 52 S19 2000 0 1.97 2.26 0.245 85 50 S20 950 0 1.96 2.28 0.217 85 66 S21 850 0 1.95 2.32 0.212 85 80 S22 0 0 1.91 2.30 0.195 85 39 3 RESULTS AND DISCUSSION The attributes of the hardened specimens influence on the carbide content (Table 3). The specimens are labeled as S1 to S22. Attribute x1 represents the temperature [°C] and x2 represents the speed of RLH [mm/s]. Attributes x3, x4, and x5 represent the keys for pattern recognition. Parameter x3 represents the complexity in 2D, x4 represents the complexity in 3D, x5 represents the density of the visibility networks in a 3D space, and x6 represents the carbides in specimens. The last attribute Y is the measured surface carbide Genetic Programming Butiding hybrid system of machine learning Fig. 8. Intelligent system model and new hybrid intelligent system content of the RLH specimens. Specimen S22 presents the material before the RLH process. In Table 4, we denote measurement (M) data (D) by MD, prediction data obtained by MR, prediction data obtained by neural network (NN), prediction data obtained by GP, and prediction data obtained by the hybrid system by H. Specimen S14 has a minimal carbide content after hardening, that is, 19 %. Table 4 present the statistical properties of the experimental and predicted patterns. Table 5 present topological properties of 3D visibility network. The measured and predicted carbide content of the RLH specimens is presented in the Fig. 9. The MR model is presented by Eq. (1), GP model is presented by Eq. (2), and hybrid model is presented by Eq. (3). We calculated precision of the GP model, NN and of the MR model by calculating average of absolute difference between measured and predicted data divided by measured data. The GP model has 85.57 % precision, the NN has 90.57 % precision, MR has 80.87 % precision and hybrid system has 62.98 % precision. Table 5 presents the topological properties of the 3D visibility network. Measured and predicted parts of carbides of the LHR specimens depend on attributes X], X2, X3, X4, X5 and x6 are presented in Fig. 9. We use the statistical topological property of the visibility networks to describe the carbide content in the microstructures. Image analysis of the SEM images of the RLH specimens is an interesting approach. With MR, GP and NN, we predict the carbide content in the microstructures. Finally, we present a new hybrid system of intelligent systems. For measured and predicted parts of carbides of the LHR specimens data, we calculated Kendall correlation coefficient. The best results for prediction give us NN, because the Kendall correlation coefficient (0.021) is most close to experimental data (0.131). Table 3 presents the topological properties of the 3D visibility network and attributes of RLH specimens. In this way, we can see how the attributes of speed and temperature influence the topological structures of visibility graphs in 3D space. Table 6 presents the statistical properties of the topological properties of the extreme number, number of edges, and triadic census type 16 to 300 of the 3D visibility network for RLH specimens. Firstly, we calculated the basic statistical properties of the mean, standard deviation, standard error, median, geometric mean, and harmonic mean of the topological properties of visibility graphs in 3D space of RLH specimens. We found significant positive relationships between the kurtosis, Fisher's G2, the coefficient of variation, the coefficient of dispersion, Table. 4. Measured and predicted data S MD MR NN GP H S1 39 55.28 38.98 38.7 34.77 S2 45 50.93 44.53 45.0 37.32 S3 43 47.84 42.25 45.0 40.61 S4 41 45.89 42.27 41.4 44.78 S5 36 42.41 36.98 40.8 44.18 S6 49 49.69 48.32 44.3 43.32 S7 45 40.97 44.86 46.7 46.45 S8 48 40.00 48.06 44.6 48.86 S9 46 42.74 45.97 46.2 40.85 S10 32 35.98 32.08 18.8 47.98 S11 45 48.01 44.98 40.0 44.99 S12 42 33.04 42.26 42.0 55.08 S13 28 39.91 46.48 27.7 47.05 S14 19 19.88 18.99 22.8 67.35 S15 41 42.15 41.54 45.2 51.32 S16 38 27.23 37.08 38.0 60.50 S17 47 59.78 47.86 47.4 30.50 S18 52 50.58 50.11 49.7 39.58 S19 50 42.10 51.01 50.1 48.54 S20 66 62.98 66.65 47.5 28.93 S21 80 65.82 79.25 46.4 27.35 S22 39 79.59 84.60 84.9 58.25 Table. 5. Topological properties of 3D visibility network S Extreme number Number of edges Triadic census type 16 to 300 S1 120823 3500351 4865624 S2 125787 3308776 4191425 S3 123943 3335861 4267175 S4 124833 3355735 4353872 S5 124626 3314397 4212248 S6 131540 3190001 3796016 S7 126962 3311163 4196282 S8 130799 3173601 3741603 S9 123393 3355056 4256560 S10 126395 3386391 4483986 S11 124296 3315948 4207031 S12 123829 3355735 4353872 S13 128143 3451450 4862060 S14 122500 3685175 5877473 S15 120818 3338595 4199754 S16 116812 3733624 5848517 S17 133031 3178192 3774789 S18 130974 3182544 3819193 S19 131043 3170121 3746658 S20 95090 4151533 7284078 S21 106916 5653616 1764141 S22 86871 5735036 1466536 Model of multiple regression: Y=-99.0509 - 0.0178 x x1 - 3.25717 x x2 + 46.65489 x x3 + 38.06208 x x4 - 63.3748 x x5 + 0.165097 x x6. (2) Model of genetic programming Y = 0.129654 x (-x6 - x22 - 2 + 2 + X X^ Xr :(-x 2 + x22) x3 + x2 J X2 X X6 X ( O X + — X2 Xi + X2 X (X2 + XX2 X X3 ) + X6 (3 X (x2 + X3 ) + X5 )X (3 + 2X X3 X (2 + X3 ) + X6)) (X2 + x3 + X6) X Y 7.71283 + 2 x x2 - x6 — + - v x2 - x3 X V 3 J (3) x 6 X 6 2 X 6 4 V 3 J X 6 + X 4 4 x6 Hybrid model Y2 + Y(A + B) + AB - C — 0, A — 0.129654 ;(-x2 + x2 ) - X3 +1—+-2 x3 + x2 B — 99.0509 + 0.0178x1 + 3.25717x2 - 46.65489x3 + 63.3748x5 -0.165097X6, CC — X2 X3 X6 , x1 + x2 (x2 + x2 x3 ) + x6 2 + X6 x2 - x3 (x3 (x2 + x3 ) + x6 )(x3 + 2x3 x (x2 + x3 ) + x6)) f \ X6 X6 + — X (x3 x2 + x6 X 7.71283 + 2x2 - x6 + 2 6 +- X2 X3 X2 X1 X 6 X 2 X X 6 + 90 80 70 60 50 40 30 20 10 0 & & $ & rip' rt> £ & & & J? n$> A" o> o> o> o> o> o> o> o> o> o> o> o> o> o> o> /¿> ¿V .0S> Ä . ♦ Experimental data y = -0.1417X + 45.766 T = 0,131 ■ Prediction with MR y = -0.4789X + 51.998 T = - 0,08 A Prediction with NN y = -0.5965X + 53.91 T = 0,021 x Prediction with GP y = -0.4445X + 48.439 T = - 0.008 X Prediction with H y = -0.1457X + 46.61 T = -0,11 Density of the visibility networks in a 3D space of RLH specimens Fig. 9. Measured and predicted carbide content of the LHR specimens depend on attributes xh x2, x3, x4, x5 and x6 and the topological properties of visibility graphs in 3D space of RLH specimens. Table. 6. Statistical properties of topological properties of 3D visibility network SP Extreme number Number of edges Triadic census type 16 to 300 Mean 121792 3599223 4253132 Standard deviation 11531.35 714391.3 1201353 Standard error 2458.49 152308.8 256129.4 Median 124461 3346826 4209640 Geometric mean 121194 3546497 4055020 Harmonic mean 120509 3505535 3790200 Kurtosis 5.90 7.454659 4.719475 Fisher's G2 4.03 5.993685 2.517122 Coefficient of variation 0.094 0.1984849 0.2824632 Coefficient of dispersion 0.05 0.1024648 0.1701942 4 CONCLUSIONS The paper presents a new method of constructing visibility networks in a 3D space, a new method of describing the complexity of 3D space, and a new hyper-hybrid system of machine learning for the use in mechanical engineering to predict the topographical properties of materials. The paper presents a method of using visibility graphs in 3D space and fractal geometry to analyze the complexity of RLH specimens. Analyzing the complexity of RLH surfaces is a very hard problem. This new method has many applications in pattern recognition, computer graphics, computational geometry, and so on. The main findings are: 1. We use the method of network theory and fractal geometry to analyse the microstructure. 2. For prediction of the carbide content of hardened specimens, we use intelligent system methods, namely a neural network, multiple regression, and a genetic algorithm. The best results for prediction give us neural network. 3. We present the new hybrid spiral sequences. 4. The paper introduces a new method of machine learning in metallurgy. 5. We find the statistical significance of the relationship between attributes of the hardened specimens and the experimental and predicted pattern data. 6. The paper compares three methods, namely multiple regression, neural network and genetic programming, with a hybrid system of intelligent systems. 5 REFERENCES [1] Petrovič, S., D., Šturm, R. (2014). Fine-structured morphology of a silicon steel sheet after laser surface alloying of Sb powder. Strojniški vestnik - Journal of Mechanical Engineering, vol. 60, no. 1, p. 5-11, DOI:10.5545/sv-jme.2013.1347. [2] Ghosh, S.K. (1997). On recognizing and characterizing visibility graphs of simple polygons. Discrete & Computational Geometry, vol. 17, no. 2, p. 143-162, DOI:10.1007/ BF02770871. ISSN 0179-5376. [3] Mandelbrot, B.B. (1983). The Fractal Geometry of Nature. W.H. Freeman & Co., San Francisco, DOI:10.1119/1.13295. [4] Stoev, S., Pipiras, V., Taqqu, M.S. (2002). Estimation of the self-similarity parameter in linear fractional stable motion. Signal Processing, vol. 82, no. 12, p. 1873-1901, DOI:10.1016/ S0165-1684(02)00317-1. [5] Alta steel (2017). from http://www.altaspecialsteel.com/DIN-1-7225-AISI-4140-38.html, accessed on 2017-12-30. [6] Babič, M., Hluchy, L., Krammer, P., Matovič, B., Kumar, R., Kovač, P. (2017). New method for constructing a visibility graph-network in 3D space and new hybrid system of modeling. Journal of Computing and Informatics, vol. 36, no. 5, p. 1107-1126, DOI:10.4149/cai.2017.5.1107. [7] Babic, M., Kokol, P., Guid, N., Panjan, P. (2014). A new method for estimating the Hurst exponent H for 3D objects. Materials and Technology, vol. 48, no. 2, p. 203-208. [8] Graves, A., Schmidhuber, J. (2009). Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks, in Bengio, Y., Schuurmans, D.,; Lafferty, J., Williams, C.K.I., Culotta, A. (eds.). Advances in Neural Information Processing Systems, vol. 22, p. 545-552. [9] Koza, J.R. (1992). Genetic Programming: On the Programming of Computers by Means of Natural Selection, MIT Press. [10] Armstrong, J.S. (2012). Illusions in regression analysis. International Journal of Forecasting, vol. 28, no. 3, p. 689694, DOI:10.1016/j.ijforecast.2012.02.001. [11] Ravi, V., Naveen, N., Pandey, M. (2013). Hybrid classification and regression models via particle swarm optimization auto associative neural network based nonlinear PCA. International Journal of Hybrid Intelligent Systems, vol. 10, p. 137-149, DOI:10.3233/HIS-130173.