Paper received: 08.06.2010 Paper accepted: 09.09.2010 Fault Detection of an Industrial Heat-Exchanger: A Model-Based Approach Dejan Dragan* University of Maribor, Faculty of Logistics, Slovenia One of the key issues in modelling for fault detection is how to accommodate the level of detail of the model description to suit the diagnostic requirements. The paper addresses a two-stage modelling concept to an industrial heat exchanger, which is located in a tyre factory. Modelling relies on both, prior knowledge and recorded data. During the identification procedure, the estimates of continuous model parameters are calculated by the least squares method and the state variable filters (SVF). It is shown that the estimates are largely invariant of the bandwidth of the SVFs. This greatly reduces the overall modelling effort and makes the whole concept applicable even for less experienced users. The main issues of the modelling procedure are emphasized. Based on the process model, a simple detection system is derived. An excerpt of the results obtained on operating records is given. ©2010 Journal of Mechanical Engineering. All rights reserved. Keywords: industrial heat exchanger, fault detection, condition monitoring, model-based detection, modelling, identification 0 INTRODUCTION Model-based condition monitoring of industrial processes aims at early revelation of degradations in process equipment and instrumentation. A sensible process model acts as an additional virtual instrument, which contributes to a higher quality of production and better safety. There are many papers and books dealing with model-based techniques for detecting, isolating and identifying faults [1] to [6]. However, in many real applications deriving a proper model still takes a bulk of overall design effort. Moreover, proper shaping of the model precision with respect to the diagnostic requirements remains to be rather an art. This paper deals with the design of a fault detector for a heat exchanger as a possible alternative to some other approaches [7] to [11]. The work represents part of the prototyping design of a condition monitoring system for the process of incineration of vulcanisation gasses located in a tyre factory [6]. A brief idea of the process can be grasped from Fig. 1. Vulcanisation gas (VU gas), which is one of hand products of vulcanisation, is generated on vulcanisation lines. Prior to the emission into the atmosphere, all the carbon particles contained in the gas need to be destroyed by incineration in a combustion chamber. The entire system consists of three major parts [6]: • pre-heating of the VU gas in a gas-gas heat exchanger; • incineration of the VU gas in a combustion chamber; • transportation of the flue gasses to the chimney. The focus of this paper is on model-based condition monitoring of the cold part of heat exchanger. The goal is to improve the support to the maintenance team through permanent monitoring of the condition of sensors and detection of fouling. The project aims at designing a detector with the highest precision possible. For that purpose, the extra redundancy is achieved by means of an analytical model of the plant. The modelling concept consists of two stages in which prior knowledge and recorded data are combined (grey-box modelling concept). In the first stage, the model structure is derived up to unknown parameters by strongly relying on reasoning from first principles. After taking the available instrumentation into consideration, the set of prior assumptions and the diagnostic requirements the modelling procedure ended up with a continuous-time model linear in parameters. For the purpose of parameter estimation, the least squares method (LSM) combined with state variable filters (SVF) is adopted in the second *Corr. Author's Address: University of Maribor, Faculty of Logistics, Mariborska c. 7, 3000 Celje, Slovenia, dejan.dragan@fl.uni-mb.si stage. It is shown that the use of LSM leads to the identification results, which are largely invariant of the bandwidth of the SVFs. This observation deserves attention since the choice of bandwidth, as a design parameter, turns out to be quite an easy task. This is believed to be one of the contributions of the paper. Fig. 1. The incineration system The derived model of the cold part of an exchanger employs the temperatures (Tco, Tho) and flow of the VU gas (^VU). The measured signals (c.f. Fig. 1) are collected, displayed and stored by the FactoryLink™ SCADA system (USDATA Corp.) Finally, the model-based diagnostic algorithm is designed to run on-line as an external C module of SCADA. The proposed design of the condition monitoring system is very simple and the suggested detector is a helpful indicator for the operator to take corrective action. The derivation of the model structure and the parameter estimation approach is described in the first section. An excerpt of the experimental results is given in the second section. Finally, the diagnostic procedure is overviewed in the third section. 1 SYSTEM IDENTIFICATION 1.1 Determination of Model Structure The purpose of this section is to emphasize the importance of prior knowledge in deriving the model structure. This knowledge is essential in early modelling steps, in particular in defining causal relationships between process variables. Depending on the diagnostic requirements, these relationships can range from loose (qualitative) to precise (quantitative) descriptions. In case of poor prior knowledge and/or loose diagnostic requirements, model precision can be restricted to qualitative relationships defined on sign, interval or fuzzy sets. Perfect prior knowledge allows for precise expressions that are fully defined on the set of real numbers. Since in many practical cases prior knowledge is incomplete, process data that represent the carrier of additional information needed to complete the model description need to be employed. Here, the finest level of detailed description is observed. The process of the heat exchange between (cool) VU gas and (hot) flue-gas is illustrated in Fig. 2. The modelling procedure starts with setting the energy balance equation for an infinitesimally small piece of the VU gas channel and pipe wall. This results in the following partial differential equations respectively [1]: dTC (x, t) c dt ^ •«W (t) • dTC (x, t) dx (1) Pvu • AC1 dT (x, t) W \ ' dt •a2(t) •[(xt) - tc (xt)] *,(t) -\th (x, t) - TW (x, t) pwAwcw (2) pwAwcw ■a2(t) \Tw(x, t) -Tc(x, t)], where TC(x,t), TH(x,t) and TW(x,t) represent spacetime behaviour of the temperatures of the cold part of the exchanger, hot part and pipe wall, respectively, while (o1 = 2rcrj, o2 = 2nr2). In order to identify the relationship between the measured variables (Tco, Tho, @VU), the distributed parameter models in Eqs. (1) and (2) need to be converted into the lumped models. The lumping procedure is based on the following set of assumptions: 1. specific mass (pVU) and specific heat (cVU) of the VU gas are assumed to be constant, 2. flow of the VU gas is space independent (®VU(x,t) = ^VU(t)), c o 2 3. convective heat exchange coefficients are independent of time and space (a^x,t) = a1, a2(x,t) = a2), 4. as the wall is only 4 mm thin Aw ~ 0 and dTW(x,t) / dt ~ 0 can be assumed. 5. The retention time of vulcanisation gasses in the heat exchanger is fairly low, as the speed of VU gasses is relatively high while the length of the heat exchange channel is quite short. This implies an almost momentary formation of temperature profiles along the heat exchange channel. Assumption 4 implies short time constant for the dynamics of the wall temperature TW(x,t) in Eq. (2). Transients can be neglected so that the following static relation emerges: Tw ( x, ? ) = m^- Th ( x, ? ) + (1 - m )• TC ( x, ? ), (3) o1 •a1 + o2 • a2 In the next step let us combine Eqs. (3) and (1) at x = L, which results in: dTC (x, t) | d t x = L 1 ^ / \ dTC (x, t) + — -®VU (t ) ■ C. A, d x x = L Tco(t) 1 (4) = °2 a ^m L,t)--(L,t)]. PVU ■ 4ct ■ CTJ 1-T T , v -' '-2,---Tho (t) - TCo(t) Then, according to assumption 5, the temperature profiles along the heat exchange channels exhibit almost static behaviour. From the steady-state condition at x = L, Eq. (4) results in the following expression: dTc ( x) d x = ■ ( - Tco )=£■(( - Tco ), x = L b ® (5) where bars over symbols denote stationary values. However, a number of identification runs, carried out on model structures based on Eq. (5), turned to produce one-step-ahead predictor with relatively poor performance. Careful examination of the unmodelled effects related to the derivation of Eq. (5), suggests the approximation of the gradient dTC(x,t) / dx at x = L by a richer structure in order to improve the predictive power of the final process model. It can be shown that the gradient is related with the measured input and ouput temperatures and flow in a very complicated manner. Instead of an exact solution a black-box structure is sought so that experimental data are fitted as well as possible while keeping the number of unknown parameters at a minimum. Among many candidates (polynomials, neural networks) it turned out that already a simple structure with only two free parameters can significantly raise the quality of fit of the model Eq. (4). The approximation reads very similarly to Eq. (5) i.e. dTc(x,t) dx x = L "t: ■ TJt)-e2 ■ TJt). (6) If Eq. (6) is entered to Eq. (4), the following expression is obtained: Tco(t) + b^vu(t)•( • T (t)-s2 • T (t)) = ho co ! (/) = a ,(Tho (t ) - To (t ) ), i.e. o1 •a1 Tco (t ) + V • [Om(t)-TJt)] + TT^t) + V • HV (0 T (t) ] = a • [Tho (t) - Tco (t)]. ^T -F« (8) Eq. (8) represents the lumped model of the cold part of the heat exchanger. The same modelling procedure can be applied to derive a model of the hot part. Unfortunately, the task is not that easy due to a lack of a flow sensor in the flue-gas channel (hot part). Nevertheless, a careful analysis indicates that the unmeasurable flow mostly depends on flow of the vulcanisation gasses, roughly in a linear way. With this in mind a model of the hot part being entirely similar to that of the cold part is achieved. Due to a lack of space the hot part will not be treated here. In the sequel emphasis is on the cold part only. 1.2 Parameter Estimation In order to identify the model parameters batch the least squares method is chosen. To estimate the parameters of the continuous time model in Eq. (8), the derivative of Tco is needed. As direct differentiation is prone to significant errors due to measurement noise, the problem is alleviated by using state variable filters. In this case, any stable filter with relative degree > 1 and appropriate bandwidth would suit. A simple transfer function is employed to filter the signals in Eq. (8): Gf ( S ) = 1 T-S +1 (9) where 1/t is the bandwidth of filter (t is the time constant). If Eq. (9) is applied to both sides of Eq. (8), filtering preserves the original model structure. To improve the numerical properties of the algorithm, the filtered signals from Eq. (8) are transformed in the next step as for example: ATco_mf(t) = Tco_mf(t) Tco_mf(t-1) , where index "mf refers to the value of the filtered signal. Differentiated data preserve the original model structure while eliminating the problem of offsets in prediction error. After filtering all the signals in Eq. (8) and after transformation of filtered data, an appropriate form for identification procedure is obtained. For the data collected by the SCADA system, an over-determined system of equations, which can be represented in the following form, is obtained: Y = V • 0 + An, where: (10) Y = ATco _mf a) -ATho_ mf a) AT AN)-AT , co _ mf \ ' ho _ mf (N ) -A TCo_mf(1) AFlmf (1) AF2mf (1) 1 -A Tco_mf (N) AF, mf (N) AF2 mf (N) 1 b b2 K (11) An, " Anfi(l) Anf i(N ) The unknown constant K is added in order to take into account the bias in the prediction error. In case of perfect model structure (i.e. free of modelling errors) the estimated K should be zero. Vector An^ is added to encounter the noise effects. The unknown parameters of the system Eq. (10) result as follows [12] and [13]: ê = [VT-t]-1 •V T • Y. (12) 2 PRACTICAL IDENTIFICATION RESULTS Process identification was carried out on a batch of 14000 samples taken during normal operating regime (interval [1, 14000] [min]). Outliers and intervals with non-informative data are carefully eliminated from further processing. The results of the estimation achieved at various bandwidths of SVF's (1/t) are shown in Fig. 3. It can be seen that the results of estimation do not vary significantly with respect to SVF bandwidth changing over two decades. This statement includes the variations of parameter K(r)/al(r), which are small compared to the elements of vector Y (Eq. (11)) and can be 1 ui a a neglected. Identification results largely invariant to pre-filtering imply great freedom in choosing the SVF bandwidth. This is an advantage for every practitioner. The purpose of the designed model is to predict the difference between temperatures of cold and hot part at the end of exchanger. Thus, by taking into account Eq. (8) process output can be represented as follows: yf(t) = Tco_mf(t) - Thomf(t) and the predicted output as: (13) 1 ft A 02 ai yf (t) = 1^ l-l -Tco_mf (t) 1 + ft ^ bi ai Fi mf (t) + 2_ mf (t) + yf, (14) respectively. Similarly to Eq. (11), estimated constant Kyf is added to avoid bias in modelling error. A comparison between process output Eq. (13) and predicted output Eq. (14) on validation set containing 5000 samples (interval [14000, 19000] [min]) is shown in Fig. 4. The model fits the process reasonably well so that the prediction error does not exceed 10% of the dynamic range of the signal Eq. (13). In other words, the underlying mathematical model can be viewed as an additional (virtual) instrument, which obviously brings extra redundancy into the system. 3 DIAGNOSTIC RESULTS Inference about faults is made on the basis of the residual signal defined as follows: ah) bh) i.W 30 40 50 EC 70 80 SO 10G 0 10 ¡0 30 40 9] EC /0 00 90 1tt) 0 10 20 J) (0 SO 80 /0 1l2]30«9]e0 70 «090 Fig. 3. The estimated parameters in dependence of different bandwidths (1/t) of the SVF 10 1.4 1.45 1.5 1.55 1.6 1.65 1.7 1.75 1.8 1.85 1 3 t [min] x104 Fig. 4. Model validation: measured output, predicted output and prediction error; a) y(t); b) prediction error + r (t ) = yf (t ) - yf (t ), (15) where yf (t) and y f (t) are process output Eq. (13) and predicted output Eq. (14), respectively. Based on the residual Eq. (15) the presence of a fault can be detected. This means that if the residual is near zero, there is no evidence that a fault is present. On the contrary, if the residual departs significantly far from zero, the presence of a fault can be inferred. Generally, purely on the basis of the residual Eq. (15) it is not possible to determine the location of the fault. The exception is discussed in the literature [3] and [5]. Indeed, provided the parameters 0 of the process model bear physical meaning, the regressor form of the model Eq. (10) reads: Y = ¥ • 0 + Anf and Y = Y-0, for the true process and the mathematical model. Symbols Y and Y denote true and predicted outputs, while 0 and 8 denote true and estimated process parameters, respectively. By differentiating the two equations the following is obtained: e = Y - Y = Y-0 + An„ - Y-0 = f i = Y-(0-0 ) + An f 1 = TAe + An V i' where e denotes the vector of residuals. Since e results in a straightforward manner from the recorded data and the nominal process model, the estimate of the vector A0 can easily be calculated Any non-zero term in A0 indicates a variation in the process parameters due to the fault. Knowing the physical origin of such a term in the vector 0 the position of the fault can be inferred. An important technical requirement is that the data matrix ¥ is full rank (persistent excitation). However, the idea is not applicable in the present case for three reasons: 1. The temperature signals Tho and Tco are generally too poor from the point of view of information content, which means that with recursive parameter estimation it would not be possible to unambiguously estimate all the parameters in Eq. (14); 2. Signal to noise ratio in certain intervals of process operation can be rather low. 3. Not all the parameters in Eq. (14) reflect the physical properties of the system (the model Eq. (8) is semi-physical); consequently such parameteres would be of little use in fault localisation. Having residual Eq. (15) it is neccessary to draw the decision about alarm. If pure Boolean logic were applied, then frequent, even small deviations in residuals in the vicinity of the threshold value would lead to large variations in the diagnostic results [14] and [15]. In order to smooth the diagnostic output, approximate reasoning techniques seem to be a better alternative. A residual is no longer qualified as zero (0) or non-zero (1) but is associated a degree of being zero, which is a number between 0 and 1. In this way, incremental changes in the residual Eq. (15) result in incremental changes in the diagnostic results. For the sake of detection the belief mass (Fig. 5) is introduced, which can be represented in the following form [14]: bel(r) = 1 1 + 1 -Sd (hfYd (16) Sd h and yd are threshold and the adjustable smoothing parameter respectively, while ( yd = bel(r=h) ). Fig. 6 shows the response of the detector, when two consecutive faults are injected into the system. Firstly, the temperature sensor related to Tho is stuck high. This fault is emulated by fixing the sensor output to 560 °C for the period of 700 minutes. The next fault is emulated drift in the sensor related to Tco. Obviously, the detector is quite sensitive to the occurrence of both faults. Fig. 5. Belief assignment function Since belief in a non-zero residual also significantly increases, the detector provides clear evidence of the presence of both faults (c.f. Fig. 6). Though obvious, the first fault remains undetected by the classical alarm system. Indeed, the alarm is set at 600 °C as lower values can be easily reached during normal operating conditions (not presented in Fig. 6). On the contrary, the model-based detector reacts quite quickly and accurately. This detector reacts relatively promptly in the case of a drift type of fault as well. For example, it reports the presence of fault at approx. t = 4100 min with 70% belief. The classical alarm system would react, but much later (at approx. 5200 min). To sum up, by running the suggested detector on the recorded data sets it has been possible to reveal several temporary erroneous sensor readings during operation of the real plant. These were completely overlooked by the existing alarm system (missed alarms). Fig. 6. Detection of sticking high of sensor Tho and detection of drift in sensor Tco 4 CONCLUSIONS A two-stage modelling procedure is presented and applied to a heat exchanger in an incineration unit. It relies on blending prior knowledge with the information contained in data records. Prior knowledge serves to derive the model structure, while data are used to identify unknown model parameters. The paper makes a significant contribution in two ways. Firstly, a set of prior (heuristic) assumptions provides a means for determining the model structure. Secondly, it is shown that the results of estimation, obtained using the least squares method, are largely invariant of the bandwidth of the SVF. This greatly reduces the overall modelling effort and is an advantage for every practitioner. The suggested detection algorithm is very simple for execution in real time. The diagnostic results show that the module is able to accurately indicate the presence of incipient faults and thus facilitate timely on-condition maintenance. Traditional alarm systems based on thresholding are insensitive to such faults, i.e. do not react until large deviations and failures occur. 5 REFERENCES [1] Bogaerts, P., Castillo, J., Hanus, R. (1997). Analytical solution of the non uniform heat exchange in a reactor cooling coil with constant fluid flow. Mathematics and Computers in Simulation, vol. 43, no. 2, p. 101-113. [2] Weyer, E., Hangos, K.M. (1997). Grey box fault detection in heat exchanger networks. Prepr. IFAC Symp. Safeprocess, vol. 1, p. 187-192. [3] Gertler, J. (1998). Fault detection and diagnosis in engineering systems. Marcel Dekker, New York. [4] Krishnan, R.A., Pappa, N. (2005). Real time fault diagnosis for a heat exchanger - a model based approach. INDICON, Annual IEEE, p. 78-82. [5] Klancar, G., Juricic, B., Karba, R. (2002). Robust fault detection based on compensation of the modelling error. International Journal of Systems Science, vol. 33, no. 2, p. 97-105. [6] Dragan, D., Juricic, B., Strmcnik, S. (2000). Modelling for condition monitoring application to a heat transfer process. Process control qual., vol. 11, p. 419-431. [7] Lalot, S., Mercere, G. (2008). Detection of fouling in a heat exchanger using a recursive subspace identification algorithm. Proceedings of the 19th International Symposium on Transport Phenomena, Paper #37. [8] Jonsson, G.R., Lalot, S., Palsson, O.P., Desmet, B. (2007). Use of extended Kalman filtering in detecting fouling in heat exchangers. International Journal of Heat and Mass Transfer, vol. 50, no. 13-14, p. 2643-2655. [9] Ingimundardóttir, H., Lalot, S. (2009). Detection of fouling in a cross-flow heat exchanger using wavelets. Proceedings of International Conference on Heat Exchanger Fouling and Cleaning, p. 484-491. [10] Ramasamy, M., Shahid, A., Zabiri, H. (2008). Drift analysis on neural network model of heat exchanger fouling. Journal of Engineering Science and Technology, vol. 3, no. 1, p. 40-47. [11] Juan, C., Martínez, T., Morales-Menendez, R., Garza-Castañón, L.E. (2010). Fault diagnosis in a heat exchanger using process history based-methods, computer aided chemical engineering. 20th European Symposium on Computer Aided Process Engineering, Pierucci, S., Buzzi, G. (eds.), Elsevier B.V., vol. 28, p. 169-174. [12] Garnier, H., Wang, L. (2008). Identification of continuous-time models from sampled data. Springer-Verlag, London. [13] Ljung, L. (1999). System identification: theory for the user. 2nd ed., Prentice Hall, New York. [14] Rakar, A., Juricic, B., Balle, P. (1999). Transferable belief model in fault diagnosis. Engineering Applications of Artificial Intelligence, vol. 12, p. 555-567. [15] Rakar, A., Juricic, B. (2002). Diagnostic reasoning under conflicting data: the application of the transferable belief model. Journal of Process Control, vol. 12, p. 55-67.