Znanstveno strokovni prispevki objavljene v Informacijah MIDEM zajemamo v podatkovne baze C0BISS in INSPEC. Prispevke iz revije zajema ISP v naslednje svoje produkte: Sei Search®, Research Alert® in Materials Science Citation Index™ Scientific and professional papers published In Informacije MIDEM are assessed into C0BISS and INSPEC databases. The Journal is indexed by IS Is for Sei Search5, Research Alert® and Material Science Citation Index™

ZNANSTVENO STROKOVNI PRISPEVKI PROFESSIONAL SCIENTIFIC PAPERS A. Dziedzic, A.A. Snarskii: Teorija perkolacije in njena uporaba v znanosti o materialih in mikroelektroniki; Prvi del: Teorija 1 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Sciencs and Microelectronics; Part I: Theoretical Description D. Grgec, Ž. Butkovič: Načrtovanje premikalnih podsklopov za integrirana vezja po naročilu 10 D.Grgec, Z.Butkovic: Shifter Designs for ASICs D. Strle: Optimizacija moči in površine A - 2 modulatorjev visoke stopnje 21 D.Strle: Capacitor-area and Power-consumption Optimization of High Order A - £ Modulators B. Peršič, N. Basarič: Frekvenčno izkrivljanje in kaotično obnašanje simulatorja SPICE 26 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE S. Plevel, T. Javornik, I. Ozimek, R. Trobec, G. Kandus : Viterbijev algoritem za DSP procesorje 33 S. Plevel, T. Javornik, I. Ozimek, R. Trobec, G. Kandus : Viterbi Algorithm for DSP Processors B. Kotnik, T. Rotovnik, Z. Kačič, B. Horvat, B. Horvat, I. Kramberger: Zasnova mobilne modalne komunikacijske naprave - Osebni navigator 39 B. Kotnik, T. Rotovnik, Z. Kacic, B. Horvat, B. Horvat, I. Kramberger: The Design of Mobile Multimodal Communication Device - Personal Navigator D. Čeh-Ambruš, I. Kramberger, Z. Kačič: Razvoj razširitvene kartice s signalnim procesorjem za PCI vodilo 48 D. Ceh-Ambrus, I. Kramberger, Z. Kacic: Development of Expansion Card with DSP Processor for PCI bus R. Osredkar, B. Gspan: Študija močno dopiranih plasti za BiCMOS kolektorske vložke 53 R. Osredkar, B. SNARSKIP 1) Institute of Microsystem Technology, Wroclaw University of Technology, Poland 2) Department of General and Theoretical Physics, National Technical University of Ukraine, Kiev, Ukraine Keywords: percolation, percolation system, percolation threshold, critical exponent, effective conductivity, 1/f noise, effective noise intensity, weak nonlinearity, strong nonlinearity, voltage susceptibility, third harmonic, normalised amplitude of third harmonic, temperature dependence of resistance, percolationlike system, exponentially wide spectrum of resistances, continuum percolation, Swiss-cheese model Abstract: Percolation theory permits to characterise (calculate) the effective properties of random inhomogeneous two-phase systems with comparable concentration of both phases (near the percolation threshold) but with significant difference of their properties. This paper presents the critical behaviour of various kinetic phenomena (conductivity, 1/f noise, weak and strong nonlinearity, third harmonic generation, and temperature dependence of resistivity). These quantities can be described analytically using for example hierarchical model of percolation structure. The characteristic critical indexes are dependent on conductivity and correlation length critical exponents. Possible application of percolative theoretical description for systems with exponentially broad or disordered continuum spectrum of properties is presented, too. The nonelectrical effective properties could be analysed by methods of percolation theory because of analogy between the quasistatic electrical and other physical fields. Teorija perkolacije in njena uporaba v znanosti o materialih in mikroelektroniki (Prvi del - Teorija) Ključne besede: fizika, kemija, perkolacija, prag perkolacije, sistem perkolacije, teorija perkolacije, znanost o materialih, mikroelektronika, eksponenti kritični, prevodnost električna efektivna, šum 1/f, intenzivnost šuma efektivnega, nelinearnost šibka, nelinearnost močna, susceptibilnost napetostna, harrnonske tretje, amplitude harmonskih tretjih normalizirane, odvisnost temperaturna upornosti električne, sistem podoben perkolacijskemu, spekter upornosti električnih širok eksponencialno, perkolacija neprekinjena, model sira švicarskega Izvleček: Teorija perkolacije dovoljuje izračun lastnosti naključnih dvofaznih sistemov s primerljivima koncentracijama obeh faz (blizu perkolacijskega praga), pri čemer imata obe fazi vsaka zase različne lastnosti. V prispevku prikazujemo vedenje različnih kinetičnih parametrov, kot so prevodnost, 1/f šum, nelinearnost, generacija tretje harrnonske frekvence in temperaturna odvisnost upornosti. Omenjene količine lahko predstavimo v analitični obliki z uporabo hierarhičnega modela perkolacijske strukture. Predstavimo tudi možno uporabo teorije perkolacije pri opisu sistemov s širokopasovnim eksponentnim ali neurejenim kontinuiranim spektrom lastnosti. Neelektrične lastnosti lahko analiziramo s pomočjo metod perkolacijske teorije zaradi analogije med kvazistatičnim električnim poljem in drugimi fizikalnimi polji. Introduction The percolation problem was formulated for the first time almost 45 years ago by Broadbent and Hammersley/1/. Since that time the idea and methods of percolation theory were applied Into many areas of physics, chemistry as well as other basic and applied sciences. The original results based on percolation theory can be found In numerous papers. Therefore preparation of a complete bibliography devoted to this topics seems almost unrealisable. However beginner in such area could find some interesting books or review papers, for example /2-9/. The so-called hierarchical model of percolation structure (HMPS) appeared during recent years. This model permits to describe analytically various properties of macro-scopically disordered media near the percolation threshold - for example resistivity (also Hall effect), 1/f noise, electrical breakdown, nonlinear properties of composites and many others. This review will be devoted to the above mentioned phenomena. One should note that we will discuss experimental, analytical and numerical results received very recently - it means that they were not summarised in books and papers mentioned above. 1. Effective conductivity nearpc Experimental and numerical investigations have shown, that effective conductivity ae is an analogous of order parameter in theory of phase transitions where temperature Tis replaced by concentration of well-conducting phase - p and critical temperature T Is replaced by percolation threshold - pc. Based on the above analogy Efros and Shklovskii /10,11/ used scaling formula for oe ae(x,h) = a1hsF(T/hu) (1.1) where h = cj2/a1 - distance from percolation threshold, o2 « a-,, oe - local conductivity, and F(z)-scaling function F(z -> -oo) oc z"q F(z —> 0) 1 (1.2) 1 Informacije MIDEM 31(2001)1, str. 1-9 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... where t and q - critical conductivity exponents and only the basic (single) components of sequence decomposition in relation to scaling are given in (1.2). According to (1.1) and (1.2) there are three ranges of universal behaviour of effective conductivity, where separate equations describe an universal behaviour of effective conductivity - above (p > pc), below (p < pc) and in the vicinity (p ~pc) of percolation threshold ae =a1a2(D0+D1h^|x| + ...),|x| pc,x » A ae = a2T~q(B0 + B1h|x|"(t+q) +...),p < pc,|x| » A (1.3) As it is visible the above equations consist on only the basic components but also smaller ones. A, B(.and D(. indicate constants, which according to absolute value are almost equal to 1. It is interesting to become familiar with conception of smearing region, A = (o^U2)1/t+q - it is such |t|, where good and bad conductive phase possesses the same contribution into the effective conductivity ct-jA1 = cr2A"q (qualitative behaviour of oe is shown in Fig. 1). Fig. 1. Normalised conductivity versus good conductor concentration in two-phase percolation system Many various models, in general based on percolation idea, have been used for explanation of ae shape. In the first one (which by occasion is the simplest) it is assumed that for p > pccase it is enough to consider number of single connected bonds (SCB) at the correlation length |x| ) /12,13/. This is so-called "bridge" with resistance R1 consisting of seriously connected unit resistances from the first phase r,, where 6—2 r, = (I/a-,)a0 , a0- minimal dimension in the system (for example mean size of composite grains or connection length in bond problem), d = 2, 3 - Fig. 2 (left). For analogous model, but below the percolation threshold (p pc) and interlayer (p 0. Much more reliable results could be obtained based on so-called HMPS. Its idea has been presented in /18-20/. According to this approach the values of a; and a2 are calculated based on t and q values (these quantities are considered as known) both below and above percolation threshold. It is assumed in this process that conduction process takes part both in good 2 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... Informacije MIDEM 31(2001)1, str. 1-9 as well as bad conducting phases - Fig. 3. When o2/a1 ^0 HPMS is transferred into the standard model, discussed a little earlier. P>Pc PPc Ri R, -E±ZI-J pc. Therefore it could seem that counting and regarding of so small contributions, especially outside the smearing region is needless because this does not lead to important properties of percolation system. However below the readers will find some examples in which manner elements affecting ae only insignificantly can decide about other properties of percolation systems. 2. 1/f noise 1/f noise Is an universal phenomenon. It is characteristic for many physical (but not only) processes. The amplitude of that noise has especially large importance for composites /21,22/. The quantity of 1/f noise is characterised usually by effective noise intensity C0 = £2S (2.1) where Q. - volume of analysed pattern, S - relative power spectral density SR _ {5R6R} R R (2.2) SR = {5R5R} - power spectral density, {...}- denotes the Fourier transform of the time correlation function. Based on the situation that time fluctuations of resistance 5R are spatially uncorrelated It is possible to describe (present) the effective noise intensity in terms of the Joule power dissipated in the Inhomogeneous media Cfi = C(Ejr (2.3) P 0 ) the bridge resistance R1 is increased whereas interlayer resistance R2 is decreased. Both resistances are equal in the smearing region but when Eq. (1.3a) or (1,3b) are obligatory then R1 « Rr It means that dielectric interlayer gives small contribution into the effective conductivity oe above pc whereas the bridge - below where (...) denotes volume averages. The beginning of 1/f noise investigations in percolation systems is connected with scientific activity of Rammal /23/ (the reader interested in this topic can find more detailed bibliography of papers dealt with 1/f noise in /24/). We can tell that for the case of finite conductivity of both phases ( h = ct2 /o1 ^ 0) Ce near percolation threshold can be written as Ce(x > 0,x » A) = C1T~k +C2hVw (2.4a) Ce (|x| « A) = C^h"^ + C2h~k/^+q) (2.4b) Ce(x < 0,|x| » A) = C2|x| k +C2|x| w (2.4c) 3 Informacije MIDEM 31(2001)1, str. 1-9 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... where C1 and C2 - noise intensities of first and second phase and values of k and k' i.e. critical exponents of 1/f noise are given in Table 1. Table 1. Numerical estimates of the noise critical exponents K and K'(/23/ and references herein) Critical index Numerical simulations Rigorous bounds k 1.47-5-1.58 1.53-1.60 k" 0.55-0.74 0.38-1.02 According to HMPS the critical exponents w and w' can be expressed by k and k' in a simple manner as w = k' + 2(t + q), w' = k + 2(t + q) (2.5) For example, it is directly visible from (2.4) that above the percolation threshold but in the smearing region (|t| < A) the second phase could give higher income into the total 1/f noise of composite when C2 is higher than CJ18/. 3. Weak nonlinearity The deviation from linear Ohm's law is possible for large current densities. In the case of so-called weak nonlinearity (or weak cubic nonlinearity) the dependence between current density and electric field is given by the following formula j = o(r)E + x(r)|E|2E (3.1) where %- local nonlinear susceptibility. Of course (3.1) presents polynomial description of the field where the second constituent is significantly smaller than first one. The effective properties are used for description of weakly nonlinear system in the same manner as for linear system, this is + xe<|F|>-<£> (3.2) As has been shown in /25,26/ there is analogy between behaviour of effective noise intensity Ce and effective nonlinear susceptibility. Problem becomes mathematically equivalent to the estimation of effective 1/f noise Intensity, xe ~ Ceae for the system with the local noise intensity C(r) = %(r)/g2(r). Thus the critical behaviour of %e is given immediately from the equation describing the behaviours of the effective noise intensity and the effective conductivity Xe(t > 0) = Ce(t > 0)ct|(t > 0) = +%2hV2q-k' %e(x < 0) = Ce(T < OJofr < 0) = x2|f2q"k' + Xih4|f W'"2q %e(|T| ^ A) = Ce(|x| < A)Ge(|x| < A) = = Xih(2t-k)/(«+q)+Z2h-(2q+k-)/(t+q) (33) The important question in analysis of nonlinear media effective properties is connected with Eq. (3.2) application range. Most often it is assumed /27-30/ that formula (3.2) is proper for (!) « (j)c and « c (3.4) where so-called critical electric field ) (4.1) can be expressed with the aid of 1/f noise amplitude Ce. Normalised amplitude of third harmonic B3co 4 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... Informacije MIDEM 31(2001)1, str. 1-9 agrees with pgCe(with accuracy to inessential numerical multipliers) when - in formula for Ce - factor Cj is changed by temperature coefficient of resistivity of /-th phase - (3.. Generalisation of expression for b3(0 given in /32-35/ for the case h = a2/a1^0 is presented in/36/and we obtain the following Equation J3(o j3cü T> 0))- ß, (T| < A)OC fa in \t k rn yt -+2 +2 Pi P2 J („ Y -+2 f2(t+q) +2 ^3(0 V :<0))«ßj in \ -+2 , \ 2 / K±?Ö±a)+2 + ß. Pi P2 P2 \ * J (4.3) Fig. 5. Current-voltage characteristic of first (1) and second (2) phase in strongly nonlinear system The percolation treatment is possible in region I ("strongly nonuniform" medium) where where the dependence from x in (4.3) is connected with effective resistivity pe = 1 lae which of course is different for various regions. 5. Strong nonlinearity Contrary to weak nonlinearity case the current-voltage characteristics of strongly nonlinear medium are not linear even for very weak fields. The medium with the following current-voltage relation I r—j ß—1 r- : % E E (5.1) 1 + £2 X-Pc E) x < A 0> = O2T-5|(E; m 1 Ke)I , P q = q + v((3 - 1) Y (5.4) The size of smearing region a in strongly nonlinear system is field-dependent i- i -IV—t - E = PiJ J ß_1 r a, EF 'E j = a1|E| E Hi E = [j[ ß j (5.2) then three field regions can be distinguished for appropriate (3 and y (Fig. 5). A = g CT1 |X ß-1/Y (5.5) Moreover, let's note that current-voltage characteristic of system, composed of strongly nonlinear phases, becomes linear for specified values of (3 and y ((pT +q/y)/tp = 1). 5 Informacije MIDEM 31(2001)1, str. 1-9 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... 6. Temperature dependence of resistance The temperature dependence of resistivity (resistance), usually characterised by means of differential temperature coefficient of resistivity (TCp) or resistance (TCR), is one of the most important features of composite materials or devices based on them. The effective temperature coefficient of resistivity (resistance), i.e. TCPe = — -^r- orTCRe=— ( pe = 1/ae ), for percolation system created by two-phase medium with finite conductivity ratio ( h = p1 /p2 * 0 ) has been found in /42/. Similarly as in the case of the other above-considered properties the analytical formulas have been worked out for three concentration subranges. TCpe = TCp1 + (p1 / p2 >-(t+q)TCp2, p > pc (6.1) TCpe = A ■ TCp! + B • TCp2 - D(TCp-| - TCp2 )(^l)t+q x, , . P2 W ^ A (6.2) TCP1 = TCp2 + (P1 / p2 y^TCp,, p < Pc 1 dp. (6.3) In the above equations we have TCp, =--— (/' = 1, Pi dT 2) - temperature coefficient of resistivity of /-th phase and A, B, D - constants (equal to about 1). 7. Continuum problems It has been assumed for all so-far analysed cases, that the problem of current distribution in system can be transferred to model, where the random resistance distribution of first and second phase r, and r2 is given as f(r) = p8(r-r1) + (1-p)5(r-r2) (7.1) (p - concentration of first phase,5(...) - Dirac function. But the case, where distribution function can be written as f(r) = p(1-a)r"i/r = 0 (7.2) has been examined in /43/. It has been shown that critical index stops to be universal (it is said it goes to the second universality class) t = t0 + a/(1-a) (7.3) where t0 - standard critical conductivity index above percolation threshold. The case when the spectrum of resistances is continuous and exponentially broad /44/ r = r0e_;Vx,À » 1, (7.4) where x e (0, 1) is a random variable with smooth probability distribution D(x), is no less interesting. The problem with a continuous spectrum of resistance distribution is not a straightforward percolation problem - it does not exhibit the percolation threshold at which one of the two phases forms an infinite percolating cluster because the phases themselves do not exist. However, there is a method which simplifies the exponential distributed resistances problem to the standard two-phase percolation problem /45-47/ and makes it possible to determine the principal system regularity, this is to find a critical index of effective percolation conductance. The general assumption of this method is that all resistances with a random variable between x and 1 are considered as one phase. In a crude approximation the network effective conductivity is described by the largest resistance, at which this phase becomes infinite. This is related to the percolation threshold in a classical percolation, i.e. from i j*D(x)dx = | (7.5) it is possible to calculate xc and next to find the largest resistance, which defines (with accuracy to the preexponentlal factor in ae) the resistance of the whole system, ro= r0e -Xx, (7.6) It is possible to consider the above problem analogously but to start from the reverse side. Lets take a system with an exponential broad spectrum of resistance and keep in mind site of particular resistances In the network. Then we replace them in the network by "zero-resistivity" connection, and again put resistances into their previous position in the network but according to proper sequence starting from the smallest one. This process is carried on till appearance the resistance, which disconnect the current flow through the "zero-resistivity" phase. We can tell that this critical resistance specifies the resistance of such system (with accuracy to preexponential factor). The details of such 6 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... Informacije MIDEM 31(2001)1, str. 1-9 treatment are presented in /48-51/. It could seem that applied critical resistance search methods give opposing results /50,51/. However this contradiction is removed by assumption that the system is in smearing region just as the critical resistance is included in the network. Generalisation of two-phase percolation model in smearing region /52/ for systems with exponentially broad resistance spectrum lead to the following expression for effective conductivity (for simplicity it has been assumed, that D(x) = 1) g„ = -9—r e a0r(xc) ry (7.7) where a0 - minimal characteristic dimension in the system (of order of lattice cell), A - variable with a weak dependence on X (A - (|nX)ai+ct2+v(d~2) ), d-dimensionality of the problem, and critical exponent y is equal g1-a2+2v(d-2) 2 (7.8) C(x) = a/a(x), (7.11) This is in agreement with empirical Hooge law - system (device) with higher resistivity (more precisely with lower concentration of charge carriers) is characterised by larger noise intensity. The effective noise intensity of system with an exponentially wide spectrum of resistances obeys the form C0 - re"Ax< where exponent m is given as m = y + 2v (7.12) (7.13) As has been mentioned earlier the exponent y is related to the correlation length exponent v by Eq. (7.9). Therefore In terms of widely accepted values of a, = çR = 1 and a2 = çG = 1 Eq. (7.8) reduces to y = v(d -2) (7.9) The above result has been shown for the first time in [48]. Choice a1 =qR = t-v(d-2) and a2 = <;G = q + v(d - 2) gives very similar numerical results; for more details please see /53,54/) The model described in /51/, using network with exponential distribution of properties, permits to find the behaviour of many other physical quantities. Moreover, even if for example resistance distribution is not exponential but power one r = r0x"x and we have somewhat different formula for effective conductivity m = dv (7.14) -xrry (7.10) The critical index yfrom Eq. (7.10) is still given by Eq. (7.9). There are no basic troubles in characterisation of more complex quantities than effective conductivity using percolationlike model. However it is necessary to make supplementary assumption related to local properties of these quantities. For example, calculation of 1/f noise in exponentially distributed systems demands generalisation of Hooge hypothesis /21/, according to which C = a/a (a - so-called Hooge parameter). It Is logically to assume, that for considered system with local conductivity a(x) oc e~x* The above calculations have been generalised in /24,54,55/ for situation when C(x) = cc/o9(x) (7.15) (for 0 = 1 we have standard Hooge formula (7.11)). Very interesting feature of the exponent m has been observed for 0 < 0 < 2; m is independent on 0 parameter, this is Co-a0e^r% mn=vd o (7.16) Lets note that even if phenomenological Hooge formula is locally true, i.e. C(x)oe(x) = const, it is broken for the whole system, that is Ceoe ž const. Except of effective conductivity and noise intensity investigations of temperature behaviour /42/ or third harmonic generation /36/ also have been analysed in systems with exponentially broad spectrum of resistances. It has been shown that normalised amplitude of third harmonic B3(0 for such systems is related very simply to its effective conductivity p (7.17) The successive model with disordered continuum spectrum of resistances has been presented for the first time in /56,57/. This is so-called Swiss-cheese 7 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Informacije M1DEM 31(2001)1, str. 1-9 Application in Materials Science and Microelectronics... i.e. a disordered continuum system where spherical holes are randomly placed in a uniform transport medium. The distance between spherical voids is unrestrictedly small. This means that so-called microgeometry, in other words current distribution in narrow necks between mentioned spherical holes becomes very important. Such a model has been analysed based on percolation approach and it has been proved in already mentioned papers /56,57/that critical conductivity exponents for Swiss-cheese model and corresponding indices in a discrete lattice differ in value and depend on microgeometry details (shape of inclusions). For example, when p> pc and a2 = 0 then t=L + y (7.18) where exponent y is dependent on kind of voids. For random-void model and a2= 0 we have y= 0 in the case of 2D system and y = 1/2 for 3D medium. Moreover there are yet other classes of continuum model, namely potential model (space between voids is not limited by spherical area but by hyperboloidal one /57/, blue-cheese model /58/ and so on. Micro-structure in fact affects not only effective conductivity but also other properties such as dielectric (e.g. in d= 3 critical exponent of effective dielectric constant in Swiss-cheese model differs in standard one by 5/2 /56/), electrical and mechanical destruction /58/ and the like. It appears, that microgeometry influences behaviour of 1/f noise near percolation threshold /59/. Last but not least matter of this paper is that we have to be conscious of analogies between various physical fields (presented for example in /60/). Therefore percolation theory and analysis can be applied not only in calculation of electrical effective properties but also electrostatic, magnetic, thermal, fluidic and mechanical ones. Chosen examples of percolation or percolationlike systems, which have been studied experimentally as well as some numerical simulations performed with the aid of approaches given in this paper will be presented and discussed in second part of this article /61/. Acknowledgement This work was supported in part by Polish State Committee for Scientific Research, Grant no 8T11B 055 19. References /1/ S.R. Broadbent, J.M. Hammersley, Percolation processes. I. Crystals and mazes, Proc. Cambridge Philos. Soc., vol.53 (1957), p.629-641 /2/ B.I. Shklovskii and A.L. Efros, Electronic Properties of Doped Semiconductors, Springer-Verlag, Berlin, 1984 /3/ A.L. Efros, Fizika i gieometria biezporiadka, Moscow, Nauka 1982 (in Russian) /4/ D. Stauffer, A. Aharony, Introduction to Percolation Theory, Taylor and Francis, London-Washington, 1992 /5/ M. Sahimi, Application of Percolation Theory, Taylor and Francis, London-Washington 1994 /6/ M. Sahimi, Flow phenomena In rocks: from continuum models to fractals, percolation, cellular automata, and simulated annealing, Rev. Mod. Phys., vol.65 (1993), 1395 /7/ D.J. Bergman, D. Stroud, Physical properties of macroscopically Inhomogeneous media, Sol. St. Phys., vol.46 (1992), p.147-269 /8/ J.P. Clerc, G. Giraud, J.M. Laugier, J.M. Luck, The electrical conductivity of binary disordered systems, percolation clusters, fractals and related models, Adv. Physics, vol.39 (1990), p.191 /9/ T. Nakayama, K. Yakubo, R.L. Orbach, Dynamical properties of fractal networks: scaling, numerical simulation, and physical realizations, Rev. Mod. Phys., vol.66 (1994), p.381 /10/ A.L. Efros, B.I. Shklovskii, Critical behaviour of conductivity and dielectric constant near the metal-non-metal transition threshold, Phys. Stat. Sol. (b), vol.76 (1976), p.475-485 /11/ J. P. Straley, Critical exponents for the conductivity of random resistor lattices; Phys. Rev. B, vol.15 (1977), p.5733-5737 /12/ A.S. Skal, B.I. Shklovskii, Topology of infinite cluster in percolation theory and hopping conductivity theory, Sov. Phys. Semicond., vol.8 (1974), 1029 /13/ PG. de Gennes, On a relation between percolation theory and the elasticity of gels, J. de Physique, vol.37 (1976), p.L1 /14/ T. Ohtsuki, T. Keyes, Conduction in random networks of supernormal conductors: Geometrical interpretation and enhancement of nonlinearity, J. Phys. A, vol.17 (1984),p.L559-L563 /15/ A. Coniglio, Thermal phase transition of the dilute s-state Potts and n-vector models at the percolation threshold, Phys. Rev. Lett., vol.46 (1981 ),p.250-253 /16/ A. Coniglio, H.E. Stanley, Screening of the deeply invaginated clusters and the critical behaviour of the random superconductivity network, Phys. Rev. Lett., vol.52 (1984), p.1069-71 /17/ D.C.Wright, D.J. Bergman, Y. Kantor, Resistance fluctuations in random resistor networks above and below the percolation threshold. Phys. Rev. B, vol.33 (1986), p.396 /18/ A.E. Morozovsky, A.A. Snarskii, Critical behavior of the 1/f noise in percolation systems, Sov. Phys. JETP, vol.68 (1989), p.1066 /19/ A.E. Morozovsky, A.A. Snarskii, Critical behavior of fracture stress in randomly inhomogeneous composites near the percolation threshold, JETP Lett., vol.52 (1990), p.244 /20/ A.E. Morozovsky, A.A. Snarskii, Multiscaling in randomly inhomogeneous media: effective conductivity, relative spectral density of 1/f noise, and higher-order moments, Sov. Phys. JETP, vol.75 (1992), p.366 /21/ S.M. Kogan, Nizkochastotnyj tokovyj shum so spektrom tipa 1/f vtviordych tielach, Uspiechi Fizicheskich Nauk, vol.145 (1985), p.285-328 (in Russian) /22/ M.B. Weissman, 1/f noise and other slow, nonexponential kinetics in condensed matter, Rev. Mod. Phys., vol.60 (1987), p.537 /23/ R. Rammal, C. Tannous, P. Brenton, A.-M.S. Tremblay, Flicker (1/f) noise in percolation networks: a new hierarchy of exponents, Phys. Rev. Lett., vol.54 (1985), p.1718; 1/f noise in random resistor networks: fractals and percolating systems, Phys. Rev. A, vol.31 (1985), p.2662 /24/ A.A. Snarskii, A.E. Morozovsky, A. Kolek, A. Kusy, 1/f noise in percolation and percolation-like systems, Phys. Rev. E, vol.53 (1996), p.5596-5605 /25/ D. Stroud, P.M. Hui, Nonlinear susceptibilities of granular matter, Phys. Rev. B, vol.37 (1988), p.8719-8724 /26/ A. Aharony, Crossover from linear to nonlinear resistance near percolation, Phys. Rev. Lett., vol.58 (1987), p.2726 /27/ R. Blumenfeld, D.J. Bergman, Comment on "Nonlinear susceptibilities of granular matter", Phys. Rev. B, vol.43 (1991), p. 13 682 /28/ P.M. Hui, Crossover electric field in percolating perfect-conduc-tor - nonlinear-normal metal composites, Phys. Rev. B, vol. 49 (1994), p.15344 /29/ K.W. Yu, P.M. Hui, Percolation effects in two-component nonlinear composites: crossover from linear to nonlinear behavior, Phys. Rev. B, vol.50 (1994), p.13327 /30/ G.M. Zhang, Critical behavior of non-linear susceptibility in ran- 8 A. Dziedzic, A.A. Snarskii: Percolation Theory and its Application in Materials Science and Microelectronics... Informacije MIDEM 31(2001)1, str. 1-9 dom non-linear resistor networks, J. Phys. C, vol.8 (1996), p.6933 /31/ A.A. Snarskii, S.I. Buda, Critical fields and currents in a weakly nonlinear medium near the percolation threshold, Technical Physics, vol.43 (1998), p.619-621 /32/ M.B. Weissman, C.D. Dollinger, J.C. Garland, J. Appl. Phys., vol.52 (1981), p.3059 /33/ M.A. Dubson, Y.C. Hui, M.B. Weissman, J.C. Garland, Measurement of the fourth moment of the current distribution in two-dimensional random resistor networks, Phys. Rev. B, vol.39 (1989), p.6807-6815 /34/ Y. Yagil, G. Deutscher, Third-harmonic generation in semicontinuous metal films, Phys. Rev. B, vol.46 (1992), p.16115-16121 /35/ Y. Yagil, G. Deutscher, D.J. Bergman, The role of microgeometry in the electrical breakdown of metal-insulator mixtures, Int. J. Mod. Phys. B, vol.7 (1993), p.3353-3374 /36/ A.A. Snarskii, Generation of third harmonics in highly heterogeneous composites near flowing threshold, Tech. Phys. Lett., vol.21 (1995), p.1-3 /37/ A.A. Snarskii, K.V. Slipchenko, A.M. Satanin, The generation of the third harmonic in highly anisotropic media near the percolation threshold, Technical Physics, vol.45 (2000), p. 194-198 /38/ S.W. Kenkel, J.P. Straley, Percolation theory of nonlinear circuit elements, Phys. Rev. Lett., vol.49 (1982), p.767-770 /39/ J.P. Straley, S.W. Kenkel, Percolation theory for nonlinear conductors, Phys. Rev. B, vol.29 (1984), p.6299-6305 /40/ A.E. Morozovsky, A.A. Snarskii, Effective properties of high anisotropic and non-linear metal/non-metal thin films, Int. J. Electr., vol.73 (1992), p.925-927 /41/ A.A. Snarskii, K.V. Slipchenko, V.A. Sevrukov, Critical behavior in two-phase, highly inhomogeneous composites, JETP, vol.89 (1999), p.788-796 /42/ A.A. Snarskii, A. Dziedzic, B. Licznerski, Temperature behaviour of percolation and percolation-like systems, Int. J. Electr., vol.81 (1996), p.363-370 /43/ P.M. Kogut, J. Straley, J. Phys. C, vol.12 (1979), p.2151 /44/ A. Miller, E. Abrahams, Impurity conduction of low concentrations, Phys, Rev., vol.120 (1960), p.745 /45/ V. Ambegaokar, B.I. Halperin, J.S. Langer, Hopping conductivity in disordered systems, Phys. Rev. B, vol.4 (1971), p.2612 /46/ M. Pollak, A percolation threshold of d.c. hopping conduction, J. Non-Cryst. Solids, vol.11 (1972) p.1 /47/ B.I. Shklovskii, A.L. Efros, Primiesnaja zona i provodimost kompensirovanych poluprovodnikov, Zh. Eksperimentalnoi i Teoreticheskoi Fiziki, vol.60 (1971), p.867 (in Russian) /48/ P. Le Doussal, Percolationlike exponent for the conductivity of highly disordered resistor networks, Phys. Rev. B, vol.39 (1989), p.881-884 /49/ S. Tyc, B.I. Halperin, Random resistor network with an exponentially wide distribution of bond conductances, Phys. Rev. B, vol.39 (1989), p.877-880 50. A.E. Morozovskii, A.A. Snarskii, Conductivity critical exponent of exponentially distributed resistances, JETP Lett., vol.56 (1992), p.268-272 /51/ A.E. Morozovskii, A.A. Snarskii, Percolation description of the conductivity of random networks with a broad spectrum of the distribution of resistances, JETP, vol.77 (1993), p.959-965, /52/ A.A. Snarskii, A.E. Morozovskii, Percolation structure model in the smearing region, Int. J. Electronics, vol.78 (1995), p.135-137 /53/ A. Kolek, A.A. Snarskii, A.E. Morozovskii, Structure of the percolation cluster and excess 1/f noise in systems with an exponentially broad spectrum of resistances, JETP, vol.81 (1995), p.490-495 /54/ A.A. Snarskii, A. Kolek, Excess 1/f noise in system with an exponentially wide spectrum of resistance and dual universality of the percolation-like noise exponent, JETP Lett., vol.63 (1996), p.651-656 /55/ A.A. Snarskii, A. Kolek, Double universality of 1/f noise percolation-like exponent in systems with exponentially wide spectrum of resistances, Physica A, vol.241 (1997), p.355-359 /56/ B.I. Halperin, S. Feng, P.N. Sen, Difference between lattice and continuum percolation transport exponents, Phys. Rev. Lett., vol.54 (1985), p.2391-2394 /57/ S. Feng, B.I. Halperin, P.N. Sen, Transport properties of continuum systems near the percolation threshold, Phys. Rev. B, vol.35 (1987), p.197-214 /58/ D. Sornette, Critical transport and failure in continuum crack percolation, J. Phys. France, vol.46 (1988), p. 1365-1377 /59/ A.-M.S. Tremblay, S. Feng, P. Brenton, Exponents for 1/f noise near the a continuum percolation threshold, Phys. Rev. B, vol.33 (1986), p.2077-2080 /60/ A. Dziedzic, B.W. Licznerski, A.A. Snarskii, Application of percolation theory in description of composite materials, Proc. Conf. "Kompozyty'98-Teoria i praktyka", Warszawa 1998, p.25-35 (in Polish) /61/ A. Dziedzic, A.A. Snarski; Percolation theory and its application in materials science and microelectronics (Part II - Experiments and numerical simulations), to be published in Informacije MIDEM Andrzej Dziedzic Institute of Microsystem Technology, Wroclaw University of Technology, Wybrzeze Wyspianskiego 27, 50-370 Wroclaw,Poland, e-mail: adziedzic @pwr. wroc.pl Andrei A. Snarskii Department of General and Theoretical Physics, National Technical University of Ukraine, Prospect Peremoga 37, 252 056 Kiev, Ukraine, e-mail: asnar@phys.carrier.kiev.ua Prispelo (Arrived): 05.02.01 Sprejeto (Accepted): 07.03.01 9 UDK621,3:(53+54+621+66), ISSN0352-9045 Informacije MIDEM 31(2001)1, Ljubljana SHIFTER DESIGNS FOR ASICs Dalibor Grgec* and Zeljko Butkovic Faculty of Electrical Engineering and Computing University of Zagreb, Croatia Keywords: computer science, shifters, phase shifters, logarithmic shifters, data buses, data paths, arithmetic operations, decoders, ALU, Arithmetic-Logic Unit, FPU, Fast Processing Units, ASIC, Application Specific Integrated Circuits, VLSI circuits, Very Large Scale of Integration circuits, CAD, Computer Aided Design, logical simulations, electrical simulations, MAGIC computer tools, IRSIM computer tools, SPICE computer tools, Simulation Program with Integrated Circuit Emphasis computer tools, transmission gates, restoring buffers, propagation delay, power dissipation Abstract: This paper presents four versions of 32 bit shifter designs that can be used in ASICs, namely: barrel shifter and logarithmic shifter, each implemented with pass transistors and transmission gates. The circuits are designed in the standard MOSIS scalable CMOS n-well technology with the 0.8 |im minimal feature size fabrication process. The design procedure is thoroughly explained. The designs are logically and electrically simulated. They are compared according to the functionality, wafer area, number of transistors, delay and power dissipation. The usage and optimization guidelines are given. Načrtovanje premikalnih podsklopov za integrirana vezja po naročilu Ključne besede: računalništvo, pomikalniki, pomikalniki fazni, pomikalniki logaritmični, vodila podatkovna, operacije aritmetične, dekoderji, ALU enota aritmetično-logična, FPU enote obdelave hitre, ASIC vezja integrirana za aplikacije specifične, VLSI vezja integracije zelo visoke stopnje, CAD snovanje računalniško podprto, simulacije logične, simulacije električne, MAGIC orodja računalniška, IRSIM orodja računalniška, SPICE orodja računalniška, vrata prenosna, bufferji obnovitveni, zakasnitev razširjanja, stresanje moči Izvleček : V prispevku predstavljamo štiri tipe 32-bitnih pomikalnih vezij, ki jih lahko uporabimo pri načrtovanju integriranih vezij po naročilu. To sta matrični in logaritemski premikalni vezji, pri čemer je vsako lahko izvedeno s prehodnimi tranzistorji, oz. prenosnimi vrati. Vezja so načrtovana v standarni MOSIS CMOS tehnologiji z n otokom in minimalnimi risanimi dimenzijami 0.8 ¡im. Podrobno smo razložili postopek načrtovanja. Opravili smo logično in električno simulacijo vezij ter jih med seboj primerjali glede na funkcionalnost, površino čipa, število tranzistorjev, hitrost in porabo. Podali smo napotke za uporabo in optimizacijo vezij. 1. INTRODUCTION Shifting of binary numbers is an arithmetic operation required in many operations such as multiplication, division and bit-manipulation. Shifting is performed in specially designed circuits. Shifters are part of every contemporary datapath, usually located at the output of the Arithmetic-Logical Unit (ALU). Shift operations can be classified into left-right, logical, arithmetical or circular shift (rotating). Usually shifting is implemented only in one direction, I.e., right. Shifting left by m bits is realized with a shift right of n-m bits in an n-bit machine /1/. During the logical shift the LSB takes the value of a predefined input (usually 0/1 or bit-stream from an outer source). Arithmetical shift is a shift operation where the MSB, which represents the sign of the binary number, is preserved. Circular shift puts the LSB in the place of MSB and vice-versa. All sorts of shift operations are required in modern processing units 12, 3/. According to implementation, shifters can be classified into shift-register (sequential logic) and flow-through (combinatorial logic) types. In the shift-register type, shift by one bit requires at least one machine cycle. In the flow-through type, the time required for shifting depends only on the circuit combi- ' Now with Institute for Theoretical Electrical Engineering and Microek natorial delay and it is usually shorter than one machine cycle. The dominant shifter implementation in modern datapaths is the flow-through type. The flow-through shifters can further be classified into /4,6/: • binary shifter, • crossbar switch, • barrel shifter, • logarithmic shifter, • other shifter implementations. The common requirements set upon shifter implementations in modern datapaths are: 1. bus width of 32 or 64 bits (n bits in general), 2. performing nxn shift in one clock cycle, 3. performing many types of shifter operations according to control signals (left-right, logical, arithmetic or circular shift, masking, etc.) 4. separate control signals, usually perpendicular to direction of data, 5. coded control signals, 6. low propagation delay and no degradation of output signal electrical characteristics, 7. compatibility with the rest of the datapath. Barrel and logarithmic shifter implementations satisfy these requirements best, and thus are two most frequently used shifters /4, 5, 6/. cs, University of Bremen, Germany 10 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Informacije MIDEM 31(2001)1, str. 10-20 Binary shifter only performs a one-place left-right shift, and crossbar switch is a universal circuit and a basis for a barrel shifter design. In this paper, designs of barrel and logarithmic shifter in most common logic styles and comparisons of their performance are presented. The general block schematic of the designed shifters is shown in Fig. 1. 2. Shifter design 2.1. Circuit design Barrel shifters All versions of barrel shifters are based on the crossbar switch. A version of crossbar switch with pass transistors is shown In Fig. 2. oIO control inputs ■- II*- 10*- nu ^ i .. T* tru i JTL, ÏL* © © JTL, IL» IL* J2U © O(n-l) -*01 >00 Fig. 3a. Main field of the barrel shifter: with pass transistors 11 Informacije MIDEM 31(2001)1, str. 10-20 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Si(n-l) ISii-l>»- II«»-©: Si(n-l) ■ffi' ■ffi' ■ffi' ■ffi' Sil SiO Sil' a •ffi' Ï' •ffi' af SiO' ■ffi- O(n-l) -e . »01 -9 00 Fig. 3b. Main field of the barrel shifter: with transmission gates. In order to comply with the requirements set upon the shifter designs, and to have coded control input signals, a decoder must be added to the circuit. In order to minimize the layout area, a NOR decoder in dynamic logic with p-type préchargé transistors is chosen /4, 6/. The schematic of this decoder is shown in Fig. 4. ZZ3 3 3 -v- -v- 1 Fig. 4. NOR decoder in dynamic logic (4-bit example). During the préchargé phase the clock 0 is low, and during the evaluation phase the clock is high. Outputs of the decoder are control inputs Si((hn_v to the barrel shifter main field. In order to ensure the correct logic value of control inputs Si and Si' during the préchargé phase /4/, interface clocked buffers, specially devised for this pur- pose, are used. The circuitry is shown in Fig. 5. The buffer exists in two versions: inverting and non-inverting, depending on the used output. Both buffer versions are used in the decoder for the barrel shifter with transmission gates and can be seen in Fig 4. Only the non-inverting buffer version is used in decoder for the barrel shifter with pass transistors. Both decoders require an inverted clock It should be noted that the shifters are fully functional even without the interface buffers /8/, but the power dissipation is too large, due to the current leakage during the evaluation phase. U»n elk o-1>- Symbol -0 (1,A<) A -o (0,A) elk o- Ha 4 GND Fig. 5. Interface clocked buffers. An interface is required to connect the barrel shifter main field with the rest of the datapath. A column of inverters at the input and at the output of the barrel shifter main field is used for this purpouse. Being in pair, these inverters do not influence the logic function of the shifter. When integrating these shifters in an actual datapath, the inverters could be replaced by latches or some other form of interface. Degradation of electrical characteristics of output signals is a common problem in pass transistor logic. In the nMOS pass transistor the output high voltage level is lowered by the amount of the transistor threshold voltage. This effect can lead to increased power dissipation and lower noise margins of the output signals 141. The signal level can be restored using the buffer shown in Fig. 6. The feedback transistor Tr pulls the inverter Input high when the output of the buffer goes low, and prevents the lasting intermediate voltage value in buffer. This buffer Is used at the outputs of the barrel shifter with pass transistors instead of regular inverters. 12 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Informacije MIDEM 31(2001)1, str. 10-20 C Or k-4 In jTL Tp Out GND Fig. 6. Output level-restoring buffer with adjacent nMOS pass transistor /4/. Logarithmic shifters While the barrel shifter implements the whole shifter as a single switch field, the logarithmic shifter uses a multistage approach. The total shift value is decomposed into stages. Each stage shifts the data by some fixed amount, usually by power-of-two 14/. The shift stages can be represented with columns of multiplexers. The basis for such logarithmic shifter in base 2 is shown in Fig. 7. Such a logarithmic shifter has \og2n stages for n-bit data bus and is usually smaller than the equivalent barrel shifter /4/. Note that the control signals S for this shifter are already encoded, i.e. no decoder is required. The design of the multiplexer determines the logarithmic shifter type. Two multiplexer designs are implemented: with nMOS pass transistors and with transmission gates. The logarithmic shifter basic cells are shown in Fig. 8. An inverting buffer is placed at the output of the multiplexer. The odd number of stages in 32-bit shifter (log232 = 5), combined with the input inverter column, ensures that the output signal is in-phase with the input signal. As in the case of a barrel shifter, the restoring buffer is used at the output of the logarithmic shifter cell with pass transistors. S2®1 SI®- C C Fig. 8a. Logarithmic shifter basic cells: multiplexer with pass transistors. c V S ^^p"* S1®- buff. c c Fig. 7: Logarithmic shifter basic schematic (only first two stages). Fig. 8b Logarithmic shifter basic cells: multiplexer with transmission gates. 13 Informacije MIDEM 31(2001)1, str. 10-20 D. Grgec, Z. Butkovic: Shifter Designs for ASICs 2.2. Layout design The shifter layouts are designed by MAGIC, a widely accepted software tool /9/. The layouts are designed so that they follow both the standard scalable CMOS and submicron scalable CMOS set of design rules from MOSIS /7/. The chosen fabrication process is HP CMOS26G 0.8 p.m n-well process with 3 metallization layers (of which only 2 are used in the design). This fabrication process was chosen because it supports both the standard and submicron set of scalable CMOS design rules, as well as a supply voltage of 5V, and because of the availability of parameter files for many computer tools /10/. The main features of this fabrication process are listed in Table I. The ratio of p to n channel widths in a layout is usually determined from the requirement for symmetrical output characteristic, equal rise and fall times or minimal propagation delay /4, 6/. This ratio is chosen to be 4, which, according to Table I, corresponds to the ratio of MOSFET K constants. Beside this requirement, the layouts are designed with the primary goal of achieving minimal chip area. Table I Parameters of MOS fabrication process HP CMOS26G, run N680/10/. Nlintmal IViituiv Si/« Sùppoi'tecliSët o£ Design Rules l.ansî»ta ■ ; if ni' (¿alt- ¡iît -inn" Operating Viiitiige 0.8 urn SCMOS (_TM) SCMOS_SUB 0.5 pm 0.4 ¡.im 6200 5.0 V Xormali/tii MOSFKT K Constants Ratios :of: : MOSFETs K ■Constants S Thresh"!:! Vuhaei « Minimum IliUTHT . Delay ■^ k; = 1.2704 ■ 1er4 a/v2 Kp = 3.2077.1Cr5A/V2 K;/K'p =3.960 JK'„/K'p =1.990 UGSon =0.7086V Uesop = -0.8446V 290 ps The shifter layouts are compiled of many basic shifter cell layouts. A basic cell is designed in such a way that it can be easily connected with the equal adjoining cells. After arranging arrays of basic cells, the final layout is obtained by adding contacts, external connections and power supplies. Barrel shifters The layout of the barrel shifter consists of two main parts: main field and decoder. Both are composed of basic cells. The layouts of the barrel shifter basic cells are shown in Fig. 9. The schematics of the same basic cells can be seen shaded in Fig. 3 (the contacts k are implemented in only some cells when the layout is complete). Fig. 9a. Barrel shifter basic cells layouts: cell with pass transistors. Fig. 9b Barrel shifter basic cells layouts: cell with transmission gates. The same decoder is used for both shifters. The layout of its basic cell is shown in Fig. 10. The layout of the output level-restoring buffer (Fig. 6) is adjusted to the vertical pitch of the basic shifter cell in Fig. 9a and shown in Fig. 11. 14 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Informacije MIDEM 31(2001)1, str. 10-20 Jll^p^Ppip myjmMm&i lltfl ■I WmSlIm WMMhms 4MMHKK MjjMpg JNHHr MÊ Fig. 10. Decoder basic cell layout. The layouts of other inverters and buffers are not shown separately, as their design is traditional. The vertical buffers, which drive the control signals Into the decoder field (Fig. 4), have channel widths of twice the minimal size, because of the large capacitive load of polysilicon control lines in decoder field. The precharge p-type transistors in the NOR decoder have also larger widths, determined by the horizontal pitch of the basic decoder cell. The final layouts of the barrel shifters are shown in Fig. 12. Note the different aspect ratio of these two layouts (the barrel shifter layout with transmission gates is much higher due to the difference In basic cell size). > ¿w M KA-JUU-J ili UJ J I 311 * 3** r+kl SUUM -H * «i * % £* / %> r* ti & * w f>r " i !® h t ? it fi* pistil «s i œ ■h* \1 ¡H* ItSi i 'f^WmwWêwi i»ii4fff«Mff is -ft 4 Fig. 12b. Barrel shifters layouts: barrel shifter with transmission gates. Fig. 12a. Barrel shifters layouts: barrel shifter with pass transistors Fig. 11. The output level-restoring buffer layout. 15 Informacije MIDEM 31(2001)1, str. 10-20 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Fig. 13a. Logarithmic shifter basic ceiis layouts; multiplexer with pass transistors. Logarithmic shifters The layout of a logarithmic shifter consists of two parts: columns of multiplexers and switch field between the columns. The layouts of logarithmic shifter basic cells are shown in Fig. 13. The vertical buffers, which are twice the minimal size, are added in the path of control signals at the top of each multiplexer column. These buffers are shown together with multiplexer cells in Fig. 8. This is necessary due to the large capacitance of polysilicon multiplexer control lines. The switch field connects the output of the cell m in the stage /c with the input of the cell (m-2k) modulo 2" in the stage k+1, and is typical of logarithmic shifters. It is made in two metal layers. The minimal distance between the vertical metal lines determines the size of the switch field. The width of the switch field increases exponentially with the stage shift value - 2k /41. The complete layouts of the logarithmic shifters are shown in Fig. 14. The difference in layout aspect ratio is also Fig. 13b. Logarithmic shifter basic cells layouts: visible in this case. multiplexer with stransmission gates. Fig. 14a. Logarithmic shifters layouts: logarithmic shifter with pass transistors, 16 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Informacije MIDEM 31(2001)1, str. 10-20 Finally, the layout complexity of the designed circuits is presented In Table II by comparing: the number of transistors, linear dimension In units of X, and chip area for Implementation in the HP CMOS26G 0.8 |_im fabrication process. Table II. Layout parameters of designed circuits. Shifter circuit layout' ; Number of Transistors Linear size height width, , * : Chip Area, S (¿=■-0.4 um), inm2 nMOS pM OS Total Barrel 1 1364 180 1544 848 x 989 0.134 Barrel 2 1428 1204 2632 1522 x 894 0.218 Logarithmic 1 522 362 884 645 x 1080 0.111 Logarithmic 2 522 522 1024 1635 x 829 0.217 ' 1 corresponds to circuit version with nMOS pass transistors 2 corresponds to circuit version with transmission gates 3. SIMULATIONS In order to evaluate circuit performance, logical and electrical simulations are used. The simulations are performed with circuit models extracted from the layout by using MAGIC and additional tools that come with the software /91. All circuit models are extracted with the parameters of HP CMOS26G 0.8 jxm n-well fabrication process. For logic validation, debugging of design and Initial electrical simulation, the event-driven electrical simulator IRSIM /11/ is used. It enables easy handling of wide data buses and logical states. The results obtained with this program Include logical values, propagation delays and power dissipation /8/. However, due to the simple MOS transistor model based on resistance, this program could only give the approximate values of electrical parameters; the range of values and their relationships. Simulations in IRSIM showed that all designed shifters were fully functional at a chosen clock frequency of 50 MHz. The results of one typical simulation are shown in Fig. 15. IRSIM simulations determine the combinations of input and control signals for which the propagation delay is the largest. After Initial simulations in IRSIM, additional electrical simulations for chosen input and control signal combinations are performed by SPICE /12/, in order to determine the circuit electrical parameters: voltage levels, propagation delay and power dissipation. The MOS transistors are modeled with the SPICE Level 3 MOSFET model obtained from MOSIS /10/. This model offers the advantage of faster convergence compared to the more sophisticated BSIM (Level 4 and 5) SPICE MOSFET models also available from MOSIS. The accuracy of the model Is satisfying for the used minimal feature size of 0.8 ,um. Summarized results of SPICE simulations are given in Table III. Two types of propagation delays are determined from SPICE simulations: the delay from control input to output (S->0, toso) and delay from input to output (/->0, tDI0). The control input to output delay for barrel shifters, which have a decoder in dynamic logic, and require clock signals, Fig. 14b. Logarithmic shifters layouts: logarithmic shifter with transmission gates. Table III. Summarized results of SPICE simulations. Simulated: :: Shifter Circuit , Propagation delay, ■■. ns ; Average i : Power Dissipation P, mVV ; : Specific ;■'. Power . ; Dissipation W/cm* j tdso. : tpio H >L ■/■'to)0 ■ L >H Barrel 1 7.1 1.7 1.1 6.4 4.8 Barrel 2 1.4 0.6 0.5 6.0 2.8 Logarithmic 1 2.6 1.8 1.6 5.8 5.2 : Logarithmic 2 2.4 1.2 1.1 5.2 2.4 is defined starting from the rising edge of clock signal (J), i.e., from the beginning of the circuit evaluation phase. Only the longest delay tDS0 is shown in Table Ili regardless of the transition direction. The input to output delays are evaluated for both high to low (HL) and low to high (LH) transitions of the output signal for various input and output signals. The longest delays tDI0 are given separately for HL and LH transitions. Fig. 16 shows the simulated electric signals from a typical SPICE simulation. 17 Informacije MIDEM 31(2001)1, str. 10-20 D. Grgec, Z. Butkovic: Shifter Designs for ASICs ;USER:g%a:v vFItÊbteèlt.r pgigjàstl .gs} sat sêplëjaisa 2 3 04 8888888 i 88888888 4444444 2222222 1 1111111 88888889 90 0 06 0 Fig. 15 Signal simulated by IRSIM (logic validation for barrel shifter with pass transistors). Note the difference in the propagation delays for HL and LH transitions. High to low transition time is always longer. This is caused with the lower output voltage of the nMOS pass transistor and lower current drive of pMOS transistor in the transmission gate. Both of these effects cause slower rising of the signal at the output of the pass logic (which is than inverted in the output buffer). The level-restoring buffer, shown in Fig. 6, can only partially decrease this problem. The difference in delay times is however smaller for the circuits with transmission gates, which are therefore more often used in present pass logic circuits /4, 51. The total power dissipation of the circuits is evaluated at the clock frequency of 50 MHz (and corresponding input signal frequency of 25 MHz, see Fig. 15) for selected signal pattern by simulations over a longer time period. The input and control signal patterns were chosen to achieve the maximum dynamic power dissipation by permanent change of logic states at every clock cycle at as many circuit nodes as possible. The final signal pattern is determined by test simulations. The calculated specific power dissipation, dissipated power to chip area ratio, is given in Table III. This parameter is an indicator of thermal flux in VLSI chips and limits the integration density as well as the clock frequency for the circuit implemented in the chosen fabrication process. Barrel shifter 1 delay test l->0 ---- f— - - t i ... i i i ! 1 i ; ; i i 1 i i 15 20 25 30 35 40 45 50 Time (ns) - 10--031 Fig. 16 Signals simulated by SPICE (propagation delay tD/0 for barrel shifter with pass transistors). 18 D. Grgec, Z. Butkovic: Shifter Designs for ASICs Informacije MIDEM 31(2001)1, str. 10-20 4. COMPARISONS AND CONCLUSIONS The comparison of designed circuits can be made according to many criteria: number of transistors, chip area, speed, power dissipation. All designed circuits offer the same functionality; they perform circular right shift at the clock frequency of at least 50 MHz and satisfy all the requirements listed in the Introductory section. One must also note, according to delay times shown in Table III, that the majority of the circuits could support significantly higher clock frequencies. The main design difference Is the existence of the dynamic logic decoder in the barrel shifters, and therefore the requirement for Independently generated non-overlapping clock pair cj) and c|>' in these shifters. This requirement must not be considered as a deficiency, since in the datapath where this shifter would be integrated, a clock would be present anyway. By using the decoder in dynamic logic, the chip area and transistor count are minimized. Dynamic logic circuits usually have a higher speed due to the lower capacitive load 14/. In the case where the decoded control signals are already available in the chip, one would implement only the barrel shifter's main field, which would lead to significant circuit simplification, decreased chip area and increased speed. The logarithmic shifters presented in this work are designed in combinatorial pass and static logic and do not require clock signals. When integrating these shifters in the datapath, one can add latches at the input and output of the shifter. Pipelined versions of logarithmic shifters with multiplexers In dynamic logic and with latches appeared in the literature, and are used mostly in high-performance chips 131. The comparison of the main shifter parameters Is given in Table IV. The comparison is performed according to three criteria: complexity, delay and power dissipation. According to Table IV, no shifter comes as a clean winner. The application of a particular shifter will depend on the requirements set upon the chip as a whole. The shifters with nMOS pass transistors have in general lower complexity. The logarithmic shifter has lower complexity than the barrel shifter In equivalent logic style. In particular, the logarithmic shifter with pass transistors has the lowest transistor count and smallest chip area. The lowest propagation delay is achieved in the barrel shifter with transmission gates. The logarithmic shifter with transmission gates has the smallest power dissipation. However the total power dissipated in other circuits is not substantially larger. The difference in the specific power dissipation is much larger and determined by the difference in the used chip area. The specific power is not a deciding design criterion except in some special cases (low-power electronics with reduced cooling possibilities). The barrel shifter with transmission gates, due to its high speed and low power demands, seems to be the best choice for implementation in general purpose ASIC designs if no limits to the chip area are set. In the applications where a circuit with low power dissipation is needed, the best choice is the logarithmic shifter with transmission gates. The shifters with pass transistors should be preferred if the chip area is limited. The barrel shifter has the disadvantage of larger power dissipation and, especially, substantially larger control signal delay ^OSO' Further improvements of the shifters are possible. For example, it is possible to enlarge or replace interface buffers or decoders. With none or small gain in chip area, one can also enlarge the otherwise minimal channel width of pass transistors. With such modification one could equalize the LH and HL propagation delays in shifters with transmission gates or lower the propagation delay in shifters with pass transistors. By adding additional peripheral and control circuitry one can enhance the shifter functionality to support more shift operations. Finally, it is worth noting that the shifter design procedures presented In this work can be used as application guidelines for integrated circuits in general. Acknowledgments This paper is supported by "Ministry for Science and Technology" of the Republic of Croatia within the sci- Table IV. The comparison of shifter designs and performance. Designed Complexity Delay Power Shifter Circuit Tot. # Chip toso toio h toio Tot. Spec. Tran. Area ■K HL LH Power Power Barrel 1 1.747 1.207 5.071 2.833 2.200 1.231 2.000 Barrel 2 2.977 1.964 1.000 1.000 1.000 1.154 1.167 Logarithmic 1 1.000 1.000 1.857 3.000 3.200 1.115 2.167 Logarithmic 2 1.158 1.955 1.714 2.000 2.200 1.000 1.000 19 Informacije MIDEM 31(2001)1, str. 10-20 D. Grgec, Z. Butkovic: Shifter Designs for ASICs entific project "036001 Research on VLSI/ULSI semiconductor structures". We are grateful to Dr. Adrijan Baric for many useful suggestions regarding the paper. REFERENCES /1/ R.S. lim, "A Barrel Switch Design", Computer Design, August 1972, pp. 76-79 /2/ S.M. Kang, "Domino-CMOS Barrel Switch for 32-Bit VLSI Processors", IEEE Circuits and Devices Magazine, May 1987, pp. 3-8 /3/ R. Pereira, J.A. Michell, J.M. Solana, "Fully Pipelined TSPC Barrel Shifter for High-Speed Applications", IEEE Journal of Solid-State Circuits, Vol. 30, No. 6, June 1995, pp. 686-690 /4/ J.M, Rabaey, "Digital Integrated Circuits - a Design Perspective", Prentice Hall, 1996 /5/ N.H.E. Weste, K. Eshraghian, "Principles of CMOS VLSI Design", Addison Wesley, 1993 /6/ E.D. Fabricius, "Introduction to VLSI Design", McGraw-Hill, 1990 111 The MOSIS Service, "MOSIS Scalable CMOS (SCMOS) Design Rules, Revision 7.2,1996 /8/ D. Grgec, Z. Butkovic, "Barrel Shifter Design and Simulations", Proceedings of MIPRO '99 Conference, Opatija, Croatia, pp. 62-65,1999 /9/ Digital Western Research Laboratory: "WRL Research Report 90/7 -1990 DECWRULivermore Magic Release", September 1990, "Magic Addendum: Version 6.5 differences", 1994 /10/ MOSIS Service Web Site: http://www.mosis.org /11/ University of California at Berkeley, IRSIM ver. 9.4, "IRSIM User's Manual", 1993 /12/ MicroSim Corporation, "MicroSim PSpice A/D & Basics+ Circuit Analysis Software User's Guide", Version 6.3, April 1996 Dalibor Grgec Institute for Electromagnetic Theory and Microelectronics University of Bremen Kufsteiner Str. , Postfach 33 04 40 28334 Bremen, GERMANY Phone: +49421 218 2204 Fax: +49 421 218 4434 E-mail: grgec@item.uni-bremen.de Zeljko Butkovic Department of Electronics, Microelectronics, Computer and Intelligent Systems Faculty of Electrical Engineering and Computing Unska 3, HR -10000 Zagreb, CROATIA Phone. +385 1 6129 924 Fax. +385 1 6129 653 E-mail: Zeljko.Butkovic@fer.hr Prispelo (Arrived): 17.07.00 Sprejeto (Accepted):22.11.00 20 UDK621,3:(53+54+621+66), ISSN0352-9045 Informacije MIDEM 31(2001)1, Ljubljana CAPACITOR-AREA AND POWER-CONSUMPTION OPTIMIZATION OF HIGH ORDER A - X MODULATORS Drago Strle University of Ljubljana, Faculty for electrical engineering, Slovenia. Keywords: computer science, microelectronics, SP, Signal Processing, DELTA-SIGMA modulators, A - Z modulators, noise reduction, area optimization, power consumption optimization, design Abstract: A design methodology for the power consumption and capacitor area optimisation of high-order high-resolution single-bit switched-capacitor type A-2 modulators is described. The main reasons for the power consumption are determined and appropriate steps for its reduction are proposed. The algorithm Is coded in MATLAB and gives the unit capacitor size of each integrator stage for an arbitrary topology and the required specification of each opamp used in the integrator to achieve the required S/N ratio, minimize the power consumption and the silicon area and preserve matching accuracy. The 5"1 order modulator was built and the results prove the effectiveness of the approach. Optimizacija moči in površine À-E modulatorjev visoke stopnje Ključne besede: racunalništvo.mikroelektronika, SP obdelave signalov, DELTA-SIGMA modulatorji, A - S modulatorji, zmanjšanje šuma, optimiranje površine, optimiranje porabe energije, snovanje Povzetek: V članku obravnavamo metodologijo načrtovanja eno in večbitnih S-C A-X modulatorjev, ki omogoča optimizacijo površine silicija in porabo moči. Prikazani so glavni razlogi za porabo moči in potrebni koraki za zmanjšanje. Algoritem smo realizirali v programu MATLAB. Rezultat je tabela kondenzatorjev in specifikacije ojačevalnikov vsake integracijske stopnje za poljubno topologijo modulatorja. Cilja optimizacije sta zmanjšanje površine polja kondenzatorjev ter porabe moči ob upoštevanju robnih pogojev: zahtevano razmerje S/N, točnost razmerij kondenzatorjev itd. Primer optimizacije modulatorja 5. reda dokazuje učinkovitost algoritma in metodologije. 1. INTRODUCTION The power-consumption optimisation methodology for high-order high-resolution A-2 S-C modulators is presented. The optimisation procedure tries to minimize the unit capacitances of the integrators in a loop filter in such a way that the noise requirements are fulfilled, area is minimized and matching accuracy is maintained. The contribution of each noise source is dependent on the architecture, coefficients, capacitances and noise generated in the opamps. Usually this is calculated by using the linear model of the modulator /3/, which is adequate method if one is satisfied with approximate results. Since modulators are non-linear systems, linear model is not good enough and we need real non-linear time-domain simulation and optimisation. The kT/C noise simulation principle was presented in /5/ assuming the unit capacitor sizes are given. In this work the unit capacitor sizes are optimised in such a way that contributions to the total noise are approximately the same. At the same time the requirements for the opamps regarding 1/f and thermal noise as well as slew-rate are determined and can be used as specifications for the circuit design. The model of chopping and more realistic models for the opamps are added to the existing algorithm given in /5/. The algorithm is coded in MATLAB and is based on state-space description of an arbi- trary topology S-C A-2 modulator. The constraints are minimum possible noise and at the same time appropriate accuracy of the capacitor ratios which are the area and the technology dependent. To illustrate the methodology one example is given. It presents power consumption optimisation of the 5th-order single-loop ML-FT (multiple-loop feedback topology) with poles optimised for stability, having S/N > 110dB. The improvement of the power efficiency and area of the modulator prove the correctness of the approach. In section 2 the reasons for power consumption in an arbitrary S-C modulator are analysed and the algorithm for noise optimisation is presented. Since most of the power is consumed by the opamps (>80%) driving the capacitive loads, they must be minimized, taking the area and matching of the capacitor ratio as a constraint. The algorithm also defines the required noise level of the opamps, while the design of the opamps is beyond the scope of this article and the algorithm. Short example illustrates the procedure and gives some additional information. Section 3 briefly repeats the published state-space description of an arbitrary modulator with an additional noise model of the S-C stages and opamps and gives some optimisation results as a table of capacitances and noise requirements. In section 4 the conclusions are presented. 21 Informacije MIDEM 31(2001)1, str. 21-25 D. Strie: Capacitor-area and Power-consumption Optimization of High Order A - Z Modulators 2. REASONS FOR POWER CONSUMPTION The main reason for power consumption In any switched-capacitor-type modulator is the power needed by the opamps to drive capacitive loads and to maintain the required noise level. To simplify the description of the power-optimisation procedure, the 5th-order modulator is taken as an example, for which the power-optimisation results are presented at the end. The procedure works well for any S-C-type modulator. The system design considerations for the 5lh-order modulator have been presented in an Internal report/1/ (figure 1). The coefficients are realised with appropriate capacitor ratios cu accondingtofigure3. The coefficient a( is given by a1 =—1 . In Ci h reality, S-C integrators are fully differential to maintain good rejection against noise coming from the substrate and to gain 3dB in S/N ratio. Most of the power in a modulator is consumed by the opamps, which must be capable of driving capacitive loads and have a sufficiently low noise to maintain the S/N ratio of the modulator. The power consumption of the transconductance amplifier, with its simplified AC model shown in figure 2 is proportional to the equation (1) /2/: P kTBw(DR) 2 Vgs " ^TH Vr dd (1) Where: k is Boltzman's constant, T is the absolute temperature, DR is the dynamic range, Bw is the bandwidth, VGS-VTH is the gate-source over-drive voltage and VDD is the supply voltage. -O gm *Vx Fx o Rd -o CI vo -O Figure 2: Simplified model of the opamp Figure 3: Single-ended S-C integrator The capacitive loads of the opamps are defined by the coefficients of the loop-filter characteristics, stability constraints /1/ and by the thermal kT/C noise generated by the switched capacitors. For higher S/N-ratio requirements the noise must be smaller at the same supply voltage. This can be realised by increasing the unit (usually sampling) capacitance of the integrator; thus the capacitive load of the opamp is increased and more power is required to maintain the bandwidth (Bw). At the same time, for a larger S/ N ratio a smaller thermal noise is required for the amplifier, which can only be realised by an increased current and/or increased area of the differential stage transistors. Fortunately only the first amplifier is critical for the low-pass modulator because all the other contributions are attenuated by noise-transfer functions of the loop filter. The slew-rate must be larger than the maximum slope of the settling of vout at maximum input signal to maintain linear settling behavior /3/. This requires the tail current of the differential stage to be greater than: itai, > SrCload /4/. If this limit is not respected the distortion and in-band noise will increase. Chopping translates 1/f noise and the DC offset around multiples of the chopping frequency (fch = fs /2), which is later attenuated by a decimation filter together with shaped quantization noise. The only Important parameter concerning 1/f noise Is its corner frequency, usually between 10kHz and 100kHz, which in our case is out of the signal bandwidth after chopping. D. Strle: Capacitor-area and Power-consumption Optimization of High Order A - E Modulators Informacije MIDEM 31(2001)1, str. 21-25 In any S-C circuit the coefficients are defined by capacitor ratios, and calculated from the loop-filter poles and zeroes. Each switched capacitor produces a noise power proportional to kT/C, where C is the capacitance, T is the absolute temperature and k is Boltzman's constant /4/. According to this model the noise power spectrum density is flat from 0 to fs/2. Every kT/Cnoise source contributes a certain amount of thermal noise which depends on the capacitance value and noise-transfer functions gr Different noise sources are uncorrelated, so noise contributions at the output of the loop filter can be added according to the equation (2). This is only possible for a simplified linear model of the whole modulator. loop VV Hz i=M j=N j=1 X Nsc, JJ 9sc, (f )df + £ N0P| JJ g2p. (f )df (2) Where: M is the number of S-C stages, N is the number of integrator stages, which is equal to the order of the loop filter, Nsd is the noise-power density of S-C stage i (independent of the frequency), Nop. is the noise-power density of the opamp j (Independent of the frequency), gsd(f)is the frequency-dependent noise-transfer function for each of the S-C stages, gopj(f) is the frequency dependent noise-transfer function for each opamp, Nloop is the noise-power density at the output of the loop filter and fN is the Nyquist rate after decimation. We can see that assuming uncorrelated noise sources the noise powers of the switched capacitor stages and opamps multiplied by the corresponding noise-transfer functions are added at the output of the loop filter. In addition to this the quantization noise power (which is assumed to be random) is also added. The summation is possible because at the beginning we assumed a linear model for the modulator, and so the theorem of superposition holds up to the output of the modulator. This simplified linear model is used just to get an insight Intermediate Noise TF : gi=y/xi -50 -100 Î-150 I -200 -250 -300 BI B* 03 Si-! /......: .. / 10° freq [Hz] into the behavior and is later replaced by a time-domain method. 1/f noise powers of all the integrator stages, except the first, are negligible because of the high-pass characteristics of the noise-transfer functions gopj (figure 4). Figure 5 shows AC noise-simulation results for the first opamp. Its thermal and 1/f noise components can be easily recognised. AC simulation results were taken as the input for the noise-generation procedure/5/and then time-domain simulations were performed, including chopping by fs/2. The 1/f noise component has been translated around fs/2. The thermal noise at the output of the loop filter is composed of contributions from all S-C stages and opamps. The contributions are different for every S-C stage and opamp, and are dependent on the noise-transfer functions and the noise powers of the corresponding source; the total noise can be optimised by calculating appropriate capacitor sizes for the switched capacitors and the thermal noise of the opamps. The optimisation is achieved when the contributions at the output of the loop filter are the same. Figure 6 shows the input-referred power-spectrum noise density of the first integrator: (a) with and (b) without chopping. It is thus possible to reduce capacitor sizes and noise requirements for the opamps, except for the first one. Unfortunately the unit capacitor size can not be made smaller than, for example, Cuni =0.3pF; the smallest dimension is constrained by the capacitor-ratio mismatch requirements, which are obtained from the stability and gain accuracy requirements of the modulator. The limit is a result of the synthesis procedure defined in /1/. The unit capacitor size of the first integrator must be bigger than Cu>15pF to achieve the S/N > 110dB at an oversampling ratio of D > 256, taking into consideration all noise sources. The unit capacitor size in the following stages drops drastically (in the third and following stages the unit capacitor size is Cu>0.3pF(see table on figure 9); the lower limit Is calculated from the matching-accu-racy parameters for a particular process and its requirements. /te NJSE/WmSœSLUSCRDPI 10" 10 frequency [Hz] Figure 4: Noise transfer function of the E?h-order modulator Figure 5: AC noise characteristics of the first opamp 23 Informacije MIDEM 31(2001)1, str. 21-25 D. Strle: Capacitor-area and Power-consumption Optimization of High Order A - E Modulators i U ! t T i L> - I S ¡0 5 CL .2 10 OS tí. '-'uiàçi-irciiô lüüüüü¿ i i í) no chopping ^fflftÜM" Frequency [Hzj »1 0 4Î3W -sS SJ8 >«sa<*tt>B-1|t: tM ! e«®88pF MMltsKH ' ■ - i ............. S 10' Spectrum Ol OP1; [Band-! 1.0Lt¿ IOOOO H2 ;;(b) chopping; :¿í¡úM ùiÉtmmÊÊÊÊÊÊÊÊÊÊÊÊ^i 10 10 Frequency [Hz] ;S1 = »<»»?=d8 Z t) Mk d£ü bi OHï i «MtäNwt i s Figure 6: Spectrum at the output of the first integrator a) no chopping; b) with chopping Despite the fact that the loop filter is linear, the whole modulator is a highly non-linear device because of the comparator and 1 bit D/A converter. The quantization noise is assumed to be random but in reality it is not, and also the signal is not small as predicted by the AC small-signal-analysis method. For this reason the Monte-Carlo time-domain method is used in our simulations. Figure 7 shows the noise simulation result for the 5th-order modulator: (a) before optimisation (without chopping) and (b) after optimisation (with chopping). The unit capacitor sizes and the thermal noise requirement for the opamps after optimisation if the S/N ratio of 110dB is required, are given in figure 9. By optimising capacitor sizes, the area of the capacitor arrays can be reduces by almost 10 times and the power consumption can be drastically reduced because the capacitive loads are lower, which is also shown on figure 8, which rep- Figure 7: Spectrum of the bit-stream for 5,h-order modulator a) no optimisation, no chopping; b) optimisation with chopping resents the layout of the realized modulator. It is evident that the area of the capacitor array is the biggest for the first integrator and than smaller for the second while the third and the following integrators have unit capacitor size of 0.3pF and thus need very small area. Integ Cu[pF] ZC[pF] " nV ~ _4Hz j 1 15 66 <9 2 1.2 24 <85 3 0.3 4.4 <1000 4 0.3 2.8 <1000 5 0.3 2.0 <1000 Figure 9: Table of capacitors before and after optimisation and opamp noise requirements 24 Figure 8: Layout of the realized 5th-order modulator D. Strie: Capacitor-area and Power-consumption Optimization of High Order A - £ Modulators Informacije MIDEM 31(2001)1, str. 21-25 3. OPTIMIZATION ALGORITHM 4. CONCLUSIONS The block diagram of a general A - Z modulator is presented in figure 10, /3/. Its loop filter is generally an n,h-order FIR or IIR analog or sampled data filter. The behavior of the whole modulator is most generally and efficiently described by a combination of a linear time-domain multi-input single-output state-space description of the loop filter (state variables are outputs of the integrators) and of non-linear part, which describes single-bit or multi-bit quantizer (equation 3). u N Figure 10: Block diagram of A-X modulator A methodology for power-consumption optimisation for an S-C high order, high-resolution modulator is presented. The main reasons for power consumption are explained and a method to reduce the effects is proposed. An algorithm coded in MATLAB is developed and the results of an optimisation for a practical example of the 5lh-order single-bit Z-A modulator are given. The required power consumptions of two modulators with equal architecture and different capacitors and opamps are compared. The power consumption for an optimised structure is reduced by more than 5 times compared to the non-optimised, while the capacitor array area is reduced by almost 10 times. The advantage of the proposed algorithm is the speed and the effectiveness of the optimisation procedure and the possibility of performing the optimisation and the noise simulation in a reasonable time. It is only possible to use a simplified mathematical model for the circuits involved, which is the main limitation of the approach. x(n)= Ax(n-l)+Bv(n-l)+Nn(n-l) y(n)=cTx(n-l)+Dv(n-l) v(n)=f(y(n-l))Vref References (3) The relation between the matrix elements and capacitors is defined for an arbitrary S-C loop filter in /5/. This model is improved by additional noise and linear and non-linear settling parameters. For convenience the simulation can be performed by turning on or off the chopping effects. The optimisation algorithm gets its input data from the synthesis procedure. All the capacitor ratios are known, as is the architecture of the modulator. At the beginning the program assumes equal capacitors for all integratorstages. They are calculated from the S/N and oversampling-ratio requirements. The same thermal noise requirement for the opamps is taken as a basis. The time-domain Monte-Carlo simulation is then performed for each switched capacitor and each opamp in a modulator according to /5/. The contribution of each element is calculated in a band 0Hz to fn and saved, then the capacitor sizes and the opamps' thermal noise parameters are adjusted according to these contributions in such a way that each noise source adds approximately the same contribution. The result is a table of unit capacitor sizes, the sum of all the capacitors for each integrator and the thermal noise requirements for the opamps. For our example the result is given in figure 9. We assumed that chopping of the first amplifier is performed, so the 1/f noise is translated around fs/2 and later removed by the decimation filter. At the end a complete simulation of the modulator with optimised elements is performed. The result is presented in figure 7 (b). /1/ /2/ /3/ /4/ /51 D.Strle, "A system design considerations of high resoition, I ow power a-XA/D converter", tech. rep., University of Ljubljana, Faculty of Electrical Engineering, Tržaška 25, Ljubljana, 2000. V.Peluso, P.Vancorenland, A.M.Marques, M.Steyaert, and W.Sansen, "A 900-mv low power a-z A/D converter with 77-db dynamic range", IEEE JSSC, vol.33, pp1887-1897, Dec 1998. J.C.Candy and G.Temes, "Oversampling methods for A/D and D/A conversion in Oversampling Delta-Sigma Converters", IEEE Press, 1991. R.Gregorian and G.Temes, "Analog MOS integrated Circuits for signal processing", John Wiley and Sons, 1986. D.Strle, "Noise modelling and simulation of high resolution a-z A/D converters", Proceedings of the 1997 International MIDEM Conference, pp148-154,1997. Dr. Drago Strie dipl. ing. University of Ljubljana, Faculty for electrical engineering, Tržaška 25, 1000 Ljubljana, Slovenia, e-mail: drago, strie @ fe. uni-lj. si Prispelo (Arrived): 05.02.01 Sprejeto (Accepted): 07.03.01 25 UDK621,3:(53+54+621 +66), ISSN0352-9045 Informacije MIDEM 31(2001)1, Ljubljana FREQUENCY WARPING AND CHAOTIC BEHAVIOUR GENERATED BY SPICE Boštjan Peršič, Niko Basarič University of Ljubljana, Slovenia Keywords: physics, electrotechnics, circuit analysis, SPICE computer tools, DSP, Digital Signal Processing, frequency warping, numerical integrations, Z-transform, LAPLACE transform, chaos Abstract: The numerical procedure of the SPICE simulator is a compromise between accuracy and speed, so that due to efficiency, the simulator can yield errors. One source is frequency warping, which is discussed in the article. The simulator is presented as a digital signal processor. General terms describing the distortion emanating from a time discrete treatment of the continuous signal are achieved in the frequency domain by comparing the Z-transform of the simulator model to the Laplace transform of the model of the actual circuit. The theoretical derivations are illustrated by examples treating autonomous circuits. Within the first example, the SPICE transient analysis is analysed and the discrepancy between actual and calculated responses is presented in the time domain. In addition to frequency warping, the discrete treatment can generate chaotic behaviour. This is presented in the second example, an analysis of a CL oscillator. This example demonstrates the influence of magnitude of the numerical integration step. It is shown that if a step is not limited, the simulation does not follow the behaviour of the circuit and an unpredictable shape is output. The calculated response seems to be chaotic, despite the actual circuit having a closed limit cycle. Frekvenčno izkrivljanje in kaotično obnašanje simulatorja SPICE Ključne besede: fizika, elektrotehnika, analize vezij, SPICE orodja računalniška, DSP obdelava signalov digitalna, izkrivljanje frekvenčno, integracije numerične, Z-transformacija, LAPLACE transformacije, kaos Povzetek: Numerični postopki, ki jih uporablja simulator SPICE, so kompromis med učinkovitostjo in točnostjo. Članek opisuje nekaj vidikov frekvenčnega izkrivljanja in kaotičnega obnašanja CL oscilatorja pri analizi s simulatorjem SPICE. Simulator smo predstavili kot digitalni procesor signala. Splošne analitične izraze za popačenja, ki nastanejo zaradi diskretnega obravnavanja zveznega signala, smo izpeljali v frekvenčnem prostoru prek primerjave rezultatov Z-transformacije odziva modela simulatorja in Laplaceove transformacije odziva dejanskega vezja. Teoretične izpeljave ilustrirata dva zgleda analize avtonomnih vezij. Prvi zgled obdeluje linearno vezje. Napake tranzientne analize simulatorja SPICE so prikazane v prostoru stanj in časovnem prostoru. Drugi zgled je nelinearno vezje, pri katerem je poudarjen vpliv velikosti koraka numerične integracije. Če je maksimalni dopustni korak numerične integracije prevelik, analiza vezja s simulatorjem ne sledi obnašanju dejanskega vezja, temveč generira nepredvidljive rezultate. Izračunani izhodni signal je tedaj videti kaotične oblike, čeprav ima obravnavano vezje zaključen limitni cikel. 1. INTRODUCTION Implementation in microelectronics technology is impossible without the use of a wide palette of CAD tools. The basic tool, which yields circuit responses at the level of elements, is the SPICE simulator (Simulation Program with Integrated Circuits Emphasis) /1/. The simulation is performed on a digital computer, so its behaviour is similar to digital signal processing. Digital signal processing does not encompass the entire signal; It treats a chain of discrete samples only, and the samples have a limited number of values. This discrete approach introduces some inevitable impairment of the signal. The transformation of the magnitude from continuous to discrete values is a distortion that is presented as quantization noise. The amount of the noise is inversely proportional to the distance between two neighbouring levels. Similar distortion is caused by errors emanating from unavoidable truncations and by rounding off the intermediate results of mathematical operations /2,3/. Naturally, the noise impairs the signals calculated. In addition, manipulation of a signal in discrete instants of time is the second source of errors 121. The sampling theorem states that an original signal can be reconstructed by its sam- ples if they are nearer each other than half a period of the highest spectral component. If this condition is violated, the spectrum transposed around multiples of the sampling frequency is added to the baseband spectrum, which causes irreparable corruption of the original signal. A detailed analysis reveals that the treatment of a signal in discrete instants introduces distortion, even if they are close enough to fulfil the requirement of the theorem. This distortion is called frequency warping. Its size depends on the ratio between the frequency of the signal and the sampling frequency. Assuming that an exact result would be obtained incorporating an infinite number of samples, the difference between the actual result, acquired by a limited amount of samples, and the exact one can be defined as the error. The quantitative estimate of frequency warping is normally done In the frequency domain by comparing a Z-transform of the discrete system to a Laplace transform of the continuous counterpart /3, 4/. The error caused by the discrete approach comes into play as the observed system is excited by a different frequency, which changes the shape of the response /3/. Within some applications, analog circuit simulators 26 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE Informacije MIDEM 31(2001)1, str. 26-32 evaluate the response in discrete points of time; therefore, they have all the essential properties of a digital signal processor. The feature denoted is incorporated into the SPICE simulator. The result of SPICE transient analysis can be presented as an output of a digital integrator /5, 6/ whereby sampling time, numerical accuracy and the integration method can be selected. Due to its discrete nature, numerical integration causes the distortion mentioned above. Contrary to standard digital signal processing, SPICE does not maintain a constant distance between samples. In order to accelerate analysis, the integration step of the simulator continues until an initially controlled error surpasses the threshold value chosen. Interrupting a lengthy step by an abrupt change in a signal can be another source of errors. The error can even transform an undoubtedly unstable response into a stable one 111. In addition, it can cause completely irregular results /8/. Despite the possibility of chaotic behaviour in a Colpitts oscillator /9,10/, a circuit which has a simple closed limit cycle with no bifurcation can yield a strange response with no steady state. It has been confirmed that the unexpected results are consequences of the imprecise numerical integration of the simulator. 2. MODEL OF SPICE TRANSIENT ANALYSIS SPICE transient analysis has been created to be a numerical solver of differential equations. In fact, the analysis has been adapted to cover non-linear circuits and uses implicit integration methods /1/, where the magnitudes of signals and their derivatives are simultaneously calculated by previous values. However, since we wish to avoid an overly intricate explanation by omitting facts not essential to our topic, the circuit analysed is assumed to be linear and is presented by a system of equations in a normal form. The excitation vector, the state vector, and its time derivative are depicted by u(f), y(t) and y(t) respectively. A and B are corresponding matrices. y(t) = Ay(t) + Bu(t) (1) The explicit form of the signal y(t) can be obtained by integrating the differential equation. t t y(t)= jy(t)dt = J(Ay(t) + Bu(t))dt (2) The simulator captures and manipulates signals at discrete time instants only. Suppose that these instants are equidistant and that they are h units of time apart. If this is so, vectors can be replaced by their samples u(t)->u(kh) = uk, y(t) -> y(kh) = yk, y(t) -» y(kh) = yk. The usual numerical integration technique exploited by the simulator is the trapezoidal method. The procedure for one integration step is presented by (3) where h depicts the time step. h . h . Yk+1 = Yk +~^~Yk + ~Yk+1 (3) Derivatives of the state vector in the equation above can be replaced by the right side of the equation (1) for matching instants. After some rearrangement, (4) is obtained: A B Yk+1 = Yk +h~(yk +yk+i) + h-(uk +uk+1) ^ A system operating according to equation (4) is sketched in Figure 1. The block z1 depicts a time delay lasting one integration step. The input signals of the middle summator are formed by the mean value of the present and past samples of the output signal and the excitation, multiplied by A and B respectively. The integrator is presented as an accumulator. Fig. 1. Model of the trapezoid method Figure 1 confirms that numerical integration can be treated similarly to digital processing of signals, and that consequently the SPICE transient analysis can be regarded as a digital signal processor. The logical conclusion is that this task of the simulator has all the main features, including deficiencies, of a processor of this kind. 3. FREQUENCY WARPING IN THE FREUENCY DOMAIN Frequency warping is the phenomenon where a discrete system corresponds to its continuous counterpart at a distinct frequency. This discrepancy increases significantly if the frequency of the signal approaches a value limited by the sampling theorem. An error can be caused if this fact is neglected when analysing an oscillator /4/. Our quantitative description of the phe- 27 Informacije MIDEM 31(2001)1, str. 26-32 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE nomenon inside the SPICE analysis is divided into steps, as follows. 3.1. Comparison of Transfer Functions The transfer function of the continuous integrator can be obtained by performing a Laplace transform on basic equations (1) or (2) that describe the behaviour of the system. ^ id! 2 coh r = 1; o^-arctan— (8) The map of the ordinate of the s-plane is a circle in the z-plane, as shown in Figure 2. The frequency of the discrete system corresponds to the phase in the z-plane. H(S) U(s) s-A (5) Equation (3) presents the signal in discrete time instants; therefore, the natural approach is a Z-transform. The transfer function of a numerical integrator which exploits the trapezoidal method is given by (6). 2. Z-1_a h z + 1 (6) A comparison of equations (6) and (5) confirms that the discrete approach alters the transfer function of a system. Consequently, a change in the transfer function results in a difference between the responses of the discrete system and its continuous (e.g. actual) counterpart. Nevertheless, it can be noticed that the structure of equation (5) is equal to the structure of (6), except that the variable s Is replaced by the fraction. The transfer functions are identical if mapping, defined by equation (7), is introduced. 2 z-1 2 + hs s =---=> z =- h z + 1 2-hs (7) As the variables of the transforms have both real and imaginary parts, the last equation describes two-dimensional mapping. The map of a significant curve is additionally highlighted in the next subsection. 3.2. Map of s-Plane Ordinate to z-Plane The continuous complex frequency s is expressed as a sum of the real and imaginary parts ( s = a + jw ), and variable z by its polar co-ordi-nates (z = r-eJcoh). To distinguish the frequencies, the upper index in the parenthesis is added at the discrete signal (codiscrele=> co(ti)). The portrayal of harmonic signals in the s-plane is its ordinate. As our main concern is studying relations between the frequencies of continuous and discrete systems, a map of this curve into the z-plane can explain the point. After replacing the variable s in equation (7) by y'co and after some manipulation, the relation presented by (8) can be found. Fig. 2. Map of the ordinate of the s-plane into the z-plane according to the trapezoidal method. The relation between continuous and discrete frequencies, dictated by equation (8), is presented in Figure 3. The abscissa is proportional to the frequency in a continuous (actual) system, and the ordinate to the frequency of its discrete counterpart (simulated system). The scales of both axes are normalised by the sampling frequency, which is calculated from the time step (/=1//7). continuous system and the discrete system if equal responses are required. The curve reveals the relation between the systems. It can be inferred that the test tone frequency inside the simulator (/(d)) has to differ from the actual frequency (/) if the same magnitude of response is required. If the ratio between the test and the sampling frequency is small, the value of the function arc tan is almost equal to its argument and the differences between the responses are hardly observed. When the ratio approaches or even exceeds one half, the corresponding discrete frequency is significantly distinct. 28 B. Peršič, N. Basarič: Frequency Warping and Chaotic Behaviour Generated by SPICE Informacije MIDEM 31 (2001 )1, str. 26-32 This non-linear relation emanates from the treatment of the signal In discrete time instants. 3.3. Approximation Equation (8) is the exact description of the distortion of a signal due to treatment of a continuous system by the simulator. The error can be easily estimated if a non-linear tangent function is expanded into a truncated Taylor series. The result of the procedure is equation (9). A detailed analysis reveals that with ten samples per period, the first neglected term is approximately 4% of the last included term. co = to 1 + (co(d)h) 12 2 (9) As the integration step of the SPICE transient analysis is adapted automatically to changes in the signal [11], the number of steps in a period is normally greater than ten, especially if the signal is formed by a single spectral component. With this in mind, the approximations of the distortion seem acceptable for obtaining a fair estimate of an error of frequency obtained by the simulator. 4. EXAMPLE I - FREQUENCY WARPING This section illustrates the subject through an example. It aims to display the typical effect of frequency warping and to clarify the relation (9). 4.1. The circuit The simplest form of an autonomous circuit, producing a response that incorporates only the imaginary part of the complex frequency ( s = jco0 ), is a CL combination with an initial state (u(0), i(0)) different from zero. The circuit is presented In Figure 4. It can be described by an homogeneous second order linear differential equation. Its solution is undamped oscillation . + C IL d u(t) u(t) dt2 CL u(t) = Asin(w0t + cp) Fig. 4. The circuit in Example 1 4.2. Numerical Integration in the Time Domain Frequency warping of the example is demonstrated in the time domain. The trapezoidal method has no amplitude distortion /3/; therefore, any point of the sig- nals acquired by numerical integration defines a sine curve with given constant amplitude. The trapezoidal algorithm (3) includes the initial and the final derivatives of the signal within each integration step. For convenience, the step should be divided into two equal parts. During the first part, the signal of the discrete integrator is assumed to be changing according to the slope through the initial point. This slope can be unambiguously found from the known point of the signal. During the second half of the interval, the simulator follows the tangent of the final point. Presentation of the outcome without calculation is rather intricate, as the final point is unknown. For a particular case, the obscurity can be bypassed by choosing an initial point on the abscissa (y0=0) and by setting the magnitude of the integration step to the value where the signal reaches amplitude during its first half. With this intermediate result, the only solution at the end of the integration step is the slope parallel to the abscissa, which holds only at the maximum value of the harmonic signal. The point after the second integration step is obtained similarly. The first move within the step is parallel to the abscissa, and the second has maximum slope: therefore the abscissa is reached again, as shown In Figure 5. Fig. 5. Frequency warping demonstrated in time domain. The three points describe half a period of the calculated signal. It can be observed in Figure 5 that the peak of the signal obtained is shifted rightwards compared to the actual signal. The delay is depicted by At. Sequential Integration steps add the same delay and the period is prolonged, which means that the frequency of the response is lowered. The difference can be approximated by (9). 5. EXAMPLE II-CHAOS As noted, transient analysis of the simulator consists of numerical integration with discrete time increments. A circuit has to be solved inside each step, which results in a considerable amount of calculations if the increments are tiny. To output a result in a reasonable 29 Informacije MIDEM 31(2001)1, str. 26-32 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE time, the standard SPICE simulator tends to enlarge the increment towards IT, -T, J/50 or T .whichever v stop start> max1 is smaller/11/. The times mentioned are the second, third and fourth parameters in the .TRAN statement respectively. Another limitation stems from the slopes of signals. Slowly varying signals enlarge the time increment, and vice versa. If an abrupt change occurs in a signal, the integration step shrinks gradually. The gradual change and asynchronism between the integration instants and the abrupt change can be another source of errors. 5.1. The circuit The active element of the circuit in Figure 7 is a general purpose NPN transistor in a common emitter orientation, loaded by a resistor. Two capacitors and an inductor form the feedback. The latter provides the quiescent current into the base, so that the transistor operates near saturation. Note that the ratio of capacitances causes a considerable signal at the base; therefore, the transistor is forced to traverse a strongly non-linear region. The non-linearity yields significant digress of actual frequency of oscillation (18.5 kHz) from the resonant frequency of the passive n feedback circuit (8.9 kHz). Fig. 7. The oscillator analysed (Ucc = 5V, Q =2N2222, R=1k,C1 = 47nF, IC = 0.6074422 V, C2= 100 nF, IC=1.361650 V, L= 10 mH, IC - 28.98162E-6 A) 5.2. Results of simulations The first solution presented is referred to as exact. Setting a small enough time between outputted results (T, ) and the maximum time increment of the x step' numerical Integration {TmJ, and starting the simulation with adequate initial conditions, an actual steady-state response is obtained. Observing the actual time increment of the integration in the raw file, It was confirmed that T chosen defines the time increment max inside the entire period. Figure 8 displays the current into the base, the collector-emitter voltage in the time domain and the current through the inductor in the frequency domain. 0.8 0.4 10 30 50 70 90 tfas iL! raA 10 30 50 70 90 /7kHz Fig. 8. Base current, base-emitter voltage and the spectrum of the inductor current (T, = T = 50 ns) * step max ' Leaving Tmax unchanged and T$lep significantly increased, the simulation is repeated. Figure 9 shows the results in the state domain formed by the voltage across the capacitor at the collector side and the inductor current. The upper part of the trajectory is spitted. The inset explains that the vertices of curves are merely different points of the original trajectory. Thus, presenting the signals using a small number of points, the distortion appears. Fig. 9. The trajectory with increased Tslep (Tslep =1,5 US' TmSX = 50 ns) 30 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE Informacije MIDEM 31(2001)1, str. 26-32 A simulation with slightly increased Tmax causes the spectral lines to be lower and wider and the noise floor raised. In the state domain, the points calculated randomly diverge from the original trajectory, but the limit cycle can be unambiguously confirmed. Omitting Tmax leads to an unpredictable yield of the simulator (Figure 10). The spectrum has some peaks around the original frequency of oscillation. The spikes are less than 6dB above the vicinity, so all spectral components are significant inside a limited interval. In addition, the trajectory gained is not a closed curve; moreover, it seems chaotic. 0.2- ^/mA 0.1 10 30 50 70 90 ft //kHz Fig. 10. The spectrum and the trajectory (T = 1,5 lis, T = omitted). r ' max ' 5.3. Comments Observing the current at the base side (Figure 8), it can be inferred that significant changes occur when the transistor passes from the cut-off to the active region. As the amplifier is idle for almost the entire period except inside this short interval, the slopes of signals are defined mainly by passive feedback. Thus, all signals are smooth before the spike of the base current occurs. In observing the raw results, it can be found that the integration increment has its maximum allowed value at the beginning of the abrupt change. Overly large steps cause alternation of the calculated base current. If this numerically provoked alternation is not damped enough, the process can continue through the entire period. Not being synchronised with the signal, the moments of calculation are not congruent to the response. Given this, the calculated amount of the charge injected into the base randomly varies from cycle to cycle, so the response is aperiodic. 2. CONCLUSION The aim of the contribution is to highlight some deficiencies of the SPICE simulator. In addition, it explains the origin of frequency warping and the irregular behaviour due to manipulation of a signal in discrete time instants. Considering the equidistant time steps, a similar approach is found within digital signal processing, so the transient analysis of the SPICE simulator is presented as a digital signal processor. Expressions for the distortion caused by numerical Integration are derived in the frequency domain. It is shown that the discrepancy between a continuous system and its discrete counterpart increases if a time steps approach is made to periods of signals. A simple incorporated electronic circuit, the solution to which is harmonic oscillation, is presented to illustrate these features. The circuit is simulated exploiting numerical integration. Within this example, the distortion is additionally surveyed in the time domain. In addition, the SPICE simulator varies the time step of numerical integration, which introduces another error, treated in the second example. The circuit used in the example is a Colpitis oscillator. It has been confirmed that the simulator outputs the periodic signals if adequate controls are set. However, the automatic adaptation of the time increment leads to an aperiodic response. It must be stressed that the SPICE simulator is an excellent tool for circuit analyses, and usually accomplishes its job requiring no additional actions by its users. Sometimes the facts mentioned above prevail, so outcomes contain errors. We hope that the paper clearly elucidates some unpleasant features of the SPICE simulator, and trust that consideration of the features and their consequences will facilitate detection of false results. We advise that a survey of the simulator's outputs should be made and if they seen doubtful, additional controls must be utilised. As the examples Illustrate, the most powerful control is the limitation of the time step. References /1/ L.W. Nagel, A Computer Program to Simulate Semiconductor Circuits. Memorandum No. ERL-M520, Berkeley, 1975. /2/ James V. Candy, Signal Processing: the Modern Aproach. McGraw-Hill, NY, 1988. /3/ Andreas Antonlu, Digital Filters Analysis, Design and Applications. McGraw-Hill, NY, 1993. /41 B. Persic, "SPICE Transient Analysis Errors Estimated in Frequency Space", IEE Electronics Letters, Vol. 30, No. 8, pp. 617-8, April 1994. /5/ A. Brambila, D. D'Amore, "The Simulation Errors Introduced by the SPICE Transient Analysis," IEEE Trans. CAS I, Vol. 40 No. 1, pp. 57-60, Jan. 1993. 31 Informacije MIDEM 31(2001)1, str. 26-32 B. Persic, N. Basaric: Frequency Warping and Chaotic Behaviour Generated by SPICE /6/ K.G. Nichols, T.J. Kazimerski, M. Zwolinski, A.D. Brown, "Overview of SPICE-Like Circuit Simulations Algorithms," IEE Proc.: Circuits Devices Systems, Vol. 141, No. 4, pp. 242-250, August 1994. Ill P. Kinget, J. Crols, M. Ingles, E. Peluso, "Are Circuit Simulators Becoming Too Stable", IEEE Circuits and Devices Magazine, Vol. 10, No. 3, pp. 50, May 1994. /81 B. Persic, I. Medic, "Chaotic Results of the SPICE Simulator," Proceedings of ECCTD '97, pp. 1226-1230, Budapest 1997. /9/ G. Sarafin, B. Kaplan, "Is the Colpitis Oscillator Chua's Circuit", IEEE Trans. CAS-I, Vol. 42, No. 6, pp. 373-6, 1995. /10/ M.P. Kennedy, "On the Relationship Between Chaotic Colpitts Oscillator and Chua's Circuit", IEEE Trans. CAS-I, Vol. 42, No. 6, pp. 376-9, 1995. /11/ SPICE 3F4 online help, Berkeley, 1993. Prispelo (Arrived): 23.06.00 Sprejeto (Accepted):22.11.00 32 Boštjan Peršič University of Ljubljana Tržaška 25, 1000 Ljubljana, Slovenia bostjan.persic@fe. uni-lj. si Niko Basaric University of Ljubljana Trzaska 25, 1000 Ljubljana, Slovenia niko. basaric@fe. uni-lj. si UDK621,3:(53+54+621+66), ISSN0352-9045 Informacije MIDEM 31(2001)1, Ljubljana VITERBIJEV ALGORITEM ZA DSP PROCESORJE Srečo Plevel, Tomaž Javornik, Igor Ozimek, Roman Trobec and Gorazd Kandus Institut Jožef Stefan, Ljubljana, Slovenia Ključne besede: sistemi komunikacijski digitalni, naprave radijske mobilne, kodiranje, dekodiranje, VITERBI algoritem, DSP obdelava signalov digitalna, DSP procesorji, kodiranje konvolucijsko, dekodiranje trdo, dekodiranje mehko, TMS320C4x Texas Instruments DSP procesorji, TMS320C6x Texas Instruments DSP procesorji Povzetek: Sodobni digitalni komunikacijski sistemi, zlasti mobilne radijske naprave, so v vse širši uporabi. Za kvaliteten prenos podatkov in učinkovito izrabo razpoložljivega radijskega frekvenčnega spektra so potrebni kvalitetni modulacijski in kodirni postopki. Ena od pomembnih tehnik kodiranja je konvolucijsko kodiranje in dekodiranje z uporabo Viterbijevega algoritma. Modulacijski in kodirni postopki so relativno zahtevni za obdelavo v realnem času in so se običajno izvajali v specializiranih vezjih. Razvoj vedno bolj zmogljivih univerzalnih signalnih procesorjev pa omogoča programsko izvedbo teh postopkov in stem koncept t.i. programljivega radia. Ta prinaša razne prednosti, med drugim enostavno spreminjanje in stem prilagodljivost raznim komunikacijskim sistemom in standardom. V članku je predstavljena izvedba Viterbijevega algoritma za dve družini univerzalnih signalnih procesorjev TMS320C4x in TMS320C6x. Opisani so razni načini optimizacije in pohitritve Viterbijevega algoritma ter predstavljene dosežene zmogljivosti algoritma pri izvajanju na teh dveh procesorjih tako za trdo kot mehko dekodiranje. Viterbi Algorithm for DSP Processors Keywords: digital communication systems, mobile wireless devices, coding, decoding, VITERBI algorithm, DSP, Digital Signal Processing, DSP processors, convolution coding, hard decoding, soft decoding, TMS320C4x Texas Instruments DSP processors, TMS320C6x Texas Instruments DSP processors Abstract: Modern digital communication systems, especially mobile wireless devices, are playing more and more important role in everyday life. For reliable data transmission and efficient utilisation of limited frequency spectrum resources, bandwidth-efficient modulation and coding must be employed. An important coding technique is convolutlonal coding and decoding with Viterbi algorithm. Modulation and coding procedures are relatively demanding for digital real-time processing and until recently they were mostly implemented in specialised integrated circuits. The development of ever more powerful general-purpose signal processors came to the point, where software implementation of these procedures is viable. This approach, known as software radio, has a number of advantages. It allows easy modification and adaptation to various communication systems and standards. In the article, the implementation of the Viterbi algorithm is described for two families of general-purpose digital signal processors, TMS320C4x and TMS320C6x. Various optimisations are presented together with the resulting performance in terms of achievable bit rates and error bit rates for hard and soft Viterbi decoding. 1. Uvod V sodobnih digitalnih telekomunikacijskih sistemih se uporabljajo za zmanjšanje verjetnosti bitne napake kanalni kodirni postopki. Ti vnašajo v podatkovni tok dodatne bite, ki sprejemniku omogočajo popravljanje napak pri prenosu. Kanalne kodirnike delimo v bločne In konvolucijske. Pri bločnem kodiranju razdelimo bitni tok na posamezne bloke in vsakemu dodamo dodatne (redundantne) bite. Dekodiranje (s popravljanjem napak) se izvaja neodvisno na vsakem bloku posebej, nakar se izhodni biti sestavijo nazaj v neprekinjen bitni tok, seveda z zakasnitvijo, ki je posledica sprejemanja in obdelave posameznega bloka. Pri konvolucijsklh kodirnih postopkih poteka kodiranje in dekodiranje sproti na neprekinjenem bitnem toku. Posamezen simbol na izhodu kodirnika je odvisen od trenutnega vhodnega simbola in od stanja kodirnika, to pa od določenega števila predhodnih vhodnih simbolov. Število izhodnih bitov je večje od vhodnih za nek faktor, kar zagotavlja redundanco in s tem možnost odpravljanja napak. Za dekodiranje konvolucijsko kodiranih signalov se uporabljajo različni algoritmi, najpogosteje pa Viterbijev algoritem. Na tržišču obstaja vrsta integriranih vezij za dekodiranje konvolucijsko kodiranih signalov, vendar je večina od njih vezana na določen konvolucijski kodirnik, torej na določen komunikacijski sistem. V zadnjih nekaj letih se je na področju mobilnih komunikacij pojavil koncept programljivega radija, za katerega je značilno, da se radijski vmesnik menja glede na storitve, kijih zahteva uporabnik, in stanje radijskega kanala med sprejemnikom in oddajnikom. Eden glavnih gradnikov v sistemu programljivega radija so digitalni signalni procesorji - DSP. Viterbijev algoritem smo priredili tako, da ga je mogoče izvajati v digitalnih procesorjih družine Texas Instruments TMS320C4x in TMS320C6x. 2. Opis konvolucijskega kodiranja Konvolucijsko kodiranje bomo prikazali na preprostem primeru kodirnika z enim vhodom in dvema izhodoma, pri katerem je izhodni 2-bitni simbol odvisen od trenutnega vhodnega bita in dveh predhodno poslanih bitov. To pomeni, da ima kodirnik dva registra, torej 22 - 4 možna notranja stanja. Bločna shema kodirnika je prikazana na sliki 1. Konvolucijske kodirnike pogosto označujemo z zapisom (št. izhodnih bitov, št. vhodnih 33 Informacije MIDEM 31(2001)1, str. 33-38 S. Plevel, T. Javornlk, I. Ozimek, R. Trobec, G. Kandus: Viterbijev algoritem za DSP procesorje bitov, št. registrov), v našem primeru je oznaka (2,1,2). Ta zapis ne navaja povezav med zakasnilnimi členi in izhodi, zato z njo kodirnik še ni enolično določen. Če označimo bita notranjega stanja x1 in x2, bit na vhodu pa x0, potem izhod kodirnika zapišemo z naslednjima enačbama: Y1 Y2 Slika 1: Konvolucijski kodirnik (2,1,2) y1 = x0 + x1 + x2 y2 = x0 + x2 Pri tem pomeni znak + vsoto po modulu 2 oziroma operacijo izključujoči ali (xor). Pri vsakem prehodu Iz trenutnega v naslednje stanje sta izhodna bita kodirnika enolično določena s prehodom, njuna vrednost je v sliki 2 zapisana nad vsakim prehodom. Izhodni bitni tok opisanega konvolucijskega kodirnika ima dvojno število bitov v primerjavi z originalnim nekodiranlm podatkovnim bitnim tokom na vhodu in zahteva zato dvakrat večjo hitrost prenosa v zameno za večjo zanesljivost. Registra X1 In X2 predstavljata notranje stanje kodirnika, register X0 pa je vhod v kodirnik. Ko je na vhodu v kodirnik nov bit, se shrani v skrajno levi register (na sliki X0), vsi ostali pa se pomaknejo za eno mesto v desno. Indeks notranjega stanja predstavlja kar binarni zapis registrov X1 in X2. Diagram prehajanja stanj prikazuje slika 2. Črne puščice pomenijo prehod stanja pri vhodnem bitu 0, bele pa pri vhodnem bitu 1. Binarni zapis v krogih pomeni stanja binarnih registrov pomnilnika konvolucijskega kodirnika. Iz diagrama prehajanja stanj je razvidno, da na vsakem koraku nov vhodni bit pride na najpomembnejše binarno mesto (z leve strani), najmanj pomemben bit (desni) pa odpade. Stanje x1 x2 Izhod x1 x2 0 xo=o x0=1 Slika 2: Diagram prehajanja stanj konvolucijskega kodirnika (2,1,2) 3. Viterbijev dekodirnik Za dekodiranje konvolucijsko kodiranega signala se običajno uporablja Viterbijev algoritem. (Ta se uporablja tudi v primerih, ko podatki sicer niso konvolucijsko kodirani, pač pa neidealna prenosna karakteristika komunikacijskega kanala povzroča enak učinek, t.j. medsebojni vpliv med podatkovnimi biti oz. simboli.) Na prenosnih poteh prihaja do napak. Če so napake na prenosni poti statistično neodvisne in se ne pojavljajo v rafalih (burstih), je postopek dekodiranja konvolucijsko kodiranega signala s pomočjo Viterbijevega algoritma optimalen. Rafalom napak, ki pogosto nastopajo v telekomunikacijskih sistemih, se izognemo s premešanjem (scrambling) poslanih bitov v daljšem časovnem intervalu. Dekodirnik opravi svojo nalogo, če ugotovi, kako so prehajala notranja stanja v kodirniku, ko je kodiral podatke za prenos. Po dogovoru se kodirnik na začetku nahaja v stanju 0 (vsi biti notranjega pomnilnika so nič). Viterbijev algoritem gradi usmerjen graf vseh možnih poti v mrežnem diagramu iz začetnega stanja 0. Mrežni diagram dobimo, če prehajanje po diagramu stanj raztegnemo po časovni osi. Vsako vozlišče predstavlja notranje stanje ob določenem času, vsaka povezava pa predstavlja možen prehod med notranjimi stanji kodirnika. Vsaki povezavi izračunamo Hammingovo razdaljo med sprejetim simbolom in Izhodnim simbolom kodirnika pri izbrani povezavi. (Hammingova razdalja je preprosta aritmetična vsota napak, v binarnem primeru je to kar število napačnih bitov.) Minimalna pot v grafu predstavlja najbolj verjetno prehajanje stanj kodirnika. Dolžina vsake poti pomeni natanko število napačnih bitov. Viterbijev algoritem uporablja princip dinamičnega programiranja, ki pravi, da si je potrebno za vsako vozlišče v grafu zapomniti le najboljšo pot do njega. 34 S. Plevel, T. Javornlk, I. Ozimek, R. Trobec, G. Kandus: Viterbijev algoritem za DSP procesorje Informacije MIDEM 31(2001)1, str. 33-38 Druge poti v vozlišče se zavrže. Viterbijev algoritem za vsako vozlišče izračuna kumulativno metriko vseh poti, ki vodijo v vozlišče. V vsakem vozlišču ohrani le najboljšo pot (pot z najmanjšo metriko). Na koncu izmed vseh vozlišč poišče vozlišče z minimalno metriko in iz opisa poti v to vozlišče določi izhodni simbol dekodirnika. Čeprav ima konvolucijski kodirnik končno dolžino (število registrov), bi moral dekodirnik, ki opravlja recipročno funkcijo, za teoretično optimalno dekodiranje v vsakem trenutku računati metriko celotne poti od začetka oddajanja signala. To v praksi seveda ni mogoče, saj mora imeti dekodirnik končno dolžino. Napaka glede na optimalno rešitev postane zanemarljiva, če je dolžina dekodirnika (število upoštevanih vozlišč na poti pri vsakokratnem izračunu) enaka ali večja od približno štirikratne vrednosti števila registrov kodirnika. 4. Uporaba tabel za hitrejše delovanje Viterbijevega algoritma Delovanje Viterbijevega algoritma pohitrimo z uporabo tabel. Število in vrsta tabel je odvisna od velikosti pomnilnika, ki je na razpolago, in velikosti konvolucijskega kodirnika. Za začetek smo se odločili, da implementiramo Viterbijev algoritem v C-ju na enem procesorju. V članku /1/ je predstavljen pristop s pomočjo vnaprej izračunanih tabel, ki jih algoritem uporablja za hitrejše računanje. Optimizacija je usmerjena v hitrost izvajanja, velikost porabljenega pomnilnika pa ni pomembna. Velikost tabel eksponentno narašča s številom registrov kodirnika, ki pa je relativno majhno (tipično manjše od 10). Uporabljene so sledeče tabele: Tabela vseh možnih prehodov med stanji MOrder (Mesh Order): Prva pomembna tabela je tabela vseh možnih prehodov med stanji za vsak vhodni simbol. Vrednost tabele predstavlja stanja, Iz katerih lahko pridemo v dano stanje. S pomočjo te tabele v trenutku dobimo vsa stanja, iz katerih je možen prehod v neko stanje, kar je ključnega pomena pri hitrem iskanju najbolj verjetne poti. Tabela vnaprej izračunanih vseh možnih Hammingovih razdalj - Distarr. To je najpomembnejša tabela, v kateri so shranjene vse možne Hammingove razdalje za vsako pot pri vsakem vhodnem simbolu. Tabela je tri-dimenzionalna. Prva dimenzija pove, za kateri vhodni simbol gre, druga pove, za katero pot po vrsti gre, tretja pa, za katero končno stanje. Branje iz te tabele nadomesti relativno počasno sprotno računanje razdalj, kar pomeni večkratno pohitritev izvajanja dekodirnega algoritma. Tabeli trenutnih minimalnih razdalj do vozlišč - OldMetric in NewMetric: Potrebujemo še dve tabeli, ki vsebujeta trenutno metriko (minimalno Hammingovo razdaljo) do vsakega notranjega stanja. Tabela optimalnih prehodov med vozlišči - NevvSurviv: V tabeli je shranjenih zadnjih 32 poti, ki so preživele za vsako vozlišče. Vsakič pomaknemo bite v tabeli za eno mesto v levo, če preživi druga pot, besedo zapolnimo z desne z 1, sicer pa z 0. Tako se po 32 korakih v njej nahaja 32 bitov, ki po vrsti hranijo informacijo, katera pot je preživela. Delovanje dekodirnika, ki ga ponazarja diagram na sliki 3, je sledeče: Preberemo vhodni simbol, imenujmo ga Symb. Hammingove razdalje računamo preko vseh notranjih stanj. Na vsakem koraku prideta v vsako vozlišče dve poti, od katerih ena preživi, druga pa odpade. Če računamo stanje state, potem nam M0rder[state][0]ter MOrder[state][1 ] povesta obe stanji, iz katerih vodi pot v diagramu prehajanja stanj v končno stanje state. Imenujmo ju sO in s1. Stari metriki stanja OldMetricfsO] je potrebno prišteti še Hammingovo razdaljo za zadnjo pot Distan[Symb][0][state]. Isto ponovimo za drugo pot (OldMetric[s1], pri kateri prištejemo Distan[Symb][1][state]). Preživelo pot in s tem novo metriko do vozlišča state dobimo tako, da manjšo vsoto zapišemo v NewMetric[state]. Bite v NewSurviv[state] vsakič premaknemo za eno mesto v levo, najbolj levi bit odpade, z desne pa se zapiše nova vrednost, ki pove, katera pot je preživela na zadnjem koraku. Določanje izhoda se Izvede na vsakih 16 sprejetih simbolov. Algoritem najprej poišče stanje z minimalno metriko, nato pa s pomočjo tabele NevvSurviv izračuna najbolj verjetno poslane podatke. Izpisuje se le 16 podatkov, ki so bili poslani pred 32 simbolnimi intervali. To pomeni, da vedno Izpisujemo dekodirano zgodovino za 16 do 32 bitov nazaj, kar ustreza 32-bitni besedi procesorja. Celotna potrebna zgodovina preživelih poti za vsako stanje je shranjena v eni besedi tabele Nevvsurviv. Dekodirnik se mora najprej premakniti 16 prehodov stanj v zgodovino brez izpisovanja izhodnega simbola, nato pa še 16 z izpisovanjem. Koda je napisana splošno za poljubne konvolucijske kodirnike z enim vhodom in dvema izhodoma. Pri povečevanju števila registrov kodirnika se eksponentno veča število notranjih stanj, s tem pa velikosti tabel in čas izvajanja. 35 Informacije MIDEM 31(2001)1, str. 33-38 S. Plevel, T. Javornik, I. Ozimek, R. Trobec, G. Kandus: Vlterbijev algoritem za DSP procesorje 3.2. DSP algoritem prirejen za TMS320C44 Hitrost Viterbijevega algoritma smo testirali na procesorju Texas Instruments TMS320C44. Kot testni primer smo izbrali konvolucijski kodirnik z enim vhodom, dvema izhodoma in petimi zakasnilnimi celicami (2,1,5), ki se uporablja za kanalno kodiranje pri prenosu govora ali podatkov v GSM telekomunikacijskih sistemih. Izhodna hitrost vokoderja uporabljenega v GSM mobilnih telefonih je 13 kblt/s, (blok 260 bitov, dolžina bloka je 20 ms). Blok 260 bitov je razdeljen na 3 dele, od katerih sta konvolucijsko kodirana le prva dva podbloka dolžine 182 bitov. Prvemu podbloku dodamo tri paritetne bite. Zaradi lastnosti konvolucijskega kodirnikaje potrebno celotnemu bloku dodati še štiri ničle na koncu prvih dveh podblokov, tako da je podatkovna hitrost na vhodu konvolucijskega kodirnika 9,4 kbit/s za en govorni kanal. Pri prenosu podatkov s prenosno hitrostjo 9,6 kblt/s uporabljamo enak konvolucijski kodirnik, vendar je v tem primeru vhodna podatkovna hitrost v konvolucijski kodirnik 12,2 kb/s. Pri taktu DSP procesorja 50 MHz in algoritmu napisanem v programskem jeziku C je procesor potreboval za izpis 16 bitov Informacije 0,305 ms, kar pomeni podatkovno hitrost 52,4 kbit/s. 3.2.1 Uporaba hitrega notranjega pomnilnika V naslednjem koraku smo vse tabele prestavili v hitri notranji pomnilnik, ki ga imajo procesorji TMS320C44 na naslovih od 0x002FF800 do 0x002FFFFF. Velikost naslovnega prostora je 8 kB, oziroma 2048 lokacij. Branje in pisanje v notranjem pomnilniku traja samo en strojni cikel, v običajnem pomnilniku pa dva strojna cikla. V tem primeru dobimo hitrost 72,9 kbit/s. 3.2.2 Uporaba naslavljanja »post incremented« Dostop do tabel preko kazalcev predstavlja naslednjo izboljšavo. Uporabili smo posebno naslavljanje (post incrementedaddressing), pri katerem se pri linearnem prehodu skozi tabelo ne Izgubi nič časa za naslavljanje tabele, saj se register (ki predstavlja kazalec na tabelo) ob branju vrednosti, kamor kaže, samodejno poveča za ena. RISC procesorji za opisani ukaz porabijo le en cikel. Vsaka od vrstic v spodnjem primeru sešteje staro metriko stanja in metriko dane poti, hkrati v Istem ukazu pa še poveča oba kazalca za 1, tako da zaradi posebne razvrstitve elementov tabel v naslednjem koraku že dostopa do naslednjega stanja in poti: ADDI *AR2++,*AR7++,R10 ADDI *AR2++,*AR7++,R9 V dveh urinih periodah procesorja dobimo dolžini obeh možnih poti v končno stanje. Podatkovna hitrost se je povečala na 78,3 kbit/s. 3.2.3 Implicitna uporaba vrednosti MOrder Dodatno povečanje hitrosti dosežemo z implicitno uporabo vrednosti v tabeli MOrder. Za izbrani konvolucijski kodirnik opazimo, da se vrednosti v tabeli MOrderza prvo polovico stanj in prvo ter drugo vhodno vejo v stanje povečujejo linearno s korakom 1. Enako velja za drugi del stanj konvolucijskega kodirnika. Tab. 1: Tabela MOrder Pot\stanje 0 1 2 3 4 5 6 7 0 ,0 ■i : 2 v i 4 i 6 8 ■i 9 / * ¡10 „ :12 „ .14 1 h /\ 4-3 / 45 / 4 7 ,/ 4-11 / ' 4-13/ ' -515 Pot\stanje 8 9 10 11 12 13 14 15 0 ,0 : 2 " i 4 i i 6 i 8 (14 1 i-1 / 4'3 / t 5 / 4 7 / i 9 / 411/ ' 413/ ' 415 Lastnost tabele smo izkoristili tako, da vrednosti kazalca nismo brali iz tabele MOrder, temveč smo ga linearno povečali. Z opisano izboljšavo smo dosegli za 10% hitrejše delovanje algoritma in s tem podatkovno hitrost 86,7 kbit/s. 3.2.4 Uporaba razporeditve dimenzij v tabeli Distan V tabeli Distan je v prvi dimenziji vhodni simbol, tako da so v pomnilniku vse vrednosti za isti vhodni simbol skupaj (32 vrednosti = 16 stanj x 2 poti). Algoritem v enem koraku vse razdalje po poteh računa za Isti 36 i S. Plevel, T. Javornlk, I. Ozimek, R. Trobec, G. Kandus: Viterbijev algoritem za DSP procesorje Informacije MIDEM 31(2001)1, str. 33-38 vhodni simbol, ki ga sprejme kot parameter. Na začetku algoritem nastavi odmik kazalca na pravo vrednost, nato se samodejno ob vsakem naslavljanju poveča za 1. Zaradi omenjenega načina naslavljanja ne porabimo dodatnega časa. Dosegli smo propustnost 93 kbit/s. 3.3 Mehki Viterbijev dekodirnik Opisani dekodirniki so trdi dekodirniki, ker obdelujejo le vrednosti 0 in 1. V vseh komunikacijskih sistemih, tudi digitalnih, se podatki v komunikacijskih kanalih prenašajo s pomočjo signalov, ki so analogne veličine. S pretvorbo signala v logične vrednosti (0,1) izgubimo del informacije, ki bi lahko izboljšal postopek dekodiranja. Če ničlo in enico predstavimo z vrednostjo, ki pove, kako blizu je sprejeti podatek logični enici ali ničli, in te vrednosti obdelujemo v dekodirniku, govorimo o mehkem Viterbijevem dekodirniku. Pri 8-bitni A/D pretvorbi sprejme dekodirnik za vsak prenesen simbol vrednost med 0 in 255. Vrednost 0 pomeni čisto ničlo, vrednost 255 pa čisto enico. Dekodirnik (2, 1, n) mora sprejeti vrednosti za dva sprejeta bita (t.j. skupno 16 bitov) za vsak dekodirani bit. Računanje razdalj je bolj zamudno, saj bi bilo za enak način uporabe tabel kot pri prejšnjem dekodirniku potrebno imeti zelo velike tabele. Tabela Distan bi bila velika kar 16><2x216 = 2M naslovov, saj je sedaj 216 možnih vhodnih simbolov. Če pa bi npr. uporabili le vrednosti od 0 do 15 (4-bitna A/D pretvorba), potem bi bila velikost 16x2x256 = 8K naslovov. Pri mehkem Viterbijevem dekodirniku se Hammingova razdalja nadomesti s kvadratom evklidske razdalje (razlike sprejetega simbola in pravilnega simbola) za dano pot. Minimalna vsota po poti pomeni najbolj verjetno oddano zaporedje simbolov, torej je to Viterbijevo dekodiranje na podlagi maksimalne verjetnosti (MLVD - Maximum likelihood Viterbi Decoding, 111, poglavje 5-1-4). 3.3.1 Hitro dekodiranje s 16 nivoji Napisali smo program za hitro dekodiranje za 16 nivojskl dekodirnik (N = 16) z uporabo dveh novih tabel Distl[SIMBOL][STANJE][POT] in Dist2[SIMBOL][STANJE][POTJ. Tabeli imata enako vlogo kot jo je imela tabela Distan pri trdem dekodiranju, le da se sedaj prva tabela nanaša na prvi, druga pa na drugi sprejeti simbol. Vsaka ima dimenzijo vhodnega simbola enako 16, tako da sta velikosti 16x16x2 = 512 lokacij. Skupaj zasedeta 1024 lokacij, kar je že polovico hitrega pomnilnika. Ostale tabele so v delu druge polovice hitrega pomnilnika, nekaj pomnilnika pa mora ostati prostega, saj se tam po privzetem nahaja sklad. Funkcija za računanje razdalje je sedaj rahlo spremenjena, saj je potrebno prišteti stari metriki razdalji za oba sprejeta simbola. Skupaj z vsemi opisanimi časovnimi optimizacijami in uporabo opisanih tabel preko kazalcev in hitrega pomnilnika smo dosegli hitrost mehkega dekodlrnika 73 kbit/s, kar je več kot dvakratna pohitritev zaradi uporabe tabel In le 20% slabše od trdega dekodirnika. 3.4 Primerjava odpornosti proti šumu med mehkim in trdim dekodiranjem Z manjšanjem šuma se razmerje med učinkovitostjo mehkega in trdega dekodiranja močno povečuje. Mehko dekodiranje je veliko učinkovitejše kot trdo dekodiranje. Viterbi decoding Slika 4: BER diagram za različne Viterbijeve dekodirnike Slika 4 prikazuje odvisnost pogostosti napak (BER - Bit Error Rate) od razmerja moči signala in šuma (SNR). Štiri krivulje prikazujejo: trdi dekodirnik, mehki dekodirnik s štirimi nivoji, mehki dekodirnik z osmimi nivoji ter mehki dekodirnik s šestnajstimi nivoji. Na sliki se lepo vidi, da je mehki dekodirnik veliko boljši od trdega. Z večanjem razmerja signal/šum pa pride razlika še bolj do izraza. Vsa merjenja so zaradi omejenega časa približna. Mehki dekodirnik je za več razredov boljši od trdega dekodirnika, razmerje se s kvaliteto prenosnega kanala še povečuje. Vendar pa večanje natančnosti na več kot 16 nivojev (en prenesen bit predstavljen s šestnajst vrednostmi) ne prinese vidne izboljšave h kvaliteti dekodiranja. Tako med rezultati dekodirnika s 16 nivoji in dekodirnika z 256 nivoji pri Gaussovem šumu nismo zasledili razlike. Več nivojev pa pomeni mnogo večje tabele, kar lahko posledično na večini sistemov pomeni tudi počasnejše izvajanje zaradi omejene velikosti najhitrejšega pomnilnika (npr. notranji pomnilnik pri DSP, predpomnilnik pri PC). 37 Informacije MIDEM 31(2001)1, str. 33-38 S. Plevel, T. Javornlk, I. Ozimek, R. Trobec, G. Kandus: Viterbijev algoritem za DSP procesorje Pri mehkem dekodiranju dobimo malo boljše rezultate, če simbolov 0 in 1 pri A/D pretvorbi ne postavimo čisto na meje amplitudnega območja pretvornika. Tako pri 16 nivojskem mehkem dekodirnlku dobimo najboljše rezultate, če vrednost 0 predstavimo z vrednostjo 2, vrednost 1 pa z vrednostjo 13. Na ta način ostane šum znotraj amplitudnega območja pretvornika in ohrani svoje lastnosti, t.j. Gaussovo porazdelitev. V nasprotnem primeru bi bil šum na eni ali drugi strani porezan, ne bi bil več Gaussov, in zato tudi dekodiranje ne bi bilo več optimalno. 4. Zaključek Hitrost trdega dekodiranja na 50 MHz procesorju zadošča za bazno postajo GSM sistema, ki mora delovati z 8-kratno hitrostjo mobilnih GSM terminalov. Za mehko dekodiranje pa je potrebno imeti približno 20% hitrejši procesor. Pri opisanem algoritmu se ne uporablja operacij s plavajočo vejico. Zaradi tega je mogoče uporabiti hitrejše in cenejše celoštevilčne DSP procesorje za izvedbo Viterbijevega algoritma v realnem času. Opisani algoritmi omogočajo realizacijo programljlvega radijskega vmesnika, ki se prilagaja zahtevam uporabnika in pogojem prenosa radijskega signala med sprejemnikom in oddajnikom. 5. Literatura /1/ S. M. Said, K. R. Dimond, »Realtime implementation of the Viterbi decoding algorithem on a high-performance microprocessor«, Microprocessors and microsystems, vol. 10, no. 1, january/february 1986 /2/ N. Pavešič, Informacija in kodi, Fakulteta za elektrotehniko, Ljubljana, 1997 /31 Domen Šuligoj, Roman Trobec, Borut Robič, »Vzporedna izvedba Viterbijevega algoritma«, v tisku - Elektrotehniški vestnik, Ljubljana 141 G. Feygin, P. G. Gulak, P. Chow, »A Multi processor Architecture for Viterbi Decoders with Linear Speedup«, IEEE Transactions on Signal Processing, Vol. 41, No. 9, September 1993 151 I. N. Bronstein, K. A. Semendjajew, G. Musiol, H. Muhlig, Matematični priročnik, Tehniška založba Slovenije, 2. Predelana izdaja, Ljubljana, 1997 /6/ Texas Instruments, TMS320C4x User's guide, Digital Signal Processing Products, 1993 /7/ John G. Dragu Kolarju, se je porodila že februarja lani, potem ko nas je naš cenjeni kolega nepričakovano zapustil. 14. februarja je minilo leto od takrat, zato smo se tega dne na Odseku za keramiko poklonili spominu nanj. Nabito polna, od sedaj Kolarjeva, predavalnica, ki sta jo odprla prof. dr. Vito Turk in prof. dr. Marija Kosec, je zgovorno pričala o tem, kako živ je spomin na prof. Kolarja v vseh, ki smo ga imeli priložnost poznati in delati z njim in ob njem. Vabilu so se odzvali številni sodelavci Instituta ."Jožef Štefan", profesorjevi sodelavci s fakultet, institutov, podjetij in tovarn, prišla je tudi njegova hči Jana. Prireditevje obogatila nadarjena mlada violinistka, dijakinja srednje glasbene šole, Živa Cigienečki, ki je zaigrala nekaj skladb, s katerimi se uspešno uveljavlja na mednarodnih mladinskih tekmovanjih. Prvo predavanje v Kolarjevi predavalnici z naslovom Strukture v sistemu TiC^-Ga203-La203 je pripravil prof. dr. Ljubo Golič, Kolarjev dolgoletni fakultetni in institutski sodelavec. Sodelavci Odseka za keramiko se zahvaljujemo vsem, ki so z nami počastili spomin na prof. Kolarja, vsem, ki so poklicali, da ne bodo mogli priti in vsem, ki v sebi, kot mi, hranijo majhen prostorček za tega velikega človeka. Prispevek je pripravila Sanja Fidlerin je bil objavljen v internem glasilu NOVICE IJS št. 87, marec 2001 Govor ob odprtju Kolarjeve predavalnice je imela vodja Odseka za keramiko prof.dr. Marija Kosec. 62