107 Damir Horga*1 UDK 811.163.42'342.623 University of Zagreb DOI: 10.4312/linguistica.57.1.107-122 THE INFLUENCE OF THE ASSIMILATION OPERATOR, SPEECH RATE AND LINGUISTIC BOUNDARY ON THE PRODUCTION OF /Z/ IN CROATIAN 1 INTRODUCTION 1.1 Speech Production Process In defining the fundamental feature of speech as communication, the speaker represents an informational processor that simultaneously produces and receives information, us- ing the system of signs and a medium by means of which they are communicated. The most widely used and effective system of signs is a natural language, and the most natural medium is speech. Mechanisms of speech production are defined by different models that follow the process of reshaping information using different codes or a materially shaped signal on the way from the source to the receiver of information. Jeager (2005) assumes that the speaker must have, on the one hand, built programmes of speech production that make up the representational level in which not only the speaker’s knowledge of the world is stored, but also the knowledge of language-speech mechanisms, thus enabling the speaker to shape information in speech. On the other hand, the speaker must master the processor level by which the representational components from the long-term memory are evoked, selected and activated, and these are then installed in the actual utterance. Hierarchical or modular models of speech production (Levelt 1989; Horga 2008; Horga/Liker 2016) assume the existence of conceptual, linguistic and articulatory levels, and a whole range of feedback loops that control speech production. On the conceptual level the speaker plans the informational content of the utterance. The final result of the activity of the conceptualizer is information shaped by an algorithm characteristic for that level as a preverbal plan that is not linguistically shaped, but is available to the speaker’s language code and enables the information to be linguistically shaped. The formulator shapes preverbal information using the linguistic code of a specific natural language, building the grammatical surface structure which enables the phonological processor to join phonological segmental and prosodic units to the surface structure. The final result of language coding is a phonetic plan. A phonetic plan is an input for the articulator whose assignment is, by means of neural commands for speech organs, to turn this plan into speech movements and a speech sound as the final result of speech production. The con- ceptualizer, formulator and articulator in speech production correspond to the auditory- receptive, linguistic and interpretation levels of speech perception. * dhorga@ffzg.hr Linguistica_2017_FINAL.indd 107 12.3.2018 13:08:23 108 In each of these levels the information is coded with a special system of signs and the whole communication chain has to meet the basic requirement to ensure sameness/uni- formity in order to achieve the final communication goal, namely that the listener receives the exact information the speaker has sent. Given that the speaker/listener is not an ideal mechanism and has a range of articulatory and acoustical limitations, the whole process of speech communication is not ideal, with interruptions in the transmission of informa- tion occurring in different parts of the communication chain. One of the ways to ensure the efficiency of spoken communication is coding it with a high degree of redundancy. Unlike the hierarchical models of speech production, other models are based on theo- ries of spreading activation in which speech production is shown as an entwined sys- tem of units of various speech production levels depending on their degree of activation (Stemberger 1985; Dell 1986; Erdeljac 2009). The network is pictured as a range of levels in which the units of each level are presented as knots connected to other units of the same level with associative connections, and the unit that is activated to the maximum spreads its activation to the next level. This demands timely activation of the needed unit, inhibition of used and other at the moment unnecessary units, as well as the appropriate activation transmission to the next unit. Errors in speech production are interpreted by inadequate activation of needed units, and inadequate inhibition of realised units. 1.2 Articulatory Joints Given that speech in its horizontal timeline can be divided into speech units (Škarić 2007; Horga 2005) that have their own correlates on the linguistic level, a question can be asked as to how particular units are joined together, i.e. how articulatory joints are organised. An articulatory joint is defined as a spoken realisation of two sounds where two speech units of the same hierarchical level meet, with a linguistic boundary passing between them (Horga 2005, Horga/Liker 2006). For example, in the sentence Doputovao je iż Zagreba /z#z/ represents a proclitical articulatory joint that connects the proclitic iż and the full word Zagreb, and between these two /z/ phonemes passes a proclitical linguistic boundary /#/. Theoretically, we can talk about the following articulatory joints: discourse (connection between two discourses: …To ti je priča o Snjeguljici.#Sada ći ti ispričati priču o Crvenkapici…), paragraphic (connection be- tween two spoken paragraphs: …U Splitu je proveo dva mjeseca.#Nakon povratka u Zagreb nastavio je studij. Upisao je…), sentence (connection between two sentences: Čekao sam ga na kolodvoru. Vlak je kasnio.), clausal (connection between two claus- es: Kad sam došao na kolodvor,# saznao sam#da vlak kasni), word phrase (connec- tion between two word phrases: Doputovao je#brzim vlakom.), lexemic (connection between two spoken words: …brzim#vlakom…), clitical (connection between a full word and a clitic that can be either a proclitical: iz#Splita, or enclitical: doputovat#će) joint, morphological (connection between two morphemes), which means a compound in the production of compounds (prefixal: iz#govor, suffixal: mlad#ost, and ending: doputova#la), syllabic (connection between two syllables: Za#greb) and sound-related (connection between two sounds in a syllable: Z#a#g#r#e#b). The boundary between the first and second members of an articulatory joint can be of different degrees of Linguistica_2017_FINAL.indd 108 12.3.2018 13:08:23 109 exposure. The clearest boundary between two members of a joint is a pause. The dura- tion of a pause is shortened from those joints that connect larger units towards those that connect shorter units. and that is why the pause will probably not appear in sound, syllable and clitical joints and will be regular in discourse, sentence and clausal joints, while in word phrase and lexemic joints the use of a pause could be freer (Horga/Liker 2006). Even from the standpoints of speech production and receiving speech, the im- portant question is whether the boundaries between particular members of articulatory joints are encoded into spoken sound. or are erased because of coarticulatory influ- ences. Therefore, the speaker is trying to produce the most economical articulatory movement in connecting articulatory gestures characteristic for the neighbouring seg- ments, but at the same time aiming to preserve their linguistic distinctiveness so that the listener can understand what is being said (Lindblom 1979). 1.3 Coarticulation Both in speech production and speech perception the important question is the dichoto- my between the physical articulatory-acoustic level of spoken signal and its representa- tional level. The following fundamental communicational question can thus be asked: in what way is the sameness of the information sent by the source, and the one the listener perceives, realised in spite of such reshaping? (Horga/Požgaj Hadži 2012) On a linguistic representational level, the information is presented by the system of abstract, invariant and discrete units, i.e. phonemes. The reshaping of this abstract representation into a spoken acoustic signal is made on a performative-articulatory level, with move- ment of speech organs that conduct changeable, continuous and partially simultaneous motor patterns that result in an equally variable, continuous and non-discrete acoustic signal. The variability of an acoustic spoken signal and the inability to achieve its seg- mentation into clearly separated representational discrete units is a consequence of, on the one hand, the specificity of human articulatory abilities, and on the other the univer- sal speech phenomenon of coarticulation defined as a systematic and reciprocal activity of neighbouring spoken segments during their articulation, as they are connecting into larger speech sequences. This means that coarticulation assumes that every spoken segment contains the influence of neighbouring segments, but also that each segment is affecting its neighbours. The articulatory apparatus as a multi-componential mecha- nism enables the freedom of particular articulatory organs to make anticipatory move- ments for the next segment, and thus operate on the actual segment in a coarticulatory way. It is also possible for the articulator, active in pronunciation of the neighbouring sounds, to make a movement that is a compromise between the requirements of differ- ent neighbouring sounds. In this way, coarticulatory phenomena can be characterized as temporal or spatial. Due to the coarticulatory influences of neighbouring sounds, different degrees of ad- aptations and segmental assimilations can be realised. Coarticulation processes can be explained in different ways (Farnetani/Recasens 2013). Glide hypothesis explains coar- ticulation as simply the mechanical inertia of speech organs that need time to shift from the pronunciation position of one sound into that needed for the next one. According to Linguistica_2017_FINAL.indd 109 12.3.2018 13:08:23 110 Lindblom’s theory of “adaptive variability” and the theory of “hyper and hypo speech,” speech variability occurs because of the constant adjustment of speech production to the requirements of the specific communicational situation. In some communicational situations it is necessary to produce a spoken signal which, from the point of view of the listener, has a maximum degree of contrast, meaning that “hyper speech” mechanisms are activated. On the other hand, in other situations a lesser degree of contrast between segments is allowed, and “hypo speech” mechanisms prevail in articulating the utter- ance. These are the reasons why pronunciation varies from hyper-correct to inaccurate and careless. Coarticulation mechanisms enable fluent speech production because, just like any other motor behaviour, they satisfy the principles of economy of motor effort. Therefore, the speaker is trying to produce the most economical articulatory movement in connecting articulatory gestures characteristic for the neighbouring segments (Lind- blom 1979). According to the “distinctive features spreading” theory, coarticulation is not conditioned only by the mechanical limitations of the pronunciation apparatus, but makes up a constituent part of the phonological level in which the input commands for realisation of variable spoken segments have already been defined. Therefore, the coar- ticulatory influence of segments is also assumed and planned on the representational level. From the standpoints of speech production and receiving speech, the important question is whether the linguistic boundaries among particular joints are coded into a speech sound or are erased because of coarticulatory influences. All these mechanisms on the levels of production and perceiving speech function ef- fectively if the coder of the speaker and decoder of the listener are working effectively. Due to a lack of such abilities, speakers of a foreign language will be recognised as na- tive speakers of another language because of their foreign accents. The speaker’s skills to reorganise their motor articulatory programmes with regard to the pronunciation of a foreign language include the abilities to follow the rules of transforming the repre- sentational linguistic level into the performative articulatory level, and acquiring coar- ticulatory rules that can be language-specific are needed to reduce their foreign accent. 2 AIMS OF RESEARCH AND METHODOLOGY 2.1 Aims of Research The aim of this research was to analyse some characteristics of coarticulation on the example of the sound /z/, behind which, as coarticulatory operators, follow the sounds /s/, /z/, /š/ and /ž/ in a connected text read at a natural and then a fast speech rate. The participants were instructed to read the text as quickly as possible but to keep intelligi- bility at an acceptable level. Apart from sound operators and speech rate, the influence of different types of boundaries between the first and second members of an articulatory joint on coarticulation was also examined. 2.2 Methodology and Procedure Twenty female students of the Faculty of Humanities and Social Sciences of Zagreb University, native speakers of Croatian with a regular speech status, were involved in Linguistica_2017_FINAL.indd 110 12.3.2018 13:08:23 111 the research. Only female students were chosen to make a homogeneous sample with regard to general acoustic characteristics. The spoken material was a connected text (1 page) in which the sound /z/ was the first member of the articulatory joint, while other members were sounds /s/, /z/, /š/ and /ž/, and the boundary between parts of the joint were sentence, clausal, lexemic or proclitical. A total of 32 articulatory joints were analysed for each participant (two joints for each lexical boundary in the natural and the fast speech rate), or a total of 640 articulatory joints for the 20 participants. The participants read the text at the natural speech rate (5.14 syl/s) first and then at the fast (6.71 syl/s) speech rate, with the latter being 30.5% faster than the former. They had three minutes for reading preparation for each speech rate. The text reading was recorded under laboratory conditions that enabled a quality acoustic analysis. The analysis of six acoustic variables in all occurrences of the articulatory joints, namely of the duration, spectrum centre of gravity, standard deviation of the centre of gravity, spectral skewness, spectral kurtosis and harmonic to noise ratio (Kent/Read 2002; Jones/Nolan 2007; Jones/McDougall 2009), was done using the Praat software (Boersma/Weenink 2015). The acoustic variables were measured after segmenting and annotation of the components of the articulatory joints (Figure 1). The spectrum noise centre of gravity is a measure of the biggest concentration of energy in the spec- trum, and it approximately matches the central frequency of the analysed segment. Spectral skewness of sound is a standard deviation of the centre of gravity and the illustration of the spectral skewness of sound around the central frequency. Spectral kurtosis is a statistical deviation of the spectrum centre of gravity, and it shows the location most of the energy skewed around the centre of gravity: if most of the energy is below the centre of gravity, then the value is closer to zero or is negative, and if most of the energy is above the centre of gravity then the value is higher and posi- tive. The prominence of the main amplitude is a statistical measure for the relative intensity of the most prominent part of the spectrum in relation to the neighbouring parts of the spectrum. Finally, the harmonic to noise ratio is a measure of the sonority of sounds, and the higher the coefficient the more intensive the harmonic component in the sound is. Figure 1: Annotation of the articulatory joint /zš/ realised as /z#š/, /zš/ and as /š/ in the sentence: Izašla je iz šatora. 0 8000 F re q u e n c y ( H z ) i z #  i z  i  Linguistica_2017_FINAL.indd 111 12.3.2018 13:08:23 112 3 RESULTS AND DISCUSSION The issues of the influences of the sounds /s, š, z, ž/ as the assimilation operator, types of lexical boundaries inside the articulatory joint and speech rate on acoustic characteristics of the sound /z/ analysed in this paper fall into the general research of phonotactics, coarticulation, adaptation and assimilation that occur in connected speech. By analysing assimilation mechanisms as parametric, Škarić/Kišićek (2006) confirm in their research that “the speaker’s conflicts that they investigate (i.e. the pronunciation of unpronounceable phonemes DH) are solved in a compromised man- ner, so that sometimes an assimilation request prevails, striving towards smooth and economical pronunciation, and sometimes the request for clear performance of pho- nemes, striving to articulate a hint by which the speaker would mark phonemic rep- resentation, and that would, along with redundancy, help the listener to reconstruct it.” This conclusion is in accordance with Lindblom’s views on communicational conditioning of coarticulatory processes, and research has shown that the activity of assimilation operators can be different in particular speaking situations, i.e. that the represented phonemic features could be realized or that the assimilation mechanisms can efface them. To understand connected speech the listener must build a hierarchy of language units from sounds, syllables, words, word phrases or sentences. This process of percep- tive segmentation also involves the revelation of the linguistic boundaries in articula- tory joints. Analysis of the influence of linguistic boundaries on the pronunciation of articulatory joints of same phonemic content, but with different positions of lexical boundary (Horga 1996), shows that it has the biggest influence on the duration of con- sonants (stops were analysed), because in the utterance V#CV their duration is signifi- cantly longer, and on average is 83.5 ms, than in the utterance VC#V, where it is shorter and 69.5 ms on average. Therefore, the duration of the consonants has been shown to be a significant hint of the position of the linguistic boundary, while the duration and in- tensity of the vowel and explosion of stops were not statistically significant parameters in determining this. Horga and Liker (2006) examined the influence of lexical bounda- ries on the pronunciation of members of articulatory joints using electropalatography. The results showed that a possible indicator of lexical boundaries in lexemic articula- tory joints for the consonants /t/ and /k/ was the index of duration of tongue-palate contact. The lexical boundary was also marked by the duration of a pause. The weight indexes of tongue-palate contact were found to be the indicator of lexical boundaries, while the indexes of alveolar and velar contact and the centre of gravity of the contact were not found to be important in determining the position of these. This conclusion is in accordance with Linblom’s views on the communicational conditionality of coar- ticulation processes. The current research has shown that the influence of assimilation operators can be different in particular speaking situations, i.e. that supposed phonemic features could be realised, or that assimilation mechanisms to diminish or even enface them can be at work. These views of coarticulation are in accordance with the features of articulatory phonology in which phonological units are defined as planned move- ments and dynamic phonetic gestures with an intrinsic temporal dimension, in which Linguistica_2017_FINAL.indd 112 12.3.2018 13:08:23 113 the overlapping of particular gestures is allowed (Farnetani/Rascasens 2013). This can explain the even bigger overlapping of articulatory gestures in the faster speech rate, and the larger coarticulatory influence which was also confirmed in this research, based on the observed acoustic variables. 3.1 Realisation of Particular Segments of the Articulatory Joint The articulatory joint can be realised with three segments (first member, pause, sec- ond member), with two segments (first member, second member) and as the one segment (first and second member of a joint connected and realised as one segment). Therefore, in the first case the linguistic boundary between members of the joint is clearly expressed with a pause, while in second case it is acoustically possible to di- vide first and second segments and the linguistic boundary is found in that division. Finally, in the third case the linguistic boundary is erased because two segments are realised as one under the influence of the assimilation operator. The growth of the activity of the assimilation operator can be expected to have the same order, i.e. that it is the lowest when a pause is realised and biggest when the articulatory joint is re- alised as one segment. The results on the number of articulatory joints regarding their content are presented in Table 1 and Figure 2, and they illustrate that the influence of the assimilation operator is bigger in the faster than the natural speech rate, and that it grows from the sentence, then from clausal and lexemic to proclitical joint, in which it is the highest (χ2 = 266, p=0.00). Table 1 and Figure 2: The number of articulatory joints realised with a pause (s1-p-s2), as two segments (s1s2) and as one segment (s1), regarding the type of the linguistic boundary: sentence (sent), clausal (clau), lexemic (lexe) or proclitical (proc) and regarding the speech rate: natural or fast. sent clau lexe proc sent clau lexe proc s1-p-s2 154 121 22 2 115 62 11 1 s1s2 2 19 40 29 13 20 18 8 s1 4 20 97 129 31 78 131 150 154 121 22 115 62 11 2 19 40 29 13 20 18 84 20 97 129 31 78 131 150 0 20 40 60 80 100 120 140 160 Natural Fast Linguistica_2017_FINAL.indd 113 12.3.2018 13:08:23 114 The average duration of a pause in the articulatory joint can be analysed in the same direction as the influence on the degree of assimilation, when it is realised, as can be seen in the Table 2 and Figure 3 (t=7.50, p=0.00). Namely, it can be said that the longer the pause, the lower the influence of the assimilation operator. It is seen that the average duration of a pause is longer for the natural rather than the fast speech rate, and that the duration of a pause is shortened from the sentence, to the clausal, lexemic and procliti- cal, without any significant difference between the last two. Table 2 and Figure 3: Average pause duration (in ms) if it represents sentence (sent), clausal (clau), lexemic (lexe) or proclitical (proc), and regarding the speech rate: natural or fast. The influence of the assimilation operator, i.e. the second member of the articu- latory joint on the first member, is shown in Table 3 and Figure 4, which illustrates that, generally, the degree of assimilation is higher at the fast speech rate (χ2 186.12; p=0.00), but that at both rates the assimilation is greater, i.e. the pronunciation of both members of the articulatory joint as one sound for sounds /s/ and /z/ is more common than for sounds /š/ and /ž/. Therefore, it is possible to conclude that the similarity or commonality of the place of articulation will contribute to the influence of the assimila- tion operator on the degree of assimilation. sent clau lexe proc sent clau lexe proc pause 0.507 0.246 0.145 0.159 0.303 0.200 0.093 0.090 0.507 0.246 0.145 0.159 0.303 0.200 0.093 0.090 0.000 0.100 0.200 0.300 0.400 0.500 0.600 Natural Fast Linguistica_2017_FINAL.indd 114 12.3.2018 13:08:23 115 Table 3 and Figure 4: The number of articulatory joints realised with a pause (s1-p-s2), as two segments (s1s2) and as one segment (s1), depending on the second member of the articulatory joint: /s/, /š/, /z/ or /ž/, and regarding the speech rate: natural or fast. 3.2 Spectrum Centre of Gravity Table 4 and Figure 5 show that the spectrum centre of gravity regardless of the type of linguistic boundary in the natural speech rate is shifted towards higher frequen- cies in comparison to that seen with the fast speech rate (t=6.65; p=0.00). This can be explained with the realisation of the complete articulatory movement and with the realisation of the necessary articulatory conditions for making the intensive and high frequency friction that is characteristic of fricatives. This can also be assigned to the greater influence of the higher frequency affricates /s/ and /z/ as assimilation operators (Table 1, Figure 2). When it comes to the influence of the linguistic boundary between members of a joint, that shift of the spectrum centre of gravity towards higher frequen- cies is greater for sentence and clausal boundaries than for lexemic and proclitical ones. Sentence and clausal boundaries are most commonly realised with a pause, so the first member of the joint, /z/, is found in the final sentence or clause position that enables that sound to become voiceless and with it the shift of the spectrum centre of gravity towards higher frequencies. By measuring acoustic parameters, Bakran (1996) concludes that in the Croatian language vowels and consonants within a syllable be- fore a pause are significantly longer, regardless of the duration of the pause, and states that this lengthening at the end of the word is on average an extra 12% for vowels and 6–12% for consonants. Therefore, in the current research the more significant shift of the spectrum centre of gravity in front of sentence and clausal boundary in articulatory joints can be attributed to the lengthening of the /z/ sound in front of a pause, which is more common in this type of the articulatory joint than in lexemic and proclitical joints. s š z ž B s š z ž s1-p-s2 77 57 70 95 0 43 47 44 55 s1s2 12 49 7 22 0 9 22 2 26 s1 71 54 82 43 0 107 90 114 79 77 57 70 0 43 47 12 49 7 22 0 9 22 2 26 71 54 82 43 0 107 90 114 79 0 20 40 60 80 100 120 140 160 Natural Fast Linguistica_2017_FINAL.indd 115 12.3.2018 13:08:23 116 It is worth mentioning the factor of making the sound /z/ voiceless in sentence and clausal joints in front of a pause, which contributes to the heightening of the spectrum centre of gravity for two reasons: on the one hand, the noise component of the sound becomes higher in frequency and, on the other, with the absence of voicing that is lower in frequency, the part of the spectrum that would usually shift the spectrum centre of gravity towards areas with lower frequencies is absent. Table 4 and Figure 5: Spectrum centre of gravity (in Hz) regarding the type of linguistic bound- ary: sentence (sent), clausal (clau), lexemic (lexe) or proclitical (proc), and regarding the speech rate: natural or fast. 3.3 Spectral Skewness When it comes to spectral skewness regarding speech rate (Table 5 and Picture 6), it can be said that the difference is significant only for /z/ in the sentence joint, because it is bigger at the faster speech rate than at the natural one, and it can be explained by a bigger articulation variability in the articulation of the sound /z/ allowed by a pause, which is very frequent in the sentence joint. This also explains the overall reduction of spectral skewness due to stronger connections between joint members and the bigger assimilation influence of the operator. In general, it can be said that the spectral skewness did not show statistically significant difference in the two ob- served speech rates (t=0.20; p=0.84), which can be assigned to the weaknesses of the measure itself. Namely, this measure was evaluated as ineffective in research on the acoustic characteristics of fricatives in the speech of people wearing dental braces (Horga et al. 2013). sent clau lexe proc sent clau lexe proc COG 6664 5103 4254 4045 5654 3937 3633 3556 6664 5103 4254 4045 5654 3937 3633 3556 0 1000 2000 3000 4000 5000 6000 7000 Natural Fast Linguistica_2017_FINAL.indd 116 12.3.2018 13:08:23 117 Table 5 and Figure 6. Spectral skewness (in Hz) regarding the type of linguistic boundary: sen- tence (sent), clausal (clau), lexemic (lexe) or proclitical (proc), and regarding the speech rate: natural or fast. 3.4 Spectral Kurtosis The values of spectral kurtosis (Table 6 and Figure 7) are generally higher at the fast speech rate than the natural one (t=5.61; p=0.00). Given that the spectrum centre of gravity at the natural speech rate is shifted towards higher frequencies, there is more space left at the lower frequencies for spectral kurtosis towards that lower frequency sent clau lexe proc sent clau lexe proc S.D. 3264 3417 3054 2756 3501 3208 2919 2664 3264 3417 3054 2756 3501 3208 2919 2664 0 500 1000 1500 2000 2500 3000 3500 4000 Natural Fast sent clau lexe proc sent clau lexe proc skew -0,31 0,71 1,82 1,51 0,15 1,61 1,98 1,77 -0,31 0,71 1,82 1,51 0,15 1,61 1,98 1,77 -0,50 0,00 0,50 1,00 1,50 2,00 2,50 Natural Fast Table 6 and Figure7: Spectral kurtosis regarding the type of linguistic boundary: sentence (sent), clausal (clau), lexemic (lexe) or proclitical (proc), and regarding the speech rate: natural or fast. Linguistica_2017_FINAL.indd 117 12.3.2018 13:08:23 118 area. The different behaviour of spectral kurtosis in sentence and clausal joints, unlike that seen with lexemic and proclitical ones, is again prominent, especially in the natural speech rate. Namely, in sentence and clausal joints the kurtosis is placed towards lower frequencies, and in lexemic and proclitical ones towards the higher frequencies. To a certain extent, clausal joints at the fast speech rate are an exception. 3.5 Prominence of Amplitude The prominence of amplitude (Table 7 and Figure 8) shows simultaneous movement of values as well as spectral kurtosis (t=2.11; p=0.04). The amplitude is bigger at the fast speech rate than at the natural speech rate, and it is bigger for lexemic and proclitical joints than for sentence and clausal ones, which shows a higher coarticulatory influence with the fast speech rate. Table 7 and Figure 8: The prominence of amplitude regarding the type of linguistic boundary: sentence (sent), clausal (clau), lexemic (lexe) or proclitical (proc), and regarding the speech rate: natural or fast. 3.6 Harmonic to Noise Ratio The harmonic to noise ratio (Table 8 and Figure 9) also shows that the coarticulation influence is higher at the fast speech rate because the ratio of harmonic sound in the sound /z/ is under the influence of the voiced assimilation operators /z/ and /ž/ at the fast speech rate, which is higher than at the natural speech rate, while under the influence of voiceless /s/ and /š/ it is somewhat lower (t=5.97; p=0.00). Horga et al. (2013) applies the abovementioned set of acoustic variables to inves- tigate the influence of dental prosthetics on the articulation of fricatives. The results of the paper can be summarised in three points: firstly, speech with braces was clos- er to the speech of the eugnatic participants; secondly, in speech without braces the sent clau lexe proc sent clau lexe proc kurtosis 2.84 6.54 18.30 11.05 3.80 16.32 18.77 13.36 2.84 6.54 18.30 11.05 3.80 16.32 18.77 13.36 0.00 2.00 4.00 6.00 8.00 10.00 12.00 14.00 16.00 18.00 20.00 Natural Fast Linguistica_2017_FINAL.indd 118 12.3.2018 13:08:23 119 participants had fewer possibilities for compensational articulatory mechanisms; and thirdly, speech with prosthetics differed from the speech of eugnate speakers because prosthetics, as foreign bodies in the mouth, influence pronunciation. This research is mentioned because it showed that we can observe the characteristics of pronunciation of fricatives with these types of acoustic variables. It is also interesting that the spec- trum centre of gravity was shown to be the most useful variable in this earlier research. Table 8 and Figure 9. Harmonic to noise ratio in sound /z/ at the natural and fast speech rates depends on whether the assimilation operators are voiced sounds /z/ or /ž/ or voiceless sounds /s/ and /š/. 4 CONCLUSION The results of this research can be summarised as follows: • the acoustic variables of spectrum centre of gravity, spectral skewness and kur- tosis, as well as the prominence of amplitude, show that a faster speech rate has a higher assimilation influence on the members in the articulatory joint; • the sentence and clausal lexical boundaries between members of articulatory joints preserve the inherent phonemic features of the sound /z/, while lexemic and proclitical boundaries allow a higher influence of the assimilation operator; • the sounds /s/ and /z/ as assimilation operators, due to their similarity to the sound /z/ as the first member of the articulatory joint, have a greater assimila- tion influence than the sounds /š/ and /ž/, which differ from the sound /z/ when it comes to the place of articulation; • the assimilation influence of the sounds /z/ and /ž/ on /z/ is greater than that of the voiceless sounds /s/ and /š/ because of the same voicing, resulting in the higher harmonic to noise ratio of /z/; • the set of acoustic variables used in this work are proven to be good measures for use in research on assimilation influences. voiced 6.54 8.34 voiceless 2.82 2.66 6.54 8.34 2.82 2.66 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 9.00 Natural Fast Linguistica_2017_FINAL.indd 119 12.3.2018 13:08:23 120 The next phase of this research could be an in-depth analysis of observed articula- tory joints, and a comparison with the results of an acoustic analysis in order to get a more complete insight into the assimilation processes. Furthermore, it is possible to imagine a whole range of research that would analyse the assimilation relations of other sounds in the articulatory joint. Certainly, research into the physiological parameters of the activity of particular articulators, and a comparison with the results of acoustic and perceptive methods, would give a more thorough insight into the coarticulation phenomenon. References BAKRAN, Juraj (1996) Zvučna slika hrvatskoga govora. Zagreb: Ibis grafika. BOERSMA, Paul/David WEENINK (2012) Praat: doing phonetics by computer, http://www.praat.org/ DELL, Garry S. (1986) “A spreading activation theory of retrieval in sentence produc- tion.” Psychological Review 93, 283–321. ERDELJAC, Vlasta (2009) Mentalni leksikon. Modeli i činjenice. Zagreb: Ibis. FARNETANI, Edda/Daniel RECASENS (2013) “Coarticulation and Connected Speech Processes.” In: W. J. Hardcastle/J. Laver/F. E. Gibbon (eds), The Handbook of Phonetic Sciences. Malden: Wiley-Blackwell, 316–352. HORGA, Damir (1996) “Utjecaj položaja leksičke granice na slogovnu strukturu u hrvatskom govoru.” Acta Universitatis Nicolai Copernici, Studia Slavica, 1/311, 55–78. HORGA, Damir (2005) “Izgovorni zglobovi u spontanom govoru.” In: I. Pranjković (ed.) Od fonetike do etike. Zbornik o sedamdesetoj godišnjici prof. dr. sc. Josipa Silića. Zagreb: Disput, 63–74. HORGA, Damir/Marko LIKER (2006) “Utjecaj leksičke granice na izgovor glasnika u artikulacijskom zglobu promatran elektropalatografski.” Govor 23/2, 121–139. HORGA, Damir (2008) “Repetitions in interrupted speech production.” Beszédkutatás 1, 157–171. HORGA, Damir/Vesna POŽGAJ HADŽI (2012) “(Dis)fluentnost i proizvodnja go- vora.” Slavistička revija – časopis za jezikoslovje in literarne vede 60/4, 621–638. HORGA, Damir/Tomislav BADEL/Marko LIKER/Ana VIDOVIĆ (2013) “Utjecaj zubne proteze na izgovor frikativa.” Revija za rehabilitacijska istraživanja 49, 37–49. HORGA, Damir/Marko LIKER (2016) Artikulacijska fonetika. Anatomija i fiziologija izgovora. Zagreb: Ibis. JAEGER, J. Jery (2005) Kids’ Slips. What Young Children’s Slips of the Tongue Re- veal About Language Development. Mahwah, NJ: Lawrence Erlbaum Associates Publishers. JONES, Mark J./Francis J. NOLAN (2007) “An acoustic study of North Welsh voice- less fricatives.” In: J. Trouvain/W. J. Barry (eds), Proceedings of the ICPhS 16 in Saarbrücken. Saarbrücken: Universität des Saarlandes, 873–876. Linguistica_2017_FINAL.indd 120 12.3.2018 13:08:23 121 JONES, Mark J./Kirsty MCDOUGALL (2009) “The acoustic character of fricated /t/ in Australian English: A comparison with /s/ and /ʃ/.” Journal of the International Phonetic Association 39/3, 265–289. KENT, Ray/Charles READ (22002) The Acoustic Analysis of Speech. New York: Del- mar Cengage Learning. LEVELT, Willem J. M. (1995) Speaking. From intention to articulation. Cambridge, MA: The MIT Press. LINDBLOM, Bjorn/James LUBKER/Thomas GAY (1979) “Formant frequencies of some fixed-mandible vowels and a model of special motor programming by predic- tive stimulation.” Journal of Phonetics 7, 147–161. STEMBERG, J. Robert (1985) “An interactive activation model of language produc- tion.” In: Andrew W. Ellis (ed.), Progress in the Psychology of Language. Vol 1. London: Psychology Press, 143–186. ŠKARIĆ, Ivo (2007) “Fonetika hrvatskoga književnoga jezika.” In: A. Kovačec (ed.), Glasovi i oblici hrvatskoga knjževnoga jezika. Zagreb: Nakladni zavod Globus, 15–157. ŠKARIĆ, Ivo/Gabrijela KIŠIČEK (2006) “Izgovaranje neizgovorljivih fonema.” Go- vor XXIII/1, 1–18. ŠKARIĆ, Ivo/Damir HORGA (2008) “Hrvatski jezik.” In: Sawicka, Irena (ed.), Kom- paracja systemów i funkcjonowania wspólczcnych jęezyków słcwiańskich, 2: Fone- tyka/Fonologia. Opole: Uniwersytet Opolski, Instytut Filologii Polskiej, 531–555. Abstract THE INFLUENCE OF THE ASSIMILATION OPERATOR, SPEECH RATE AND LINGUISTIC BOUNDARY ON THE PRODUCTION OF /z/ IN CROATIAN It is widely accepted that invariant and discrete phonological units at the linguis- tic level are transformed into variable and continuous movements of speech organs, which in turn results in equally continuous acoustical results. The variability of pho- nemic units depends on neighbouring phonetic units, but also on the various linguistic, communicational and pragmatic contexts of a particular speech act. The influence of phonetic units upon each other results in adaptations, coarticulations and assimilations. By means of assimilation at least one distinctive feature of a phoneme is changed, so the observed phoneme becomes similar to its neighbouring sound – the assimilation operator. This paper is aimed at analysing the influence of speech rate on assimilation processes in the voiced fricative /z/, when it is preceded by sounds /s, z, ʃ, ʒ / in four different types of articulatory joint: sentence, clausal, lexemic and proclitical. The ar- ticulatory joint refers to the production of two phonemes separated by different types of linguistic boundaries. Twenty female native speakers of Croatian with no history of speech or hearing impairments read a text at both natural and fast speech rates. The acoustical recording was performed in a sound-treated room. The Praat software was used to analyse six variables in all occurrences of the sound /z/: duration, spectrum Linguistica_2017_FINAL.indd 121 12.3.2018 13:08:23 122 centre of gravity, standard deviation of the centre of gravity, spectral skewness, spec- tral kurtosis, and harmonic to noise ratio. The results showed that various linguistic boundaries, speech rates and sounds as assimilation operators influence the degree of assimilation of the phoneme /z/, as measured by the acoustic variables. Keywords: speech segment, coarticulation, assimilation, speech rate, articulatory joint Povzetek VPLIV ASIMILACIJSKEGA OPERATORJA, HITROSTI GOVORA IN JEZIKOVNE MEJE PRI PRODUKCIJI /z/ V HRVAŠČINI Splošno sprejeto dejstvo je, da se na jezikovnem nivoju nespremenljive in diskretne fonološke enote pretvorijo v spremenljivo in neprekinjeno premikanje govornih or- ganov, kar se odraža na akustičnem nivoju. Spremenljivost fonemskih enot je odvis- na od sosednjih fonetičnih enot kakor tudi od različnih jezikovnih, sporazumevalnih in pragmatičnih kontekstov, v katerih se govorno dejanje odvija. Medsebojni vpliv fonetičnih enot se kaže v prilagoditvah, koartikulacijah in asimilacijah. V procesu asimilacije se spremeni najmanj ena razločevalna lastnost fonema, ki tako postane po- doben sosednjemu glasu, to je asimilacijskemu operatorju. Cilj članka je analizirati vpliv hitrosti govora na asimilacijo zvenečega pripornika /z/, kadar se pred njim na- hajajo glasovi /s, z, ʃ, ʒ/v štirih različnih vrstah artikulacijskih stičišč: povedno stičišče, stavčno stičišče, leksikalno stičišče in proklitično stičišče. Artikulacijsko stičišče se nanaša na izgovor dveh fonemov, ki ju ločijo različne oblike jezikovnih ločnic. Dvajset domačih govork hrvaškega jezika, ki niso nikdar imele nobenih govornih ali slušnih ok- var, je prebralo besedilo v običajnem in hitrem tempu. Snemanje je potekalo v studijskih razmerah. Za analizo šestih spremenljivk glasu /z/ (trajanje, težišče spektra, standardni odklon težišča, asimetrija spektra, sploščenost spektra in razmerje med harmoničnostjo in šumom) je bil uporabljen program Praat. Rezultati so pokazali, da različne jezikovne ločnice, hitrost govora in različni glasovi v vlogi asimilacijskih operatorjev vplivajo na stopnjo asimilacije fonema /z/ z vidika merjenih akustičnih spremenljivk. Ključne besede: govorni segment, koartikulacija, asimilacija, hitrost govora, artiku- lacijsko stičišče Linguistica_2017_FINAL.indd 122 12.3.2018 13:08:24