|Year : 2018 | Volume
| Issue : 2 | Page : 56-61
Comparing fundamental frequency within and across speakers of Hindi and English
Towino Paramby1, Katherine Verdolini Abbott2, Gregory Turner3, Carlotta Kimble3, Robert DeJonge3
1 Department of Communication Sciences and Disorders, University of Central Arkansas, Conway; Department of University Rehab, University of Arkansas for Medical Sciences, Little Rock, AR, USA
2 Department of Communication Sciences and Disorders, University of Delaware, Newark, DE, USA
3 Program In Communication Disorder in the School of Human Services, University of Central Missouri, Warrensburg, Missouri, USA
|Date of Web Publication||27-Dec-2018|
Department of Communication Sciences and Disorders, University of Central Arkansas, 201 Donaghey Avenue, Conway, AR 72035
Source of Support: None, Conflict of Interest: None
Introduction: The objective of this study was to compare fundamental frequency (F0) during vowel phonation, reading, and monolog in Indian Hindi speakers as compared to native English speakers. Methods: Thirty normal, age-matched native English and 30 Indian Hindi speakers participated in the study. Participants' F0values were extracted from samples obtained during sustained/a/, reading, and monolog. Analyses centered on F0differences during (a) sustained/a/, comparing native English and Hindi speakers; (b) reading and monolog production, comparing native English and Hindi speakers speaking English and Hindi, respectively; (c) reading and monologs, comparing native English versus Hindi speakers speaking English; and (d) Hindi participants speaking English versus the same participants speaking Hindi. Results: Analyses did not reveal significant differences between F0during sustained/a/for native English and Hindi speakers. All other comparisons, which involved reading and monolog, revealed significantly higher F0in Hindi speakers. Relationships between language learning variables and mean F0were statistically insignificant. Conclusions: The finding of equivalent F0for sustained vowel phonation across groups, in comparison to between- and within-group differences for reading and monolog tasks, suggests that F0differences that were found were largely linguistically determined. Proposals are provided for evaluating additional anatomical and cultural factors determining F0.
Keywords: English, fundamental frequency, Hindi, multilinguistic
|How to cite this article:|
Paramby T, Abbott KV, Turner G, Kimble C, DeJonge R. Comparing fundamental frequency within and across speakers of Hindi and English. J Indian Speech Language Hearing Assoc 2018;32:56-61
|How to cite this URL:|
Paramby T, Abbott KV, Turner G, Kimble C, DeJonge R. Comparing fundamental frequency within and across speakers of Hindi and English. J Indian Speech Language Hearing Assoc [serial online] 2018 [cited 2021 Sep 22];32:56-61. Available from: https://www.jisha.org/text.asp?2018/32/2/56/248016
| Introduction|| |
A speaker's geographic background is usually associated with a unique language and perhaps a dialect within a language. Some reports have indicated that vocal characteristics may also vary with geography which may be relevant in the clinical evaluation of an individual's voice. In fact, voice disorders have been defined as conditions in which one or more aspects of voice, such as loudness, pitch, quality, or resonance, are outside of the normal range for the age, gender, or geographic background of the speaker.
The focus of this study was to examine differences in fundamental frequency (F0) values across Indian Hindi speakers and native English speakers. F0 reflects the rate at which the vocal folds vibrate during phonation, which may vary across languages, like that of Jordanian Arabic-speaking children who spoke at higher F0 values during sustained vowel/a/than their Western, English-speaking counterparts.
It is speculated whether F0 differences occur due to physical or linguistic and cultural factors or both. Tom pointed to anatomical and physiological differences as a likely basis for F0 variations across some populations. In contrast, Shriberg and Kent suggested that linguistic features such as prosody may influence F0. Recent research has shown that F0 may vary not only across languages in native speakers but also may shift in the same individual speaking different languages. One study of highly proficient German-English speakers revealed significantly higher mean F0 when participants spoke in English compared to speaking in German. Conceptually, similar findings reported that differences in F0 can also occur across dialects within a language.
Clearly, F0 shifts in these studies are indicative of linguistic or cultural as opposed to physical causal factors. However, physical factors such as laryngeal size cannot be ruled out. In any event, adequate clinical assessment of voice requires that the speaker's linguistic background is considered when comparing a patient's F0 value to norms.
Unfortunately, norms for fundamental frequency are lacking for many populations, especially in multilingual populations like in India. Few normative studies have been published on F0 or other vocal parameters in Hindi speakers; therefore, normative data from studies of Western countries are generally used clinically, perhaps ill-advisedly. On the surface, some findings indicate that the use of Western normative data for F0 in Indian Hindi speakers may be reasonable, including studies that failed to detect differences in F0 values for sustained vowels for Hindi compared to American English speakers. However, F0 may vary significantly with actual language production as opposed to simple, sustained phonation. If running speech is used to extract habitual F0, there would be value in knowing whether F0 values differ across native speakers of different languages, including speakers producing their native language and postnatively acquired languages. Findings could provide information in the ultimate generation of F0 norms for international speakers, particularly Hindi speakers, including those living outside of India with over one million alone in the US. Findings could also be useful for theoretical speculation about the role of physical versus linguistic and cultural factors in F0 and other vocal variations across populations.
Comparison of F0 during isolated vowel phonation was completed across Hindi/English speakers and monolingual American English speakers to extend research of previous studies. To extend the understanding of the voice of the Hindi/English bilingual speaker, comparison of reading and monolog tasks was also undertaken for the two groups. Within-group comparisons of F0 in the bilingual group were completed to control for physical difference of the vocal mechanism inherent to across-group comparisons, allowing examination of the influence of the two languages on mean F0. Finally, variables used to explore relationships for pronunciation/accent in second-language learners were applied to evaluate relationships between these variables and F0, including length of time in the US, use of English in daily living, and confidence in speaking English. The addition of running speech tasks, within- and cross-group comparisons, and the exploration of language learner variables should contribute to identifying potential variations in F0 across the two languages.
| Methods|| |
This study was approved by the Institutional Review Board at the University of Central Missouri, Warrensburg, Missouri. All participants signed an informed consent before participation. Thirty native Indian Hindi male speakers (age: 20–25 years) and 30 age-matched native English male speakers, all living in the US, were recruited for participation. A “snowball” recruitment method was used to attract subjects. On the day of participation, all speakers exhibited normal voice, speech, and language in both Hindi (Indian speakers) and English (all speakers), as informally judged by a speech-language pathologist proficient in both languages. Participants reported no history of articulation, voice, language, or hearing difficulties. Participants reported good general physical health and denied any history of medical conditions that might adversely influence voice. Participants were nonsmokers and had no previous formal voice training. Each participant passed a bilateral pure-tone hearing screening.
All native Indian Hindi speakers considered English their second language and reported learning both spoken and written English as part of their formal education in India though formal education varied from person to person. All participants were naive to the purposes of the study. [Table 1] provides information on participants' languages spoken and time in the US.
Equipment and data acquisition
Voice samples were recorded in a single-walled, sound-treated chamber (Acoustic System Model RE-143MC) with an average ambient noise of 40 dB. The KayPENTAX Computerized Speech Lab (CSL) Model 4500, (KayPENTAX, Lincoln Park, NJ) was used to collect voice samples. A head-worn cardioid condenser microphone (CROWN CM-311A) was positioned 45° off-axis from the corner of the subject's mouth on the right side of the body.
Five voice samples were collected from native Indian Hindi-speaking participants; three samples were collected from native English-speaking participants. Subjects were instructed to speak at a comfortable pitch and loudness for all tasks. All subjects (i) produced one sustained/a/for 5 s, (ii) read the complete version of the rainbow passage out loud, in English, and (iii) produced a 1–2-min monolog speech sample of subjects' choice about one of four topics: a summer vacation, a favorite movie, a place visited recently, or any aspect of their native country they wished to discuss in English. In addition, native Indian Hindi speakers were asked (iv) to read a Hindi passage comparable in length and difficulty to the English rainbow passage, prepared by a proficient speaker of Hindi and English, and (v) to produce a 1–2-min monolog in Hindi on one of the four topics noted above. The order of task presentation was randomized across speakers.
The real-time pitch program was adopted for analysis of F0 for each sample. Entire sample of sustained/a/was analyzed. In analysis of running speech samples, the midpoint of the sample was identified, and a segment 30 s before and after the midpoint was extracted to acquire a 60-s sample.
The t-statistic was adopted to statistically analyze both between-group comparisons and within-group comparisons for the Hindi speakers (alpha = 0.05). A Bonferroni adjust P = 0.007 (0.05/7) was applied for the seven between- and within-group comparisons for the mean F0. Relationships among dependent variables and descriptive variables associated with the Hindi-speaking group (i.e., number of years living in the USA, confidence in speaking English, and percent of time speaking English in Hindi before coming to the USA) were evaluated visually and through correlation analysis (Pearson product moment correlation coefficient). An adjusted P = 0.006 (0.05/9) was adopted for the correlation analysis.
| Results|| |
Mean F0 findings
An independent-samples t- test compared F0 between sustained/a/phonation in Hindi speakers and native English speakers. No evidence was found suggesting F0 differences during sustained phonation across groups. Results are shown in [Table 2].
Two independent-samples t-tests were conducted to compare average F0 during running speech across the same two speaker groups, when participants spoke in their native language. F0 was compared between Hindi speakers and English speakers reading in their native language. F0 was also compared between Hindi speakers and English speakers producing a monolog in their native language. Indian Hindi speakers' F0 during running speech in Hindi was higher compared to native English speakers' running speech in English for both tasks.
Similarly, two independent-samples t-tests were conducted to compare F0 during running speech across native Indian Hindi and native English speakers, when both groups spoke in English. The first t-test compared F0 between Indian Hindi speakers and native English speakers reading the same English passage. A second t-test compared F0 between Indian Hindi speakers and native English speakers producing the same monolog in English. For both tasks, Hindi speakers' F0 in English was higher than native English speakers' F0.
Two paired-samples t-tests compared F0 during running speech within native Indian Hindi speakers, when speaking in Hindi versus English. The first test compared F0 for Hindi speakers reading in Hindi versus English. A second t-test compared F0 for Hindi speakers producing monologs in English versus Hindi. F0 in Hindi was significantly higher than F0 in English [Table 3].
Running speech tasks and language learning variables
[Table 4] describes the relationships across three language learning variables and the mean F0 for the three speaking tasks completed by Hindi/English speakers. No significant relationships were noted.
|Table 4: Correlations between language learning variables and the mean F0 in English for the three speaking tasks for the Hindi/English bilingual speakers*|
Click here to view
Intra-measurement reliability estimates evaluated consistency of F0 values. Six randomly selected participants from each group were selected to measure F0a second time for all tasks, for a total of 48 speech samples measured. The Pearson correlation coefficient (r) obtained between the first and second frequency measurements made by the investigator was 0.99, and mean absolute intra-measurement error was 5.42 Hz. The correlation was significant at 0.01 levels (two-tailed). High intra-measurement correlations and small absolute error indicate that F0 values measured by the primary investigator were consistent and reliable. Inter-rater reliability of measurement was not assessed.
Data from 12 subjects were randomly selected for retesting to determine within-subject reliability of measures. These participants completed the experimental protocol a second time the next day at the exact same time of day. Samples were measured using the same measurement procedures as previously noted. F0 values for the first data acquisition were compared to the second acquisition. The Pearson r correlation coefficient obtained between the first and second frequency measurements made by the investigator was 0.99, and mean absolute intra-measurement error was 10.88 Hz. This correlation was significant at 0.01 levels (two-tailed). High within-subject correlations and small absolute error indicate that frequency values produced by the participants were consistent and reliable.
| Discussion|| |
F0 did not differ across native Indian Hindi versus native English speakers for sustained vowel production. However, consistent differences were detected across speakers in all running speech tasks. For running speech tasks, F0 was consistently higher for Hindi speakers compared to native English speakers, whether they spoke in Hindi or English [Table 2]. Overall, Hindi speakers' F0s remained roughly three semitones higher than F0s for native English speakers, which may be perceptually notable and influence psychological responses of listeners. This cross-language difference existed even when Indian Hindi speakers spoke in Hindi compared to their own English at about one semitone [Table 3]. Finally, preliminary exploration of relationships between language learning variables and mean F0 for both sustained vowel and English running speech conditions for Indian Hindi speakers was not significant.
Intrinsic F0 for the sustained/a/was virtually identical for Indian Hindi speakers and Anglo-European English American speakers, supporting previous findings of Andrianopoulos et al. Both Hindi and English contain an/a/in phonological inventory of each language. The/a/in Hindi is produced with a slightly advanced tongue position compared to English. Tongue height, however, is similar to English, a factor influencing intrinsic F0 for vowels in words. It appears that similar physiological mechanisms involved in alternating elasticity, tension, and mass of the vocal folds to alter F0 are present in the vowel production of the two speaking groups. Alternate findings exist, however, for tone languages where the F0 tends to be higher., Altenberg and Ferrand suggest not using sustained vowels when measuring F0 for clinical normative data of bilingual speakers unless the clinician can document the language mode of the vowel being phonated.
Significant across-language differences in the mean F0 existed for the connected speech tasks but not for the sustained vowel task, evident in both within- and between-group comparisons. Speaking in English or Hindi, Indian Hindi speakers exhibited a higher F0 compared to the speech of Anglo-European males. Across-language differences exist between Hindi and English, which are further supported by similar F0 for sustained vowel comparisons. The sustained vowel task likely does not engage the suprasegmental or tonal aspects of the language and will not capture across-language differences.
By itself, higher F0 in running speech among Hindi speakers may be the result of anatomical differences between Indian Hindi speakers and Anglo-European English speakers. When comparisons were made between Hindi and English running speech tasks produced solely by Hindi speakers, a significantly higher F0 existed when the Hindi language was used. For this within-speaker comparison, anatomical differences cannot account for cross-language difference in mean F0. The one semitone increase in F0 while speaking in Hindi could be the result of differences in prosody between the two languages, not studied here.
In terms of cultural factors, habitual speaking intensity of Indian speakers is greater than their European peers due to their habitual need to speak in conditions where high levels of environmental noise exist. A positive relationship exists between intensity of the voice and F0. On average, a 2–4 Hz increase in F0 occurs with every centimeter increase in subglottal pressure. Greater intensity and reduced vital capacity may be associated with increased adduction in the vocal folds to supplement maintenance of subglottal pressure. Data on speaking intensity and/or subglottal pressure should be obtained as a covariate when performing cross-group language comparisons to verify speaking intensity of Hindi speakers.
Finally, a lack of significant relationships between second-language acquisition and level of exposure, practice in daily life, and confidence with language may be the result of several factors. Most Indian Hindi speakers had experience speaking additional languages, and the influence of these languages on learning F0 in American English is uncertain. Further, only eight Indian Hindi speakers indicated that Hindi was their most comfortable language. The more comfortable one is with a second language, the more likely native features of the language will be transmitted to a second language.
The length of exposure for participants living in the US was shorter than many studies. A relationship may not exist between this variable and F0, or the range of exposure time was not sufficient. Anecdotally, authors in the present study have interacted with Indian Hindi speakers who exhibited typical pitch values while speaking American English, but exposure was longer than in the present study.
The mean percent of time speaking English for Hindi/English speakers during an average day was 56% with a range of 6%–98%, suggesting that time speaking English during the day is not significantly related to F0. Research on phonological acquisition of second-language learners suggests that the quality and/or type of speaking activities participated in may influence the acquisition of second-language features.
Finally, participants indicated how confident they were in speaking English using a 1–5 interval scale, 5 being most confident, to provide an indirect view of English language proficiency. Confidence in speaking English had little to do with the mean F0. Over 70% of the speakers chose a value of 2, indicating limited confidence with the English language. Limited range of confidence level rating reduces the validity of the relationship.
The findings provide that direction for tasks clinicians may use in attempts to obtain valid and reliable measures of F0 in multilingual speakers. Arguably, F0 in sustained phonation might reflect the physical status of the vocal folds, but if a clinician adopts sustained vowel F0 as the sole metric of vocal fold vibration, he or she might miss findings indicated by running speech tasks and may not capture the difference in F0 between languages. The type of task used to sample F0 is central to understanding across-language differences.
Beyond acoustic measures of F0, most clinical evaluations include a perceptual component. With pitch, a clinician will make perceptual judgments generally based on internalized norms for age and gender. Clinicians may need to adjust their points of reference for pitch and possibly other perceptual parameters based on linguistic and cultural group. For the bilingual speaker, precautions may be required for acoustic and perceptual evaluations of voice.
Results from the present study provide a promising direction for future research. One area for future research is the acquisition of normative F0 data among speakers of various Indian languages and dialects. Moreover, data should be established for both monolingual and multilingual Indian speakers, which will improve the objective evaluation of F0 for Indian languages. The influence of other acoustic (and perceptual) parameters in bilingual and multilingual populations beyond F0 should be explored. Research should consider various proficiency levels of bilingual and multilingual speakers to estimate the influence of language proficiency on running speech F0. The contributions of intensity of the voice on F0 should be considered for the Indian Hindi-speaking population. Finally, future studies should also include female native Hindi speakers – who were not readily available in the subject pool – given reports of cross-task variability in F0 for female speakers in general.,
| Conclusions|| |
The central aim of this study was to explore the influence of language among native Hindi speakers and native English speakers, using F0. The results for sustained phonation on/a/did not reveal significant differences across speaker groups. However, in running speech, F0 for native Hindi speakers was consistently higher than for native English speakers, whether the Hindi speakers spoke in Hindi or English. Findings are most readily attributable to linguistic factors. Further exploration is warranted on the potential contribution of anatomical factors in F0 differences in running speech, although the relevance of these factors was not strongly supported by the present data.
Financial support and sponsorship
Conflicts of interest
There are no conflicts of interest.
| References|| |
American Speech-Language-Hearing Associaiton Clinical Management of Communicatively Handicapped Minorty Language Populations (Position Statement); 1985. Available from: http://www.asha.org/policy
. [Last accessed on 2017 Oct 20].
Boone DR, McFarlane SC, Von Berg SL. The Voice and Voice Therapy. Boston, MA: Pearson Education 2005.
Natour YS, Wingate JM. Fundamental frequency characteristics of jordanian arabic speakers. J Voice 2009;23:560-6.
Tom K. Fundamental frequency characteristics of Mexican-American speakers of English and Spanish. In: Annual meeting of the American Speech-Language-Hearing Association; 2004.
Shriberg L, Kent R. Clinical Phonetics. 3rd
ed. Boston, MA: Allyn and Bacon; 2003. p. 99.
Scharff-Rethfeldt W, Miller N. Speaking fundament frequency differences in highly proficient German-English bilinguals. Gehör 2008;32:123-8.
Hanley TD. An analysis of vocal frequency and duration characteristics of selected samples of speech from three American dialect regions. Speech Monogr 1951;18:78-93.
Altenberg EP, Ferrand CT. Fundamental frequency in monolingual English, bilingual English/Russian, and bilingual English/Cantonese young adult women. J Voice 2006;20:89-96.
Andrianopoulos AV, Darrow KN, Chen J. Multimodal standardization of voice among four multicultural populations: Fundamental frequency and spectral characteristics. J Voice 2001;15:194-219.
Guimarães I, Abberton E. Fundamental frequency in speakers of Portuguese for different voice samples. J Voice 2005;19:592-606.
American Speech-Language-Hearing Association. Guidlines for Audiologic Screening; 1997. Available from: http://www.asha.org/policy
. [Last accessed on 2017 Oct 20].
Fairbanks G. Voice and Articulation Drill Book. 2nd
ed. New York: Harper & Row; 1960.
Puts DA, Caulin SJ, Verdolini K. Dominance and the evolution of sexual dimorphism in human voice pitch. Evol Hum Behav 2006;27:283-96.
Ohala M. Handbook of International Phonetics Association: A Guide to the Use of the International Phonetic Alphabet: Hindi. Cambridge: International Phonetic Association; 1999.
Peterson GE, Barney HL. Control methods used in a study of the vowels. J Acous Soc Am 1952;24:175-84.
Keating P, Kuo G. Comparison of speaking fundamental frequency in English and Mandarin. J Acoust Soc Am 2012;132:1050-60.
Hakkesteegt MM, Brocaar MP, Wieringa MH, Feenstra L. Influence of age and gender on the dysphonia severity index. A study of normative values. Folia Phoniatr Logopedia, 2006;60:86-90.
Zraick RI, Skaggs SD, Montague JC. The effect of task on determination of habitual pitch. J Voice 2000;14:484-9.
[Table 1], [Table 2], [Table 3], [Table 4]