• Users Online: 366
  • Home
  • Print this page
  • Email this page
Home About us Editorial board Search Ahead of print Current issue Archives Submit article Instructions Subscribe Contacts Login 
ORIGINAL ARTICLE
Year : 2015  |  Volume : 29  |  Issue : 2  |  Page : 28-34

Identifying speaker from disguised speech using aural perception and Mel-frequency cepstral coefficient


1 Department of ENT, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India
2 Department of Speech and Hearing, School of Allied Health Sciences, Manipal University, Manipal, Karnataka, India

Correspondence Address:
J Praveena
Department of ENT, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry
India
Login to access the Email id

Source of Support: None, Conflict of Interest: None


DOI: 10.4103/0974-2131.185974

Rights and Permissions

Objective: The present study intended to compare the accuracy of speaker identification using aural perception and semiautomatic method (Mel –Frequency Cepstral Coefficient; MFCC), when the speech is in disguise condition by using the handkerchief during recording and to check the percentage of correct identification in the semiautomatic method when the vowel and consonant segments were used for analysis. Methods: Thirty speaker's single sentence speech sample was recorded in undisguised and disguised conditions were randomly paired into the sets of one undisguised followed by five disguised samples for the task of speaker identification. In aural perceptual method the five judges listened to the samples and made a decision on the match. In MFCC method, from /ðə/ segment, ten coefficient values were extracted. The coefficient values were manually averaged and the pair that obtained the lowest value of Euclidean distance was determined to be the sample of the same speaker. The Kappa agreement was used to find the agreement between the two methods in speaker identification and the percentage of correct identification was calculated for the vowel and consonant segment analysis. Results: The results revealed the kappa value to be negative (k < 0) indicating no agreement between the two methods. The percentage of correct identification using aural perception ranged from 56.7% - 80% and for MFCC under whole word, consonant segment and vowel segment analysis were 46.7%, 26.7% and 53.33% respectively. Conclusion: The aural perception method had a greater percentage of correct identification than MFCC though it was not statistically significant for speaker identification from disguised speech.


[FULL TEXT] [PDF]*
Print this article     Email this article
 Next article
 Previous article
 Table of Contents

 Similar in PUBMED
   Search Pubmed for
   Search in Google Scholar for
 Related articles
 Citation Manager
 Access Statistics
 Reader Comments
 Email Alert *
 Add to My List *
 * Requires registration (Free)
 

 Article Access Statistics
    Viewed1743    
    Printed179    
    Emailed0    
    PDF Downloaded258    
    Comments [Add]    

Recommend this journal