Please use this identifier to cite or link to this item:
Title: Comparison of MFCC, LPCC and PLP Features for The Determination of a Speaker's Gender
Authors: Yucesoy, Ergun
Nabiyev, Vasif V.
Ordu Üniversitesi
Keywords: Gender identification, Gaussian Mixture Model (GMM), MFCC, LPCC, PLP
Issue Date: 2014
Publisher: IEEE-NEW YORK
Citation: Yücesoy, E., Nabiyev, VV. (2014). Comparison of MFCC, LPCC and PLP Features for The Determination of a Speaker's Gender. , 321-324
Abstract: Gender information is a distinctive and the most important property in a speech. Determination of this information from a speech signal is a substantial subject. Gender information used for various purposes in many applications, provides the less error rate by defining the gender-dependent speech/speaker models. In this study, a system determining the gender of a speaker with no dependency from a text is proposed. In the proposed system, speech records represented by Mel Frequency Cepstral Coefficients (MFCC), Linear Prediction Cepstral Coefficients (LPCC) and Perceptual Linear Prediction Coefficients (PLP) features are classified according to the genders by using GMM model. In the study, the effect of feature type and its dimension and the number of GMM components on the success are comparatively investigated. In the experiments, the best result is obtained as 99.37% with 16-coefficient MFCC and 8-component GMM.
Description: WoS Categories: Engineering, Electrical & Electronic; Telecommunications
Web of Science Index: Conference Proceedings Citation Index - Science (CPCI-S)
Research Areas: Engineering; Telecommunications
Conference Title: 22nd IEEE Signal Processing and Communications Applications Conference (SIU)
ISBN: 978-1-4799-4874-1
ISSN: 2165-0608
Appears in Collections:Makale Koleksiyonu

Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.