Two-Level Classification in Determining the Age and Gender Group of a Speaker

Yucesoy, Ergun

dc.contributor.author	Yucesoy, Ergun
dc.date.accessioned	2023-01-06T10:45:23Z
dc.date.available	2023-01-06T10:45:23Z
dc.date.issued	2021
dc.identifier.citation	Yucesoy, E. (2021). Two-Level Classification in Determining the Age and Gender Group of a Speaker. International Arab Journal of Information Technology, 18(5), 663-670.Doi:10.34028/iajit/18/5/5	en_US
dc.identifier.isbn	1683-3198
dc.identifier.uri	http://dx.doi.org/10.34028/iajit/18/5/5
dc.identifier.uri	https://www.webofscience.com/wos/woscc/full-record/WOS:000707933200005
dc.identifier.uri	http://earsiv.odu.edu.tr:8080/xmlui/handle/11489/3378
dc.description	WoS Categories : Computer Science, Artificial Intelligence; Computer Science, Information Systems; Engineering, Electrical & Electronic Web of Science Index : Science Citation Index Expanded (SCI-EXPANDED) Research Areas : Computer Science; Engineering	en_US
dc.description.abstract	In this study, the classification of the speakers according to age and gender was discussed. Age and gender classes were first examined separately, and then by combining these classes a classification with a total of 7 classes was made. Speech signals represented by Mel-Frequency Cepstral Coefficients (MFCC) and delta parameters were converted into Gaussian Mixture Model (GMM) mean supervectors and classified with a Support Vector Machine (SVM). While the GMM mean supervectors were formed according to the Maximum-A-Posteriori (MAP) adaptive GMM-Universal Background Model (UBM) configuration, the number of components was changed from 16 to 512, and the optimum number of components was decided. Gender classification accuracy of the system developed using aGender dataset was measured as 99.02% for two classes and 92.58% for three classes and age group classification accuracy was measured as 67.03% for female and 63.79% for male. In the classification of age and gender classes together in one step, an accuracy of 61.46% was obtained. In the study, a two-level approach was proposed for classifying age and gender classes together. According to this approach, the speakers were first divided into three classes as child, male and female, then males and females were classified according to their age groups and thus a 7-class classification was realized. This two-level approach was increased the accuracy of the classification in all other cases except when 32-component GMMs were used. While the highest improvement of 2.45% was achieved with 64 component GMMs, an improvement of 0.79 was achieved with 256 component GMMs.	en_US
dc.language.iso	eng	en_US
dc.publisher	ZARKA PRIVATE UNIV ZARQA	en_US
dc.relation.isversionof	10.34028/iajit/18/5/5	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	SUPPORT VECTOR MACHINES; AUTOMATIC SPEAKER; IDENTIFICATION	en_US
dc.subject	GMM; mean supervector; speaker age and gender classification; SVM; two level classification	en_US
dc.title	Two-Level Classification in Determining the Age and Gender Group of a Speaker	en_US
dc.type	article	en_US
dc.relation.journal	INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY	en_US
dc.contributor.department	Ordu Üniversitesi	en_US
dc.identifier.volume	18	en_US
dc.identifier.issue	5	en_US
dc.identifier.startpage	663	en_US
dc.identifier.endpage	670	en_US

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Makale Koleksiyonu

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Two-Level Classification in Determining the Age and Gender Group of a Speaker

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account