Text-independent speaker identification system based on the histogram of DCT-cepstrum coefficients

S. Al-Rawahy*, A. Hossen, U. Heute

*المؤلف المقابل لهذا العمل

نتاج البحث: المساهمة في مجلةArticleمراجعة النظراء

8 اقتباسات (Scopus)

ملخص

There are several known feature sets for text-independent speaker-identification systems, most of which depend on spectral information. Among these feature sets as a most successful one, there is the set of the Mel-Frequency Cepstrum Coefficients (MFCC). This paper introduces a new feature set, namely, the Histogram of the DCT-Cepstrum Coefficients, inspired by the common use of the MFCC, but simpler and faster in computation. A text-independent speaker-identification system based on the DCT-Cepstrum Histogram and Gaussian Mixture Model (GMM) is implemented. The new feature was tested using speech files from the ELSDSR database and TIMIT corpus. The new feature set managed to achieve high efficiency rates with speaker identification accuracy of 100% on 23 speakers from the ELSDSR database, and 99% on 630 speakers from the TIMIT corpus.

اللغة الأصليةEnglish
الصفحات (من إلى)141-161
عدد الصفحات21
دوريةInternational Journal of Knowledge-Based and Intelligent Engineering Systems
مستوى الصوت16
رقم الإصدار3
المعرِّفات الرقمية للأشياء
حالة النشرPublished - 2012

ASJC Scopus subject areas

  • ???subjectarea.asjc.1700.1712???
  • ???subjectarea.asjc.2200.2207???
  • ???subjectarea.asjc.1700.1702???

بصمة

أدرس بدقة موضوعات البحث “Text-independent speaker identification system based on the histogram of DCT-cepstrum coefficients'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا