A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM

Authors

  • Kumari Piu Gorai School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India
  • Thomas Abraham School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India

DOI:

https://doi.org/10.22159/ajpcr.2017.v10s1.19596

Keywords:

Speaker recognition, Mel-frequency cepstral coefficients, Gaussian mixture model, Support vector machine, Robust speaker recognition system

Abstract

A human being has lot of unique features and one of them is voice. Speaker recognition is the use of a system to distinguish and identify a person from his/
her vocal sound. A speaker recognition system (SRS) can be used as one of the authentication technique, in addition to the conventional authentication methods. This paper represents the overview of voice signal characteristics and speaker recognition techniques. It also discusses the advantages and problem of current SRS. The only biometric system that allows users to authenticate remotely is voice-based SRS, we are in the need of a robust SRS.

Downloads

Download data is not yet available.

References

Campbell JP. Speaker recognition: A tutorial. Proceedings of the IEEE. Vol. 85. No. 9. September; 1997.

Doddington GR. Speaker recognition—Identifying people by their voices. Proc IEEE 1985;73;1651-64.

Kenny P, Boulianne G, Ouellet P, Dumouchel P. Speaker and session variability in GMM-based speaker verification. IEEE Trans Audio Speech Lang Process 2007;15(4):1448-60.

Kinnunen T, Li H. An overview of text-independent speaker recognition: From features to supervectors. Speech Commun 2010;52:12-40.

Togneri R, Pullella D. An overview of speaker identification: Accuracy and robustness issues. IEEE Circuits and Systems Magazine Second Quarter. 2011. p. 23-60.

Liu G, Lei Y, Hansen JH. Robust feature front-end for speaker identification. In: Proceeding ICASSP. Kyoto, Japan: March; 2012. p. 4233-6.

Campbell W, Campbell J, Reynolds D, Singer E, Carrasquillo PT. Support vector machines for speaker and language recognition. Comput Speech Lang 2006;20(2-3):210-29.

May T, van de Par S, Kohlrausch A. Noise-robust speaker recognition combining missing data techniques and universal background modeling. IEEE Trans Audio Speech Lang Process 2012;20(1):108-21.

Solomonoff A, Campbell WM, Boardman I. Advances in channel compensation for SVM speaker recognition. In: Proceeding ICASSP. 2005. p. 629-32.

Reynolds D, Rose R. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 1995;3(1):72-83.

Alam MJ, Kenny P, Shaughnessy DO. Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique. Digit Signal Process 2014;29:147-57.

Published

01-04-2017

How to Cite

Gorai, K. P., and T. Abraham. “A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM”. Asian Journal of Pharmaceutical and Clinical Research, vol. 10, no. 13, Apr. 2017, pp. 140-2, doi:10.22159/ajpcr.2017.v10s1.19596.

Issue

Section

Original Article(s)