参考文献/References:
[1] DAVIS S B, MERMELSTEIN P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1980, 28(4): 357-366.
[2] HUANG X, ACERO A, HON H. Spoken language processing: a guide to theory, algorithm, and system development[M]. New Jersey, USA: Prentice Hall, 2001.
[3] TORFI A, DAWSON J, NASRABADI N M. Text-independent speaker verification using 3D convolutional neural networks[C]// Proceeding of ICME 2018. San Diego, USA: IEEE, 2018: 1-5.
[4] RAVANELLI M, BENGIO Y. Speaker recognition from raw waveform with SincNet[C]// Proceeding of IEEE Spoken Language Technology Workshop (SLT). Athens, IEEE, 2018: 1-8.
[5] LIN Ting, ZHANG Ye. Speaker recognition based on long-term acoustic features with analysis sparse representation[J]. IEEE Access, 2019, 7: 87439-87447.
[6] CORDEIRO H, RIBEIRO C M. Speaker characterization with MLSFs[C]// Proceeding of Speaker and Language Recognition Workshop. San Juan: IEEE, 2006: 1-4.
[7] POUR A F, ASGARI M, HASANABADI M R. Gammatonegram based speaker identification[C]// Proceeding of International Conference on Computer & Knowledge Engineering. Mashhad, Iran: IEEE, 2014: 52-55.
[8] ZHANG Yadong, SUN Fuyuan. A methodology based on wavelet packet for speaker transform recognition[C]// Proceeding of International Conference on Wavelet Analysis and Pattern Recognition. Beijing: IEEE, 2007: 767-771.
[9] KINNUNEN T, LI H. An overview of text-independent speaker recognition: from features to supervectors[J]. Speech Communication, 2010, 52(1): 12-40.
[10] BURTON D K. Text-dependent speaker verification using vector quantization source coding[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, 35(2): 133-143.
[11] FURUI S. Cepstral analysis technique for automatic speaker verification[J]. IEEE Transactions on Acoustics Speech, and Signal Processing, 1981, 29(2): 254-272.
[12] REYNOLDS D A, ROSE R C. Robust text-independent speaker identification using Gaussian mixture speaker models[J]. IEEE Transactions on Speech and Audio Processing,1995, 3(1): 72-83.
[13] REYNOLDS D A, QUATIERI T F, DUNN R B. Speaker verification using adapted Gaussian mixture models[J]. Digital Signal Processing, 2000, 10(1/2/3): 19-41.
[14] BENZEGHIBA M F, HERV B. User-customized password speaker verification using multiple reference and background models[J]. Speech Communication, 2004, 48(9): 1200-1213.
[15] MOHAMMADI M, MOHAMMADI H R S. Weighted I-vector based text-independent speaker verification system[C]// Proceeding of the 27th Iranian Conference on Electrical Engineering (ICEE). Yazd, Iran: IEEE, 2019: 1647-1653.
[16] VARIANI E, LEI X, MCDERMOTT E, et al. Deep neural networks for small footprint text-dependent speaker verification[C]// Proceeding of IEEE International Conference on Acoustics. Florence, Italy: IEEE, 2014: 4052-4056.
[17] LI Chao, MA Xiaokong, JIANG Bing, et al. Deep speaker: an end-to-end neural speaker embedding system[EB/OL]. arXiv, 2017[2020-09-15]. https://arxiv.org/abs/1705.02304.
[18] HEIGOLD G, MORENO I, BENGIO S, et al. End-to-end text-dependent speaker verification[C]// Proceeding of ICASSP. Shanghai, China: IEEE, 2016: 5115-5119.
[19] VRIES N J, DAVEL M H, BADENHORST J, et al. A smartphone-based ASR data collection tool for under-resourced languages[J]. Speech Communication, 2014, 56: 119-131.
[20] MOKGONYANE T B, SEFARA T J, MODIPA T I, et al. Automatic speaker recognition system based on machine learning algorithms[C]// 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA). Bloemfontein, South Africa: IEEE, 2019: 141-146.
相似文献/References:
[1]薛丽萍,尹俊勋,纪震.基于粒子群优化-模糊聚类的说话人识别[J].深圳大学学报理工版,2008,25(2):178.
XUE Li-ping,YIN Jun-xun,and JI Zhen.Speaker recognition based on particle swarm optimizition and fuzzy clustering analysis[J].Journal of Shenzhen University Science and Engineering,2008,25(增刊1):178.
[2]解焱陆,张劲松,刘明辉,等.基于分层增长语音活动检测的鲁棒性说话人识别[J].深圳大学学报理工版,2012,29(No.4(283-376)):328.[doi:10.3724/SP.J.1249.2012.04328]
XIE Yan-lu,ZHANG Jing-song,LIU Ming-hui,et al.Robust speaker recognition based on level-building voice activity detection[J].Journal of Shenzhen University Science and Engineering,2012,29(增刊1):328.[doi:10.3724/SP.J.1249.2012.04328]