[1]徐 明,陈知困,黄云森.基于FFT-ACF和候选值估计的基音周期提取方法[J].深圳大学学报理工版,2007,24(4):388-392.
 XU Ming,CHEN Zhi-kun,and HUANG Yun-sen.A novel pitch tracking method based on FFT-ACF and estimation of pitch candidates[J].Journal of Shenzhen University Science and Engineering,2007,24(4):388-392.
点击复制

基于FFT-ACF和候选值估计的基音周期提取方法()
分享到:

《深圳大学学报理工版》[ISSN:1000-2618/CN:44-1401/N]

卷:
第24卷
期数:
2007年4期
页码:
388-392
栏目:
出版日期:
2007-10-30

文章信息/Info

Title:
A novel pitch tracking method based on FFT-ACF and estimation of pitch candidates
文章编号:
1000-2618(2007)04-0388-05
作者:
徐 明1陈知困12黄云森1
1)深圳大学信息中心,深圳 518060;
2)深圳大学信息工程学院,深圳 518060
Author(s):
XU Ming1CHEN Zhi-kun12and HUANG Yun-sen1
1)Information Center,Shenzhen University,Shenzhen 518060,P.R.China
2)College of Information Engineering,Shenzhen University,Shenzhen 518060,P.R.China
关键词:
语音信号处理基音周期提取FFT-ACF后处理算法候选值估计
Keywords:
Speech processingpitch trackingFFT-ACFpost-processing algorithmestimation of pitch candidates
分类号:
TP 391;TN 912
文献标志码:
A
摘要:

利用FFT-ACF算法进行基音周期候选值估计,减少在语音基音周期提取中常见的倍频和半频错误,提出针对候选值的多重后处理算法.后处理过程:首先运用峰值筛选法进行初选,接着利用一次均值法将语音分为不同的音高段,再使用二次均值法为每个音高段确定合适的频率范围,最后精确提取出基音周期.实验结果表明,基音周期后处理算法有效,在音乐哼唱识别应用中收到良好效果.

Abstract:

To reduce the halving and doubling errors in pitch tracking, the FFT-ACF algorithm was used to estimate the candidates of pitch, and a new multi-post-processing algorithm to process the candidates of pitch was proposed. Firstly, the peek-selecting method was used to detect the right candidates of pitch. Secondly, the first-mean method was used to divide the singing speech into different pitch segment. Thirdly, the second-mean method was used to obtain the optimal frequency range for every pitch segment. As a result, the precise pitch is determined from speech signal. Experiments show that the proposed multi-post-processing algorithm outperforms other algorithms, demonstrating desirable performance in query by singing system.

参考文献/References:

[1]Paul B.数字语音的基频和谐波-噪音率的精确短时分析[C]//语音科学研究所论文集.阿姆斯特丹: 阿姆斯特丹大学,1993:97-110(英文版).
[2]刘 建,郑 芳,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报(自然科学版),2006,46(1):47-51.
[3]刘 建,郑 芳,邓 菁,等.基于混合幅度差函数的基音提取算法[J].电子学报,2006,34(10):1925-1928.
[4]张文耀,许 刚,王裕国.循环AMDF及其语音基音周期估计算法[J].电子学报,2003,31(6):886-889.
[5]顾 良,刘润生.高性能汉语语音基音周期估计[J ].电子学报,1999,27(1) :8-11.
[6]鲍长春,樊昌信.基于归一化互相关函数的基音检测算法[J].通信学报,1998,19(10):21-25.
[7] Secrest B,Doddington G.一种用于语音系统的综合音高提取算法[C].ICASSP论文集.纽约:IEEE,1983:1352-1355(英文版).
[8]黄海亮,谢康林,杜平,等.一种高精度的基频提取方案[J].计算机工程,2004,30(增刊):343-372.
[9]Bagshaw.Fda评估数据库[M/OL].爱丁堡:爱丁堡大学语言信息研究中心,1993[2007-02-10].http://www.cstr.ed.ac.uk/research/projects/fda/fda_eval.tar.gz(英文版).
[10]罗亚飞,鲍长春.基于DCT分带谱熵与信号分解的高精度基音检测算法[J].电子学报,2007,35(1):13-22.
[1]Paul B.Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound [C]//Proc Institute of Phonetic Sciences.Amsterdam: UVA,1993:97-110.
[2]LIU Jian,ZHENG Fang,WU Wen-hu.Real-time pitch tracking based on sum of magnitude difference square function.[J] Tsinghua Univ (Sci&Tech),2006,46(1): 47-51(in Chinese).
[3] LIU Jian,ZHENG Fang,DENG Jing,et al.Combined magnitude difference function based pitch tracking algorithm[J].Acta Electronic Sinica,2006,34(10):1925-1928(in Chinese).
[4] ZHANG Wen-yao,XU Gang,WANG Yu-guo.Circular AMDF and pitch estimation based on it [J].Acta Electronic Sinica,2003,31(6): 886 - 889(in Chinese).
[5] GU Liang,LIU Run-sheng.High-performance mandarin pitch estimation [J].Acta Electronic Sinica,1999,27(1): 8-11(in Chinese).
[6] BAO Chang-chun,FAN Chang-xin.Pitch detection algorithm based on normalized cross- correlation function[J].Journal of China Institute of Communications,1998,19(10):21-25(in Chinese).
[7] Secrest B,Doddington G.An integrated pitch tracking algorithm for speech systems [C].Proc ICASSP.NY: IEEE,1983:1352-1355.
[8] HUANG Hai-liang,XIE Kang-lin,DU Ping,et al.A high precision solution of pitch detection[J].Computer Engineering,2004,30(Supplementary Issue):343-372(in Chinese).
[9]Bagshaw.Fda evaluation database [M/OL].Edinburgh: The Centre for Speech Technology Research in University of Edinburgh,1993[2007-02-10].http://www.cstr.ed.ac.uk/research/projects/fda/fda_eval.tar.gz
[10]LUO Ya-fei,BAO Chang-chun.Super resolution pitch detection based on band-partitioning spectral entropy and signal decomposition in DCT domain[J].Acta Electronic Sinica,2007,35(1):13-22(in Chinese).

相似文献/References:

[1]解焱陆,张劲松,刘明辉,等.基于分层增长语音活动检测的鲁棒性说话人识别[J].深圳大学学报理工版,2012,29(No.4(283-376)):328.[doi:10.3724/SP.J.1249.2012.04328]
 XIE Yan-lu,ZHANG Jing-song,LIU Ming-hui,et al.Robust speaker recognition based on level-building voice activity detection[J].Journal of Shenzhen University Science and Engineering,2012,29(4):328.[doi:10.3724/SP.J.1249.2012.04328]

备注/Memo

备注/Memo:
收稿日期:2007-04-04;修回日期:2007-06-11
基金项目:深圳市科技计划资助项目(QK200601)
作者简介:徐明(1967-),男(汉族),湖南省怀化市人,深圳大学高级工程师.E-mail:xuming@szu.edu.cn
更新日期/Last Update: 2007-12-05