语音时长规整技术的研究回溯A Survey of Time Scale Modification of Speech
周俊,高悦,谭薇,陈砚圃
摘要(Abstract):
语音时长规整技术是在不改变语音音调并保证良好音质的情况下,对语音进行一定的压缩或拉伸的技术。首先给出了语音时长规整技术的发展历程和主要实现方法,重点阐述了主要实现算法的原理,并仿真实现了适合实时处理的两种时域算法,比较分析了两种时域方法的效果。最后对语音时长规整技术进行了展望。
关键词(KeyWords): 时长规整;固定同步叠加;相位声码器;正弦模型
基金项目(Foundation):
作者(Author): 周俊,高悦,谭薇,陈砚圃
参考文献(References):
- [1]Wong W,AU O(C).Fast SOLA based Time Scale Modifi-cation Using Modified Envelope Matching.Proc of IEEE In-ternational Conference on Acoustics,Speech and Signal Pro-cessing[C].Orlando,FL,2002:3 188 3 191.
- [2]Wong Hon Wah.Variable Speed Playback System for Speechand Audio Signals(and Topics in Video Processing).MasterThesis,MIT,1998.
- [3]赵胜辉.离散时间语音信号处理原理与应用[M].北京:电子工业出版社,2004.
- [4](D)Malah.Time domain Algorithms for Harmonic Band-width Reduction and Time Scaling of Speech Signals[J].IEEE Trans.Acoust.,Speech,Signal Processing.1979,AS-SP 27(12):121 133.
- [5]Griffin D W,Lin J S.Signal Estimation from ModifiedShort Time Fourier Transform[J].IEEE Trans.Acoust.,Speech,Signal Processing,1984,ASSP 32(2):236 243.
- [6]Roucos S,Wilgus A M.High Quality Time Scale Modifica-tion for Speech[J].proc.IEEE int.Conf.Acoustics,Speech.,Signal Processing,1985,1:493 496.
- [7]Hejna D J.Real Time Time Scale Modification of Speechvia the Synchronized Overlap Add Algorithm.Master The-sis,1990.
- [8]Makhoul J,(A)El Jaroudi.Time scale Modification inMedium to Low Rate Coding[J].In Proceedings of the Inter-national Conference on Acoustics,Speech,and Signal Pro-cessing,1986,3:1 705 1 708.
- [9]Wayman J L,Wilson(D)L.Some Improvements on theSynchronized overlap add Method of Time scale Modifi-cation for Use in Real time Speech Compression and NoiseFiltering[J].IEEE Transactions on Acoustics,Speech,andSignal Processing,1998,36(1):139 140.
- [10]Wayman J L,Reinke R E,Wilson(D)L.High QualitySpeech Expansion,Compression,and Noise Filtering Usingthe SOLA Method of Time Scale Modification.In 23rd Asi-lomar Conference on Signals,Systems,and Computers,Oc-tober 1989,2:714 717.
- [11]Hardam E.High Quality Time scale Modification ofSpeech Signals Using Fast Synchronized overlap addAlgorithms[J].In Proceedings of the International Confer-ence on Acoustics,Speech,and Signal Processing,1990:409 412.
- [12]Flanagan J L,Golden R M.Phase Vocoder[J].Bell SystemTech.,1966,45:1 493 1 509.
- [13]McAulay R J,Quatieri T F.Speech Analysis Synthesisbased on a Sinusoidal Representation[J].IEEE Trans.Acoust.,Speech,Signal Prosess.1986,ASSP 34(4):744 754.
- [14]McAulay R J,Quatier T F.Speech Transformations Basedon a Sinusoidal Representation[J].IEEE Trans.Acoust.,Speech,Signal Prosess.,1986,ASSP 34(6):1 4491 464.
- [15]杜守富,毛启容,詹永照.自适应同步叠加语音时长规整算法[J].通信学报,2005,26(2):136 140.
- [16]莫福源.基于听觉感知和语音生成的语音变速回放算法[M].the 8th National Conference on Man MachineSpeech Communication,2005.
- [17]沙泉,周江扬.一种调整汉语语速的新方法[J].计算机工程与科学,2000,22(4):64 66.
- [18]Sawako Shibata.Hiroto Saito and Shogo Nakamura.ATime Scale Modification Using Hierarchical StructureCIC Filter and Sinusoidal Representation,NCSP2005,2005.
- [19]苏勇.音频信号保真处理方法[P].专利号:CN 1145519A,1997.
- [20]Mark Dolson.The Phase Vocoder:A Tutorial[J].Comput-er Music Journal,1986,10(4):14 27.