|
[1] A. Black and K. Lenzo, “Limited domain synthesis”, ICSLP pp. 411–414, 2000. [2] A. Hunt and A. Black, “Unit selection in a concatenative speechsynthesis system using a large speech database” , ICASSP, pp. 373–376, 1996. [3] K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, T.Kitamura, “Speech parameter generation algorithms for HMM-based speech synthesis”, Proc. of ICASSP, June 2000. [4] Junichi Yamagishi1, Heiga Zen, Tomoki Toda and Keiichi Tokuda “Speaker-Independent HMM-based Speech Synthesis System— HTS-2007 System for the Blizzard Challenge”, Blizzard 2007, 2007. [5] S. Imai, “Cepstral analysis synthesis on the mel-frequency scale”, ICASSP, 1983. [6] 王小川, “語音訊號處理”. [7] Wavesurfer, http://www.speech.kth.se/wavesurfer/. [8] K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, “Mel-generalized cepstral analysis – A unified approach to speech spectral estimation”, Proc. ICASSP, pp.1043–1046, 1994. [9] T. Kobayashi and S. Imai, “Spectral analysis using generalized cepstrum,” IEEE Trans. Acoust., Speech, Signal processing, vol. ASSP-32, pp.1087–1089, Oct. 1984. [10] K. Tokuda, T. Kobayashi, T. Masuko, S. Imai, “Mel-generalized cepstral analysis-a unified approach to speech spectral estimation”, Citeseer, 1994. [11]H.Zen, K.Tokuda, T.Masuko, T.Kobayashi, and T. Kitamura, ”Hidden semi-Markov model based speech synthesis”, Proc. ICSLP 2004. [12] T. Yoshimura, T. Masuko, K. Tokuda, T. Kobayashi, and T. Kitamura,“Duration modeling for HMM-based speech synthesis,” Proc. ICSLP-98, vol.2, Tu3A4, pp.29--32, Nov. 1998. [13] K. Tokuda, T. Kobayashi, and S. Imai, “Speech parameter generation from HMM using dynamic features,” in Proc. of ICASSP, 1995, pp. 660–663. [14] K. Shinoda and T.Watanabe, “MDL-based context-dependent subword modeling for speech recognition,” J. Acoust. Soc. Japan (E), vol. 21,pp. 79–86, Mar. 2000. [15] C. Leggetter and P.Woodland, “Maximum likelihood linear egression for speaker adaptation of continuous density hidden Markov models,” Comput. Speech Lang., vol. 9, no. 2, pp. 171–185, 1995. [16] J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, “Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm,” IEEE Trans. Speech, Audio, Lang. Process., vol. 17, no. 1, pp. 66–83, Jan. 2009, 2007. [17] K. Shinoda and C. Lee, “A structural Bayes approach to speaker adaptation,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276–287, Mar. 2001. [18] Hidden Markov Model Toolkit (HTK), http://htk.eng.cam.ac.uk/ [19] Speech Signal Processing Toolkit (SPTK), http://sp-tk.sourceforge.net/ [20] HMM-based Speech Synthesis System (HTS), http://hts.sp.nitech.ac.jp/ [21] HTS engine, http://hts-engine.sourceforge.net/ [24] Hidden Markov Model Toolkit (HTK), http://htk.eng.cam.ac.uk/
|