Publications
2010
- Yusuke Ijima, Takashi Nose, Makoto Tachibana, Takao Kobayashi,
``A rapid model adaptation technique for emotional speech recognition with style estimation based on multiple-regression HMM,''
IEICE Trans. on Information and Systems, vol.E93-D, 1, pp.107-115 (2010.01)
- Takashi Nose, Takao Kobayashi,
``A technique for estimating intensity of emotional expressions and speaking styles in speech based on based on multiple-regression HSMM,''
IEICE Trans. on Information and Systems, vol.E93-D, 1, pp.116-124 (2010.01)
2009
- Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhenhua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, Steve Renals,
``A robust speaker-adaptive HMM-based text-to-speech synthesis,''
IEEE Trans. on Audio, Speech, and Language Processing, vol.17, 6, pp.1208-1230 (2009.08)
- Suphattharachai Chomphan, Takao Kobayashi,
``Tone correctness improvement in speaker-independent average-voice-based Thai speech synthesis,''
Speech Communication, vol.51, 4, pp.330-343 (2009.04)
- Takashi Nose, Makoto Tachibana, Takao Kobayashi,
``HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation,''
IEICE Trans. on Information and Systems, vol.E92-D, 3, pp.489-497 (2009.03)
- Junichi Yamagishi, Takao Kobayashi, Yuji Nakano, Katsumi Ogata, Juri Isogai,
``Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm,''
IEEE Trans. on Audio, Speech, and Language Processing, vol.17, 1, pp.66-83 (2009.01)
- Heiga Zen, Keiichiro Oura, Takashi Nose, Junichi Yamagishi, Shinji Sako, Tomoki Toda, Takashi Masuko, Alan W. Black, Keiichi Tokuda,
``Recent development of the HMM-based speech synthesis system (HTS),''
2009 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2009, MP-SS1-2, Sapporo, Japan. (2009.10)
- Yusuke Ijima, Takeshi Matsubara, Takashi Nose, Takao Kobayashi,
``Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM,''
Proc. 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, pp.552-555, Brighton, U.K. (2009.09)
- Takashi Nose, Junichi Asada, Takao Kobayashi,
``HMM-based speaker characteristics emphasis using average voice model,''
Proc. 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, pp.2631-2634, Brighton, U.K. (2009.09)
- Ryo Taguchi, Naoto Iwahashi, Takashi Nose, Kotaro Funakoshi, Mikio Nakano,
``Learning lexicons from spoken utterances based on statistical model selection,''
Proc. 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, pp.2731-2734, Brighton, U.K. (2009.09)
- Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi,
``Emotional speech recognition based on style estimation and adaptationwith multiple-regression HMM,''
Proc. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, pp.4157-4160, Taipei, Taiwan (2009.04)
2008
- Suphattharachai Chomphan, Takao Kobayashi,
``Tone correctness improvement in speaker dependent HMM-based Thai speech synthesis,''
Speech Communication, vol.50, 5, pp.392-404 (2008.05)
- Junichi Yamagishi, Hisashi Kawai, Takao Kobayashi,
``Phone duration modeling using gradient tree boosting,''
Speech Communication, vol.50, 5, pp.405-415 (2008.05)
- Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi,
``An on-line adaptation technique for emotional speech recognition using style estimation with multiple-regression HMM,''
Proc. 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, pp.1297-1300, Brisbane, Australia (2008.09)
- Takashi Nose, Yoichi Kato, Makoto Tachibana, Takao Kobayashi,
``An estimation technique of style expressiveness for emotional speech using model adaptation based on multiple-regression HSMM,''
Proc. 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, pp.2759-2762, Brisbane, Australia (2008.09)
- Makoto Tachibana, Shinsuke Izawa, Takashi Nose, Takao Kobayashi,
``Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis,''
Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008, pp.4633-4636, Las Vegas, USA (2008.04)
- Suphattharachai Chomphan, Takao Kobayashi,
``Incorporation of phrase intonation to context clustering for average voice models in HMM-based Thai speech synthesis,''
Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008, pp.4637-4640, Las Vegas, USA (2008.04)
- Suphattharachai Chomphan, Takao Kobayashi,
``A context clustering technique for improvement of tone intelligibility of average-voice-based Thai speech synthesis,''
Asian Workshop on Speech Science and Technology,, IEICE Technical Report, Vol.107 No.551, SP2007-194, pp.45-50, Tokyo (2008.03)
- Kiyoto Ichikawa, Takeshi Mita, Osamu Hori, Takao Kobayashi,
``Component-based face detection method for various types of occluded faces,''
Proc. 2008 3rd International Symposium on Communications, Control, and Signal Processing, ISCCSP2008, pp.538-543, Malta (2008.03)
2007
- Takashi Nose, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi,
``A style control technique for HMM-based expressive speech synthesis,''
IEICE Trans. Information and Systems, E90-D, 9, pp.1406-1413 (2007.09)
- Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``A Hidden semi-Markov model-based speech synthesis system,''
IEICE Trans. Information and Systems, E90-D, 5, pp.825-834 (2007.05)
- Heiga Zen, Takashi Masuko, Keiichi Tokuda, Takashi Yoshimura, Takao Kobayashi, Tadashi Kitamura,
``State duration modeling for HMM-based speech synthesis,''
IEICE Trans. Information and Systems, E90-D, 3, pp.692-693 (2007.03)
- Junichi Yamagishi, Takao Kobayashi,
``Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training,''
IEICE Trans. Information and Systems, E90-D, 2, pp.533-543 (2007.02)
- Makoto Tachibana, Keigo Kawashima, Junichi Yamagishi, Takao Kobayashi,
``Performance evaluation of HMM-based style classification with a small amount of training data,''
Proc. 8th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, pp.2261--2264, Antwerp, Belgium (2007.08)
- Takashi Nose, Yoichi Kato, Takao Kobayashi,
``Style estimation of speech based on multiple regression hidden semi-Markov model,''
Proc. 8th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, pp.2285--2288, Antwerp, Belgium (2007.08)
- Suphattharachai Chomphan, Takao Kobayashi,
``Implementation and evaluation of an HMM-based Thai speech synthesis system,''
Proc. 8th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, pp.2849--2852, Antwerp, Belgium (2007.08)
- Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, Keiichi Tokuda,
``Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV,''
Proc. 6th ISCA Workshop on Speech Synthesis, SSW6-2007, pp.125--130, Bonn, Germany (2007.08)
- Suphattharachai Chomphan, Takao Kobayashi,
``Design of tree-based context clustering for an HMM-based Thai speech synthesis system,''
Proc. 6th ISCA Workshop on Speech Synthesis, SSW6-2007, pp.160--165, Bonn, Germany (2007.08)
- Takashi Nose, Yoichi Kato, Takao Kobayashi,
``A speaker adaptation technique for MRHSMM-based style control of synthetic speech,''
Proc. 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, vol.IV, pp.833-836, Honolulu, USA (2007.04)
- Junichi Yamagishi, Takao Kobayashi, Makoto Tachibana, Katsumi Ogata, Yuji Nakano,
``Model adaptation approach to speech synthesis with diverse voices and styles,''
Proc. 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, vol.IV, pp.1233-1236, Honolulu, USA (2007.04)
2006
- Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi,
``A style adaptation technique for speech synthesis using HSMM and suprasegmental features,''
IEICE Trans. Information and Systems, E89-D, 3, pp.1092-1099 (2006.03)
- Takashi Nose, Junichi Yamagishi, Takao Kobayashi,
``A style control technique for speech synthesis using multiple regression HSMM,''
Proc. 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, pp.1324--1327, Pittsburgh, USA (2006.09)
- Katsumi Ogata, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi,
``Acoustic model training based on linear transformation and map modification for hsmm-based speech synthesis,''
Proc. 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, pp.1328--1331, Pittsburgh, USA (2006.09)
- Yuji Nakano, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi,
``Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis,''
Proc. 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, pp.2286--2289, Pittsburgh, USA (2006.09)
- Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi,
``A technique for controlling voice quality of synthetic speech using multiple regression HSMM,''
Proc. 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, pp.2438--2441, Pittsburgh, USA (2006.09)
- Junichi Yamagishi, Katsumi Ogata, Yuji Nakano, Juri Isogai, Takao Kobayashi,
``HSMM-based model adaptation algorithms for average-voice-based speech synthesis,''
Proc. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006, vol.I, pp.77-80, Toulouse, France (2006.05)
2005
- Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi,
``Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing,''
IEICE Trans. Information and Systems, E88-D, 11, pp.2484-2491 (2005.11)
- Naotake Niwase, Junichi Yamagishi, Takao Kobayashi,
``Human walking motion synthesis with desired pace and stride length based on HSMM,''
IEICE Trans. Information and Systems, E88-D, 11, pp.2492-2499 (2005.11)
- Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi, ``Acoustic modeling of speaking styles and emotional expressions in HMM-based speech synthesis,'' IEICE Trans. Information and Systems, E88-D, 3, pp.502-509 (2005.03)
- Takashi Yamazaki, Naotake Niwase, Junichi Yamagishi, Takao Kobayashi,
``Human walking motion synthesis based on multiple regression hidden semi-Markov model,''
Second International Workshop on Language Understanding and Agents for Real World Interaction, LUAR 2005, pp.445-452, Singapore (2005.11)
- Vladimir Braquet, Takao Kobayashi,
``A Wavelet based noise reduction algorithm for speech signal corrupted by coloured noise,''
Proc. 9th European Conference on Speech Communication and Technology, INTERSPEECH 2005, pp.2073-2076, Lisbon, Portugal (2005.09)
- Juri Isogai, Junichi Yamagishi, Takao Kobayashi,
``Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis,''
Proc. 9th European Conference on Speech Communication and Technology, INTERSPEECH 2005, pp.2597-2600, Lisbon, Portugal (2005.09)
- Makoto Tachibana, Junich Yamagishi, Takashi Masuko, Takao Kobayashi,
``Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis,''
Proc. 9th European Conference on Speech Communication and Technology, INTERSPEECH 2005, pp.2805-2808, Lisbon, Portugal (2005.09)
- Junichi Yamagishi, Takao Kobayashi, ``Adaptive training for hidden semi-Markov model,'' Proc. 2005 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2005, vol.I, pp.365-368, Philadelphia, USA (2005.03)
- Dhany Arifianto, Takao Kobayashi, ``Voiced/Unvoiced determination of speech signal in noisy environment using harmonicity measure based on instantaneous frequency,'' Proc. 2005 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2005, vol.I, pp.877-880, Philadelphia, USA (2005.03)
2004
- Dhany Arifianto, Tomohiro Tanaka, Takashi Masuko, Takao Kobayashi, ``Robust F0 Estimation of speech signal using Harmonicity measure based on instantaneous frequency,'' IEICE Trans. Information and Systems, E87-D, 12, pp.2812-2820 (2004)
- Junichi Yamagishi, Takashi Masuko, Takao Kobayashi, ``MLLR adaptation for hidden semi-Markov model based speech synthesis,'' Proc. the 8th International Conference on Spoken Language Processing, INTERSPEECH 2004-ICSLP, vo.II, pp.1213-1216, WeB1403p.14, Jeju Island, Korea (2004.10)
- Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura ``Hidden semi-Markov model based speech synthesis,'' Proc. the 8th International Conference on Spoken Language Processing, INTERSPEECH 2004-ICSLP, vol.II, pp1393-1396, WeC1401o.5, Jeju Island, Korea (2004.10)
- Keisuke Miyanaga, Takashi Masuko, Takao Kobayashi, ``A style control technique for HMM-based speech synthesis,'' Proc. the 8th International Conference on Spoken Language Processing, INTERSPEECH 2004-ICSLP, vol.II, pp.1437-1440, Spec4701o.4, Jeju Island, Korea (2004.10)
- Junichi Yamagishi, Makoto Tachibana, Takashi Masuko, Takao Kobayashi, ``Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis,'' Proc. the 2004 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2004, vol.I, pp.5-8, Montreal, Canada (2004.05)
- Masatsugu Okazaki, Toshifumi Kunimoto, Takao Kobayashi, ``Multi-stage spectral subtraction for enhancement of audio signals,'' Proc. the 2004 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2004, vol.II, pp.805-808, Montreal, Canada (2004.05)
- Masatsugu Okazaki, Toshifumi Kunimoto, Takao Kobayashi, ``A noise reduction technique for audio signals using Multi stage spectral subtraction,'' Proc. the 18th International Congress on Acoustics, ICA 2004, vol.IV, pp.3113-3114, Kyoto (2004.04)
- Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi, ``HMM-based speech synthesis with various speaking styles using model interpolation'' Proc. the 2nd International Conference on Speech Prosody, SP2004, pp.413-416, Nara, Japan (2004.03)
- Junichi Yamagishi, Takashi Masuko, Takao Kobayashi, ``HMM-based expressive speech synthesis ---Towards TTS with arbitrary speaking styles and emotions,'' Special Workshop in MAUI (SWIM), Lectures by Masters in Speech Processing,, Conference CD-ROM, 1.13 (4 pages), Maui, Hawaii (2004.01)
- Takahiro Hoshiya, Heiga Zen, Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura, ``An HMM-based approach to speaker-dependent 100 bit/s speech coding,'' Special Workshop in MAUI (SWIM), Lectures by Masters in Speech Processing,, Conference CD-ROM, 1.5 (6 pages), Maui, Hawaii (2004.01)
2003
- Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, ``A training method of average voice model for HMM-based speech synthesis,'' IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, E86-A, 8, pp.1956-1963 (2003)
- Toru Takahashi, Keiichi Tokuda, Takao Kobayashi, Tadashi Kitamura, ``Mixture density models based on mel-cepstral representation of Gaussian Process,'' IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, E86-A, 8, pp.1971-1978 (2003)
- Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, ``A context clustering technique for average voice models,'' IEICE Trans. Information and Systems, E86-D, 3, pp.534-542 (2003)
- Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi, ``Modeling of various speaking styles and emotions for HMM-based speech synthesis,'' Proc. the 8th Eurpean Conference on Speech Communication and Technology, EUROSPEECH '03, vol.III, pp.2461-2464, Geneva (2003.9)
- Dhany Arifianto, Takao Kobayashi, ``Performance evaluation of IFAS-based fundamental frequency estimator in noisy environments,'' Proc. the 8th Eurpean Conference on Speech Communication and Technology, EUROSPEECH '03, vol.IV, pp.2877-2880, Geneva (2003.9)
- Junichi Yamagishi, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, ``A training method for average voice model based on shared decision tree context clustering and speaker adaptive training,'' Proc. the 2003 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2003, vol.I, pp.716-719, Hong Kong (2003.4)
- Takahiro Hoshiya, Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura, Heiga Zen, ``Improving the performance of HMM-based very low bitrate speech coding,'' Proc. the 2003 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2003, vol.I, pp.800-803, Hong Kong (2003.4)
- Dhany Arifianto, Takao Kobayashi, ``IFAS-based Voiced/Unvoiced Classification of Speech Signal,'' Proc. the 2003 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2003, vol.I, pp.812-815, Hong Kong (2003.4)
Top
2002
- Keiichi Tokuda, Takashi Mausko, Noboru Miyazaki, Takao Kobayashi,
``Multi-space probability distribution HMM (Invited paper),''
IEICE Trans. Information and Systems, E85-D, 3, pp.455-464 (2002)
- Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi,
``A context clustering technique for average voice model in HMM-based speech synthesis,''
Proc. 7th International Conference on Spoken Language Processing, ICSLP 2002, vol.1, pp.133-136, Denver, USA (2002.09)
- Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``Eigenvoices for HMM-based speech synthesis,''
Proc. 7th International Conference on Spoken Language Processing, ICSLP 2002, vol.2, pp.1269-1272, Denver, USA (2002.09)
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama,
``Open-source software for developing anthropomorphic spoken dialog agents,''
Proc. International Workshop on Lifelike Animated Agents, Tools, Affective Functions, and Applications, pp.64-69, Tokyo, Japan (2002.8)
- Tomohiro Tanaka, Takao Kobayashi, Dhany Arifianto, Takashi Masuko,
``Fundamental frequency estimation based on instantaneous frequency amplitude spectrum,''
Proc. the 2002 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2002, vol.I, pp.329-332, Orlando, USA (2002.5)
2001
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi,
``Text-to-speech synthesis with arbitrary speaker's voice from average voice,''
Proc. 7th European Conference on Speech Communication and Technology, EUROSPEECH 2001, vol.1, pp.345-348, Aalborg, Denmark (2001.9)
- Takayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda,
``A robust speaker verification system against imposture using an HMM-based speech synthesis,''
Proc. 7th European Conference on Speech Communication and Technology, EUROSPEECH 2001, vol.2, pp.759-762, Aalborg, Denmark (2001.9)
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``Mixed excitation for HMM-based speech synthesis,''
Proc. 7th European Conference on Speech Communication and Technology, EUROSPEECH 2001, vol.3, pp.2263-2266, Aalborg, Denmark (2001.9)
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi,
``Adaptation of pitch and spectrum for HMM-Based speech synthesis using MLLR,''
Proc. the 2001 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2001, vol.II, pp.805-808, Salt Lake City, USA (2001.5)
- Chiyomi Miyajima, Yousuke Hattori, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``Speaker identification using Gaussian mixture models based on multi-space probability distribution,''
Proc. the 2001 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2001, vol.I, pp.433-436, Salt Lake City, USA (2001.5)
2000
- Tohru Takahashi, Keiichi Tokuda, Takao Kobayashi, Tadashi Kitamura,
``Vector Quantization of Mel-cepstral Coefficients Based on a Statistical Measure,''
Proc. 2000 IEEE International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2000, pp.692-695, Honolulu, USA (2000.11)
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi,
``Imposture Using Synthetic Speech against Speaker Verification Based on Spectrum and Pitch,''
Proc. 6th International Conference on Spoken Language Processing, ICSLP 2000, pp.II-302-305, Beijing, China (2000.10)
(PDF 122KB)
- Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``HMM-Based Text-to-Audio-Visual Speech Synthesis,''
Proc. 6th International Conference on Spoken Language Processing, ICSLP 2000, pp.III-25-28, Beijing, China (2000.10)
- Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Takao Kobayashi,
``Normalized Training for HMM-Based Visual Speech Recognition,''
Proc. IEEE International Conference on Image Processing, ICIP 2000, WA07.07, Vancouver, Canada (2000.9)
- Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``Speech Parameter Generation Algorithms for HMM-Based Speech Synthesis,''
Proc. 2000 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, pp.III-1315-1318, Istanbul, Turkey (2000.6)
1999
- Masatsune Tamura, Shigekazu Kondo, Takashi Masuko, Takao Kobayashi,
``Text-to-Audio-Visual Speech Synthesis Based on Parameter Generation from HMM,''
Proc. 6th European Conference on Speech Communication and Technology, EUROSPEECH' 99, Budapest, Hungary, pp.959-962 (1999.9)
(PDF 116KB)
- Takashi Masuko, Takafumi Hitotsumatsu, Keiichi Tokuda, Takao Kobayashi,
``On the Security of HMM-Based Speaker Verification Systems against Imposture Using Synthetic Speech,''
Proc. 6th European Conference on Speech Communication and Technology, EUROSPEECH' 99, Budapest, Hungary, pp.1223-1226 (1999.9)
(PDF 60KB)
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
``Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis,''
Proc. 6th European Conference on Speech Communication and Technology, EUROSPEECH' 99, Budapest, Hungary, pp.2347-2350 (1999.9)
- Keiichi Tokuda, Takashi Masuko, Noboru Miyazaki, Takao Kobayashi,
``Hidden Markov Models Based on Multi-Space Probability Distribution for Pitch Pattern Modeling,''
Proc. 1999 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '99, pp.229-232, Phoenix, USA (1999.3)
1998
- Masatsune Tamura, Takashi Masuko, Takao Kobayashi, and Keiichi Tokuda,
``Visual Speech Ssynthesis Based on Parameter Generation from HMM: Speech-Driven and Text-and-Speech-Driven Approaches,''
Proc. International Conference on Auditory-Visual Speech Processing, AVSP '98, pp.219-224, Terrigal, Australia (1998.12)
- Takayoshi Yoshimura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, and Tadashi Kitamura,
``Duration Modeling for HMM-based Speech Synthesis,''
Proc. 5th International Conference on Spoken Language Processing, ICSLP '98, pp.29-32, Sydney, Australia (1998.12)
- Takashi Masuko, Keiichi Tokuda and Takao Kobayashi,
``A Very Low Bit Rate Speech Coder Using HMM with Speaker Adaptation,''
Proc. 5th International Conference on Spoken Language Processing, ICSLP '98, pp.507-510, Sydney, Australia (1998.12)
(PDF 107KB)
- Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, and Takao Kobayashi,
``A 16kbit/s Wideband CELP Coder Using Mel-Generalized Cepstral Analysis and Its Subjective Evaluation,''
Proc. 5th International Conference on Spoken Language Processing, ICSLP '98, pp.2583-2586, Sydney, Australia (1998.12)
(PDF 79KB)
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, and Takao Kobayashi,
``Speaker Adaptation for HMM-based Speech Synthesis System Using MLLR,''
Proc. 3rd ESCA/ COSCOSDA International Workshop on Speech Synthesis, pp.273-276, Blue Mountains, Australia (1998.11)
(PDF 233KB)
- Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, and Takao Kobayashi,
``A Wideband CELP Speech Coder at 16 kbit/s Based on Mel-Generalized Cepstral Analysis,''
Proc. 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98, pp.161-164, Seattle, USA (1998.5)
- Keiichi Tokuda, Takashi Masuko, Jun Hiroi, Takao Kobayashi, and Tadashi Kitamura,
``A Very Low Bit Rate Speech Coder Using HMM-Based Speech Recognition/Synthesis,''
Proc. 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98, pp.609-612, Seattle, USA (1998.5)
- Takashi Masuko, Takao Kobayashi, Masatsune Tamura, Jun Masubuchi, and Keiichi Tokuda,
``Text-to-Visual Speech Synthesis Based on Parameter Generation from HMM,''
Proc. 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98, pp.3745-3748, Seattle, USA (1998.5)
1997
- Takao Kobayashi, Takashi Masuko and Keiichi Tokuda,
``HMM Compensation for Noisy Speech Recognition Based on Cepstral Parameter Generation,''
Proc. 5th European Conference on Speech Communication and Technology, EUROSPEECH' 97, pp.1583-1586, Rhodes, Greece (1997.9)
(PDF 340KB)
- Takayoshi Yoshimura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, and Tadashi Kitamura,
``Speaker Iinterpolation in HMM-Based Speech Synthesis System,''
Proc. 5th European Conference on Speech Communication and Technology, EUROSPEECH' 97, Rhodes, Greece, pp.2523-2526 (1997.9)
- Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko and Takao Kobayashi,
``Spectral Quantization Using Statistics of Static and Dynamic Features,''
1997 IEEE Workshop on Speech Coding for Telecommunications Proc., Pocono Manor, USA, pp.19-20 (1997.9)
- Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko and Takao Kobayashi,
``Vector Quantization of Speech Spectral Parameters Using Statistics of Dynamic Features,''
Proc. International Conference on Speech Processing, ICSP '97, Seoul, Korea, pp.247-252 (1997.8)
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Efficient Encoding of Mel-Generalized Cepstrum for CELP Coders,''
Proc. 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP' 97, Munich, Germany, pp.1355-1358 (1997.4)
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Voice Characteristics Conversion for HMM-based Speech Synthesis System,''
Proc. 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP' 97, Munich, Germany, pp.1611-1614 (1997.4)
- Toshihiko Abe, Takao Kobayashi and Satoshi Imai,
``The IF Spectrogram: A New Spectral Representation,''
Proc. International Symposium on Simulation, Visualization and Auralization for Acoustics Research and Education, ASVA '97, Tokyo, Japan, pp.423-430 (1997.4)
(PDF 600KB)
1996
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Spectral Representation of Speech Using Mel-Generalized Cepstral Coefficients,''
J. Acoust. Soc. America, 100, 4, Pt.2, pp.2756-2756 (1196) / Proc. ASA and ASJ 3rd Joint Meeting, Honolulu, USA, pp.963-968 (1996.12)
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, and Satoshi Imai,
``HMM-Based Speech Synthesis with Various Voice Characteristics,''
J. Acoust. Soc. America, 100, 4, Pt.2, pp.2760-2760 (1966) / Proc. ASA and ASJ 3rd Joint Meeting, Honolulu, USA, pp.1043-1046 (1996.12)
- Keiichi Tokuda, Takao Kobayashi, Takashi Masuko, and Satoshi Imai,
``Quantization of Vector Sequences Using Statistics of Neighboring Input Vectors,''
J. Acoust. Soc. America, 100, 4, Pt.2, pp.2762-2763 (1966) / Proc. ASA and ASJ 3rd Joint Meeting, Honolulu, USA, pp.1067-1072 (1996.12)
- Takao Kobayashi, Takashi Masuko, Keiichi Tokuda, and Satoshi Imai,
``Noisy Speech Recognition Using HMM-Based Cepstral Parameter Generation and Compensation,''
J. Acoust. Soc. America, 100, 4, Pt.2, pp.2790-2790 (1966) / Proc. ASA and ASJ 3rd Joint Meeting, Honolulu, USA, pp.1117-1122 (1996.12)
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``CELP Coding System Based on Mel-Generalized Cepstral Analysis,''
Proc. 4th International Conference on Spoken Language Processing, ICSLP '96, Philadelphia, USA, pp.318-321 (1996.10)
(PDF 634KB)
- Toshihiko Abe, Takao Kobayashi, and Satoshi Imai,
``Robust Pitch Estimation with Harmonic Enhancement in Noisy Environment Based on Instantaneous Frequency,''
Proc. 4th International Conference on Spoken Language Processsing, ICSLP '96, Philadelphia, USA, pp.1277-1280 (1996.10)
(PDF 585KB)
- Takashi Masuko, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Speech Synthesis Using HMMs with Dynamic Features,''
Proc. 1996 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '96, Atlanta, USA, 1, pp.389-392 (1996.5)
-1995
- Keiichi Tokuda, Takao Kobayashi, Satoshi Imai,
``Adaptive Cepstral Analysis of Speech,''
IEEE Trans. Speech and Audio Processing, 3, 6, pp.481-489 (1995.12)
- Toshihiko Abe, Takao Kobayashi, Satoshi Imai,
``Harmonic estimation based on instantaneous frequency and its application to pitch determination of ppeech,''
IEICE Trans. Information and Systems, E78-D, 9, pp.1188-1194 (1995.09)
- Keiichi Tokuda, Takashi Masuko, Tetsuya Yamada, Takao Kobayashi and Satoshi Imai,
``An Algorithm for Speech Parameter Generation from Continuous Mixture HMMs with Dynamic Features,''
Proc. 4th European Conference on Speech Communication and Technology, EUROSPEECH '95, Madrid, Spain, pp.757-760 (1995.9)
(PDF 300KB)
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``CELP Coding Based on Mel-cepstral Analysis,''
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '95, Detroit, USA, 1, pp.33-36 (1995.5)
- Keiichi Tokuda, Takao Kobayashi and Satoshi Imai,
``Speech Parameter Generation from HMM Using Dynamic Features,''
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '95, Detroit, USA, 1, pp.660-663 (1995.5)
- Toshihiko Abe, Takao Kobayashi and Satoshi Imai,
``Harmonics Tracking and Pitch Extraction Based on Instantaneous Frequency,''
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '95, Detroit, USA, 1, pp.756-759 (1995.5)
- Keiichi Tokuda, Takao Kobayashi, Takashi Masuko, Satoshi Imai,
``Mel-Generalized Cepstral Analysis --- A Unified Approach to Speech Spectral Estimation,''
Proc. International Conference on Spoken Language Processing, ICSLP-94, pp.1043-1046, Yokohama, Japan (1994.09)
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai,
``Speech Coding Based on Adaptive Mel-Cepstral Analysis for Noisy Channels,''
Proc. International Conference on Spoken Language Processing, ICSLP-94, pp.2087-2090, Yokohama (1994.09)
- Keiichi Tokuda, H. Matsumura, Takao Kobayashi, Satoshi Imai,
``Speech Coding Based on Adaptive Mel-Cepstral Analysis,''
Proc. IEEE International Conference on Acoustics, Speech & Signal Processing, ICASSP-94, I, pp.197-200, Adelaide, Australia (1994.04)
- Toshio Kanno, Takao Kobayashi, S. Imai,
``Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement,''
IEICE Trans. Fundamentals, E76-A, 8, pp.1300-1307 (1993.08)
- Takao Kobayashi, Toshio Kanno, Satoshi Imai,
``Generalized Cepstral Modeling of Speech Degraded by Additive Noise,''
Proc. 3rd European Conference on Speech Communication and Technology, EUROSPEECH-93, pp.609-612, Berlin, Germany (1993.09)
- Takao Kobayashi, Kazuyoshi Fukushi, Keiichi Tokuda, Satoshi Imai,
``2-D LMA Filters --- Design of Stable Two-Dimensional Digital Filters with Arbitrary Magnitude Function,''
IEICE Trans. Fundamentals, E75-A, 2, pp.240-246 (1992.02)
- Toshiaki Fukada, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai,
``An Adaptive Algorithm for Mel-Cepstral Analysis of Speech,''
Proc. IEEE International Conference Acoustics, Speech, and Signal Processing, ICASSP-92, I, pp.137-140, San Francisco, USA (1992.03)
- Takao Kobayashi, Kazuyoshi Fukushi, Keiichi Tokuda, Satoshi Imai,
``Design of Stable Two-Dimensional IIR Digital Filters with Arbitrary Magnitude Function,''
Proc. IEEE International Conference Acoustics, Speech, Signal Processing, ICASSP-92, V, pp.93-96, San Francisco, USA (1992.03)
- Takao Kobayashi, Satoshi Imai,
``Design of IIR Digital Filters with Arbitrary Log Magnitude Function by WLS Techniques,''
IEEE Trans. Acoustics, Speech and Signal Processing, 38, 2, pp.247-252 (1990.02)
- Keiichi Tokuda, Takao Kobayashi, Satoshi Imai,
``Generalized Cepstral Analysis of Speech --- Unified Approach to LPC and Cepstral Method,''
Proc. 1990 International Conference on Spoken Language Processing, ICSLP-90, pp.37-40, Kobe, Japan (1990.09)
- Keiichi Tokuda, Takao Kobayashi, Shoji Shiomoto, Satoshi Imai,
``Adaptive Filtering Based on Cepstral Representation,''
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-90, pp.377-380, Albuquerque, USA (1990.03)
- Takao Kobayashi, Satoshi Imai,
``Chebyshev Approximation for IIR Digital Filters Using an Iterative WLS Technique,''
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-90, pp.1321-1324, Albuquerque, USA (1990.03)
- Takao Kobayashi, Satoshi Imai,
``Design of ARMA Digital Filters with Arbitrary Log Magnitude Function by WLS Techniques,''
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-88, pp.1447-1450, NewYork, USA (1988.04)
- Takao Kobayashi, Satoshi Imai,
Takao Kobayashi and Satoshi Imai,
``Spectral Analysis Using Generalized Cepstrum,''
IEEE Trans. Acoustics, Speech and Signal Processing, ASSP-32, 5, pp.1087-1089 (1984.10)
Recent Topics