default search action
Hideki Kawahara
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c105]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei Yatabe:
Proposal of Protocols for Speech Materials Acquisition and Presentation Assisted By Tools Based on Structured Test Signals. O-COCOSDA 2024: 1-6 - [i16]Hideki Kawahara, Masanori Morise:
Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education. CoRR abs/2404.13418 (2024) - [i15]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei Yatabe:
Proposal of protocols for speech materials acquisition and presentation assisted by tools based on structured test signals. CoRR abs/2409.20516 (2024) - 2023
- [j12]Toshie Matsui, Toshio Irino, Ryo Uemura, Kodai Yamamoto, Hideki Kawahara, Roy D. Patterson:
Corrigendum to Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift, Speech Communication 136 (2022) 23-41. Speech Commun. 147: 116-117 (2023) - [c104]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Tatsuya Kitamura:
Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials. APSIPA ASC 2023: 173-180 - [c103]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Tatsuya Kitamura:
Acoustic measurement framework for audio systems based on structured periodic test signals. GCCE 2023: 227-228 - [i14]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Tatsuya Kitamura:
Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials. CoRR abs/2309.02767 (2023) - 2022
- [j11]Toshie Matsui, Toshio Irino, Ryo Uemura, Kodai Yamamoto, Hideki Kawahara, Roy D. Patterson:
Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift. Speech Commun. 136: 23-41 (2022) - [c102]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise:
An objective test tool for pitch extractors' response attributes. INTERSPEECH 2022: 659-663 - [c101]Tatsuya Kitamura, Naoki Kunimoto, Hideki Kawahara, Shigeaki Amano:
Perceptual Evaluation of Penetrating Voices through a Semantic Differential Method. INTERSPEECH 2022: 3063-3067 - [i13]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise:
An objective test tool for pitch extractors' response attributes. CoRR abs/2204.00902 (2022) - [i12]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise:
Measuring pitch extractors' response to frequency-modulated multi-component signals. CoRR abs/2204.00911 (2022) - 2021
- [c100]Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino:
Implementation of Interactive Tools for Investigating Fundamental Frequency Response of Voiced Sounds to Auditory Stimulation. APSIPA ASC 2021: 897-903 - [c99]Hideki Kawahara, Kohei Yatabe:
Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation. ICASSP 2021: 306-310 - [c98]Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino:
Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation. Interspeech 2021: 3206-3210 - [c97]Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Masanori Morise, Hideki Banno, Toshio Irino:
Interactive and Real-Time Acoustic Measurement Tools for Speech Data Acquisition and Presentation: Application of an Extended Member of Time Stretched Pulses. Interspeech 2021: 4853-4854 - [i11]Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino:
Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation. CoRR abs/2104.01444 (2021) - [i10]Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino:
Implementation of interactive tools for investigating fundamental frequency response of voiced sounds to auditory stimulation. CoRR abs/2109.11594 (2021) - [i9]Hideki Kawahara, Kohei Yatabe:
Safeguarding test signals for acoustic measurement using arbitrary sounds. CoRR abs/2112.11373 (2021) - 2020
- [c96]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Masanori Morise, Hideki Banno:
Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise. APSIPA 2020: 174-183 - [i8]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Masanori Morise, Hideki Banno:
Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise. CoRR abs/2008.02439 (2020) - [i7]Hideki Kawahara, Kohei Yatabe:
Cascaded all-pass filters with randomized center frequencies and phase polarity for acoustic and speech measurement and data augmentation. CoRR abs/2010.13185 (2020)
2010 – 2019
- 2019
- [c95]Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, Kaori Hagiwara:
Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope. APSIPA 2019: 907-910 - [c94]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, Toshio Irino:
Frequency domain variant of Velvet noise and its application to acoustic measurements. APSIPA 2019: 1523-1532 - [c93]Hiroko Terasawa, Kenta Wakasa, Hideki Kawahara, Ken-Ichi Sakakibara:
Investigating the Physiological and Acoustic Contrasts Between Choral and Operatic Singing. INTERSPEECH 2019: 2025-2029 - [i6]Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, Kaori Hagiwara:
Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope. CoRR abs/1909.03650 (2019) - [i5]Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, Toshio Irino:
Frequency domain variant of Velvet noise and its application to acoustic measurements. CoRR abs/1909.04301 (2019) - 2018
- [c92]Hideki Kawahara, Masanori Morise, Kanru Hua:
Revisiting spectral envelope recovery from speech sounds generated by periodic excitation. APSIPA 2018: 1674-1683 - [c91]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino:
Frequency Domain Variants of Velvet Noise and Their Application to Speech Processing and Synthesis. INTERSPEECH 2018: 2027-2031 - [i4]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino:
Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices. CoRR abs/1806.06812 (2018) - 2017
- [c90]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda:
Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives. APSIPA 2017: 1556-1564 - [c89]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda:
A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation. INTERSPEECH 2017: 424-428 - [c88]Toshie Matsui, Toshio Irino, Kodai Yamamoto, Hideki Kawahara, Roy D. Patterson:
The Effect of Spectral Tilt on Size Discrimination of Voiced Speech Sounds. INTERSPEECH 2017: 601-605 - [c87]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino:
A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis. INTERSPEECH 2017: 1358-1362 - [i3]Hideki Kawahara, Ken-Ichi Sakakibara, Hideki Banno, Masanori Morise, Tomoki Toda, Toshio Irino:
A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis. CoRR abs/1702.06724 (2017) - [i2]Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda:
A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation. CoRR abs/1706.02964 (2017) - 2016
- [c86]Hideki Kawahara:
SparkNG: Interactive MATLAB Tools for Introduction to Speech Production, Perception and Processing Fundamentals and Application of the Aliasing-Free L-F Model Component. INTERSPEECH 2016: 1180-1181 - [c85]Masanori Morise, Hideki Kawahara:
TUSK: A Framework for Overviewing the Performance of F0 Estimators. INTERSPEECH 2016: 1790-1794 - [c84]Hideki Kawahara:
Aliasing-free L-F model and its application to an interactive MATLAB tool and test signal generation for speech analysis procedures. SSW 2016: 123 - [c83]Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen:
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis. SSW 2016: 221-228 - [i1]Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen:
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis. CoRR abs/1605.07809 (2016) - 2015
- [c82]Hideki Kawahara, Ken-Ichi Sakakibara, Hideki Banno, Masanori Morise, Tomoki Toda, Toshio Irino:
Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation. APSIPA 2015: 520-529 - [c81]Kodai Yamamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara, Roy D. Patterson:
How the slope of the speech spectrum affects the perception of speaker size. INTERSPEECH 2015: 1556-1560 - 2014
- [c80]Hideki Kawahara, Masanori Morise, Tomoki Toda, Hideki Banno, Ryuichi Nisimura, Toshio Irino:
Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals. APSIPA 2014: 1-10 - [c79]Misaki Nagae, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara, Roy D. Patterson:
Hearing impairment simulator based on compressive gammachirp filter. APSIPA 2014: 1-4 - [c78]Ryuichi Nisimura, Kazuki Hashimoto, Hideki Kawahara, Toshio Irino:
Proposal for an Interactive 3D Sound Playback Interface Controlled by User behavior. HCI (26) 2014: 446-450 - [c77]Minori Matsuyama, Ryuichi Nisimura, Hideki Kawahara, Junnosuke Yamada, Toshio Irino:
Development of a Mobile Application for Crowdsourcing the Data Collection of Environmental Sounds. HCI (12) 2014: 514-524 - [c76]Hideki Kawahara, Tatsuya Kitamura, Hironori Takemoto, Ryuichi Nisimura, Toshio Irino:
Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information. INTERSPEECH 2014: 870-874 - [c75]Hideki Kawahara, Masanori Morise, Tomoki Toda, Hideki Banno, Ryuichi Nisimura, Toshio Irino:
Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation. INTERSPEECH 2014: 2243-2247 - 2013
- [c74]Toshio Irino, Erika Okamoto, Ryuichi Nisimura, Hideki Kawahara:
Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank. APSIPA 2013: 1-4 - [c73]Hideki Kawahara, Masanori Morise, Hideki Banno, Verena G. Skuk:
Temporally variable multi-aspect N-way morphing based on interference-free speech representations. APSIPA 2013: 1-10 - [c72]Hideki Kawahara, Masanori Morise, Ryuichi Nisimura, Toshio Irino:
Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution. ICASSP 2013: 6797-6801 - [c71]Hideki Kawahara, Masanori Morise, Tomoki Toda, Ryuichi Nisimura, Toshio Irino:
Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds. INTERSPEECH 2013: 34-38 - [c70]Masanori Morise, Hideki Kawahara, Kenji Ozawa:
Periodicity extraction for voiced sounds with multiple periodicity. INTERSPEECH 2013: 1921-1925 - [c69]Yuri Nishigaki, Ken-Ichi Sakakibara, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara:
Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study. INTERSPEECH 2013: 2905-2909 - 2012
- [j10]Toshio Irino, Yoshie Aoki, Hideki Kawahara, Roy D. Patterson:
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination. Speech Commun. 54(9): 998-1013 (2012) - [c68]Hideki Kawahara, Masanori Morise, Ryuichi Nisimura, Toshio Irino:
An interference-free representation of group delay for periodic signals. APSIPA 2012: 1-4 - [c67]Taiki Nishi, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara:
Modulation transfer function design for a flexible cross synthesis VOCODER based on F0 adaptive spectral envelope recovery. APSIPA 2012: 1-7 - [c66]Ryuichi Nisimura, Shoko Miyamori, Erika Okamoto, Hideki Kawahara, Toshio Irino:
Detecting child speaker based on auditory feature vectors for VTL estimation. APSIPA 2012: 1-5 - [c65]Hideki Kawahara, Masanori Morise:
Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice. ICASSP 2012: 5389-5392 - [c64]Josh H. McDermott, Daniel P. W. Ellis, Hideki Kawahara:
Inharmonic speech: a tool for the study of speech perception and separation. SAPA@INTERSPEECH 2012: 114-117 - [c63]Zhengqi Wen, Hideki Kawahara, Jianhua Tao:
Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis. INTERSPEECH 2012: 374-377 - [c62]Hideki Kawahara, Masanori Morise, Ryuichi Nisimura, Toshio Irino:
Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation. INTERSPEECH 2012: 386-389 - 2011
- [c61]Ryuichi Nisimura, Shoko Miyamori, Lisa Kurihara, Hideki Kawahara, Toshio Irino:
Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System. HCI (4) 2011: 607-616 - [c60]Hideki Kawahara, Toshio Irino, Masanori Morise:
An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction. ICASSP 2011: 5420-5423 - [c59]Erika Okamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara:
Auditory Filterbank Improves Voice Morphing. INTERSPEECH 2011: 2517-2520 - 2010
- [c58]Hideki Kawahara, Ryuichi Nisimura, Toshio Irino, Masanori Morise, Toru Takahashi, Hideki Banno:
High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown. ICASSP 2010: 4818-4821 - [c57]Ayanori Arakawa, Yoshinori Uchimura, Hideki Banno, Fumitada Itakura, Hideki Kawahara:
High quality voice manipulation method based on the vocal tract area function obtained from sub-band LSP of straight spectrum. ICASSP 2010: 4834-4837 - [c56]Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura, Toshio Irino:
Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems. INTERSPEECH 2010: 38-41 - [c55]Hideki Kawahara:
Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing. SSW 2010: 32-37
2000 – 2009
- 2009
- [c54]Ryuichi Nisimura, Jumpei Miyake, Hideki Kawahara, Toshio Irino:
Development of Speech Input Method for Interactive VoiceWeb Systems. HCI (2) 2009: 710-719 - [c53]Hideki Kawahara, Ryuichi Nisimura, Toshio Irino, Masanori Morise, Toru Takahashi, Hideki Banno:
Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown. ICASSP 2009: 3905-3908 - [c52]Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura, Toshio Irino:
Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion. INTERSPEECH 2009: 2647-2650 - [c51]Masanori Morise, Masato Onishi, Hideki Kawahara, Haruhiro Katayose:
v.morish'09: A Morphing-Based Singing Design Interface for Vocal Melodies. ICEC 2009: 185-190 - [c50]Hideki Kawahara:
Speech morphing based on biologically relevant signal representations. MAVEBA 2009: 83-86 - [c49]Hanae Itagaki, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara:
A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices. MAVEBA 2009: 115-118 - 2008
- [c48]Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nisimura, Toshio Irino, Hideki Banno:
Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation. ICASSP 2008: 3933-3936 - [c47]Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nisimura, Toshio Irino:
Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds. INTERSPEECH 2008: 650-653 - [c46]Yoshinori Uchimura, Hideki Banno, Fumitada Itakura, Hideki Kawahara:
Study on manipulation method of voice quality based on the vocal tract area function. INTERSPEECH 2008: 1084-1087 - [c45]Masato Onishi, Toru Takahashi, Toshio Irino, Hideki Kawahara:
Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing. SLT 2008: 25-28 - [c44]Ryuichi Nisimura, Jumpei Miyake, Hideki Kawahara, Toshio Irino:
Speech-to-text input method for web system using JavaScript. SLT 2008: 209-212 - 2007
- [c43]Hideki Kawahara, Masanori Morise, Toru Takahashi, Toshio Irino, Hideki Banno, Osamu Fujimura:
Group delay for acoustic event representation and its application for speech aperiodicity analysis. EUSIPCO 2007: 2219-2223 - [c42]Toshio Irino, Yoshie Aoki, Yoshie Hayashi, Hideki Kawahara, Roy D. Patterson:
Discrimination and recognition of scaled word sounds. INTERSPEECH 2007: 378-381 - 2006
- [j9]Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements. IEEE Trans. Speech Audio Process. 14(6): 2212-2221 (2006) - [c41]Toru Takahashi, Hideki Banno, Toshio Irino, Hideki Kawahara:
Speech style conversion based on the statistics of vowel spectrograms and nonlinear frequency mapping. EUSIPCO 2006: 1-5 - [c40]Masanori Morise, Toshio Irino, Hideki Kawahara:
Logarithmic temporal processing applied to accurate empirical transfer function measurements in vocal sound propagation. EUSIPCO 2006: 1-5 - [c39]Ryuichi Nisimura, Souji Omae, Hideki Kawahara, Toshio Irino:
Analyzing dialogue data for real-world emotional speech classification. INTERSPEECH 2006 - [c38]Toru Takahashi, Masashi Nishi, Toshio Irino, Hideki Kawahara:
Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples. INTERSPEECH 2006 - 2005
- [c37]Hideki Kawahara, Alain de Cheveigné, Hideki Banno, Toru Takahashi, Toshio Irino:
Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT. INTERSPEECH 2005: 537-540 - [c36]Toshio Irino, Satoru Satou, Shunsuke Nomura, Hideki Banno, Hideki Kawahara:
Speech intelligibility derived from time-frequency and source smearing. INTERSPEECH 2005: 1737-1740 - [c35]Toru Takahashi, Takeshi Fujii, Masashi Nishi, Hideki Banno, Toshio Irino, Hideki Kawahara:
Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database. INTERSPEECH 2005: 1853-1856 - [p2]Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Speech Segregation Using an Event-synchronous Auditory Image and STRAIGHT. Speech Separation by Humans and Machines 2005: 155-165 - [p1]Hideki Kawahara, Toshio Irino:
Underlying Principles of a High-quality Speech Manipulation System STRAIGHT and Its Application to Speech Segregation. Speech Separation by Humans and Machines 2005: 167-180 - 2004
- [c34]Masanori Morise, Hideki Kawahara:
Loudspeaker equalization based on multi-location observation with reliable time-frequency region selection and its evaluation using sound propagation measurement. EUSIPCO 2004: 1995-1998 - [c33]Hideki Kawahara, Hideki Banno, Toshio Irino, Parham Zolfaghari:
Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHT. ICASSP (1) 2004: 13-16 - [c32]Hideki Kawahara, Yumi Hirachi, Masanori Morise, Hideki Banno:
Procedure "senza vibrato": a key component for morphing singing. INTERSPEECH 2004: 89-92 - [c31]Hideki Kawahara, Hideki Banno, Toshio Irino, Jiang Jin:
Intelligibility of degraded speech from smeared STRAIGHT spectrum. INTERSPEECH 2004: 473-476 - [c30]Nishiura Denda, Takanobu Nishiura, Hideki Kawahara, Toshio Irino:
A design of audio-visual talker tracking system based on CSP analysis and frame difference in real noisy environments. MMSP 2004: 63-66 - [c29]Hideki Kawahara, Hideki Banno, Masanori Morise:
Acappella synthesis demonstrations using RWC music database. NIME 2004: 130-131 - 2003
- [c28]Hideki Kawahara, Hisami Matsui:
Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation. ICASSP (1) 2003: 256-259 - [c27]Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Speech segregation using event synchronous auditory vocoder. ICASSP (5) 2003: 525-528 - [c26]Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Speech segregation based on fundamental event information using an auditory vocoder. INTERSPEECH 2003: 553-556 - [c25]Hisami Matsui, Hideki Kawahara:
Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system. INTERSPEECH 2003: 2113-2116 - [c24]Yuki Denda, Takanobu Nishiura, Hideki Kawahara:
Speech enhancement with microphone array and fourier / wavelet spectral subtraction in real noisy environments. INTERSPEECH 2003: 2153-2156 - [c23]Parham Zolfaghari, Tomohiro Nakatani, Toshio Irino, Hideki Kawahara, Fumitada Itakura:
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis. INTERSPEECH 2003: 2441-2444 - [c22]Hiroaki Kato, Masumi Nukinay, Hideki Kawahara, Reiko Akahane-Yamada:
Influence of recording equipment on the identification of second language phoneme contrasts. INTERSPEECH 2003: 3157-3160 - 2002
- [c21]Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Auditory VOCODER: Speech resynthesis from an auditory Mellin representation. ICASSP 2002: 1921-1924 - [c20]Hideki Kawahara, Parham Zolfaghari, Alain de Cheveigné:
On F0 trajectory optimization for very high-quality speech manipulation. INTERSPEECH 2002: 2397-2400 - 2001
- [c19]Alain de Cheveigné, Hideki Kawahara:
Comparative evaluation of F0 estimation algorithms. INTERSPEECH 2001: 2451-2454 - [c18]Hideki Kawahara, Parham Zolfaghari:
Systematic F0 glitches around nasal-vowel transitions. INTERSPEECH 2001: 2459-2462 - [c17]Hideki Kawahara, Jo Estill, Osamu Fujimura:
Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT. MAVEBA 2001: 59-64 - 2000
- [c16]Parham Zolfaghari, Yoshinori Atake, Kiyohiro Shikano, Hideki Kawahara:
Investigation of analysis and synthesis parameters of straight by subjective evaluation. INTERSPEECH 2000: 498-501 - [c15]Hideki Kawahara, Yoshinori Atake, Parham Zolfaghari:
Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay. INTERSPEECH 2000: 664-667 - [c14]Parham Zolfaghari, Hideki Kawahara:
A sinusoidal model based on frequency-to-instantaneous frequency mapping. INTERSPEECH 2000: 692-695 - [c13]Yoshinori Atake, Toshio Irino, Hideki Kawahara, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano:
Robust fundamental frequency estimation using instantaneous frequencies of harmonic components. INTERSPEECH 2000: 907-910
1990 – 1999
- 1999
- [j8]Alain de Cheveigné, Hideki Kawahara:
Multiple period estimation and pitch perception model. Speech Commun. 27(3-4): 175-185 (1999) - [j7]Hideki Kawahara, Ikuyo Masuda-Katsuse, Alain de Cheveigné:
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Commun. 27(3-4): 187-207 (1999) - [j6]Ikuyo Masuda-Katsuse, Hideki Kawahara:
Dynamic sound stream formation based on continuity of spectral change. Speech Commun. 27(3-4): 235-259 (1999) - [c12]Haruhiro Katayose, Hideki Kawahara:
Applying STRAIGHT toward Music Systems - Accurate F0 Estimation and Application for Data-driven Synthesis. ICMC 1999 - [c11]Hideki Kawahara, Haruhiro Katayose, Alain de Cheveigné, Roy D. Patterson:
Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity. EUROSPEECH 1999: 2781-2784 - 1998
- [j5]Hiroko Kato, Hideki Kawahara:
An application of the Bayesian time series model and statistical system analysis for F0 control. Speech Commun. 24(4): 325-339 (1998) - [c10]Hideki Banno, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano, Hideki Kawahara:
Efficient representation of short-time phase based on group delay. ICASSP 1998: 861-864 - [c9]Yasuji Sawada, Hideki Kawahara:
Brain Creators: Japanese Initiative to Create Computational Models of Brain Functions. ICONIP 1998: 1193-1184 - [c8]Reiko Akahane-Yamada, Erik McDermott, Takahiro Adachi, Hideki Kawahara, John S. Pruitt:
Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores. ICSLP 1998 - [c7]Hideki Kawahara, Alain de Cheveigné, Roy D. Patterson:
An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite. ICSLP 1998 - 1997
- [c6]Hideki Kawahara:
Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited. ICASSP 1997: 1303-1306 - 1996
- [c5]Hideki Kawahara, Hiroko Kato, J. C. Williams:
Effects of auditory feedback on F0 trajectory generation. ICSLP 1996: 287-290 - [c4]Kiyoaki Aikawa, Hideki Kawahara, Minoru Tsuzaki:
A neural matrix model for active tracking of frequency-modulated tones. ICSLP 1996: 578-581 - 1994
- [c3]Hideki Kawahara:
Effects of natural auditory feedback on fundamental frequency control. ICSLP 1994: 1399-1402 - 1993
- [j4]Toshio Irino, Hideki Kawahara:
Signal reconstruction from modified auditory wavelet transform. IEEE Trans. Signal Process. 41(12): 3549-3554 (1993) - [c2]Kiyoaki Aikawa, Harald Singer, Hideki Kawahara, Yoh'ichi Tohkura:
A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition. ICASSP (2) 1993: 668-671 - 1992
- [c1]Toshio Irino, Hideki Kawahara:
Signal reconstruction from modified wavelet transform-An application to auditory signal processing. ICASSP 1992: 85-88 - 1990
- [j3]Toshio Irino, Hideki Kawahara:
A Method for Designing Neural Networks Using Nonlinear Multivariate Analysis: Application to Speaker-Independent Vowel Recognition. Neural Comput. 2(3): 386-397 (1990) - [j2]Toshio Irino, Hideki Kawahara:
A method for designing neural networks using nonlinear multivariate analysis - application to speaker-independent vowel recognition. Syst. Comput. Jpn. 21(9): 80-88 (1990)
1980 – 1989
- 1988
- [j1]Toshio Irino, Hideki Kawahara:
Vowel-feature extraction from cochlear vibration using neural networks. Neural Networks 1(Supplement-1): 300-301 (1988)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-20 22:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint