Hideki Kawahara

Orcid: 0000-0001-9360-5700

According to our database1, Hideki Kawahara authored at least 121 papers between 1988 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Corrigendum to Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift, Speech Communication 136 (2022) 23-41.
Speech Commun., February, 2023

Acoustic measurement framework for audio systems based on structured periodic test signals.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift.
Speech Commun., 2022

Measuring pitch extractors' response to frequency-modulated multi-component signals.
CoRR, 2022

Perceptual Evaluation of Penetrating Voices through a Semantic Differential Method.
Proceedings of the Interspeech 2022, 2022

An objective test tool for pitch extractors' response attributes.
Proceedings of the Interspeech 2022, 2022

2021
Safeguarding test signals for acoustic measurement using arbitrary sounds.
CoRR, 2021

Interactive and Real-Time Acoustic Measurement Tools for Speech Data Acquisition and Presentation: Application of an Extended Member of Time Stretched Pulses.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Implementation of Interactive Tools for Investigating Fundamental Frequency Response of Voiced Sounds to Auditory Stimulation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Investigating the Physiological and Acoustic Contrasts Between Choral and Operatic Singing.
Proceedings of the Interspeech 2019, 2019

Frequency domain variant of Velvet noise and its application to acoustic measurements.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices.
CoRR, 2018

Frequency Domain Variants of Velvet Noise and Their Application to Speech Processing and Synthesis.
Proceedings of the Interspeech 2018, 2018

Revisiting spectral envelope recovery from speech sounds generated by periodic excitation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation.
CoRR, 2017

The Effect of Spectral Tilt on Size Discrimination of Voiced Speech Sounds.
Proceedings of the Interspeech 2017, 2017

A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis.
Proceedings of the Interspeech 2017, 2017

A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and f<sub>o</sub> Estimation.
Proceedings of the Interspeech 2017, 2017

Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Aliasing-free L-F model and its application to an interactive MATLAB tool and test signal generation for speech analysis procedures.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

TUSK: A Framework for Overviewing the Performance of F0 Estimators.
Proceedings of the Interspeech 2016, 2016

SparkNG: Interactive MATLAB Tools for Introduction to Speech Production, Perception and Processing Fundamentals and Application of the Aliasing-Free L-F Model Component.
Proceedings of the Interspeech 2016, 2016

2015
How the slope of the speech spectrum affects the perception of speaker size.
Proceedings of the INTERSPEECH 2015, 2015

Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation.
Proceedings of the INTERSPEECH 2014, 2014

Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information.
Proceedings of the INTERSPEECH 2014, 2014

Proposal for an Interactive 3D Sound Playback Interface Controlled by User behavior.
Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

Development of a Mobile Application for Crowdsourcing the Data Collection of Environmental Sounds.
Proceedings of the Human Interface and the Management of Information. Information and Knowledge Design and Evaluation, 2014

Hearing impairment simulator based on compressive gammachirp filter.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study.
Proceedings of the INTERSPEECH 2013, 2013

Periodicity extraction for voiced sounds with multiple periodicity.
Proceedings of the INTERSPEECH 2013, 2013

Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds.
Proceedings of the INTERSPEECH 2013, 2013

Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution.
Proceedings of the IEEE International Conference on Acoustics, 2013

Temporally variable multi-aspect N-way morphing based on interference-free speech representations.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination.
Speech Commun., 2012

Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Inharmonic speech: a tool for the study of speech perception and separation.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation.
Proceedings of the INTERSPEECH 2012, 2012

Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Detecting child speaker based on auditory feature vectors for VTL estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Modulation transfer function design for a flexible cross synthesis VOCODER based on F0 adaptive spectral envelope recovery.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An interference-free representation of group delay for periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Auditory Filterbank Improves Voice Morphing.
Proceedings of the INTERSPEECH 2011, 2011

An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction.
Proceedings of the IEEE International Conference on Acoustics, 2011

Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System.
Proceedings of the Human-Computer Interaction. Users and Applications, 2011

2010
Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems.
Proceedings of the INTERSPEECH 2010, 2010

High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2010

High quality voice manipulation method based on the vocal tract area function obtained from sub-band LSP of straight spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Speech morphing based on biologically relevant signal representations.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

v.morish'09: A Morphing-Based Singing Design Interface for Vocal Melodies.
Proceedings of the Entertainment Computing, 2009

Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion.
Proceedings of the INTERSPEECH 2009, 2009

Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2009

Development of Speech Input Method for Interactive VoiceWeb Systems.
Proceedings of the Human-Computer Interaction. Novel Interaction Methods and Techniques, 2009

2008
Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Speech-to-text input method for web system using JavaScript.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Study on manipulation method of voice quality based on the vocal tract area function.
Proceedings of the INTERSPEECH 2008, 2008

Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds.
Proceedings of the INTERSPEECH 2008, 2008

Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Discrimination and recognition of scaled word sounds.
Proceedings of the INTERSPEECH 2007, 2007

Group delay for acoustic event representation and its application for speech aperiodicity analysis.
Proceedings of the 15th European Signal Processing Conference, 2007

2006
Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.
IEEE Trans. Speech Audio Process., 2006

Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples.
Proceedings of the INTERSPEECH 2006, 2006

Analyzing dialogue data for real-world emotional speech classification.
Proceedings of the INTERSPEECH 2006, 2006

Logarithmic temporal processing applied to accurate empirical transfer function measurements in vocal sound propagation.
Proceedings of the 14th European Signal Processing Conference, 2006

Speech style conversion based on the statistics of vowel spectrograms and nonlinear frequency mapping.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database.
Proceedings of the INTERSPEECH 2005, 2005

Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT.
Proceedings of the INTERSPEECH 2005, 2005

Speech intelligibility derived from time-frequency and source smearing.
Proceedings of the INTERSPEECH 2005, 2005

Underlying Principles of a High-quality Speech Manipulation System STRAIGHT and Its Application to Speech Segregation.
Proceedings of the Speech Separation by Humans and Machines, 2005

Speech Segregation Using an Event-synchronous Auditory Image and STRAIGHT.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
Acappella synthesis demonstrations using RWC music database.
Proceedings of the New Interfaces for Musical Expression, 2004

A design of audio-visual talker tracking system based on CSP analysis and frame difference in real noisy environments.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Procedure "senza vibrato": a key component for morphing singing.
Proceedings of the INTERSPEECH 2004, 2004

Intelligibility of degraded speech from smeared STRAIGHT spectrum.
Proceedings of the INTERSPEECH 2004, 2004

Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHT.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Loudspeaker equalization based on multi-location observation with reliable time-frequency region selection and its evaluation using sound propagation measurement.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Influence of recording equipment on the identification of second language phoneme contrasts.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech segregation based on fundamental event information using an auditory vocoder.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech enhancement with microphone array and fourier / wavelet spectral subtraction in real noisy environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Speech segregation using event synchronous auditory vocoder.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
On F0 trajectory optimization for very high-quality speech manipulation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Auditory VOCODER: Speech resynthesis from an auditory Mellin representation.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT.
Proceedings of the Second International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2001

Systematic F0 glitches around nasal-vowel transitions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Comparative evaluation of F0 estimation algorithms.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
A sinusoidal model based on frequency-to-instantaneous frequency mapping.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Investigation of analysis and synthesis parameters of straight by subjective evaluation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust fundamental frequency estimation using instantaneous frequencies of harmonic components.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Dynamic sound stream formation based on continuity of spectral change.
Speech Commun., 1999

Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds.
Speech Commun., 1999

Multiple period estimation and pitch perception model.
Speech Commun., 1999

Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Applying STRAIGHT toward Music Systems - Accurate F0 Estimation and Application for Data-driven Synthesis.
Proceedings of the 1999 International Computer Music Conference, 1999

1998
An application of the Bayesian time series model and statistical system analysis for F0 control.
Speech Commun., 1998

An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Brain Creators: Japanese Initiative to Create Computational Models of Brain Functions.
Proceedings of the Fifth International Conference on Neural Information Processing, 1998

Efficient representation of short-time phase based on group delay.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Effects of auditory feedback on F0 trajectory generation.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A neural matrix model for active tracking of frequency-modulated tones.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1994
Effects of natural auditory feedback on fundamental frequency control.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
Signal reconstruction from modified auditory wavelet transform.
IEEE Trans. Signal Process., 1993

A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Signal reconstruction from modified wavelet transform-An application to auditory signal processing.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990
A Method for Designing Neural Networks Using Nonlinear Multivariate Analysis: Application to Speaker-Independent Vowel Recognition.
Neural Comput., 1990

1988
Vowel-feature extraction from cochlear vibration using neural networks.
Neural Networks, 1988


  Loading...