Toshio Irino

Orcid: 0000-0002-7691-4189

According to our database1, Toshio Irino authored at least 96 papers between 1988 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Corrigendum to Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift, Speech Communication 136 (2022) 23-41.
Speech Commun., February, 2023

GESI: Gammachirp Envelope Similarity Index for Predicting Intelligibility of Simulated Hearing Loss Sounds.
CoRR, 2023

Hearing Impairment Simulator Based on Auditory Excitation Pattern Playback: WHIS.
IEEE Access, 2023

Auditory Representation Effective for Estimating Vocal Tract Information.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Modelling speaker-size discrimination with voiced and unvoiced speech sounds based on the effect of spectral lift.
Speech Commun., 2022

WHIS: Hearing impairment simulator based on the gammachirp auditory filterbank.
CoRR, 2022

Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening.
CoRR, 2022

Speech intelligibility of simulated hearing loss sounds and its prediction using the Gammachirp Envelope Similarity Index (GESI).
Proceedings of the Interspeech 2022, 2022

2021
Observational and Accelerometer Analysis of Head Movement Patterns in Psychotherapeutic Dialogue.
Sensors, 2021

Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Interactive and Real-Time Acoustic Measurement Tools for Speech Data Acquisition and Presentation: Application of an Extended Member of Time Stretched Pulses.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Implementation of Interactive Tools for Investigating Fundamental Frequency Response of Voiced Sounds to Auditory Stimulation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech.
Speech Commun., 2020

Speech Clarity Improvement by Vocal Self-Training Using a Hearing Impairment Simulator and its Correlation with an Auditory Modulation Index.
Proceedings of the Interspeech 2020, 2020

Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System.
Proceedings of the Interspeech 2020, 2020

2019
Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System.
Proceedings of the Interspeech 2019, 2019

Frequency domain variant of Velvet noise and its application to acoustic measurements.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices.
CoRR, 2018

Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech.
Proceedings of the Interspeech 2018, 2018

Frequency Domain Variants of Velvet Noise and Their Application to Speech Processing and Synthesis.
Proceedings of the Interspeech 2018, 2018

2017
Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio.
Proceedings of the Interspeech 2017, 2017

The Effect of Spectral Tilt on Size Discrimination of Voiced Speech Sounds.
Proceedings of the Interspeech 2017, 2017

A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis.
Proceedings of the Interspeech 2017, 2017

An Auditory Model of Speaker Size Perception for Voiced Speech Sounds.
Proceedings of the Interspeech 2017, 2017

2016
Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank.
Proceedings of the Interspeech 2016, 2016

2015
How the slope of the speech spectrum affects the perception of speaker size.
Proceedings of the INTERSPEECH 2015, 2015

Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation.
Proceedings of the INTERSPEECH 2014, 2014

Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information.
Proceedings of the INTERSPEECH 2014, 2014

Proposal for an Interactive 3D Sound Playback Interface Controlled by User behavior.
Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

Development of a Mobile Application for Crowdsourcing the Data Collection of Environmental Sounds.
Proceedings of the Human Interface and the Management of Information. Information and Knowledge Design and Evaluation, 2014

Hearing impairment simulator based on compressive gammachirp filter.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study.
Proceedings of the INTERSPEECH 2013, 2013

Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds.
Proceedings of the INTERSPEECH 2013, 2013

Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution.
Proceedings of the IEEE International Conference on Acoustics, 2013

Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination.
Speech Commun., 2012

Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation.
Proceedings of the INTERSPEECH 2012, 2012

Detecting child speaker based on auditory feature vectors for VTL estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Modulation transfer function design for a flexible cross synthesis VOCODER based on F0 adaptive spectral envelope recovery.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An interference-free representation of group delay for periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Auditory Filterbank Improves Voice Morphing.
Proceedings of the INTERSPEECH 2011, 2011

An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction.
Proceedings of the IEEE International Conference on Acoustics, 2011

Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System.
Proceedings of the Human-Computer Interaction. Users and Applications, 2011

Manual and Accelerometer Analysis of Head Nodding Patterns in Goal-oriented Dialogues.
Proceedings of the Human-Computer Interaction. Interaction Techniques and Environments, 2011

2010
Auditory speech processing for scale-shift covariance and its evaluation in automatic speech recognition.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems.
Proceedings of the INTERSPEECH 2010, 2010

High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

Influences of vowel duration on speaker-size estimation and discrimination.
Proceedings of the INTERSPEECH 2009, 2009

Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion.
Proceedings of the INTERSPEECH 2009, 2009

Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2009

Development of Speech Input Method for Interactive VoiceWeb Systems.
Proceedings of the Human-Computer Interaction. Novel Interaction Methods and Techniques, 2009

2008
A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments.
Speech Commun., 2008

Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Speech-to-text input method for web system using JavaScript.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds.
Proceedings of the INTERSPEECH 2008, 2008

Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Discrimination and recognition of scaled word sounds.
Proceedings of the INTERSPEECH 2007, 2007

Group delay for acoustic event representation and its application for speech aperiodicity analysis.
Proceedings of the 15th European Signal Processing Conference, 2007

2006
Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.
IEEE Trans. Speech Audio Process., 2006

A Dynamic Compressive Gammachirp Auditory Filterbank.
IEEE Trans. Speech Audio Process., 2006

Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples.
Proceedings of the INTERSPEECH 2006, 2006

Analyzing dialogue data for real-world emotional speech classification.
Proceedings of the INTERSPEECH 2006, 2006

Dynamic, Compressive Gammachirp Auditory Filterbank for Perceptual Signal Processing.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Logarithmic temporal processing applied to accurate empirical transfer function measurements in vocal sound propagation.
Proceedings of the 14th European Signal Processing Conference, 2006

Speech style conversion based on the statistics of vowel spectrograms and nonlinear frequency mapping.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database.
Proceedings of the INTERSPEECH 2005, 2005

Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT.
Proceedings of the INTERSPEECH 2005, 2005

Speech intelligibility derived from time-frequency and source smearing.
Proceedings of the INTERSPEECH 2005, 2005

Underlying Principles of a High-quality Speech Manipulation System STRAIGHT and Its Application to Speech Segregation.
Proceedings of the Speech Separation by Humans and Machines, 2005

Speech Segregation Using an Event-synchronous Auditory Image and STRAIGHT.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
A design of audio-visual talker tracking system based on CSP analysis and frame difference in real noisy environments.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Intelligibility of degraded speech from smeared STRAIGHT spectrum.
Proceedings of the INTERSPEECH 2004, 2004

Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHT.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Dominance spectrum based v/UV classification and f_0 estimation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech segregation based on fundamental event information using an auditory vocoder.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech segregation using event synchronous auditory vocoder.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform.
Speech Commun., 2002

Robust fundamental frequency estimation against background noise and spectral distortion.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of a speech recognition / generation method based on HMM and straight.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Auditory VOCODER: Speech resynthesis from an auditory Mellin representation.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
Robust fundamental frequency estimation using instantaneous frequencies of harmonic components.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Stabilised wavelet mellin transform: an auditory strategy for normalising sound-source size.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Noise suppression using a time-varying, analysis/synthesis gamma chirp filterbank.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
The Gammachirp for Optimal Auditory Filtering.
Proceedings of the Fifth International Conference on Neural Information Processing, 1998

A time-varying, analysis/synthesis auditory filterbank using the gammachirp.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1996
A 'gammachirp' function as an optimal auditory filter with the Mellin transform.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994
A theory of asymmetric intensity enhancement around acoustic transients.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
Signal reconstruction from modified auditory wavelet transform.
IEEE Trans. Signal Process., 1993

1992
Signal reconstruction from modified wavelet transform-An application to auditory signal processing.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990
A Method for Designing Neural Networks Using Nonlinear Multivariate Analysis: Application to Speaker-Independent Vowel Recognition.
Neural Comput., 1990

1988
Vowel-feature extraction from cochlear vibration using neural networks.
Neural Networks, 1988


  Loading...