Longbiao Wang

Masahiro Iwahashi

Multim. Tools Appl., 2018

Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Distant-talking Speech Recognition Based on Multi-objective Learning using Phase and Magnitude-based Feature.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multiple Phase Information Combination for Replay Attacks Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Integrative Network Embedding via Deep Joint Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Convolutional Neural Network with Spectrogram and Perceptual Features for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Efficient Multi-spike Learning with Tempotron-Like LTP and PSD-Like LTD.

[BibT_eX]

[DOI]

Qiang Yu

Proceedings of the Neural Information Processing - 25th International Conference, 2018

A Feature Fusion Method Based on Extreme Learning Machine for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Gender-Aware CNN-BLSTM for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

Spoofing Speech Detection Using Modified Relative Phase Information.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2017

Noise robust voice activity detection using joint phase and magnitude based feature enhancement.

[BibT_eX]

[DOI]

J. Ambient Intell. Humaniz. Comput., 2017

Prediction of F0 Based on Articulatory Features Using DNN.

[BibT_eX]

[DOI]

Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Global Monitoring of Dynamic Functional Interactions in the Brain During Chinese Verbs Perception.

[BibT_eX]

[DOI]

Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Speech Emotion Recognition Considering Local Dynamic Features.

[BibT_eX]

[DOI]

Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Phonemic Restoration Based on the Movement Continuity of Articulation.

[BibT_eX]

[DOI]

Cenxi Zhao

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Neuronal Classifier for both Rate and Timing-Based Spike Patterns.

[BibT_eX]

[DOI]

Qiang Yu

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Exploiting the Tibetan Radicals in Recurrent Neural Network for Low-Resource Language Models.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Phase aware deep neural network for noise robust voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

2016

Guest Editorial: Immersive Audio/Visual Systems.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Distant-talking accent recognition by combining GMM and DNN.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting.

[BibT_eX]

[DOI]

CoRR, 2016

Multi-channel feature adaptation for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Exploring tonal information for Lhasa dialect acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Face recognition with local contourlet combined patterns.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Environment-dependent denoising autoencoder for distant-talking speech recognition.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2015

Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

Relative phase information for detecting human speech and spoofed speech.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

A spectrum smoothing method for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Speech selection and environmental adaptation for asynchronous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation.

[BibT_eX]

[DOI]

Zhaofeng Zhang

EURASIP J. Audio Speech Music. Process., 2014

PLDA in the I-Supervector Space for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Ye Jiang

Kong-Aik Lee

EURASIP J. Audio Speech Music. Process., 2014

Speaker Identification by Combining Various Vocal Tract and Vocal Source Features.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Distant-talking speech recognition using multi-channel LMS and multiple-step linear prediction.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Speech enhancement via low-rank matrix decomposition and image based masking.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Single-sided approach to discriminative PLDA training for text-independent speaker verification without using expanded i-vector.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Multi-channel speech enhancement using sparse coding on local time-frequency structures.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Log-domain polynomial filters for illumination-robust face recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Improvement of distant-talking speaker identification using bottleneck features of DNN.

[BibT_eX]

[DOI]

Takanori Yamada

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach.

[BibT_eX]

[DOI]

Zhaofeng Zhang

Proceedings of the IEEE International Conference on Acoustics, 2013

Joint sparse representation based cepstral-domain dereverberation for distant-talking speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Frequency-domain dereverberation on speech signal using surround retinex.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Sparse coding for sound event classification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Local consistency preserved coupled mappings for low-resolution face recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Speech recognition using blind source separation and dereverberation method for mixed sound of speech and music.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Speaker identification using pseudo pitch synchronized phase information in noisy environments.

[BibT_eX]

[DOI]

Yuta Kawakami

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Speaker Identification and Verification by Combining MFCC and Phase Information.

[BibT_eX]

[DOI]

Shinji Ohtsuka

IEEE Trans. Speech Audio Process., 2012

Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array.

[BibT_eX]

[DOI]

Kyohei Odani

EURASIP J. Adv. Signal Process., 2012

Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment.

[BibT_eX]

[DOI]

Kyohei Odani

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Dereverberantion based on generalized spectral subtraction for distant-talking speaker recognition.

[BibT_eX]

[DOI]

Zhaofeng Zhang

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Distant-talking speaker identification using a reverberation model with various artificial room impulse responses.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

Evaluation of Hands-Free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm.

[BibT_eX]

[DOI]

Kyohei Odani

Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010

Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Speaker identification by combining MFCC and phase information in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

High improvement of speaker identification and verification by combining MFCC and phase information.

[BibT_eX]

[DOI]

Shinji Ohtsuka

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM.

[BibT_eX]

[DOI]

Speech Commun., 2007

Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Speaker recognition by combining MFCC and phase information.

[BibT_eX]

[DOI]

Kouhei Asakawa

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust Distant Speech Recognition by Combining Position-Dependent CMN with Conventional CMN.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2006

2005

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust distant speaker recognition based on position dependent cepstral mean normalization.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Robust distant speech recognition based on position dependent CMN.

[BibT_eX]

[DOI]