Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

Multimodal Cross- and Self-Attention Network for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Licai Sun

Bin Liu

Jianhua Tao

Zheng Lian

Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Scale and Multi-Region Facial Discriminative Representation for Automatic Depression Level Prediction.

[BibT_eX]

[DOI]

Mingyue Niu

Jianhua Tao

Bin Liu

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Simultaneous Denoising and Dereverberation Using Deep Embedding Features.

[BibT_eX]

[DOI]

CoRR, 2020

Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method.

[BibT_eX]

[DOI]

CoRR, 2020

Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020

Hybrid Network Feature Extraction for Depression Assessment from Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Comparison of Glottal Source Parameter Values in Emotional Vowels.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Learning Utterance-Level Representations with Label Smoothing for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

AMINN: Attention-Based Multi-Information Neural Network for Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Multimodal Transformer Fusion for Continuous Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Micro-Expression Recognition Based on Multiple Aggregation Networks.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Domain adversarial learning for emotion recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Towards Fine-Grained Prosody Control for Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2019

Automatic Depression Level Detection via ℓ<sub>p</sub>-Norm Pooling.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Conversational Emotion Analysis via Attention Mechanisms.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Loss and Double-edge-triggered Detector for Robust Small-footprint Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Voice Activity Detection Based on Time-Delay Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Local Second-Order Gradient Cross Pattern for Automatic Depression Detection.

[BibT_eX]

[DOI]

Mingyue Niu

Jianhua Tao

Bin Liu

Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

Efficient Modeling of Long Temporal Contexts for Continuous Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2018

Distilling Knowledge Using Parallel Data for Far-field Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

A Novel Unified Framework for Speech Enhancement and Bandwidth Extension Based on Jointly Trained Neural Networks.

[BibT_eX]

[DOI]

Bin Liu

Jianhua Tao

Yibin Zheng

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Stochastic Multiple Choice Learning for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A novel pitch extraction based on jointly trained deep BLSTM Recurrent Neural Networks with bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2016

Investigating deep neural network adaptation for generating exclamatory and interrogative speech in Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

CTC regularized model adaptation for improving LSTM RNN based multi-accent Mandarin speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

End-to-end keywords spotting based on connectionist temporal classification for Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

A Novel Research to Artificial Bandwidth Extension Based on Deep BLSTM Recurrent Neural Networks and Exemplar-Based Sparse Representation.

[BibT_eX]

[DOI]

Bin Liu

Jianhua Tao

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Extraction of tongue contour in real-time magnetic resonance imaging sequences.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

User behavior fusion in dialog management with multi-modal history cues.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

A novel method of artificial bandwidth extension using deep architecture.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Estimate articulatory MRI series from acoustic signal using deep architecture.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Context features based pre-selection and weight prediction in concatenation speech synthesis system.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Bin Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...