Bin Liu

Orcid: 0000-0003-1529-1552

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China


According to our database1, Bin Liu authored at least 83 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Efficient Multimodal Transformer With Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis.
IEEE Trans. Affect. Comput., 2024

SVFAP: Self-supervised Video Facial Affect Perceiver.
CoRR, 2024

2023
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Dense Modality Interaction Network for Audio-Visual Event Localization.
IEEE Trans. Multim., 2023

Multimodal Spatiotemporal Representation for Automatic Depression Level Detection.
IEEE Trans. Affect. Comput., 2023

SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition.
IEEE Trans. Affect. Comput., 2023

RMNAS: A Multimodal Neural Architecture Search Framework For Robust Multimodal Sentiment Analysis.
CoRR, 2023

Humor Detection System for MuSE 2023: Contextual Modeling, Pesudo Labelling, and Post-smoothing.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Two-Aspect Information Fusion Model For ABAW4 Multi-task Challenge.
CoRR, 2022

EmotionNAS: Two-stream Architecture Search for Speech Emotion Recognition.
CoRR, 2022

ADD 2022: the First Audio Deep Synthesis Detection Challenge.
CoRR, 2022

AHRNN: Attention-Based Hybrid Robust Neural Network for emotion recognition.
Cogn. Comput. Syst., 2022

Multimodal Temporal Attention in Sentiment Analysis.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Prediction of Depression Severity Based on Transformer Encoder and CNN Model.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

End-to-End Network Based on Transformer for Automatic Detection of Covid-19.
Proceedings of the IEEE International Conference on Acoustics, 2022

Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Review of micro-expression spotting and recognition in video sequences.
Virtual Real. Intell. Hardw., 2021

Learning long-term temporal contexts using skip RNN for continuous emotion recognition.
Virtual Real. Intell. Hardw., 2021

CTNet: Conversational Transformer Network for Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

$F_0$-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

A time-frequency channel attention and vectorization network for automatic depression level prediction.
Neurocomputing, 2021

DECN: Dialogical emotion correction network for conversational emotion recognition.
Neurocomputing, 2021

Multimodal Emotion Recognition and Sentiment Analysis via Attention Enhanced Recurrent Model.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021

Multimodal Sentiment Analysis based on Recurrent Neural Network and Multimodal Attention.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021

Towards Fine-Grained Prosody Control for Voice Conversion.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

TDCA-Net: Time-Domain Channel Attention Network for Depression Detection.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Facial Micro-Expression Recognition Based on Multi-Scale Temporal and Spatial Features.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

Multimodal Cross- and Self-Attention Network for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Scale and Multi-Region Facial Discriminative Representation for Automatic Depression Level Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Simultaneous Denoising and Dereverberation Using Deep Embedding Features.
CoRR, 2020

Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method.
CoRR, 2020

Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features.
CoRR, 2020

Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Attention Mechanism.
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020

Hybrid Network Feature Extraction for Depression Assessment from Speech.
Proceedings of the Interspeech 2020, 2020

Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks.
Proceedings of the Interspeech 2020, 2020

Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition.
Proceedings of the Interspeech 2020, 2020

Comparison of Glottal Source Parameter Values in Emotional Vowels.
Proceedings of the Interspeech 2020, 2020

Learning Utterance-Level Representations with Label Smoothing for Speech Emotion Recognition.
Proceedings of the Interspeech 2020, 2020

Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations.
Proceedings of the Interspeech 2020, 2020

Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations.
Proceedings of the Interspeech 2020, 2020

AMINN: Attention-Based Multi-Information Neural Network for Emotion Recognition.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Multimodal Transformer Fusion for Continuous Emotion Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Micro-Expression Recognition Based on Multiple Aggregation Networks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Domain adversarial learning for emotion recognition.
CoRR, 2019

Towards Fine-Grained Prosody Control for Voice Conversion.
CoRR, 2019

Automatic Depression Level Detection via ℓ<sub>p</sub>-Norm Pooling.
Proceedings of the Interspeech 2019, 2019

Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition.
Proceedings of the Interspeech 2019, 2019

Conversational Emotion Analysis via Attention Mechanisms.
Proceedings of the Interspeech 2019, 2019

Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features.
Proceedings of the Interspeech 2019, 2019

Loss and Double-edge-triggered Detector for Robust Small-footprint Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Voice Activity Detection Based on Time-Delay Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Local Second-Order Gradient Cross Pattern for Automatic Depression Detection.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

Efficient Modeling of Long Temporal Contexts for Continuous Emotion Recognition.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin.
J. Signal Process. Syst., 2018

CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition.
J. Signal Process. Syst., 2018

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.
CoRR, 2018

Distilling Knowledge Using Parallel Data for Far-field Speech Recognition.
CoRR, 2018

A Novel Unified Framework for Speech Enhancement and Bandwidth Extension Based on Jointly Trained Neural Networks.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Stochastic Multiple Choice Learning for Acoustic Modeling.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction.
Proceedings of the Interspeech 2017, 2017

A novel pitch extraction based on jointly trained deep BLSTM Recurrent Neural Networks with bottleneck features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.
J. Signal Process. Syst., 2016

Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

End-to-end keywords spotting based on connectionist temporal classification for Mandarin.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

A Novel Research to Artificial Bandwidth Extension Based on Deep BLSTM Recurrent Neural Networks and Exemplar-Based Sparse Representation.
Proceedings of the Interspeech 2016, 2016

Extraction of tongue contour in real-time magnetic resonance imaging sequences.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
User behavior fusion in dialog management with multi-modal history cues.
Multim. Tools Appl., 2015

A novel method of artificial bandwidth extension using deep architecture.
Proceedings of the INTERSPEECH 2015, 2015

Estimate articulatory MRI series from acoustic signal using deep architecture.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Context features based pre-selection and weight prediction in concatenation speech synthesis system.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014


  Loading...