Zheng Lian

Orcid: 0000-0001-9477-0599

According to our database1, Zheng Lian authored at least 92 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Hardness-Aware Dynamic Curriculum Learning for Robust Multimodal Emotion Recognition with Missing Modalities.
CoRR, August, 2025

Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions.
CoRR, July, 2025

EMER-Ranker: Learning to Rank Emotion Descriptions in the Absence of Ground Truth.
CoRR, July, 2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.
CoRR, May, 2025

𝒜LLM4ADD: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection.
CoRR, May, 2025

MER 2025: When Affective Computing Meets Large Language Models.
CoRR, April, 2025

P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation.
CoRR, April, 2025

Feature-Based Dual Visual Feature Extraction Model for Compound Multimodal Emotion Recognition.
CoRR, March, 2025

AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models.
CoRR, January, 2025

A Reconstruction Method for Weak Magnetic Pipeline Inspection Signals Based on Adaptive Multiscale Signal Reconstruction.
IEEE Trans. Instrum. Meas., 2025

SVFAP: Self-Supervised Video Facial Affect Perceiver.
IEEE Trans. Affect. Comput., 2025

Adversarial Training and Gradient Optimization for Partially Deepfake Audio Localization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MEIJU - The 1st Multimodal Emotion and Intent Joint Understanding Challenge.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Listen, Watch, and Learn to Feel: Retrieval-Augmented Emotion Reasoning for Compound Emotion Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Efficient Multimodal Transformer With Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis.
IEEE Trans. Affect. Comput., 2024

Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities.
IEEE Trans. Affect. Comput., 2024

HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition.
Inf. Fusion, 2024

GPT-4V with emotion: A zero-shot benchmark for Generalized Emotion Recognition.
Inf. Fusion, 2024

Open-vocabulary Multimodal Emotion Recognition: Dataset, Metric, and Benchmark.
CoRR, 2024

AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition.
CoRR, 2024

Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset.
CoRR, 2024

Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild.
CoRR, 2024

Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning.
CoRR, 2024

MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition.
CoRR, 2024

Social Perception Prediction for MuSe 2024: Joint Learning of Multiple Perceptions.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

DPP: A Dual-Phase Processing Method for Cross-Cultural Humor Detection.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MFSN: Multi-perspective Fusion Search Network For Pre-training Knowledge in Speech Emotion Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Pseudo Labels Regularization for Imbalanced Partial-Label Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

The Signal Characteristics of Oil and Gas Pipeline Leakage Detection Based on Magneto-Mechanical Effects.
Sensors, February, 2023

Multimodal Spatiotemporal Representation for Automatic Depression Level Detection.
IEEE Trans. Affect. Comput., 2023

SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition.
IEEE Trans. Affect. Comput., 2023

RMNAS: A Multimodal Neural Architecture Search Framework For Robust Multimodal Sentiment Analysis.
CoRR, 2023

GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding.
CoRR, 2023

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR, 2023

Explainable Multimodal Emotion Reasoning.
CoRR, 2023

MFAS: Emotion Recognition through Multiple Perspectives Fusion Architecture Search Emulating Human Cognition.
CoRR, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
CoRR, 2023

DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning.
CoRR, 2023

VRA: Variational Rectified Activation for Out-of-distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Humor Detection System for MuSE 2023: Contextual Modeling, Pesudo Labelling, and Post-smoothing.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Exclusive Modeling for MuSe-Personalisation Challenge.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MRAC'23: 1st International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ADD 2023: the Second Audio Deepfake Detection Challenge.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022
ARNet: Automatic Refinement Network for Noisy Partial Label Learning.
CoRR, 2022

Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis.
CoRR, 2022

Two-Aspect Information Fusion Model For ABAW4 Multi-task Challenge.
CoRR, 2022

EmotionNAS: Two-stream Architecture Search for Speech Emotion Recognition.
CoRR, 2022

ADD 2022: the First Audio Deep Synthesis Detection Challenge.
CoRR, 2022

Emotional Reaction Analysis based on Multi-Label Graph Convolutional Networks and Dynamic Facial Expression Recognition Transformer.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Multimodal Temporal Attention in Sentiment Analysis.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Prediction of Depression Severity Based on Transformer Encoder and CNN Model.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Investigating the Characteristic of Weak Magnetic Stress Internal Detection Signals of Long-Distance Oil and Gas Pipeline Under Demagnetization Effect.
IEEE Trans. Instrum. Meas., 2021

CTNet: Conversational Transformer Network for Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

DECN: Dialogical emotion correction network for conversational emotion recognition.
Neurocomputing, 2021

Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition.
Int. J. Autom. Comput., 2021

Multimodal Emotion Recognition and Sentiment Analysis via Attention Enhanced Recurrent Model.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021

Multimodal Sentiment Analysis based on Recurrent Neural Network and Multimodal Attention.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021

Towards Fine-Grained Prosody Control for Voice Conversion.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Multimodal Cross- and Self-Attention Network for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Expression Analysis Based on Face Regions in Real-world Conditions.
Int. J. Autom. Comput., 2020

Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Attention Mechanism.
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020

ARVC: An Auto-Regressive Voice Conversion System Without Parallel Training Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Learning Utterance-Level Representations with Label Smoothing for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multimodal Transformer Fusion for Continuous Emotion Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

CASIA Voice Conversion System for the Voice Conversion Challenge 2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
Semi-supervised Ladder Networks for Speech Emotion Recognition.
Int. J. Autom. Comput., 2019

Expression Analysis Based on Face Regions in Read-world Conditions.
CoRR, 2019

Domain adversarial learning for emotion recognition.
CoRR, 2019

Towards Fine-Grained Prosody Control for Voice Conversion.
CoRR, 2019

Speech Emotion Recognition via Contrastive Loss under Siamese Networks.
CoRR, 2019

Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Conversational Emotion Analysis via Attention Mechanisms.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Discriminative Video Representation with Temporal Order for Micro-expression Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Investigation of Multimodal Features, Classifiers and Fusion Methods for Emotion Recognition.
CoRR, 2018

Deep Learning for Continuous Multiple Time Series Annotations.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018

Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018

End-to-End Continuous Emotion Recognition from Video Using 3D Convlstm Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Continuous Multimodal Emotion Prediction Based on Long Short Term Memory Recurrent Neural Network.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017


  Loading...