Haiyang Sun

Orcid: 0009-0004-3485-3869

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China
  • University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China


According to our database1, Haiyang Sun authored at least 28 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling.
CoRR, May, 2025

Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis.
CoRR, April, 2025

FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching.
CoRR, February, 2025

AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models.
CoRR, January, 2025

SVFAP: Self-Supervised Video Facial Affect Perceiver.
IEEE Trans. Affect. Comput., 2025

2024
GPT-4V with emotion: A zero-shot benchmark for Generalized Emotion Recognition.
Inf. Fusion, 2024

Open-vocabulary Multimodal Emotion Recognition: Dataset, Metric, and Benchmark.
CoRR, 2024

AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition.
CoRR, 2024

Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild.
CoRR, 2024

Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning.
CoRR, 2024

MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition.
CoRR, 2024

Social Perception Prediction for MuSe 2024: Joint Learning of Multiple Perceptions.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

DPP: A Dual-Phase Processing Method for Cross-Cultural Humor Detection.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MFSN: Multi-perspective Fusion Search Network For Pre-training Knowledge in Speech Emotion Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
RMNAS: A Multimodal Neural Architecture Search Framework For Robust Multimodal Sentiment Analysis.
CoRR, 2023

GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding.
CoRR, 2023

Explainable Multimodal Emotion Reasoning.
CoRR, 2023

MFAS: Emotion Recognition through Multiple Perspectives Fusion Architecture Search Emulating Human Cognition.
CoRR, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
CoRR, 2023

Exclusive Modeling for MuSe-Personalisation Challenge.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Two-Aspect Information Fusion Model For ABAW4 Multi-task Challenge.
CoRR, 2022

EmotionNAS: Two-stream Architecture Search for Speech Emotion Recognition.
CoRR, 2022

Fully Automated End-to-End Fake Audio Detection.
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022

Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022


  Loading...