Zhixian Zhao

Orcid: 0000-0002-1136-8279

According to our database1, Zhixian Zhao authored at least 23 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models.
CoRR, April, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.
CoRR, March, 2026

EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs.
CoRR, February, 2026

Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning.
CoRR, January, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.
CoRR, January, 2026

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era.
CoRR, January, 2026

2025
Serial-Parallel Dual-Path Architecture for Speaking Style Recognition.
CoRR, October, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.
CoRR, August, 2025

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens.
CoRR, March, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.
CoRR, February, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.
CoRR, January, 2025

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Graph-Based Analysis of Criminal Networks on Social Media: A Novel Approach Using Intersection Graphs for Cybercrime Mitigation.
Proceedings of the 19th International Conference on Ubiquitous Information Management and Communication, 2025

2024
Joint Learning Spatial-Temporal Attention Correlation Filters for Aerial Tracking.
IEEE Signal Process. Lett., 2024

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment.
CoRR, 2024

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

2023
Learning discriminative correlation filters via saliency-aware channel selection for robust visual object tracking.
J. Real Time Image Process., June, 2023

A Knowledge Acquisition Framework for Autonomous Decision Making in Service Robots.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2023

Learning Spatial- Temporal Context - Based Dynamic Feature Fusion Correlation Filters for Object Tracking.
Proceedings of the 18th International Conference on Intelligent Systems and Knowledge Engineering, 2023

2022
Correlation Filters Based on Multi-Expert and Game Theory for Visual Object Tracking.
IEEE Trans. Instrum. Meas., 2022

UAV Visual Tracking Algorithm Based on Feature Fusion of the Attention Mechanism.
Proceedings of the 5th International Conference on Artificial Intelligence and Pattern Recognition, 2022

Correlation Filter Based on Saliency Detection and Channel Selection for Visual Object Tracking.
Proceedings of the 5th International Conference on Artificial Intelligence and Pattern Recognition, 2022

2020
Tree in forbidden triples generating a finite set of graphs with high connectivity.
AKCE Int. J. Graphs Comb., 2020


  Loading...