Daiqing Wu

Orcid: 0009-0000-2320-387X

According to our database1, Daiqing Wu authored at least 16 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Multimodal Emotion Recognition with Large Language Models.
CoRR, May, 2026

Beyond Detection: A Structure-Aware Framework for Scene Text Tracking.
CoRR, May, 2026

Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding.
CoRR, May, 2026

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization.
CoRR, April, 2026

Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning.
CoRR, February, 2026

Resolving sentiment discrepancy for multimodal sentiment detection via semantics completion and decomposition.
Pattern Recognit., 2026

EmoCaliber: Advancing Reliable Visual Emotion Comprehension via Confidence Verbalization and Calibration.
Pattern Recognit., 2026

2025
Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach.
CoRR, September, 2025

Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts.
CoRR, 2024

Robust Multimodal Sentiment Analysis of Image-Text Pairs by Distribution-Based Feature Recovery and Fusion.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2022
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022


  Loading...