Daiqing Wu

Orcid: 0009-0000-2320-387X

According to our database¹, Daiqing Wu authored at least 16 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Multimodal Emotion Recognition with Large Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

Beyond Detection: A Structure-Aware Framework for Scene Text Tracking.

[BibT_eX]

[DOI]

CoRR, May, 2026

Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding.

[BibT_eX]

[DOI]

CoRR, May, 2026

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization.

[BibT_eX]

[DOI]

CoRR, April, 2026

Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning.

[BibT_eX]

[DOI]

CoRR, February, 2026

Resolving sentiment discrepancy for multimodal sentiment detection via semantics completion and decomposition.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

EmoCaliber: Advancing Reliable Visual Emotion Comprehension via Confidence Verbalization and Calibration.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

2025

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach.

[BibT_eX]

[DOI]

CoRR, September, 2025

Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts.

[BibT_eX]

[DOI]

CoRR, 2024

Robust Multimodal Sentiment Analysis of Image-Text Pairs by Distribution-Based Feature Recovery and Fusion.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2022

Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Daiqing Wu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...