Leigang Qu

Orcid: 0000-0002-5479-8439

According to our database1, Leigang Qu authored at least 29 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
OSGNet with MLLM Reranking @ Ego4D Episodic Memory Challenge 2026.
CoRR, May, 2026

AI for Auto-Research: Roadmap & User Guide.
CoRR, May, 2026

WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval.
CoRR, February, 2026

AUHead: Realistic Emotional Talking Head Generation via Action Units Control.
CoRR, February, 2026

GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?
CoRR, February, 2026

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval.
CoRR, February, 2026

2025
TTOM: Test-Time Optimization and Memorization for Compositional Video Generation.
CoRR, October, 2025

VINCIE: Unlocking In-context Image Editing from Video.
CoRR, June, 2025

Automatic Pruning via Structured Lasso with Class-wise Information.
CoRR, February, 2025

Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Visual Content Generation in the Era of Large Foundation Models.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Unified Text-to-Image Generation and Retrieval.
CoRR, 2024

NExT-GPT: Any-to-Any Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Discriminative Probing and Tuning for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Self-Supervised Correlation Learning for Cross-Modal Retrieval.
IEEE Trans. Multim., 2023

Learnable Pillar-based Re-ranking for Image-Text Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Popularity-aware Distributionally Robust Optimization for Recommendation System.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Search-oriented Micro-video Captioning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Temporal anomaly detection on IIoT-enabled manufacturing.
J. Intell. Manuf., 2021

Dynamic Modality Interaction Modeling for Image-Text Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

2020
Iterative Local-Global Collaboration Learning Towards One-Shot Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Context-Aware Multi-View Summarization Network for Image-Text Matching.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


  Loading...