Wanlong Fang

Orcid: 0000-0003-4480-3107

According to our database1, Wanlong Fang authored at least 17 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SLAP: The Semantic Least Action Principle for Variational Video-Language Modeling.
CoRR, May, 2026

Immuno-VLM: Immunizing Large Vision-Language Models via Generative Semantic Antibodies for Open-World Trustworthiness.
CoRR, May, 2026

CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning.
CoRR, May, 2026

How Creative Are Large Language Models in Generating Molecules?
CoRR, April, 2026

To Align or Not to Align: Strategic Multimodal Representation Alignment for Optimal Performance.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Towards Unified Vision-Language Models with Incomplete Multi-Modal Inputs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Rethinking Video-Language Model from the Language Input Perspective.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Unveiling the Fragility of Vision-Language Models: Multi-Modal Adversarial Synergy via Texture-Constrained Perturbations and Cross-Modal Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Disentangling Adversarial Prompts: A Semantic-Graph Defense for Robust LLM Security.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Can LLMs Reason Over Non-Text Modalities in a Training-Free Manner? A Case Study with In-Context Representation Learning.
CoRR, September, 2025

Turing Patterns for Multimedia: Reaction-Diffusion Multi-Modal Fusion for Language-Guided Video Moment Retrieval.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval using Language.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Rethinking Weakly-Supervised Video Temporal Grounding From a Game Perspective.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Robust Temporal Activity Localization Learning with Noisy Labels.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023


  Loading...