Yuqian Fu

Orcid: 0009-0008-9031-9889

According to our database1, Yuqian Fu authored at least 53 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Meta Learning Task Representation in Multiagent Reinforcement Learning: From Global Inference to Local Inference.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering.
CoRR, August, 2025

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection.
CoRR, July, 2025

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning.
CoRR, June, 2025

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base.
CoRR, June, 2025

CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning.
CoRR, June, 2025

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy.
CoRR, June, 2025

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection.
CoRR, June, 2025

Cross-View Multi-Modal Segmentation @ Ego-Exo4D Challenges 2025.
CoRR, June, 2025

RLAE: Reinforcement Learning-Assisted Ensemble for LLMs.
CoRR, June, 2025

Manifold-aware Representation Learning for Degradation-agnostic Image Restoration.
CoRR, May, 2025

MLLMs are Deeply Affected by Modality Bias.
CoRR, May, 2025

EarthSynth: Generating Informative Earth Observation with Diffusion Models.
CoRR, May, 2025

CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes.
IEEE Robotics Autom. Lett., April, 2025

CamSAM2: Segment Anything Accurately in Camouflaged Videos.
CoRR, March, 2025

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation.
CoRR, March, 2025

Sequential Multi-Object Grasping with One Dexterous Hand.
CoRR, March, 2025

AVA: Attentive VLM Agent for Mastering StarCraft II.
CoRR, March, 2025

MergeIT: From Selection to Merging for Efficient Instruction Tuning.
CoRR, March, 2025

LDR: Learning Discrete Representation to Improve Noise Robustness in Multiagent Tasks.
IEEE Trans. Syst. Man Cybern. Syst., January, 2025

Offline Goal-Conditioned Reinforcement Learning with Elastic-Subgoal Diffused Policy Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025


Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Understanding the World's Museums through Vision-Language Reasoning.
CoRR, 2024

Prompt as Free Lunch: Enhancing Diversity in Source-Free Cross-domain Few-shot Learning through Semantic-Guided Prompting.
CoRR, 2024

ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos.
CoRR, 2024

MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution.
CoRR, 2024

CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration.
CoRR, 2024

Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes.
CoRR, 2024

fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction.
CoRR, 2024

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community.
CoRR, 2024

Towards a Generalist and Blind RGB-X Tracker.
CoRR, 2024

MinD-3D: Reconstruct High-Quality 3D Objects in Human Brain.
Proceedings of the Computer Vision - ECCV 2024, 2024

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Test-Time Linear Out-of-Distribution Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Aligning Credit for Multi-Agent Cooperation via Model-based Counterfactual Imagination.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Open-Vocabulary Video Relation Extraction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.
CoRR, 2023

On the Importance of Spatial Relations for Few-shot Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
IEEE Trans. Image Process., 2022

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning.
CoRR, 2022

TGDM: Target Guided Dynamic Mixup for Cross-Domain Few-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

LILAC: Learning a Leader for Cooperative Reinforcement Learning.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021
Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

E-ACJ: Accurate Junction Extraction For Event Cameras.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

2020
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.
Proceedings of the 27th ACM International Conference on Multimedia, 2019


  Loading...