En Yu

CoRR, October, 2025

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale.

[BibT_eX]

[DOI]

CoRR, August, 2025

Drift-aware Collaborative Assistance Mixture of Experts for Heterogeneous Multistream Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion.

[BibT_eX]

[DOI]

CoRR, July, 2025

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.

[BibT_eX]

[DOI]

CoRR, July, 2025

Generalized Incremental Learning under Concept Drift across Evolving Data Streams.

[BibT_eX]

[DOI]

Guangquan Zhang

CoRR, June, 2025

ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, May, 2025

Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Learning Robust Spectral Dynamics for Temporal Domain Generalization.

[BibT_eX]

[DOI]

CoRR, May, 2025

Perception-R1: Pioneering Perception Policy with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

InstaFace: Identity-Preserving Facial Editing with Single Image Inference.

[BibT_eX]

[DOI]

CoRR, February, 2025

Unhackable Temporal Rewarding for Scalable Video MLLMs.

[BibT_eX]

[DOI]

CoRR, February, 2025

Causal-Informed Contrastive Learning: Towards Bias-Resilient Pre-training under Concept Drift.

[BibT_eX]

[DOI]

CoRR, February, 2025

PerPO: Perceptual Preference Optimization via Discriminative Rewarding.

[BibT_eX]

[DOI]

CoRR, February, 2025

Multimodal Inverse Attention Network with Intrinsic Discriminant Feature Exploitation for Fake News Detection.

[BibT_eX]

[DOI]

CoRR, February, 2025

Multimodal Inverse Attention Network with Intrinsic Discriminant Feature Exploitation for Fake News Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Perception in Reflection.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Unhackable Temporal Reward for Scalable Video MLLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Cross-View Referring Multi-Object Tracking.

[BibT_eX]

[DOI]

Sijia Chen

Wenbing Tao

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

GroupLane: End-to-End 3D Lane Detection With Channel-Wise Grouping.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., November, 2024

Fuzzy Shared Representation Learning for Multistream Classification.

[BibT_eX]

[DOI]

Guangquan Zhang

IEEE Trans. Fuzzy Syst., October, 2024

Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World.

[BibT_eX]

[DOI]

CoRR, 2024

Small Language Model Meets with Reinforced Vision Vocabulary.

[BibT_eX]

[DOI]

CoRR, 2024

QTrack: Embracing Quality Clues for Robust 3D Multi-Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Merlin: Empowering Multimodal LLMs with Foresight Minds.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Efficient Few-Shot Classification via Contrastive Pretraining on Web Data.

[BibT_eX]

[DOI]

IEEE Trans. Artif. Intell., June, 2023

Implicit and Efficient Point Cloud Completion for 3D Single Object Tracking.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., April, 2023

RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2023

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Discrete Fusion Adversarial Hashing for cross-modal retrieval.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2022

Learn-to-adapt: Concept drift adaptation for hybrid multiple streams.

[BibT_eX]

[DOI]

Neurocomputing, 2022

Deep Discrete Cross-Modal Hashing with Multiple Supervision.

[BibT_eX]

[DOI]

Neurocomputing, 2022

MAT: Motion-aware multi-object tracking.

[BibT_eX]

[DOI]

Neurocomputing, 2022

Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2022

Delving into the Pre-training Paradigm of Monocular 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking.

[BibT_eX]

[DOI]

Zhuoling Li

Shoudong Han

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2020

Multi-class joint subspace learning for cross-modal retrieval.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2020

MAT: Motion-Aware Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2020

Refinements in Motion and Appearance for Online Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Coupled feature selection based semi-supervised modality-dependent cross-modal retrieval.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

Inf@TRECVID 2019: Instance Search Task.

[BibT_eX]

[DOI]

Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Cross-Modal Transfer Hashing Based on Coherent Projection.

[BibT_eX]

[DOI]