En Yu

Orcid: 0009-0005-3335-5486

According to our database1, En Yu authored at least 56 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Enhancing outdoor vision: Binocular desnowing with dual-stream temporal transformer.
Pattern Recognit., 2026

2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale.
CoRR, August, 2025

Drift-aware Collaborative Assistance Mixture of Experts for Heterogeneous Multistream Learning.
CoRR, August, 2025

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion.
CoRR, July, 2025

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.
CoRR, July, 2025

Generalized Incremental Learning under Concept Drift across Evolving Data Streams.
CoRR, June, 2025

ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking.
CoRR, May, 2025

Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning.
CoRR, May, 2025

Learning Robust Spectral Dynamics for Temporal Domain Generalization.
CoRR, May, 2025

Perception-R1: Pioneering Perception Policy with Reinforcement Learning.
CoRR, April, 2025

Perception in Reflection.
CoRR, April, 2025

InstaFace: Identity-Preserving Facial Editing with Single Image Inference.
CoRR, February, 2025

Unhackable Temporal Rewarding for Scalable Video MLLMs.
CoRR, February, 2025

Causal-Informed Contrastive Learning: Towards Bias-Resilient Pre-training under Concept Drift.
CoRR, February, 2025

PerPO: Perceptual Preference Optimization via Discriminative Rewarding.
CoRR, February, 2025

Multimodal Inverse Attention Network with Intrinsic Discriminant Feature Exploitation for Fake News Detection.
CoRR, February, 2025

Unhackable Temporal Reward for Scalable Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Cross-View Referring Multi-Object Tracking.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
GroupLane: End-to-End 3D Lane Detection With Channel-Wise Grouping.
IEEE Robotics Autom. Lett., November, 2024

Fuzzy Shared Representation Learning for Multistream Classification.
IEEE Trans. Fuzzy Syst., October, 2024

Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World.
CoRR, 2024

Small Language Model Meets with Reinforced Vision Vocabulary.
CoRR, 2024

QTrack: Embracing Quality Clues for Robust 3D Multi-Object Tracking.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Merlin: Empowering Multimodal LLMs with Foresight Minds.
Proceedings of the Computer Vision - ECCV 2024, 2024

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Efficient Few-Shot Classification via Contrastive Pretraining on Web Data.
IEEE Trans. Artif. Intell., June, 2023

Implicit and Efficient Point Cloud Completion for 3D Single Object Tracking.
IEEE Robotics Autom. Lett., April, 2023

RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation.
IEEE Trans. Multim., 2023

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking.
CoRR, 2023

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Discrete Fusion Adversarial Hashing for cross-modal retrieval.
Knowl. Based Syst., 2022

Learn-to-adapt: Concept drift adaptation for hybrid multiple streams.
Neurocomputing, 2022

Deep Discrete Cross-Modal Hashing with Multiple Supervision.
Neurocomputing, 2022

MAT: Motion-aware multi-object tracking.
Neurocomputing, 2022

Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking.
CoRR, 2022

Delving into the Pre-training Paradigm of Monocular 3D Object Detection.
CoRR, 2022

Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2020
Multi-class joint subspace learning for cross-modal retrieval.
Pattern Recognit. Lett., 2020

MAT: Motion-Aware Multi-Object Tracking.
CoRR, 2020

Refinements in Motion and Appearance for Online Multi-Object Tracking.
CoRR, 2020

2019
Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval.
IEEE Trans. Multim., 2019

Coupled feature selection based semi-supervised modality-dependent cross-modal retrieval.
Multim. Tools Appl., 2019

Inf@TRECVID 2019: Instance Search Task.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Cross-Modal Transfer Hashing Based on Coherent Projection.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Fusion-Supervised Deep Cross-Modal Hashing.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Task-Dependent and Query-Dependent Subspace Learning for Cross-Modal Retrieval.
IEEE Access, 2018

Shandong Normal University in the VTT Tasks at TRECVID 2018.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

2017
Shandong Normal University in the VTT Tasks at TRECVID 2017.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Semi-supervised Distance Consistent Cross-modal Retrieval.
Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities, 2017

Coupled feature selection for modality-dependent cross-media retrieval.
Proceedings of the 2017 International Symposium on Intelligent Signal Processing and Communication Systems, 2017

When diary meets lifelog video.
Proceedings of the 2017 International Symposium on Intelligent Signal Processing and Communication Systems, 2017


  Loading...