Deyi Ji

Orcid: 0000-0001-7561-9789

According to our database1, Deyi Ji authored at least 46 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The First Challenge on Mobile Real-World Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview.
CoRR, April, 2026

StreamCacheVGGT: Streaming Visual Geometry Transformers with Robust Scoring and Hybrid Cache Compression.
CoRR, April, 2026

Robust 4D Visual Geometry Transformer with Uncertainty-Aware Priors.
CoRR, April, 2026

HD-VGGT: High-Resolution Visual Geometry Transformer.
CoRR, March, 2026

StreamSense: Streaming Social Task Detection with Selective Vision-Language Model Routing.
Proceedings of the ACM Web Conference 2026, 2026

Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field.
CoRR, December, 2025

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation.
CoRR, November, 2025

SID: Multi-LLM Debate Driven by Self Signals.
CoRR, October, 2025

Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval.
CoRR, October, 2025

LLaFS++: Few-Shot Image Segmentation With Large Language Models.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Breaking the Box: Enhancing Remote Sensing Image Segmentation with Freehand Sketches.
CoRR, March, 2025

Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation.
CoRR, March, 2025

Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches.
CoRR, January, 2025

Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification.
CoRR, January, 2025

From Air to Wear: Personalized 3D Digital Fashion With AR/VR Immersive 3D Sketching.
IEEE Trans. Vis. Comput. Graph., 2025

Not Every Patch is Needed: Toward a More Efficient and Effective Backbone for Video-Based Person Re-Identification.
IEEE Trans. Image Process., 2025

Generating Negative Samples for Multi-Modal Recommendation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

CPCF: A Cross-Prompt Contrastive Framework for Referring Multimodal Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LOHRec: Leveraging Order and Hierarchy in Generative Sequential Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

RAVEN++: Pinpointing Fine-Grained Violations in Advertisement Videos with Active Reinforcement Reasoning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RAVEN: Robust Advertisement Video Violation Temporal Grounding via Reinforcement Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025

2024
Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding.
CoRR, 2024

SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More.
CoRR, 2024

xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart.
CoRR, 2024

Reasoning3D - Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models.
CoRR, 2024

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV.
CoRR, 2024

PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Discrete Latent Perspective Learning for Segmentation and Detection.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Changenet: Multi-Temporal Asymmetric Change Detection Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2024

LLaFS: When Large Language Models Meet Few-Shot Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

xLSTM-UNet can be an Effective Backbone for 2D & 3D Biomedical Image Segmentation Better than its Mamba Counterparts.
Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2024

2023
Learning Social Spatio-Temporal Relation Graph in the Wild and a Video Benchmark.
IEEE Trans. Neural Networks Learn. Syst., June, 2023

Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
IPGN: Interactiveness Proposal Graph Network for Human-Object Interaction Detection.
IEEE Trans. Image Process., 2021

Learning Statistical Texture for Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Context-Aware Graph Convolution Network for Target Re-identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Class-Wise Dynamic Graph Convolution for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2018
End to end multi-scale convolutional neural network for crowd counting.
Proceedings of the Eleventh International Conference on Machine Vision, 2018

Challenges on Large Scale Surveillance Video Analysis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018


  Loading...