Deyi Ji

Orcid: 0000-0001-7561-9789

According to our database¹, Deyi Ji authored at least 53 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CamGeo: Sparse Camera-Conditioned Image-to-Video Generation with 3D Geometry Priors.

[BibT_eX]

[DOI]

CoRR, May, 2026

Claw AI Lab: An Autonomous Multi-Agent Research Team.

[BibT_eX]

[DOI]

CoRR, May, 2026

Video-Zero: Self-Evolution Video Understanding.

[BibT_eX]

[DOI]

CoRR, May, 2026

4DVGGT-D: 4D Visual Geometry Transformer with Improved Dynamic Depth Estimation.

[BibT_eX]

[DOI]

CoRR, May, 2026

Aligning LLM Uncertainty with Human Disagreement in Subjectivity Analysis.

[BibT_eX]

[DOI]

CoRR, May, 2026

ARGUS: Policy-Adaptive Ad Governance via Evolving Reinforcement with Adversarial Umpiring.

[BibT_eX]

[DOI]

CoRR, May, 2026

Recovering Hidden Reward in Diffusion-Based Policies.

[BibT_eX]

[DOI]

CoRR, May, 2026

The First Challenge on Mobile Real-World Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview.

[BibT_eX]

[DOI]

CoRR, April, 2026

StreamCacheVGGT: Streaming Visual Geometry Transformers with Robust Scoring and Hybrid Cache Compression.

[BibT_eX]

[DOI]

CoRR, April, 2026

Robust 4D Visual Geometry Transformer with Uncertainty-Aware Priors.

[BibT_eX]

[DOI]

CoRR, April, 2026

HD-VGGT: High-Resolution Visual Geometry Transformer.

[BibT_eX]

[DOI]

CoRR, March, 2026

StreamSense: Streaming Social Task Detection with Selective Vision-Language Model Routing.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field.

[BibT_eX]

[DOI]

CoRR, December, 2025

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, November, 2025

SID: Multi-LLM Debate Driven by Self Signals.

[BibT_eX]

[DOI]

CoRR, October, 2025

LLaFS++: Few-Shot Image Segmentation With Large Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Breaking the Box: Enhancing Remote Sensing Image Segmentation with Freehand Sketches.

[BibT_eX]

[DOI]

CoRR, March, 2025

Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches.

[BibT_eX]

[DOI]

CoRR, January, 2025

Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification.

[BibT_eX]

[DOI]

CoRR, January, 2025

From Air to Wear: Personalized 3D Digital Fashion With AR/VR Immersive 3D Sketching.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2025

Not Every Patch is Needed: Toward a More Efficient and Effective Backbone for Video-Based Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Generating Negative Samples for Multi-Modal Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

CPCF: A Cross-Prompt Contrastive Framework for Referring Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LOHRec: Leveraging Order and Hierarchy in Generative Sequential Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

RAVEN++: Pinpointing Fine-Grained Violations in Advertisement Videos with Active Reinforcement Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RAVEN: Robust Advertisement Video Violation Temporal Grounding via Reinforcement Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025

2024

Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More.

[BibT_eX]

[DOI]

CoRR, 2024

xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart.

[BibT_eX]

[DOI]

CoRR, 2024

Reasoning3D - Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV.

[BibT_eX]

[DOI]

CoRR, 2024

PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Discrete Latent Perspective Learning for Segmentation and Detection.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Changenet: Multi-Temporal Asymmetric Change Detection Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

LLaFS: When Large Language Models Meet Few-Shot Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

xLSTM-UNet can be an Effective Backbone for 2D & 3D Biomedical Image Segmentation Better than its Mamba Counterparts.

[BibT_eX]

[DOI]

Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2024

2023

Learning Social Spatio-Temporal Relation Graph in the Wild and a Video Benchmark.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., June, 2023

Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation.

[BibT_eX]

[DOI]

Deyi Ji

Feng Zhao

Hongtao Lu

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

IPGN: Interactiveness Proposal Graph Network for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Learning Statistical Texture for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Context-Aware Graph Convolution Network for Target Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Class-Wise Dynamic Graph Convolution for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2018

End to end multi-scale convolutional neural network for crowd counting.

[BibT_eX]

[DOI]

Deyi Ji

Hongtao Lu

Tongzhen Zhang

Proceedings of the Eleventh International Conference on Machine Vision, 2018

Challenges on Large Scale Surveillance Video Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Deyi Ji

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...