Lu Qi

Orcid: 0000-0002-2684-0062

Affiliations:
  • Wuhan University, Insta360 Joint Lab, Wuhan, China
  • University of California at Merced, EECS Department, Merced, CA, USA
  • Chinese University of Hong Kong, Computer Science and Engineering Department, Hong Kong (PhD)


According to our database1, Lu Qi authored at least 114 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Causal Prompts for Open-Vocabulary Video Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control.
CoRR, May, 2026

DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2026

STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation.
CoRR, April, 2026

Fine-Grained Multimodal Alignment for Image-Text Retrieval via Graph Learning.
Int. J. Comput. Vis., March, 2026

SaSaSaSa2VA: 2nd Place of the 5th PVUW MeViS-Text Track.
CoRR, March, 2026

Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction.
CoRR, March, 2026

Fly360: Omnidirectional Obstacle Avoidance within Drone View.
CoRR, March, 2026

MOSIV: Multi-Object System Identification from Videos.
CoRR, March, 2026

SLER-IR: Spherical Layer-wise Expert Routing for All-in-One Image Restoration.
CoRR, March, 2026

SAMTok: Representing Any Mask with Two Words.
CoRR, January, 2026

Video Prediction Transformers without Recurrence or Convolution.
Trans. Mach. Learn. Res., 2026

2025
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation.
CoRR, December, 2025

Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark.
CoRR, December, 2025

AirSim360: A Panoramic Simulation Platform within Drone View.
CoRR, December, 2025

Re-Boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

Rethinking Cross-Generator Image Forgery Detection through DINOv3.
CoRR, November, 2025

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training.
CoRR, October, 2025

D<sup>2</sup>GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction.
CoRR, October, 2025

The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA.
CoRR, September, 2025

One Flight Over the Gap: A Survey from Perspective to Panoramic Vision.
CoRR, September, 2025

Rethinking Evaluation Metrics of Open-Vocabulary Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
CoRR, August, 2025

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World.
CoRR, June, 2025

CoCo4D: Comprehensive and Complex 4D Scene Generation.
CoRR, June, 2025

Dense360: Dense Understanding from Omnidirectional Panoramas.
CoRR, June, 2025

CyberV: Cybernetics for Test-time Scaling in Video Understanding.
CoRR, June, 2025

RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy.
CoRR, June, 2025

Conditional Panoramic Image Generation via Masked Autoregressive Modeling.
CoRR, May, 2025

BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation.
CoRR, May, 2025

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild.
CoRR, April, 2025

An Empirical Study of GPT-4o Image Generation Capabilities.
CoRR, April, 2025

4th PVUW MeViS 3rd Place Report: Sa2VA.
CoRR, April, 2025

CFCI-Net: Cross-Modality Feature Calibration and Integration Network for RGB-D Semantic Segmentation.
IEEE Trans. Intell. Veh., March, 2025

Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark.
IEEE Trans. Circuits Syst. Video Technol., March, 2025

UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions.
CoRR, February, 2025

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos.
CoRR, January, 2025

Seg-VAR: Image Segmentation with Visual Autoregressive Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

RAPID Hand: Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Embodied Intelligence.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

HQGS: High-Quality Novel View Synthesis with Gaussian Splatting in Degraded Scenes.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ViLLa: Video Reasoning Segmentation with Large Language Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Learning Deblurring Texture Prior From Unpaired Data with Diffusion Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Controllable 3D Outdoor Scene Generation via Scene Graphs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Unified Dense Prediction of Video Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DreamRelation: Bridging Customization and Relation Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Point Cloud Mamba: Point Cloud Learning via State Space Model.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Explore In-Context Segmentation via Latent Diffusion Models.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model.
Proceedings of the International Conference on 3D Vision, 2025

2024
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Spatiotemporal Feature Enhancement Network for Blur Robust Underwater Object Detection.
IEEE Trans. Cogn. Dev. Syst., October, 2024

Automatically Discovering Novel Visual Categories With Adaptive Prototype Learning.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

RelationBooth: Towards Relation-Aware Customized Object Generation.
CoRR, 2024

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners.
CoRR, 2024

LLAVADI: What Matters For Multimodal Large Language Models Distillation.
CoRR, 2024

ViLLa: Video Reasoning Segmentation with Large Language Model.
CoRR, 2024

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model.
CoRR, 2024

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations.
CoRR, 2024

Training Class-Imbalanced Diffusion Model Via Overlap Optimization.
CoRR, 2024

Generalizable Entity Grounding via Assistance of Large Language Model.
CoRR, 2024

RAP-SAM: Towards Real-Time All-Purpose Segment Anything.
CoRR, 2024

SyncVIS: Synchronized Video Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Dual Associated Encoder for Face Restoration.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pyramid Diffusion for Fine 3D Large Scene Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniGS: Unified Representation for Image Generation and Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Incremental Few-Shot Object Detection with scale- and centerness-aware weight generation.
Comput. Vis. Image Underst., October, 2023

Open World Entity Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Effective Adapter for Face Recognition in the Wild.
CoRR, 2023

Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion.
CoRR, 2023

AIMS: All-Inclusive Multi-Level Segmentation.
CoRR, 2023

TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AIMS: All-Inclusive Multi-Level Segmentation for Anything.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

High Quality Entity Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
ICM-3D: Instantiated Category Modeling for 3D Instance Segmentation.
IEEE Robotics Autom. Lett., 2022

CANet: Co-attention network for RGB-D semantic segmentation.
Pattern Recognit., 2022

PointINS: Point-Based Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Fine-Grained Entity Segmentation.
CoRR, 2022

Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning.
CoRR, 2022

PalGAN: Image Colorization with Palette Generative Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

High Quality Segmentation for Ultra High-resolution Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MAT: Mask-Aware Transformer for Large Hole Image Inpainting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Best-Buddy GANs for Highly Detailed Image Super-resolution.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
CaSP: Class-agnostic Semi-Supervised Pretraining for Detection and Segmentation.
CoRR, 2021

Best-Buddy GANs for Highly Detailed Image Super-Resolution.
CoRR, 2021

Image Synthesis via Semantic Composition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Scale Aligned Distillation for Low-Resolution Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scale-Aware Automatic Augmentation for Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
PointINS: Point-based Instance Segmentation.
CoRR, 2020

LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

MuCAN: Multi-correspondence Aggregation Network for Video Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020, 2020

RGB-D Co-attention Network for Semantic Segmentation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Faster R-CNN for marine organisms detection and recognition using data augmentation.
Neurocomputing, 2019

Amodal Instance Segmentation With KINS Dataset.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A Novel Biologically Inspired Visual Cognition Model: Automatic Extraction of Semantics, Formation of Integrated Concepts, and Reselection Features for Ambiguity.
IEEE Trans. Cogn. Dev. Syst., 2018

Sequential Context Encoding for Duplicate Removal.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Single Shot Feature Aggregation Network for Underwater Object Detection.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Path Aggregation Network for Instance Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Faster R-CNN for Marine Organism Detection and Recognition Using Data Augmentation.
Proceedings of the International Conference on Video and Image Processing, 2017

2016
A Novel Biologically Mechanism-Based Visual Cognition Model-Automatic Extraction of Semantics, Formation of Integrated Concepts and Re-selection Features for Ambiguity.
CoRR, 2016

NFLB dropout: Improve generalization ability by dropping out the best -A biologically inspired adaptive dropout method for unsupervised learning.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016


  Loading...