Pandeng Li

Orcid: 0000-0002-0717-8659

According to our database1, Pandeng Li authored at least 26 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Attention-driven frequency-based Zero-Shot Learning with phase augmentation.
Int. J. Mach. Learn. Cybern., August, 2025

UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing.
CoRR, July, 2025

Dual variational network for unsupervised cross-modal hashing.
Int. J. Mach. Learn. Cybern., June, 2025

Wan: Open and Advanced Large-Scale Video Generative Models.
CoRR, March, 2025

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach.
CoRR, March, 2025

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface.
CoRR, March, 2025

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs.
CoRR, February, 2025

Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning.
IEEE Trans. Image Process., 2025

Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image Retrieval.
IEEE Trans. Multim., 2024

Balanced Classification: A Unified Framework for Long-Tailed Object Detection.
IEEE Trans. Multim., 2024

Towards Discriminative Feature Generation for Generalized Zero-Shot Learning.
IEEE Trans. Multim., 2024

FuseTeacher: Modality-Fused Encoders are Strong Vision Supervisors.
Proceedings of the Computer Vision - ECCV 2024, 2024

AlignZeg: Mitigating Objective Misalignment for Zero-Shot Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Neighborhood-Adaptive Multi-Cluster Ranking for Deep Metric Learning.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

MomentDiff: Generative Video Moment Retrieval from Random to Real.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Frequency-based Zero-Shot Learning with Phase Augmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Dual Dynamic Proxy Hashing Network for Long-tailed Image Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-supervised Weighted Hashing with Topology Mining.
Proceedings of the 2023 7th International Conference on Control Engineering and Artificial Intelligence, 2023

2022
Online Residual Quantization Via Streaming Data Correlation Preserving.
IEEE Trans. Multim., 2022

Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval.
IEEE Trans. Image Process., 2022

Dual Part Discovery Network for Zero-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022

Neighborhood-Adaptive Structure Augmented Metric Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022


  Loading...