Jinpeng Wang

Orcid: 0000-0002-4352-4897

Affiliations:
  • Tsinghua Shenzhen, International Graduate School, Shenzhen, China


According to our database1, Jinpeng Wang authored at least 50 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Love Me, Love My Label: Rethinking the Role of Labels in Prompt Retrieval for Visual In-Context Learning.
CoRR, April, 2026

Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval.
CoRR, April, 2026

PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment.
CoRR, March, 2026

Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck.
CoRR, March, 2026

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents.
CoRR, March, 2026

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL.
CoRR, February, 2026

HALoRA: Low-Rank Adaptation with Hierarchical Budget Allocation for Efficient Vision-Language Alignment.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Imagine with Layout and Sketch: Enhancing Vision-Language Retrieval with Dual-Stream Multi-Modal Query Refinement.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
CoPRS: Learning Positional Prior from Chain-of-Thought for Reasoning Segmentation.
CoRR, October, 2025

Large Foundation Model for Ads Recommendation.
CoRR, August, 2025

HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.
CoRR, July, 2025

InstructEngine: Instruction-driven Text-to-Image Alignment.
CoRR, April, 2025

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

ROMA: Recommendation-Oriented Language Model Adaptation Using Multi-Modal Multi-Domain Item Sequences.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Efficient Self-Supervised Video Hashing with Selective State Spaces.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers.
Int. J. Comput. Vis., August, 2024

Pyramid hybrid pooling quantization for efficient fine-grained image retrieval.
Pattern Recognit. Lett., 2024

Towards Scalable Semantic Representation for Recommendation.
CoRR, 2024

MambaVC: Learned Visual Compression with Selective State Spaces.
CoRR, 2024

GMMFormer v2: An Uncertainty-aware Framework for Partially Relevant Video Retrieval.
CoRR, 2024

RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

ReFer: Retrieval-Enhanced Vertical Federated Recommendation for Full Set User Benefit.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression.
CoRR, 2023

Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Contrastive Masked Autoencoders for Self-Supervised Video Hashing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Multi-task Ranking with User Behaviors for Text-video Search.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Motion-Aware Graph Reasoning Hashing for Self-supervised Video Retrieval.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Hugs Are Better Than Handshakes: Unsupervised Cross-Modal Transformer Hashing with Multi-granularity Alignment.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Contrastive Quantization with Code Memory for Unsupervised Image Retrieval.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Pyramid Hybrid Pooling Quantization for Efficient Fine-Grained Image Retrieval.
CoRR, 2021

Webly Supervised Deep Attentive Quantization.
Proceedings of the IEEE International Conference on Acoustics, 2021

SwinFGHash: Fine-grained Image Retrieval via Transformer-based Hashing Network.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021


  Loading...