Jinpeng Wang

Orcid: 0000-0002-4352-4897

Affiliations:

Tsinghua Shenzhen, International Graduate School, Shenzhen, China

According to our database¹, Jinpeng Wang authored at least 56 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning.

[BibT_eX]

[DOI]

CoRR, May, 2026

CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception.

[BibT_eX]

[DOI]

CoRR, May, 2026

SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation.

[BibT_eX]

[DOI]

CoRR, May, 2026

Tailoring Teaching to Aptitude: Direction-Adaptive Self-Distillation for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2026

Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval.

[BibT_eX]

[DOI]

CoRR, May, 2026

FEDIN: Frequency-Enhanced Deep Interest Network for Click-Through Rate Prediction.

[BibT_eX]

[DOI]

CoRR, May, 2026

Love Me, Love My Label: Rethinking the Role of Labels in Prompt Retrieval for Visual In-Context Learning.

[BibT_eX]

[DOI]

CoRR, April, 2026

Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval.

[BibT_eX]

[DOI]

CoRR, April, 2026

PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment.

[BibT_eX]

[DOI]

CoRR, March, 2026

Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck.

[BibT_eX]

[DOI]

CoRR, March, 2026

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL.

[BibT_eX]

[DOI]

CoRR, February, 2026

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

HALoRA: Low-Rank Adaptation with Hierarchical Budget Allocation for Efficient Vision-Language Alignment.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Imagine with Layout and Sketch: Enhancing Vision-Language Retrieval with Dual-Stream Multi-Modal Query Refinement.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

CoPRS: Learning Positional Prior from Chain-of-Thought for Reasoning Segmentation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Large Foundation Model for Ads Recommendation.

[BibT_eX]

[DOI]

CoRR, August, 2025

HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

[BibT_eX]

[DOI]

CoRR, July, 2025

InstructEngine: Instruction-driven Text-to-Image Alignment.

[BibT_eX]

[DOI]

CoRR, April, 2025

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

ROMA: Recommendation-Oriented Language Model Adaptation Using Multi-Modal Multi-Domain Item Sequences.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Efficient Self-Supervised Video Hashing with Selective State Spaces.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., August, 2024

Pyramid hybrid pooling quantization for efficient fine-grained image retrieval.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2024

Towards Scalable Semantic Representation for Recommendation.

[BibT_eX]

[DOI]

CoRR, 2024

MambaVC: Learned Visual Compression with Selective State Spaces.

[BibT_eX]

[DOI]

CoRR, 2024

GMMFormer v2: An Uncertainty-aware Framework for Partially Relevant Video Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

ReFer: Retrieval-Enhanced Vertical Federated Recommendation for Full Set User Benefit.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression.

[BibT_eX]

[DOI]

CoRR, 2023

Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Contrastive Masked Autoencoders for Self-Supervised Video Hashing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Multi-task Ranking with User Behaviors for Text-video Search.

[BibT_eX]

[DOI]

Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Motion-Aware Graph Reasoning Hashing for Self-supervised Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Hugs Are Better Than Handshakes: Unsupervised Cross-Modal Transformer Hashing with Multi-granularity Alignment.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Contrastive Quantization with Code Memory for Unsupervised Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Pyramid Hybrid Pooling Quantization for Efficient Fine-Grained Image Retrieval.

[BibT_eX]

[DOI]

CoRR, 2021

Webly Supervised Deep Attentive Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

SwinFGHash: Fine-grained Image Retrieval via Transformer-based Hashing Network.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Jinpeng Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...