Zhanhui Kang

Orcid: 0009-0006-5151-4222

According to our database1, Zhanhui Kang authored at least 59 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ID-centric Pre-training for Recommendation.
ACM Trans. Inf. Syst., September, 2025

Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework.
CoRR, July, 2025

Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs.
CoRR, June, 2025

The Security Threat of Compressed Projectors in Large Vision-Language Models.
CoRR, June, 2025

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason.
CoRR, May, 2025

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought.
CoRR, May, 2025

Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training.
CoRR, April, 2025

Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training.
CoRR, April, 2025

Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language.
ACM Trans. Inf. Syst., March, 2025

TransMamba: Flexibly Switching between Transformer and Mamba.
CoRR, March, 2025

PatchRec: Multi-Grained Patching for Efficient LLM-based Sequential Recommendation.
CoRR, January, 2025

Autonomy-of-Experts Models.
CoRR, January, 2025

Scaling Laws for Floating Point Quantization Training.
CoRR, January, 2025

Frequency-Augmented Mixture-of-Heterogeneous-Experts Framework for Sequential Recommendation.
Proceedings of the ACM on Web Conference 2025, 2025

Multi-Grained Patch Training for Efficient LLM-based Recommendation.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Language Models "Grok" to Copy.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Continuous Speech Tokenizer in Text To Speech.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

A Pre-trained Plug-in Mixture-of-LoRAs Model for Transferable Sequential Recommendation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring Forgetting in Large Language Model Pre-Training.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant".
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models.
CoRR, 2024

More Expressive Attention with Negative Weights.
CoRR, 2024

Lossless KV Cache Compression to 2%.
CoRR, 2024

RosePO: Aligning LLM-based Recommenders with Human Values.
CoRR, 2024

Negative Sampling in Recommendation: A Survey and Future Directions.
CoRR, 2024

HMoE: Heterogeneous Mixture of Experts for Language Modeling.
CoRR, 2024

Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data.
CoRR, 2024

ID-centric Pre-training for Recommendation.
CoRR, 2024

PhD: A Prompted Visual Hallucination Evaluation Dataset.
CoRR, 2024

Towards Empathetic Conversational Recommender Systems.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024

The Elephant in the Room: Rethinking the Usage of Pre-trained Language Model in Sequential Recommendation.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024

Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

DFGNN: Dual-frequency Graph Neural Network for Sign-aware Feedback.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

SeeDRec: Sememe-based Diffusion for Sequential Recommendation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Style Controlling in Recommendation.
Proceedings of the Database Systems for Advanced Applications, 2024

LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Plug-In Diffusion Model for Sequential Recommendation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N: M Sparsity.
CoRR, 2023

TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval.
CoRR, 2023

Multi-Feature Integration for Perception-Dependent Examination-Bias Estimation.
CoRR, 2023

Pretraining De-Biased Language Model with Large-scale Click Logs for Document Ranking.
CoRR, 2023

ChinaOpen: A Dataset for Open-world Multimodal Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

GradSalMix: Gradient Saliency-Based Mix for Image Data Augmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

2022
BagFormer: Better Cross-Modal Retrieval via bag-wise interaction.
CoRR, 2022

MKQ-BERT: Quantized BERT with 4-bits Weights and Activations.
CoRR, 2022

An Anchor-based Relative Position Embedding Method for Cross-Modal Tasks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Multimodal Product Identification: Submission to Watch and Buy 2021 Challenge.
Proceedings of the WAB'21: Proceedings of the 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge, 2021

TexSmart: A System for Enhanced Natural Language Understanding.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis.
CoRR, 2020

STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition.
CoRR, 2020

2015
Semantic Matching in APP Search.
Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 2015


  Loading...