Xingwu Sun

Orcid: 0009-0008-3222-0901

According to our database1, Xingwu Sun authored at least 47 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs.
CoRR, June, 2025

The Security Threat of Compressed Projectors in Large Vision-Language Models.
CoRR, June, 2025

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason.
CoRR, May, 2025

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought.
CoRR, May, 2025

Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training.
CoRR, April, 2025

TransMamba: Flexibly Switching between Transformer and Mamba.
CoRR, March, 2025

PatchRec: Multi-Grained Patching for Efficient LLM-based Sequential Recommendation.
CoRR, January, 2025

Autonomy-of-Experts Models.
CoRR, January, 2025

Scaling Laws for Floating Point Quantization Training.
CoRR, January, 2025

Multi-Grained Patch Training for Efficient LLM-based Recommendation.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Language Models "Grok" to Copy.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Continuous Speech Tokenizer in Text To Speech.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring Forgetting in Large Language Model Pre-Training.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant".
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models.
CoRR, 2024

More Expressive Attention with Negative Weights.
CoRR, 2024

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent.
CoRR, 2024

Lossless KV Cache Compression to 2%.
CoRR, 2024

RosePO: Aligning LLM-based Recommenders with Human Values.
CoRR, 2024

Negative Sampling in Recommendation: A Survey and Future Directions.
CoRR, 2024

HMoE: Heterogeneous Mixture of Experts for Language Modeling.
CoRR, 2024

Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data.
CoRR, 2024

PhD: A Prompted Visual Hallucination Evaluation Dataset.
CoRR, 2024

The Elephant in the Room: Rethinking the Usage of Pre-trained Language Model in Sequential Recommendation.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PIP: Detecting Adversarial Examples in Large Vision-Language Models via Attention Patterns of Irrelevant Probe Questions.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SeeDRec: Sememe-based Diffusion for Sequential Recommendation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Style Controlling in Recommendation.
Proceedings of the Database Systems for Advanced Applications, 2024

LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Inflected Forms Are Redundant in Question Generation Models.
CoRR, 2023

CMMix: Cross-Modal Mix Augmentation Between Images and Texts for Visual Grounding.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

GradSalMix: Gradient Saliency-Based Mix for Image Data Augmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

2022
An Anchor-based Relative Position Embedding Method for Cross-Modal Tasks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
TITA: A Two-stage Interaction and Topic-Aware Text Matching Model.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Enhancing Document Ranking with Task-adaptive Training and Segmented Token Recovery Mechanism.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

A Bidirectional Multi-paragraph Reading Model for Zero-shot Entity Linking.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
TABLE: A Task-Adaptive BERT-based ListwisE Ranking Model for Document Retrieval.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
A deep spatio-temporal attention-based neural network for passenger flow prediction.
Proceedings of the MobiQuitous 2019, 2019

Answer-Focused and Position-Aware Neural Network for Transfer Learning in Question Generation.
Proceedings of the Knowledge Science, Engineering and Management, 2019

2018
Answer-focused and Position-aware Neural Question Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018


  Loading...