Yujun Cai
Orcid: 0000-0002-0993-4024
According to our database1,
Yujun Cai
authored at least 85 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs.
CoRR, August, 2025
Cure or Poison? Embedding Instructions Visually Alters Hallucination in Vision-Language Models.
CoRR, August, 2025
A<sup>2</sup>R<sup>2</sup>: Advancing Img2LaTeX Conversion via Visual Reasoning with Attention-Guided Refinement.
CoRR, July, 2025
LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation.
CoRR, July, 2025
CoRR, July, 2025
Unveiling the Potential of Diffusion Large Language Model in Controllable Generation.
CoRR, July, 2025
CoRR, July, 2025
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning.
CoRR, July, 2025
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite.
CoRR, June, 2025
SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global Thinking.
CoRR, June, 2025
CoRR, May, 2025
Unveiling Impact of Frequency Components on Membership Inference Attacks for Diffusion Models.
CoRR, May, 2025
Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression.
CoRR, April, 2025
Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs.
CoRR, April, 2025
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models.
CoRR, April, 2025
CoRR, April, 2025
CoRR, March, 2025
Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps.
CoRR, March, 2025
MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks.
CoRR, March, 2025
SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint.
CoRR, March, 2025
Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization.
CoRR, March, 2025
Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense.
CoRR, March, 2025
CoRR, March, 2025
CoRR, March, 2025
Fact or Guesswork? Evaluating Large Language Model's Medical Knowledge with Structured One-Hop Judgment.
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
CoRR, 2024
CoRR, 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization.
CoRR, 2024
RIS-Assisted Federated Learning Algorithm Based on Device Selection and Weighted Averaging.
Proceedings of the 99th IEEE Vehicular Technology Conference, 2024
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 24th IEEE International Conference on Communication Technology, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
STMG: A Machine Learning Microgesture Recognition System for Supporting Thumb-Based VR/AR Input.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024
2023
Deep learning-based channel estimation using Gaussian mixture distribution and expectation maximum algorithm.
Phys. Commun., June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
Proceedings of the International Conference on Wireless Communications and Signal Processing, 2023
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023
2022
BMC Bioinform., 2022
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering.
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Proceedings of the WWW '21: The Web Conference 2021, 2021
Proceedings of the WWW '21: The Web Conference 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder<sup>∗</sup>.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
2020
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018