Kaixun Jiang

Orcid: 0000-0002-2878-0497

According to our database1, Kaixun Jiang authored at least 41 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
VLA-Hijack: A Transferable Patch Attack against Vision-Language-Action Models via Visual Proprioception Hijacking.
CoRR, May, 2026

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation.
CoRR, May, 2026

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models.
CoRR, May, 2026

Unified Multimodal Visual Tracking with Dual Mixture-of-Experts.
CoRR, May, 2026

AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation.
CoRR, March, 2026

GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning.
CoRR, January, 2026

Delving into the adversarial robustness of semantic segmentation with decision-based black-box attacks.
Neural Networks, 2026

Seeing Is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations.
CoRR, December, 2025

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement.
CoRR, December, 2025

Improving Multimodal Sentiment Analysis via Modality Optimization and Dynamic Primary Modality Selection.
CoRR, November, 2025

Dynamic Semantic-Aware Correlation Modeling for UAV Tracking.
CoRR, October, 2025

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops.
CoRR, June, 2025

Boosting Adversarial Transferability with Spatial Adversarial Alignment.
CoRR, January, 2025

VideoPure: Diffusion-Based Adversarial Purification for Video Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2025

Exploring the adversarial robustness of face forgery detection with decision-based black-box attacks.
Knowl. Based Syst., 2025

Improving adversarial transferability with Neighborhood Gradient Information.
Appl. Soft Comput., 2025

Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

General Compression Framework for Efficient Transformer Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Boosting the transferability of adversarial attacks with global momentum initialization.
Expert Syst. Appl., 2024

General Compression Framework for Efficient Transformer Object Tracking.
CoRR, 2024

Improving Adversarial Transferability with Neighbourhood Gradient Information.
CoRR, 2024

PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving.
CoRR, 2024

Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction.
CoRR, 2024

Delving into Decision-based Black-box Attacks on Semantic Segmentation.
CoRR, 2024

DeTrack: In-model Latent Denoising Learning for Visual Object Tracking.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Exploring Decision-based Black-box Attacks on Face Forgery Detection.
CoRR, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
CoRR, 2023

Model Robustness Meets Data Privacy: Adversarial Robustness Distillation without Original Data.
CoRR, 2023

Content-based Unrestricted Adversarial Attack.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Decision-based Sparse Attacks on Video Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization.
CoRR, 2022


  Loading...