Yang Bai

Orcid: 0000-0003-1918-0462

Affiliations:
  • Chengdu University of Information Technology, School of Cyber Security, Chengdu, China


According to our database1, Yang Bai authored at least 76 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
SRBench: A Comprehensive Benchmark for Sequential Recommendation with Large Language Models.
CoRR, April, 2026

OODBench: Out-of-Distribution Benchmark for Large Vision-Language Models.
CoRR, February, 2026

Boosting adversarial transferability of vision-language pre-trained models via optimal transport.
Pattern Recognit., 2026

ConRF: Zero-shot stylization of 3D scenes with conditioned radiation fields.
Pattern Recognit., 2026

EH-Benchmark: Ophthalmic hallucination benchmark and agent-driven top-down traceable reasoning workflow.
Inf. Fusion, 2026

PLM: Point-Language Maps for Zero-shot Object Goal Navigation.
Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Uncertainty-Aware Medical Diagnostic Phrase Identification and Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

<i>IMUZero:</i> Zero-Shot Human Activity Recognition by Language-Based Cross Modality Fusion.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., December, 2025

Why does weak-OOD help? A Further Step Towards Understanding Jailbreaking VLMs.
CoRR, November, 2025

JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework.
CoRR, November, 2025

EVLF-FM: Explainable Vision Language Foundation Model for Medicine.
CoRR, September, 2025

Multimodal, Multi-Disease Medical Imaging Foundation Model (MerMED-FM).
CoRR, July, 2025

MOVE: Effective and Harmless Ownership Verification via Embedded External Features.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

An integrated language-vision foundation model for conversational diagnostics and triaging in primary eye care.
CoRR, May, 2025

Backdoor Attack and Defense on Deep Learning: A Survey.
IEEE Trans. Comput. Soc. Syst., February, 2025

Safety at Scale: A Comprehensive Survey of Large Model Safety.
CoRR, February, 2025

Video Compression Optimization and Rate Control for Cyberspace Application.
Int. J. Pattern Recognit. Artif. Intell., 2025

Using Homomorphic Proxy Re-Encryption to Enhance Security and Privacy of Federated Learning-Based Intelligent Connected Vehicles.
IET Inf. Secur., 2025

Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety.
Found. Trends Priv. Secur., 2025

Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

An Overview of Advanced Deep Graph Node Clustering.
IEEE Trans. Comput. Soc. Syst., February, 2024

Fast Propagation Is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks.
IEEE Trans. Inf. Forensics Secur., 2024

CTNeRF: Cross-time Transformer for dynamic neural radiance field from monocular video.
Pattern Recognit., 2024

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning.
CoRR, 2024

Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs.
CoRR, 2024

Adversarial Robustness for Visual Grounding of Multimodal Large Language Models.
CoRR, 2024

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models.
CoRR, 2024

Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples.
CoRR, 2024

MedRG: Medical Report Grounding with Multi-modal Large Language Model.
CoRR, 2024

FMM-Attack: A Flow-based Multi-modal Adversarial Attack on Video-based LLMs.
CoRR, 2024

Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors.
CoRR, 2024

ISPPFL: An incentive scheme based privacy-preserving federated learning for avatar in metaverse.
Comput. Networks, 2024

A Cross-Chain Mechanism for Agricultural Engineering Document Management Blockchain in the Context of Big Data.
Big Data Res., 2024

UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Enhancing Community Vision Screening: AI-Driven Retinal Photography for Early Disease Detection and Patient Trust.
Proceedings of the Ophthalmic Medical Image Analysis - 11th International Workshop, 2024

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sentence-level Prompts Benefit Composed Image Retrieval.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
BlockExplorer: Exploring Blockchain Big Data Via Parallel Processing.
IEEE Trans. Computers, August, 2023

TokenAware: Accurate and Efficient Bookkeeping Recognition for Token Smart Contracts.
ACM Trans. Softw. Eng. Methodol., January, 2023

Interpretable Graph Convolutional Network for Multi-View Semi-Supervised Learning.
IEEE Trans. Multim., 2023

Query efficient black-box adversarial attack on deep neural networks.
Pattern Recognit., 2023

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering.
CoRR, 2023

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization.
CoRR, 2023

Temporal Segment Transformer for Action Segmentation.
CoRR, 2023

BackdoorBox: A Python Toolbox for Backdoor Learning.
CoRR, 2023

Towards Few-shot Image Captioning with Cycle-based Compositional Semantic Enhancement Framework.
Proceedings of the International Joint Conference on Neural Networks, 2023

Backdoor Defense via Adaptively Splitting Poisoned Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
MOVE: Effective and Harmless Ownership Verification via Embedded External Features.
CoRR, 2022

Adaptive Frequency Learning in Two-branch Face Forgery Detection.
CoRR, 2022

Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Imitated Detectors: Stealing Knowledge of Black-box Object Detectors.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal.
Proceedings of the Computer Vision - ECCV 2022, 2022

Action Quality Assessment with Temporal Parsing Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
A Defense Framework for Privacy Risks in Remote Machine Learning Service.
Secur. Commun. Networks, 2021

Clustering Effect of (Linearized) Adversarial Robust Models.
CoRR, 2021

Clustering Effect of Adversarial Robust Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Discriminative Latent Semantic Graph for Video Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Towards Automated Fatigue Assessment using Wearable Sensing and Mixed-Effects Models.
Proceedings of the ISWC 2021: Proceedings of the 2021 ACM International Symposium on Wearable Computers, 2021

D2Defend: Dual-Domain based Defense against Adversarial Examples.
Proceedings of the International Joint Conference on Neural Networks, 2021

Improving Adversarial Robustness via Channel-wise Activation Suppressing.
Proceedings of the 9th International Conference on Learning Representations, 2021

GANMIA: GAN-based Black-box Membership Inference Attack.
Proceedings of the ICC 2021, 2021

2020
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fatigue assessment using ECG and actigraphy sensors.
Proceedings of the ISWC '20: 2020 ACM International Symposium on Wearable Computers, 2020

Self-Adaptive Feature Fool.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Query Efficiency of Black-Box Adversarial Attack.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Black-Box Attack on Neural Networks Based on Swarm Evolutionary Algorithm.
Proceedings of the Information Security and Privacy - 25th Australasian Conference, 2020

2019
Adversarial Defense Via Local Flatness Regularization.
CoRR, 2019

KnightKing: a fast distributed graph random walk engine.
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 2019

Hilbert-Based Generative Defense for Adversarial Examples.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Improved Forward-Backward Propagation to Generate Adversarial Examples.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019

2015
Test Generation for Embedded Executables via Concolic Execution in a Real Environment.
IEEE Trans. Reliab., 2015

2014
Conpy: Concolic Execution Engine for Python Applications.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2014


  Loading...