We stand with Ukraine

We stand with Ukraine

Hao Cheng

Orcid: 0000-0002-3246-6636

Affiliations:

Hong Kong University of Science and Technology (Guangzhou), Humanoid Computing Laboratory, Guangzhou, China
Xi'an Jiaotong University, China (2017 - 2020)

According to our database¹, Hao Cheng authored at least 49 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2026

MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

2025

RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Shanghang Zhang

,

,

,

CoRR, November, 2025

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Occupancy World Model for Robots.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition.

[DOI]

,

,

,

,

,

CoRR, March, 2025

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models.

[DOI]

,

,

,

,

,

,

CoRR, March, 2025

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention.

[DOI]

,

,

,

James Diffenderfer

,

Bhavya Kailkhura

,

,

,

,

CoRR, March, 2025

HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

LiPS: Large-Scale Humanoid Robot Reinforcement Learning with Parallel-Series Structures.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Spiking Diffusion Models.

[DOI]

,

,

,

,

,

,

IEEE Trans. Artif. Intell., January, 2025

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak.

[DOI]

,

,

,

,

,

,

,

CoRR, January, 2025

VCIP 2025 Grand Challenge on Live Broadcasting Video Quality Assessment: Methods and Results.

[DOI]

,

,

,

,

MohammadAli Hamidi

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the International Conference on Visual Communications and Image Processing, 2025

Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination via Latent Truthful-Guided Pre-Intervention.

[DOI]

,

,

,

James Diffenderfer

,

Bhavya Kailkhura

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Event Masked Autoencoder: Point-wise Action Recognition with Event-Based Cameras.

[DOI]

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Uncovering Vision Modality Threats in Image-to-Image Tasks.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End Vision Language Action Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

CoRR, 2024

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Spiking Denoising Diffusion Probabilistic Models.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Spiking Neural Network as Adaptive Event Stream Slicer.

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Energy-based Active Learning for Bringing Beam-induced Domain Gap for 3D Object Detection.

[DOI]

,

,

,

,

Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Gaining the Sparse Rewards by Exploring Lottery Tickets in Spiking Neural Networks.

[DOI]

,

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

DONE: Dynamic Neural Representation Via Hyperplane Neural ODE.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection.

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

ACT-Diffusion: Efficient Adversarial Consistency Training for One-Step Diffusion Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models.

[DOI]

,

,

,

,

,

,

Bhavya Kailkhura

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

ACT: Adversarial Consistency Models.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Pursing the Sparse Limitation of Spiking Deep Learning Structures.

[DOI]

,

,

,

,

,

,

,

Bhavya Kailkhura

,

,

CoRR, 2023

Gaining the Sparse Rewards by Exploring Binary Lottery Tickets in Spiking Neural Network.

[DOI]

,

,

,

,

,

,

,

,

Bhavya Kailkhura

,

,

CoRR, 2023

RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias.

[DOI]

,

,

,

Lyutianyang Zhang

,

,

,

,

,

CoRR, 2023

Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models.

[DOI]

,

,

,

,

,

,

Bhavya Kailkhura

,

CoRR, 2023

Improve Video Representation with Temporal Adversarial Augmentation.

[DOI]

,

,

,

,

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

RBFormer: Improve Adversarial Robustness of Transformers by Robust Bias.

[DOI]

,

,

,

,

,

Lyutianyang Zhang

,

,

,

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Efficient Multi-Prize Lottery Tickets: Enhanced Accuracy, Training, and Inference Speed.

[DOI]

,

,

,

,

James Diffenderfer

,

Ryan A. Goldhahn

,

Bhavya Kailkhura

CoRR, 2022

More or Less (MoL): Defending against Multiple Perturbation Attacks on Deep Neural Networks through Model Ensemble and Compression.

[DOI]

,

,

,

,

,

,

Bhavya Kailkhura

,

Ryan A. Goldhahn

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

2021

Mixture of Robust Experts (MoRE): A Flexible Defense Against Multiple Perturbations.

[DOI]

,

,

,

,

Bhavya Kailkhura

,

Ryan A. Goldhahn

CoRR, 2021

2020

Defending against Backdoor Attack on Deep Neural Networks.

[DOI]

,

,

,

,

,

CoRR, 2020

2019

Second Rethinking of Network Pruning in the Adversarial Setting.

[DOI]

,

,

,

,

Jan-Henrik Lambrechts

,

,

,

,

,

CoRR, 2019

Adversarial Robustness vs. Model Compression, or Both?

[DOI]

,

,

,

,

,

Jan-Henrik Lambrechts

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Loading...