Hao Cheng

Orcid: 0000-0002-3246-6636

Affiliations:
  • Hong Kong University of Science and Technology (Guangzhou), Humanoid Computing Laboratory, Guangzhou, China
  • Xi'an Jiaotong University, China (2017 - 2020)


According to our database1, Hao Cheng authored at least 49 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction.
CoRR, February, 2026

2025
RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.
CoRR, October, 2025

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition.
CoRR, October, 2025

VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.
CoRR, September, 2025

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots.
CoRR, July, 2025

Occupancy World Model for Robots.
CoRR, May, 2025

Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition.
CoRR, March, 2025

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models.
CoRR, March, 2025

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention.
CoRR, March, 2025

HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots.
CoRR, March, 2025

LiPS: Large-Scale Humanoid Robot Reinforcement Learning with Parallel-Series Structures.
CoRR, March, 2025

Spiking Diffusion Models.
IEEE Trans. Artif. Intell., January, 2025

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak.
CoRR, January, 2025

VCIP 2025 Grand Challenge on Live Broadcasting Video Quality Assessment: Methods and Results.
Proceedings of the International Conference on Visual Communications and Image Processing, 2025

Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025


TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination via Latent Truthful-Guided Pre-Intervention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Event Masked Autoencoder: Point-wise Action Recognition with Event-Based Cameras.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Uncovering Vision Modality Threats in Image-to-Image Tasks.
CoRR, 2024

Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End Vision Language Action Models.
CoRR, 2024

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.
CoRR, 2024

Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning.
CoRR, 2024

Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models.
CoRR, 2024

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model.
CoRR, 2024

Spiking Denoising Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Spiking Neural Network as Adaptive Event Stream Slicer.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Energy-based Active Learning for Bringing Beam-induced Domain Gap for 3D Object Detection.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Gaining the Sparse Rewards by Exploring Lottery Tickets in Spiking Neural Networks.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

DONE: Dynamic Neural Representation Via Hyperplane Neural ODE.
Proceedings of the IEEE International Conference on Acoustics, 2024

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

ACT-Diffusion: Efficient Adversarial Consistency Training for One-Step Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
ACT: Adversarial Consistency Models.
CoRR, 2023

Pursing the Sparse Limitation of Spiking Deep Learning Structures.
CoRR, 2023

Gaining the Sparse Rewards by Exploring Binary Lottery Tickets in Spiking Neural Network.
CoRR, 2023

RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias.
CoRR, 2023

Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models.
CoRR, 2023

Improve Video Representation with Temporal Adversarial Augmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

RBFormer: Improve Adversarial Robustness of Transformers by Robust Bias.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Efficient Multi-Prize Lottery Tickets: Enhanced Accuracy, Training, and Inference Speed.
CoRR, 2022

More or Less (MoL): Defending against Multiple Perturbation Attacks on Deep Neural Networks through Model Ensemble and Compression.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

2021
Mixture of Robust Experts (MoRE): A Flexible Defense Against Multiple Perturbations.
CoRR, 2021

2020
Defending against Backdoor Attack on Deep Neural Networks.
CoRR, 2020

2019
Second Rethinking of Network Pruning in the Adversarial Setting.
CoRR, 2019

Adversarial Robustness vs. Model Compression, or Both?
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...