Hao Cheng
Orcid: 0000-0002-3246-6636Affiliations:
- Hong Kong University of Science and Technology (Guangzhou), Humanoid Computing Laboratory, Guangzhou, China
- Xi'an Jiaotong University, China (2017 - 2020)
According to our database1,
Hao Cheng authored at least 49 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, February, 2026
2025
RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025
Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.
CoRR, October, 2025
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition.
CoRR, October, 2025
VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.
CoRR, September, 2025
Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots.
CoRR, July, 2025
Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition.
CoRR, March, 2025
Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models.
CoRR, March, 2025
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention.
CoRR, March, 2025
HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots.
CoRR, March, 2025
LiPS: Large-Scale Humanoid Robot Reinforcement Learning with Parallel-Series Structures.
CoRR, March, 2025
Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak.
CoRR, January, 2025
VCIP 2025 Grand Challenge on Live Broadcasting Video Quality Assessment: Methods and Results.
Proceedings of the International Conference on Visual Communications and Image Processing, 2025
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025
VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025
TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination via Latent Truthful-Guided Pre-Intervention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End Vision Language Action Models.
CoRR, 2024
Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models.
CoRR, 2024
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning.
CoRR, 2024
Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models.
CoRR, 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model.
CoRR, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Energy-based Active Learning for Bringing Beam-induced Domain Gap for 3D Object Detection.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
ACT-Diffusion: Efficient Adversarial Consistency Training for One-Step Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Gaining the Sparse Rewards by Exploring Binary Lottery Tickets in Spiking Neural Network.
CoRR, 2023
Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models.
CoRR, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the 34th British Machine Vision Conference 2023, 2023
2022
Efficient Multi-Prize Lottery Tickets: Enhanced Accuracy, Training, and Inference Speed.
CoRR, 2022
More or Less (MoL): Defending against Multiple Perturbation Attacks on Deep Neural Networks through Model Ensemble and Compression.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022
2021
CoRR, 2021
2020
2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019