Siteng Huang
Orcid: 0000-0002-9735-1186
According to our database1,
Siteng Huang authored at least 50 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning With Gaussian Splatting.
IEEE Robotics Autom. Lett., May, 2026
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation.
CoRR, March, 2026
Articulat3D: Reconstructing Articulated Digital Twins From Monocular Videos with Geometric and Motion Constraints.
CoRR, March, 2026
M2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension.
IEEE Trans. Circuits Syst. Video Technol., February, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models.
CoRR, December, 2025
CoRR, September, 2025
CoRR, September, 2025
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation.
CoRR, August, 2025
CoRR, May, 2025
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration.
CoRR, January, 2025
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Quart-Online: Latency-Free Multimodal Large Language Model for Quadruped Robot Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.
CoRR, 2024
Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation.
CoRR, 2024
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration.
CoRR, 2024
CoRR, 2024
M<sup>2</sup>IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension.
CoRR, 2024
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2019
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019