Siteng Huang
Orcid: 0000-0002-9735-1186
According to our database1,
Siteng Huang
authored at least 41 papers
between 2019 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, September, 2025
CoRR, September, 2025
CoRR, September, 2025
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation.
CoRR, August, 2025
CoRR, August, 2025
CoRR, May, 2025
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.
CoRR, May, 2025
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration.
CoRR, January, 2025
Quart-Online: Latency-Free Multimodal Large Language Model for Quadruped Robot Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.
CoRR, 2024
Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation.
CoRR, 2024
CoRR, 2024
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration.
CoRR, 2024
CoRR, 2024
M<sup>2</sup>IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension.
CoRR, 2024
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2019
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019