Hsiang-Wei Huang

Orcid: 0009-0009-2474-8869

According to our database1, Hsiang-Wei Huang authored at least 42 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CaMo: Camera Motion Grounded Evaluation and Training for Vision-Language Models.
CoRR, May, 2026

Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization.
CoRR, February, 2026

Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System.
CoRR, January, 2026

Reasoning Matters for 3D Visual Grounding.
CoRR, January, 2026

SAMURAI: Motion-Aware Memory for Training-Free Visual Object Tracking With SAM 2.
IEEE Trans. Image Process., 2026

2025
Warehouse Spatial Question Answering with LLM Agent.
CoRR, July, 2025

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action.
CoRR, May, 2025

Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 2025.
CoRR, March, 2025

PackDiT: Joint Human Motion and Text Generation via Mutual Prompting.
CoRR, January, 2025

ToSA: Token Merging with Spatial Awareness.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Details Matter for Indoor Open-Vocabulary 3D Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Warehouse Spatial Question Answering with LLM Agent 1st Place Solution of the 9th AI City Challenge Track 3.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

MambaMOT: State-Space Model as Motion Predictor for Multi-Object Tracking.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Depth-Aware Robust Multi-Object Tracker for Crowded Scene by Re-Prioritizing Association Order.
Proceedings of the IEEE International Conference on Advanced Visual and Signal-Based Systems, 2025


2024
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory.
CoRR, 2024

VersaT2I: Improving Text-to-Image Models with Versatile Reward.
CoRR, 2024

Exploring Learning-based Motion Models in Multi-Object Tracking.
CoRR, 2024

A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video.
CoRR, 2024

Sea You Later: Metadata-Guided Long-Term Re-Identification for UAV-Based Multi-Object Tracking.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024


Iterative Scale-Up ExpansionIoU and Deep Features Association for Multi-Object Tracking in Sports.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024

Boosting Online 3D Multi-Object Tracking through Camera-Radar Cross Check.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-modal Tracking.
Proceedings of the Pattern Recognition. ICPR 2024 International Workshops and Challenges, 2024

A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Videos.
Proceedings of the IEEE International Conference on Acoustics, 2024

ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

RT-Pose: A 4D Radar Tensor-Based 3D Human Pose Estimation and Localization Benchmark.
Proceedings of the Computer Vision - ECCV 2024, 2024

An Online Approach and Evaluation Method for Tracking People Across Cameras in Extremely Long Video Sequence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


VideoBadminton: A Video Dataset for Badminton Action Recognition.
Proceedings of the IEEE International Conference on Big Data, 2024

GTA: Global Tracklet Association for Multi-object Tracking in Sports.
Proceedings of the Computer Vision - ACCV 2024 Workshops, 2024

2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024.
CoRR, 2023

Iterative Scale-Up ExpansionIoU and Deep Features Association for Multi-Object Tracking in Sports.
CoRR, 2023

Multi-target multi-camera vehicle tracking using transformer-based camera link model and spatial-temporal information.
CoRR, 2023


Observation Centric and Central Distance Recovery for Athlete Tracking.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2023

Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results.
CoRR, 2022

Ki-67 Index Measurement in Breast Cancer Using Digital Image Analysis.
CoRR, 2022

Observation Centric and Central Distance Recovery on Sports Player Tracking.
CoRR, 2022

2020
Mobile Social Service User Identification Framework Based on Action-Characteristic Data Retention.
IEEE Access, 2020


  Loading...