Weitai Kang

Orcid: 0009-0007-6484-0665

According to our database1, Weitai Kang authored at least 13 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ExpVG: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model.
CoRR, August, 2025

GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning.
CoRR, August, 2025

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction.
CoRR, May, 2025

3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation.
CoRR, April, 2025

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage.
CoRR, 2024

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning.
CoRR, 2024

Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner.
CoRR, 2024

ACTRESS: Active Retraining for Semi-supervised Visual Grounding.
CoRR, 2024

Visual Grounding with Attention-Driven Constraint Balancing.
CoRR, 2024

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding.
Proceedings of the Computer Vision - ECCV 2024, 2024

On the Faithfulness of Vision Transformer Explanations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Token Transformation Matters: Towards Faithful Post-Hoc Explanation for Vision Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


  Loading...