Junxiao Shen

CoRR, January, 2026

Structural variation drives enhancer hijacking via 3D genome disruption in ccRCC.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2026

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding.

[BibT_eX]

[DOI]

CoRR, October, 2025

Prompt-Driven Agentic Video Editing System: Autonomous Comprehension of Long-Form, Story-Driven Media.

[BibT_eX]

[DOI]

CoRR, September, 2025

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.

[BibT_eX]

[DOI]

CoRR, March, 2025

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining.

[BibT_eX]

[DOI]

CoRR, March, 2025

AutoMR: A Universal Time Series Motion Recognition Pipeline.

[BibT_eX]

[DOI]

CoRR, February, 2025

Duo Streamers: A Streaming Gesture Recognition Framework.

[BibT_eX]

[DOI]

CoRR, February, 2025

CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications.

[BibT_eX]

[DOI]

CoRR, January, 2025

X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding.

[BibT_eX]

[DOI]

Fan Zhang

Weizhe Lin

CoRR, January, 2025

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering.

[BibT_eX]

[DOI]

Yunze Liu

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding.

[BibT_eX]

[DOI]

Fan Zhang

Weizhe Lin

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024

Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-Training.

[BibT_eX]

[DOI]

Khadija Khaldi

Enmin Zhou

Hemant Bhaskar Surale

Amy Karlson

IEEE Trans. Vis. Comput. Graph., November, 2024

RingGesture: A Ring-Based Mid-Air Gesture Typing System Powered by a Deep-Learning Word Prediction Framework.

[BibT_eX]

[DOI]

Hemant Bhaskar Surale

Amy Karlson

IEEE Trans. Vis. Comput. Graph., November, 2024

Design and Evaluation of Controller-Based Raycasting Methods for Efficient Alphanumeric and Special Character Entry in Virtual Reality.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., September, 2024

Lucia: A Temporal Computing Platform for Contextual Intelligence.

[BibT_eX]

[DOI]

Weizhe Lin

CoRR, 2024

Human-inspired Perspectives: A Survey on AI Long-term Memory.

[BibT_eX]

[DOI]

CoRR, 2024

RingGesture: A Ring-Based Mid-Air Gesture Typing System Powered by a Deep-Learning Word Prediction Framework.

[BibT_eX]

[DOI]

Hemant Bhaskar Surale

CoRR, 2024

Simultaneous Gesture Classification and Localization with an Automatic Gesture Annotation Model.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Open-World Gesture Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2024

Encode-Store-Retrieve: Augmenting Human Memory through Language-Encoded Egocentric Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2024

Boosting Gesture Recognition with an Automatic Gesture Annotation Framework.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

2023

Fast and Robust Mid-Air Gesture Typing for AR Headsets using 3D Trajectory Decoding.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., November, 2023

Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques.

[BibT_eX]

[DOI]

CoRR, 2023

Encode-Store-Retrieve: Enhancing Memory Augmentation through Language-Encoded Egocentric Perception.

[BibT_eX]

[DOI]

João Marcelo Evangelista Belo

CoRR, 2023

XAIR: A Framework of Explainable AI in Augmented Reality.

[BibT_eX]

[DOI]

CoRR, 2023

XAIR: A Framework of Explainable AI in Augmented Reality.

[BibT_eX]

[DOI]

João Marcelo Evangelista Belo

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

Gesture Spotter: A Rapid Prototyping Tool for Key Gesture Spotting in Virtual and Augmented Reality Applications.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2022

KWickChat: A Multi-Turn Dialogue System for AAC Using Context-Aware Sentence Generation by Bag-of-Keywords.

[BibT_eX]

[DOI]

Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22, 2022

Personalization of a Mid-Air Gesture Keyboard using Multi-Objective Bayesian Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2022

Reinforcement Learning in Presence of Discrete Markovian Context Evolution.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Simulating Realistic Human Motion Trajectories of Mid-Air Gesture Typing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2021

The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition.

[BibT_eX]

[DOI]