Junxiao Shen

Orcid: 0000-0002-1552-4689

According to our database1, Junxiao Shen authored at least 38 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
OmniRetriever: Any-to-Any Audio-Video-Text Retrieval via Fusion-as-Teacher Distillation.
CoRR, May, 2026

O-MARC: Omni Memory-Augmented Compression Distillation for Efficient Video Understanding.
CoRR, May, 2026

FGSVQA: Frequency-Guided Short-form Video Quality Assessment.
CoRR, May, 2026

SpatialMem: Unified 3D Memory with Metric Anchoring and Fast Retrieval.
CoRR, January, 2026

Structural variation drives enhancer hijacking via 3D genome disruption in ccRCC.
npj Digit. Medicine, 2026

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding.
CoRR, October, 2025

Prompt-Driven Agentic Video Editing System: Autonomous Comprehension of Long-Form, Story-Driven Media.
CoRR, September, 2025

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.
CoRR, March, 2025

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining.
CoRR, March, 2025

AutoMR: A Universal Time Series Motion Recognition Pipeline.
CoRR, February, 2025

Duo Streamers: A Streaming Gesture Recognition Framework.
CoRR, February, 2025

CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications.
CoRR, January, 2025

X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding.
CoRR, January, 2025

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-Training.
IEEE Trans. Vis. Comput. Graph., November, 2024

RingGesture: A Ring-Based Mid-Air Gesture Typing System Powered by a Deep-Learning Word Prediction Framework.
IEEE Trans. Vis. Comput. Graph., November, 2024

Design and Evaluation of Controller-Based Raycasting Methods for Efficient Alphanumeric and Special Character Entry in Virtual Reality.
IEEE Trans. Vis. Comput. Graph., September, 2024

Lucia: A Temporal Computing Platform for Contextual Intelligence.
CoRR, 2024

Human-inspired Perspectives: A Survey on AI Long-term Memory.
CoRR, 2024

RingGesture: A Ring-Based Mid-Air Gesture Typing System Powered by a Deep-Learning Word Prediction Framework.
CoRR, 2024

Simultaneous Gesture Classification and Localization with an Automatic Gesture Annotation Model.
CoRR, 2024

Towards Open-World Gesture Recognition.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2024

Encode-Store-Retrieve: Augmenting Human Memory through Language-Encoded Egocentric Perception.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2024

Boosting Gesture Recognition with an Automatic Gesture Annotation Framework.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

2023
Fast and Robust Mid-Air Gesture Typing for AR Headsets using 3D Trajectory Decoding.
IEEE Trans. Vis. Comput. Graph., November, 2023

Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques.
CoRR, 2023

Encode-Store-Retrieve: Enhancing Memory Augmentation through Language-Encoded Egocentric Perception.
CoRR, 2023

XAIR: A Framework of Explainable AI in Augmented Reality.
CoRR, 2023

XAIR: A Framework of Explainable AI in Augmented Reality.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Gesture Spotter: A Rapid Prototyping Tool for Key Gesture Spotting in Virtual and Augmented Reality Applications.
IEEE Trans. Vis. Comput. Graph., 2022

KWickChat: A Multi-Turn Dialogue System for AAC Using Context-Aware Sentence Generation by Bag-of-Keywords.
Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22, 2022

Personalization of a Mid-Air Gesture Keyboard using Multi-Objective Bayesian Optimization.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2022

Reinforcement Learning in Presence of Discrete Markovian Context Evolution.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Simulating Realistic Human Motion Trajectories of Mid-Air Gesture Typing.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2021

The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021


  Loading...