Hengxing Cai
Orcid: 0000-0001-9780-2330
According to our database1,
Hengxing Cai authored at least 30 papers
between 2017 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Draft-Refine-Optimize: Self-Evolved Learning for Natural Language to MongoDB Query Generation.
CoRR, April, 2026
AirNav: A Large-Scale Real-World UAV Vision-and-Language Navigation Dataset with Natural and Diverse Instructions.
CoRR, January, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, October, 2025
CoRR, September, 2025
CoRR, August, 2025
SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation.
CoRR, August, 2025
Doc2SAR: A Synergistic Framework for High-Fidelity Extraction of Structure-Activity Relationships from Scientific Documents.
CoRR, June, 2025
MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval.
CoRR, June, 2025
CoRR, May, 2025
Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
2023
IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
2022
CoRR, 2022
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022
Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
2021
2020
Learning Traffic as Images for Incident Detection Using Convolutional Neural Networks.
IEEE Access, 2020
Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
2018
Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
2017
Full-reference image quality assessment-based B-mode ultrasound image similarity measure.
CoRR, 2017