Hengxing Cai

Orcid: 0000-0001-9780-2330

According to our database1, Hengxing Cai authored at least 30 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Draft-Refine-Optimize: Self-Evolved Learning for Natural Language to MongoDB Query Generation.
CoRR, April, 2026

SpecXMaster Technical Report.
CoRR, March, 2026

AirNav: A Large-Scale Real-World UAV Vision-and-Language Navigation Dataset with Natural and Diverse Instructions.
CoRR, January, 2026

Interpretable Reward Model via Sparse Autoencoder.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Reasoning-Enhanced Large Language Models for Molecular Property Prediction.
CoRR, October, 2025

Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents.
CoRR, September, 2025

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs.
CoRR, August, 2025

SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation.
CoRR, August, 2025

Doc2SAR: A Synergistic Framework for High-Fidelity Extraction of Structure-Activity Relationships from Scientific Documents.
CoRR, June, 2025

MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval.
CoRR, June, 2025

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs.
CoRR, May, 2025

A Multi-Granularity Retrieval Framework for Visually-Rich Documents.
CoRR, May, 2025

Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Intelligent System for Automated Molecular Patent Infringement Assessment.
CoRR, 2024

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer.
CoRR, 2024

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.
CoRR, 2024

2023
IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning.
CoRR, 2022

VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


2021
NTIRE 2021 Challenge on Perceptual Image Quality Assessment.
CoRR, 2021

2020
Learning Traffic as Images for Incident Detection Using Convolutional Neural Networks.
IEEE Access, 2020

Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


2018
Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

2017
Full-reference image quality assessment-based B-mode ultrasound image similarity measure.
CoRR, 2017


  Loading...