Hengxing Cai

Orcid: 0000-0001-9780-2330

According to our database¹, Hengxing Cai authored at least 30 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Draft-Refine-Optimize: Self-Evolved Learning for Natural Language to MongoDB Query Generation.

[BibT_eX]

[DOI]

CoRR, April, 2026

SpecXMaster Technical Report.

[BibT_eX]

[DOI]

CoRR, March, 2026

AirNav: A Large-Scale Real-World UAV Vision-and-Language Navigation Dataset with Natural and Diverse Instructions.

[BibT_eX]

[DOI]

CoRR, January, 2026

Interpretable Reward Model via Sparse Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Reasoning-Enhanced Large Language Models for Molecular Property Prediction.

[BibT_eX]

[DOI]

CoRR, October, 2025

Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents.

[BibT_eX]

[DOI]

CoRR, September, 2025

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs.

[BibT_eX]

[DOI]

CoRR, August, 2025

SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation.

[BibT_eX]

[DOI]

CoRR, August, 2025

Doc2SAR: A Synergistic Framework for High-Fidelity Extraction of Structure-Activity Relationships from Scientific Documents.

[BibT_eX]

[DOI]

CoRR, June, 2025

MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval.

[BibT_eX]

[DOI]

CoRR, June, 2025

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs.

[BibT_eX]

[DOI]

CoRR, May, 2025

A Multi-Granularity Retrieval Framework for Visually-Rich Documents.

[BibT_eX]

[DOI]

CoRR, May, 2025

Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Intelligent System for Automated Molecular Patent Infringement Assessment.

[BibT_eX]

[DOI]

CoRR, 2024

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

2023

IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration.

[BibT_eX]

[DOI]

Ming Feng

Kele Xu

Hengxing Cai

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021

NTIRE 2021 Challenge on Perceptual Image Quality Assessment.

[BibT_eX]

[DOI]

Seyed Mehdi Ayyoubzadeh

CoRR, 2021

2020

Learning Traffic as Images for Incident Detection Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Access, 2020

Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

AIM 2020: Scene Relighting and Illumination Estimation Challenge.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

2018

Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

2017

Full-reference image quality assessment-based B-mode ultrasound image similarity measure.

[BibT_eX]

[DOI]

CoRR, 2017

Hengxing Cai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...