Hengxing Cai

Orcid: 0000-0001-9780-2330

According to our database1, Hengxing Cai authored at least 24 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Interpretable Reward Model via Sparse Autoencoder.
CoRR, August, 2025

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs.
CoRR, August, 2025

SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation.
CoRR, August, 2025

Doc2SAR: A Synergistic Framework for High-Fidelity Extraction of Structure-Activity Relationships from Scientific Documents.
CoRR, June, 2025

MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval.
CoRR, June, 2025

FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models.
CoRR, May, 2025

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs.
CoRR, May, 2025

A Multi-Granularity Retrieval Framework for Visually-Rich Documents.
CoRR, May, 2025

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Intelligent System for Automated Molecular Patent Infringement Assessment.
CoRR, 2024

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer.
CoRR, 2024

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis.
CoRR, 2024

2023
IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning.
CoRR, 2022

VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


2021
NTIRE 2021 Challenge on Perceptual Image Quality Assessment.
CoRR, 2021

2020
Learning Traffic as Images for Incident Detection Using Convolutional Neural Networks.
IEEE Access, 2020

Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


2018
Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

2017
Full-reference image quality assessment-based B-mode ultrasound image similarity measure.
CoRR, 2017


  Loading...