Hongshen Xu

Orcid: 0000-0001-9420-7130

According to our database1, Hongshen Xu authored at least 24 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MiMo-VL Technical Report.
CoRR, June, 2025

MiMo: Unlocking the Reasoning Potential of Language Model - From Pretraining to Posttraining.
CoRR, May, 2025

Delusions of Large Language Models.
CoRR, March, 2025

Alignment for Efficient Tool Calling of Large Language Models.
CoRR, March, 2025

Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling.
CoRR, March, 2025

2024
Reducing Tool Hallucination via Reliability Alignment.
CoRR, 2024

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity.
CoRR, 2024

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback.
CoRR, 2024

ChemDFM: Dialogue Foundation Model for Chemistry.
CoRR, 2024

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Birgat Model for Multi-Intent Spoken Language Understanding with Hierarchical Semantic Frames.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language behind.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Sparsity-Accelerated Training for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
A Heterogeneous Graph to Abstract Syntax Tree Framework for Text-to-SQL.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL.
CoRR, 2023

Large Language Model Is Semi-Parametric Reinforcement Learning Agent.
CoRR, 2023

Large Language Models Are Semi-Parametric Reinforcement Learning Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring Schema Generalizability of Text-to-SQL.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Cohesiveness of Robots in Groups Affects the Perception of Social Rejection by Human Observers.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022

2021
Power Chess: Robot-to-Robot Nonverbal Emotional Expression Applied to Competitive Play.
Proceedings of the ARTECH 2021: 10th International Conference on Digital and Interactive Arts, Aveiro, Portugal, October 13, 2021


  Loading...