Haonan Li

Orcid: 0000-0001-6623-5089

Affiliations:
  • Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, UAE
  • University of Melbourne, Parkville, Australia (former)
  • Shanghai Jiao Tong University, Department of Computer Science and Engineering, China (former)


According to our database1, Haonan Li authored at least 55 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Controllable Reasoning Models Are Private Thinkers.
CoRR, February, 2026

SimuScene: Training and Benchmarking Code Generation to Simulate Physical Scenarios.
CoRR, February, 2026

Neural Theorem Proving for Verification Conditions: A Real-World Benchmark.
CoRR, January, 2026

SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026


FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Control Illusion: The Failure of Instruction Hierarchies in Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Concise Reasoning in the Lens of Lagrangian Optimization.
CoRR, October, 2025

K2-Think: A Parameter-Efficient Reasoning System.
CoRR, September, 2025

BALSAM: A Platform for Benchmarking Arabic Large Language Models.
CoRR, July, 2025

IsaMini: Redesigned Isabelle Proof Lanugage for Machine Learning.
CoRR, July, 2025

AgentFly: Extensible and Scalable Reinforcement Learning for LM Agents.
CoRR, July, 2025

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.
CoRR, June, 2025

FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning.
CoRR, June, 2025

Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi.
CoRR, April, 2025

RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises.
CoRR, February, 2025

LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch.
CoRR, January, 2025

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models.
J. Artif. Intell. Res., 2025

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

NAT: Enhancing Agent Tuning with Negative Samples.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ToolGen: Unified Tool Retrieval and Calling via Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Loki: An Open-Source Tool for Fact Verification.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models.
CoRR, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
CoRR, 2024

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents.
CoRR, 2024

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Do-Not-Answer: Evaluating Safeguards in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Demystifying Instruction Mixing for Fine-tuning Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2024

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CMMLU: Measuring massive multitask language understanding in Chinese.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Understanding the Instruction Mixture for Large Language Model Fine-tuning.
CoRR, 2023

LLM360: Towards Fully Transparent Open-Source LLMs.
CoRR, 2023

Can Large Language Model Comprehend Ancient Chinese? A Preliminary Test on ACLUE.
CoRR, 2023

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models.
CoRR, 2023

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs.
CoRR, 2023

Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation.
CoRR, 2023

Can Large Langauge Model Comprehend Ancient Chinese? A Preliminary Test on ACLUE.
Proceedings of the Ancient Language Processing Workshop, 2023

Location Aware Modular Biencoder for Tourism Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Neural Character-Level Syntactic Parsing for Chinese.
J. Artif. Intell. Res., 2022

MultiSpanQA: A Dataset for Multi-Span Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CULG: Commercial Universal Language Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning.
CoRR, 2021

KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Target Word Masking for Location Metonymy Resolution.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
UniMelb at SemEval-2019 Task 12: Multi-model combination for toponym resolution.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Classifying Relation via Piecewise Convolutional Neural Networks with Transfer Learning.
Proceedings of the Man-Machine Interactions 6, 2019

Place Questions and Human-Generated Answers: A Data Analysis Approach.
Proceedings of the Geospatial Technologies for Local and Regional Development, 2019

2018
Neural Character-level Dependency Parsing for Chinese.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...