Junyi Li

Orcid: 0009-0007-0480-5593

Affiliations:
  • City University of Hong Kong, Department of Data Science, Hong Kong
  • National University of Singapore, Department of Computer Science, Singapore
  • University of Montreal, DIRO, Montreal, Canada (PhD 2025)
  • Renmin University of China, Gaoling School of Artificial Intelligence, Beijing, China (PhD 2024)
  • Beijing Key Laboratory of Big Data Management and Analysis Methods, Beijing, China


According to our database1, Junyi Li authored at least 57 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices.
CoRR, October, 2025

PAL-UI: Planning with Active Look-back for Vision-Based GUI Agents.
CoRR, October, 2025

Experience-Guided Reflective Co-Evolution of Prompts and Heuristics for Automatic Algorithm Design.
CoRR, September, 2025

Enhancing Cross-task Transfer of Large Language Models via Activation Steering.
CoRR, July, 2025

The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models.
CoRR, May, 2025

ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework.
CoRR, May, 2025

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning.
CoRR, May, 2025

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation.
CoRR, February, 2025

Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking.
CoRR, February, 2025

LLM-based Search Assistant with Holistically Guided MCTS for Intricate Information Seeking.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Search-Based Interaction For Conversation Recommendation via Generative Reward Model Based Simulated User.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DynaQuest: A Dynamic Question Answering Dataset Reflecting Real-World Knowledge Updates.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Pre-Trained Language Models for Text Generation: A Survey.
ACM Comput. Surv., September, 2024

LLMBox: A Comprehensive Library for Large Language Models.
CoRR, 2024

YuLan: An Open-source Large Language Model.
CoRR, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.
CoRR, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
The Web Can Be Your Oyster for Improving Large Language Models.
CoRR, 2023

RenderDiffusion: Text Generation as Image Generation.
CoRR, 2023

A Survey of Large Language Models.
CoRR, 2023

A Survey on Long Text Modeling with Transformers.
CoRR, 2023

HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MVP: Multi-task Supervised Pre-training for Natural Language Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning to Imagine: Visually-Augmented Natural Language Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Web Can Be Your Oyster for Improving Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Zero-shot Visual Question Answering with Language Model Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
TextBox 2.0: A Text Generation Library with Pre-trained Language Models.
CoRR, 2022

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation.
CoRR, 2022

A Survey of Pretrained Language Models Based Text Generation.
CoRR, 2022

Learning to Transfer Prompts for Text Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Survey of Vision-Language Pre-Trained Models.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

TextBox 2.0: A Text Generation Library with Pre-trained Language Models.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Pretrained Language Models for Text Generation: A Survey.
CoRR, 2021

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training.
CoRR, 2021

TextBox: A Unified, Modularized, and Extensible Framework for Text Generation.
CoRR, 2021

Knowledge-based Review Generation by Coherence Enhanced Text Planning.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Pretrained Language Model for Text Generation: A Survey.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Generating Long and Coherent Text with Multi-Level Generative Adversarial Networks.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021

Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

TextBox: A Unified, Modularized, and Extensible Framework for Text Generation.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Knowledge-Enhanced Personalized Review Generation with Capsule Graph Neural Network.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Generating Long and Informative Reviews with Aspect-Aware Coarse-to-Fine Decoding.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...