We stand with Ukraine

We stand with Ukraine

Juntao Li

Orcid: 0000-0002-6286-7529

Affiliations:

Soochow University, Suzhou, Jiangsu, China

According to our database¹, Juntao Li authored at least 110 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Revisiting Long-context Modeling from Context Denoising Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, October, 2025

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, October, 2025

BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, September, 2025

Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Knowl. Data Eng., August, 2025

CaliDrop: KV Cache Compression with Calibration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, July, 2025

Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Taming the Titans: A Survey of Efficient LLM Inference Serving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

Stick to Facts: Towards Fidelity-oriented Product Description Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

XIFBench: Evaluating Large Language Models on Multilingual Instruction Following.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, February, 2025

ASurvey: Spatiotemporal Consistency in Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

CoRR, January, 2025

Test-time Computing: from System-1 Thinking to System-2 Thinking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, January, 2025

OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Trans. Assoc. Comput. Linguistics, 2025

Improving Rationality in the Reasoning Process of Language Models through Self-playing Game.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LOGO - Long cOntext aliGnment via efficient preference Optimization.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Revealing and Mitigating Over-Attention in Knowledge Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Beware of Calibration Data for Pruning Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Survey of Generative Information Extraction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Revealing and Mitigating the Local Pattern Shortcuts of Mamba.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

From Awareness to Adaptability: Enhancing Tool Utilization for Scientific Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Decoder-Only LLMs can be Masked Auto-Encoders.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

Generative Reward Modeling via Synthetic Criteria Preference Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Tool learning via Inference-time Scaling and Cycle Verifier.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Unleashing LLM Reasoning Capability via Scalable Question Synthesis from Scratch.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

\mathcalA³: Automatic Alignment Framework for Attributed Text Generation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Accurate KV Cache Quantization with Outlier Tokens Tracing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Randomness Regularization With Simple Consistency Training for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Enhancing Low-Resource NLP by Consistency Training With Data and Model Perturbations.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

MemLong: Memory-Augmented Retrieval for Long Text Modeling.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Timo: Towards Better Temporal Reasoning for Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

A Survey on Human Preference Learning for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Demonstration Augmentation for Zero-shot In-context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Rethinking Negative Instances for Generative Named Entity Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Towards Better Chinese Spelling Check for Search Engines: A New Dataset and Strong Baseline.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

CMD: a framework for Context-aware Model self-Detoxification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Retrieval and Reasoning on KGs: Integrate Knowledge Graphs into Large Language Models for Complex Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TabMedBERT: A Tabular Knowledge Enhanced Biomedical Pretrained Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Exploring and Mitigating Shortcut Learning for Generative Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

When and How to Grow? On Efficient Pre-training via Model Growth.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Asian Conference on Machine Learning, 2024

Efficient Domain Adaptation for Non-Autoregressive Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Demonstration Augmentation for Zero-shot In-context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Exploring Reversal Mathematical Reasoning Ability for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Rethinking Negative Instances for Generative Named Entity Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Are the BERT family zero-shot learners? A study on their potential and limitations.

[BibT_eX]

[DOI]

,

,

,

,

Artif. Intell., September, 2023

OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Harnessing the Power of David against Goliath: Exploring Instruction Data Generation without Using Closed-Source Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

Detoxify Language Model Step-by-Step.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Test-Time Adaptation with Perturbation Consistency Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2023

CT4Rec: Simple yet Effective Consistency Training for Sequential Recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

INFORM : Information eNtropy based multi-step reasoning FOR large language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

G-SPEED: General SParse Efficient Editing MoDel.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Isotropy-Enhanced Conditional Masked Language Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards Better Hierarchical Text Classification with Data Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Can Diffusion Model Achieve Better Performance in Text Generation ? Bridging the Gap between Training and Inference !

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Open-ended Long Text Generation via Masked Language Modeling.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Dynamic and Efficient Inference for Text Generation via BERT Family.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Early Exit with Disentangled Representation and Equiangular Tight Frame.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

RenewNAT: Renewing Potential Translation for Non-autoregressive Transformer.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Multi-Teacher Distillation With Single Model for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Deep Learning for Dialogue Systems: Chit-Chat and Beyond.

[BibT_eX]

[DOI]

,

,

Found. Trends Inf. Retr., 2022

Image-text Retrieval: A Survey on Recent Research and Development.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Robust Question Answering against Distribution Shifts with Test-Time Adaption: An Empirical Study.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

JANUS: Joint Autoregressive and Non-autoregressive Training with Auxiliary Loss for Sequence Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SelfMix: Robust Learning against Textual Label Noise with Self-Mixup Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

,

,

,

,

,

,

ACM Trans. Inf. Syst., 2021

C<sup>2</sup>-Rec: An Effective Consistency Constraint for Sequential Recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2021

Building an Efficient and Effective Retrieval-based Dialogue System via Mutual Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Dialogue History Matters! Personalized Response Selectionin Multi-turn Retrieval-based Chatbots.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

How does Truth Evolve into Fake News? An Empirical Study of Fake News Evolution.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Companion of The Web Conference 2021, 2021

R-Drop: Regularized Dropout for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Organize a Bag of Words into Sentences with Neural Networks: An Empirical Study.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Enhancing the Open-Domain Dialogue Evaluation in Latent Space.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Content Learning with Structure-Aware Writing: A Graph-Infused Dual Conditional Variational Autoencoder for Automatic Storytelling.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Feature Adaptation of Pre-Trained Language Models across Languages and Domains for Text Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Plan-CVAE: A Planning-Based Conditional Variational Autoencoder for Story Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Chinese Computational Linguistics - 19th China National Conference, CCL 2020, Hainan, China, October 30, 2020

Draft and Edit: Automatic Storytelling Through Multi-Pass Hierarchical Conditional Variational Autoencoder.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

A Character-Centric Neural Model for Automated Story Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Boosting Variational Generative Model via Condition Enhancing and Lexical-Editing.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Modeling Personalization in Continuous Space for Response Generation via Augmented Wasserstein Autoencoders.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Stick to the Facts: Learning towards a Fidelity-oriented E-Commerce Product Description Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Are Training Samples Correlated? Learning to Generate Dialogue Responses with Multiple References.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Insufficient Data Can Also Rock! Learning to Converse Using Smaller Data with Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Write Stories with Thematic Consistency and Wording Novelty.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Overview of the NLPCC 2018 Shared Task: Multi-turn Human-Computer Conversations.

[BibT_eX]

[DOI]

,

Proceedings of the Natural Language Processing and Chinese Computing, 2018

Generating Classical Chinese Poems via Conditional Variational Autoencoder and Adversarial Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Loading...