Xunliang Cai

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing.
CoRR, August, 2025

Large-Scale Diverse Synthesis for Mid-Training.
CoRR, August, 2025

LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points.
CoRR, August, 2025

Libra: Assessing and Improving Reward Model by Learning to Think.
CoRR, July, 2025

Sub-Scaling Laws: On the Role of Data Density and Training Strategies in LLMs.
CoRR, July, 2025

CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark.
CoRR, July, 2025

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy.
CoRR, July, 2025

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models.
CoRR, June, 2025

OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics.
CoRR, June, 2025

LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models.
CoRR, June, 2025

GradPower: Powering Gradients for Faster Language Model Pre-Training.
CoRR, May, 2025

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation.
CoRR, May, 2025

Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs.
CoRR, May, 2025

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective.
CoRR, May, 2025

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?
CoRR, May, 2025

When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning.
CoRR, May, 2025

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement.
CoRR, May, 2025

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese.
CoRR, May, 2025

Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning.
CoRR, April, 2025

NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables.
CoRR, April, 2025

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity.
CoRR, March, 2025

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective.
CoRR, February, 2025

MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures.
CoRR, February, 2025

C2T: A Classifier-Based Tree Construction Method in Speculative Decoding.
CoRR, February, 2025

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training.
CoRR, February, 2025

Who's the MVP? A Game-Theoretic Evaluation Benchmark for Modular Attribution in LLM Agents.
CoRR, February, 2025

Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data.
CoRR, January, 2025

Revisit Self-Debugging with Self-Generated Tests for Code Generation.
CoRR, January, 2025

Dynamic Fisher-weighted Model Merging via Bayesian Optimization.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AgentRefine: Enhancing Agent Generalization through Refinement Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FRAME: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

The Role of Visual Modality in Multimodal Mathematical Reasoning: Challenges and Insights.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Revisit Self-Debugging with Self-Generated Tests for Code Generation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Revisiting Scaling Laws for Language Models: The Role of Data Quality and Training Strategies.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

S^3cMath: Spontaneous Step-Level Self-Correction Makes Large Language Models Better Mathematical Reasoners.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Enhancing LLMs via High-Knowledge Data Selection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern.
CoRR, 2024

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation.
CoRR, 2024

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision.
CoRR, 2024

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling.
CoRR, 2024

Multi-Programming Language Sandbox for LLMs.
CoRR, 2024

EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference.
CoRR, 2024

LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning.
CoRR, 2024

Length Desensitization in Directed Preference Optimization.
CoRR, 2024

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data.
CoRR, 2024

S<sup>3</sup>c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners.
CoRR, 2024

ReMamba: Equip Mamba with Effective Long-Sequence Modeling.
CoRR, 2024

EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning.
CoRR, 2024

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study.
CoRR, 2024

Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration.
CoRR, 2024

Unraveling the Mystery of Scaling Laws: Part I.
CoRR, 2024

What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation.
CoRR, 2024

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning.
CoRR, 2024

FIRP: Faster LLM Inference via Future Intermediate Representation Prediction.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Pattern Shifting or Knowledge Losing? A Forgetting Perspective for Understanding the Effect of Instruction Fine-Tuning.
Proceedings of the Chinese Computational Linguistics - 23rd China National Conference, 2024

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Learning or Self-aligning? Rethinking Instruction Fine-tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Graph-Structured Speculative Decoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

What Makes Quantization for Large Language Model Hard? An Empirical Study from the Lens of Perturbation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression.
CoRR, 2023

Stochastic Feature Averaging for Learning with Long-Tailed Noisy Labels.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Progressive Event Alignment Network for Partial Relevant Video Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improving Input-label Mapping with Demonstration Replay for In-context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Dialogue Topic Segmentation via Parallel Extraction Network with Neighbor Smoothing.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Structure-Aware Semantic-Aligned Network for Universal Cross-Domain Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Instance-Level Semantic Alignment for Zero-Shot Cross-Modal Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

GTLR: Graph-Based Transformer with Language Reconstruction for Video Paragraph Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Deep Graph Mutual Learning for Cross-domain Recommendation.
Proceedings of the Database Systems for Advanced Applications, 2022

Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Distant Supervision based Machine Reading Comprehension for Extractive Summarization in Customer Service.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Density-Based Dynamic Curriculum Learning for Intent Detection.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021


  Loading...