Yukun Yan

Orcid: 0000-0003-2421-7098

According to our database1, Yukun Yan authored at least 54 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization.
CoRR, August, 2025

PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models.
CoRR, August, 2025

LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain.
CoRR, August, 2025

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization.
CoRR, June, 2025

KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs.
CoRR, June, 2025

MiniCPM4: Ultra-Efficient LLMs on End Devices.
CoRR, June, 2025

KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG.
CoRR, June, 2025

ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation.
CoRR, May, 2025

EULER: Enhancing the Reasoning Ability of Large Language Models through Error-Induced Learning.
CoRR, May, 2025

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling.
CoRR, May, 2025

Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning.
CoRR, May, 2025

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation.
CoRR, April, 2025

Building a Coding Assistant via the Retrieval-Augmented Language Model.
ACM Trans. Inf. Syst., March, 2025

Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors.
CoRR, March, 2025

HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization.
CoRR, February, 2025

Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts.
CoRR, February, 2025

LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking Preferences.
CoRR, February, 2025

PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning.
CoRR, February, 2025

Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search.
CoRR, February, 2025

COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Enhancing Open-Domain Task-Solving Capability of LLMs via Autonomous Tool Integration from GitHub.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
KBAlign: Efficient Self Adaptation on Specific Knowledge Bases.
CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.
CoRR, 2024

Enabling Real-Time Conversations with Minimal Training Costs.
CoRR, 2024

Enhancing the Code Debugging Ability of LLMs via Communicative Agent Based Data Refinement.
CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
CoRR, 2024

PersLLM: A Personified Training Approach for Large Language Models.
CoRR, 2024

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression.
CoRR, 2024

Cleaner Pretraining Corpus Curation with Neural Web Scraping.
CoRR, 2024

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning.
CoRR, 2024

UniMem: Towards a Unified View of Long-Context Large Language Models.
CoRR, 2024

BCD-MM: Multimodal Sentiment Analysis Model With Dual-Bias-Aware Feature Learning and Attention Mechanisms.
IEEE Access, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DPC: Filtering Out Patch-Based Poisoned Samples with Differential Privacy.
Proceedings of the Computer Security - ESORICS 2024, 2024

Enhancing Free-Form Table Question Answering Models by Distilling Relevant-Cell-Based Rationales.
Proceedings of the Chinese Computational Linguistics - 23rd China National Conference, 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Personalized sampling graph collection with local differential privacy for link prediction.
World Wide Web (WWW), September, 2023

GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension.
CoRR, 2023

PFED-AGG: A Personalized Private Federated Learning Aggregation Algorithm.
Proceedings of the International Joint Conference on Neural Networks, 2023

Fair_FM: An Improved Functional Mechanism to Provide Better Privacy Protection and Fairness Guarantee.
Proceedings of the International Joint Conference on Neural Networks, 2023

Towards Defending Against Byzantine LDP Amplified Gain Attacks.
Proceedings of the Database Systems for Advanced Applications, 2023

PV-PATE: An Improved PATE for Deep Learning with Differential Privacy in Trusted Industrial Data Matrix.
Proceedings of the Web and Big Data - 7th International Joint Conference, 2023

Nested Named Entity Recognition as Building Local Hypergraphs.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling.
CoRR, 2022

ZoomNet for Topic-Oriented Fragment Recognition in Long Documents.
IEEE Access, 2022

2018
Zooming Network.
CoRR, 2018

Object-oriented Neural Programming (OONP) for Document Understanding.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Event Identification as a Decision Process with Non-linear Representation of Text.
CoRR, 2017


  Loading...