Haiyang Yu

Affiliations:
  • Alibaba Group, China
  • Zhejiang University, AZFT Joint Lab for Knowledge Engine, Hangzhou, China (former)


According to our database1, Haiyang Yu authored at least 46 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models.
CoRR, June, 2025

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns.
CoRR, May, 2025

Adaptive Thinking via Mode Policy Optimization for Social Language Agents.
CoRR, May, 2025

Transferable Post-training via Inverse Value Learning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

On the Role of Attention Heads in Large Language Model Safety.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
On the Role of Attention Heads in Large Language Model Safety.
CoRR, 2024

The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends.
CoRR, 2024

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement.
CoRR, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
CoRR, 2024

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
CoRR, 2024

DeCoF: Generated Video Detection via Frame Consistency.
CoRR, 2024

Self-Retrieval: End-to-End Information Retrieval with One Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SoFA: Shielded On-the-fly Alignment via Priority Rule Following.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Preference Ranking Optimization for Human Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
CoRR, 2023

Wider and Deeper LLM Networks are Fairer LLM Evaluators.
CoRR, 2023

API-Bank: A Benchmark for Tool-Augmented LLMs.
CoRR, 2023

Coarse-To-Fine Knowledge Selection for Document Grounded Dialogs.
Proceedings of the IEEE International Conference on Acoustics, 2023

Causal Document-Grounded Dialogue Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Question Generation with Multi-level Content Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Diversify Question Generation with Retrieval-Augmented Style Transfer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Universal Information Extraction with Meta-Pretrained Self-Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Unified Language Representation for Question Answering over Text, Tables, and Images.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Towards Generalized Open Information Extraction.
CoRR, 2022

Semi-Supervised Lifelong Language Learning.
CoRR, 2022

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue.
CoRR, 2022

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population.
CoRR, 2022

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Interventional Aspect-Based Sentiment Analysis.
CoRR, 2021

2020
The Devil is the Classifier: Investigating Long Tail Relation Classification with Decoupling Analysis.
CoRR, 2020

Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction.
CoRR, 2020

A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges.
CoRR, 2020

OpenUE: An Open Toolkit of Universal Extraction from Text.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction.
Proceedings of the 28th International Conference on Computational Linguistics, 2020


  Loading...