Yong Jiang

Affiliations:
  • Alibaba Group, DAMO Academy, China
  • ShanghaiTech University, China (PhD 2019)


According to our database1, Yong Jiang authored at least 103 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Tongyi DeepResearch Technical Report.
CoRR, October, 2025

AgentFold: Long-Horizon Web Agents with Proactive Context Management.
CoRR, October, 2025

ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking.
CoRR, October, 2025

WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking.
CoRR, October, 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis.
CoRR, October, 2025

Repurposing Synthetic Data for Fine-grained Search Agent Supervision.
CoRR, October, 2025

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents.
CoRR, October, 2025

DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling.
CoRR, October, 2025

Qwen3Guard Technical Report.
CoRR, October, 2025

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics.
CoRR, October, 2025

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning.
CoRR, October, 2025

Scaling Generalist Data-Analytic Agents.
CoRR, September, 2025

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization.
CoRR, September, 2025

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research.
CoRR, September, 2025

Towards General Agentic Intelligence via Environment Scaling.
CoRR, September, 2025

Scaling Agents via Continual Pre-training.
CoRR, September, 2025

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents.
CoRR, September, 2025

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning.
CoRR, September, 2025

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent.
CoRR, August, 2025

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization.
CoRR, July, 2025

WebSailor: Navigating Super-human Reasoning for Web Agent.
CoRR, July, 2025

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models.
CoRR, June, 2025

WebDancer: Towards Autonomous Information Seeking Agency.
CoRR, May, 2025

EvolveSearch: An Iterative Self-Evolving Search Agent.
CoRR, May, 2025

MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability.
CoRR, May, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching.
CoRR, May, 2025

AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs.
CoRR, March, 2025

Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference.
CoRR, February, 2025

Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties.
CoRR, February, 2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking.
CoRR, January, 2025

Unsupervised Query Routing for Retrieval Augmented Generation.
CoRR, January, 2025

Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Benchmarking Agentic Workflow Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Agentic Knowledgeable Self-awareness.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

WebWalker: Benchmarking LLMs in Web Traversal.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
STORM: A Spatio-Temporal Factor Model Based on Dual Vector Quantized Variational Autoencoders for Financial Trading.
CoRR, 2024

Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment.
CoRR, 2024

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
CoRR, 2024

An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms.
CoRR, 2024

Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation.
CoRR, 2024

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
CoRR, 2024

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario.
CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.
CoRR, 2024

Model-Agnostic Knowledge Distillation Between Heterogeneous Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Editing Personality For Large Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Agent Planning with World Knowledge Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Retrieved In-Context Principles from Previous Mistakes.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data.
CoRR, 2023

Editing Personality for LLMs.
CoRR, 2023

Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model.
CoRR, 2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
CoRR, 2023

Zero-Shot Information Extraction via Chatting with ChatGPT.
CoRR, 2023

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

COMBO: A Complete Benchmark for Open KG Canonicalization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Do PLMs Know and Understand Ontological Knowledge?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recall, Expand, and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Graph Propagation based Data Augmentation for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech Entity Linking.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CAT-MNER: Multimodal Named Entity Recognition with Knowledge-Refined Cross-Modal Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Named Entity and Relation Extraction with Multi-Modal Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Modeling Label Correlations for Ultra-Fine Entity Typing with Neural Pairwise Conditional Random Field.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Domain-Specific NER via Retrieving Correlated Samples.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
CoRR, 2021

Enhanced Universal Dependency Parsing with Automated Concatenation of Embeddings.
CoRR, 2021

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Unified Encoding of Structures in Transition Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Word Reordering for Zero-shot Cross-lingual Structured Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automated Concatenation of Embeddings for Structured Prediction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Generalized Supervised Attention for Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Risk Minimization for Zero-shot Sequence Labeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Structural Knowledge Distillation.
CoRR, 2020

Fast and Accurate Sequence Labeling with Approximate Inference Network.
CoRR, 2020

Enhanced Universal Dependency Parsing with Second-Order Inference and Mixture of Training Data.
Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

An Investigation of Potential Function Designs for Neural CRF.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

More Embeddings, Better Sequence Labelers?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Structure-Level Knowledge Distillation For Multilingual Sequence Labeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2010
An 0(1.414<sup>n</sup>) volume molecular solution for the 0-1 knapsack problem on DNA-based supercomputing.
Proceedings of the Fifth International Conference on Bio-Inspired Computing: Theories and Applications, 2010


  Loading...