Yong Jiang

Orcid: 0000-0003-4482-1559

Affiliations:
  • Alibaba Group, DAMO Academy, China
  • ShanghaiTech University, China (PhD 2019)


According to our database1, Yong Jiang authored at least 132 papers between 2010 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents.
CoRR, March, 2026

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models.
CoRR, February, 2026

WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning.
CoRR, January, 2026

Syntax-Aware Hierarchical Attention Networks for Code Vulnerability Detection.
Comput. Mater. Continua, 2026

STORM: A Spatio-Temporal Factor Model Based on Dual Vector Quantized Variational Autoencoders for Financial Trading.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Nested Browser-Use Learning for Agentic Information Seeking.
CoRR, December, 2025

AutoForge: Automated Environment Synthesis for Agentic Reinforcement Learning.
CoRR, December, 2025

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce.
CoRR, December, 2025

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction.
CoRR, November, 2025

Tongyi DeepResearch Technical Report.
CoRR, October, 2025

AgentFold: Long-Horizon Web Agents with Proactive Context Management.
CoRR, October, 2025

ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking.
CoRR, October, 2025

WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking.
CoRR, October, 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis.
CoRR, October, 2025

Repurposing Synthetic Data for Fine-grained Search Agent Supervision.
CoRR, October, 2025

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents.
CoRR, October, 2025

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics.
CoRR, October, 2025

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning.
CoRR, October, 2025

Scaling Generalist Data-Analytic Agents.
CoRR, September, 2025

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization.
CoRR, September, 2025

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research.
CoRR, September, 2025

Towards General Agentic Intelligence via Environment Scaling.
CoRR, September, 2025

Scaling Agents via Continual Pre-training.
CoRR, September, 2025

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents.
CoRR, September, 2025

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning.
CoRR, September, 2025

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent.
CoRR, August, 2025

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization.
CoRR, July, 2025

WebSailor: Navigating Super-human Reasoning for Web Agent.
CoRR, July, 2025

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models.
CoRR, June, 2025

WebDancer: Towards Autonomous Information Seeking Agency.
CoRR, May, 2025

MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability.
CoRR, May, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching.
CoRR, May, 2025

AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs.
CoRR, March, 2025

Unsupervised Query Routing for Retrieval Augmented Generation.
CoRR, January, 2025

Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Benchmarking Agentic Workflow Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

EvolveSearch: An Iterative Self-Evolving Search Agent.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

KBM: Delineating Knowledge Boundary for Adaptive Retrieval in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Agentic Knowledgeable Self-awareness.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

WebWalker: Benchmarking LLMs in Web Traversal.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment.
CoRR, 2024

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
CoRR, 2024

An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms.
CoRR, 2024

Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation.
CoRR, 2024

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
CoRR, 2024

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario.
CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.
CoRR, 2024

Model-Agnostic Knowledge Distillation Between Heterogeneous Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Editing Personality For Large Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Agent Planning with World Knowledge Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Retrieved In-Context Principles from Previous Mistakes.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data.
CoRR, 2023

Editing Personality for LLMs.
CoRR, 2023

Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model.
CoRR, 2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
CoRR, 2023

Zero-Shot Information Extraction via Chatting with ChatGPT.
CoRR, 2023

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

COMBO: A Complete Benchmark for Open KG Canonicalization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Do PLMs Know and Understand Ontological Knowledge?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recall, Expand, and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Graph Propagation based Data Augmentation for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech Entity Linking.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CAT-MNER: Multimodal Named Entity Recognition with Knowledge-Refined Cross-Modal Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Named Entity and Relation Extraction with Multi-Modal Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Modeling Label Correlations for Ultra-Fine Entity Typing with Neural Pairwise Conditional Random Field.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Domain-Specific NER via Retrieving Correlated Samples.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
CoRR, 2021

Enhanced Universal Dependency Parsing with Automated Concatenation of Embeddings.
Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, 2021

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Unified Encoding of Structures in Transition Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Word Reordering for Zero-shot Cross-lingual Structured Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Unsupervised Natural Language Parsing (Introductory Tutorial).
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts, 2021

Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automated Concatenation of Embeddings for Structured Prediction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Generalized Supervised Attention for Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Risk Minimization for Zero-shot Sequence Labeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Neural Latent Dependency Model for Sequence Labeling.
CoRR, 2020

Structural Knowledge Distillation.
CoRR, 2020

Fast and Accurate Sequence Labeling with Approximate Inference Network.
CoRR, 2020

Enhanced Universal Dependency Parsing with Second-Order Inference and Mixture of Training Data.
Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

An Investigation of Potential Function Designs for Neural CRF.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Adversarial Attack and Defense of Structured Prediction Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

More Embeddings, Better Sequence Labelers?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Second-Order Unsupervised Neural Dependency Parsing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Survey of Unsupervised Dependency Parsing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Structure-Level Knowledge Distillation For Multilingual Sequence Labeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

An Empirical Comparison of Unsupervised Constituency Parsing Methods.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Learning and evaluation of latent dependency forest models.
Neural Comput. Appl., 2019

Lexicalized Neural Unsupervised Dependency Parsing.
Neurocomputing, 2019

Projective Latent Dependency Forest Models.
IEEE Access, 2019

A Regularization-based Framework for Bilingual Grammar Induction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Multilingual Grammar Induction with Continuous Language Identification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Enhancing Unsupervised Generative Dependency Parser with Contextual Information.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Bidirectional Transition-Based Dependency Parsing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Maximum A Posteriori Inference in Sum-Product Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Semi-supervised Structured Prediction with Neural CRF Autoencoder.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Combining Generative and Discriminative Approaches to Unsupervised Dependency Parsing via Dual Decomposition.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Dependency Grammar Induction with Neural Lexicalization and Big Training Data.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

CRF Autoencoder for Unsupervised Dependency Parsing.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Latent Dependency Forest Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Unsupervised Neural Dependency Parsing.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2010
An 0(1.414<sup>n</sup>) volume molecular solution for the 0-1 knapsack problem on DNA-based supercomputing.
Proceedings of the Fifth International Conference on Bio-Inspired Computing: Theories and Applications, 2010


  Loading...