Fei Huang

Orcid: 0000-0002-3709-5053

Affiliations:
  • Alibaba Group, DAMO Academy, Language Technologies Lab, Hangzhou, China
  • Facebook, New York, NY, USA (2014 - 2018)
  • IBM T.J. Watson Research Center, Yorktown Heights, NY, USA (2006 - 2014)
  • Carnegie Mellon University, School of Computer Science, Language Technology Institute, Pittsburgh, PA, USA (PhD 2006)


According to our database1, Fei Huang authored at least 383 papers between 2000 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing.
CoRR, July, 2025

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization.
CoRR, July, 2025

Sparse Autoencoder-guided Supervised Finetuning to Mitigate Unexpected Code-Switching in LLMs.
CoRR, July, 2025

Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese.
CoRR, July, 2025

L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training.
CoRR, July, 2025

Perception-Aware Policy Optimization for Multimodal Reasoning.
CoRR, July, 2025

WebSailor: Navigating Super-human Reasoning for Web Agent.
CoRR, July, 2025

Format-Adapter: Improving Reasoning Capability of LLMs by Adapting Suitable Format.
CoRR, June, 2025

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models.
CoRR, June, 2025

MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models.
CoRR, June, 2025

EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models.
CoRR, June, 2025

Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning.
CoRR, June, 2025

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models.
CoRR, June, 2025

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation.
CoRR, June, 2025

ConText: Driving In-context Learning for Text Removal and Segmentation.
CoRR, June, 2025

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence.
CoRR, May, 2025

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents.
CoRR, May, 2025

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns.
CoRR, May, 2025

WebDancer: Towards Autonomous Information Seeking Agency.
CoRR, May, 2025

EvolveSearch: An Iterative Self-Evolving Search Agent.
CoRR, May, 2025

Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration.
CoRR, May, 2025

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding.
CoRR, May, 2025

MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability.
CoRR, May, 2025

QwenLong-CPRS: Towards ∞-LLMs with Dynamic Context Optimization.
CoRR, May, 2025

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning.
CoRR, May, 2025

VLM-R<sup>3</sup>: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought.
CoRR, May, 2025

SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization.
CoRR, May, 2025

Qwen3 Technical Report.
CoRR, May, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching.
CoRR, May, 2025

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning.
CoRR, May, 2025

Adaptive Thinking via Mode Policy Optimization for Social Language Agents.
CoRR, May, 2025

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts.
CoRR, April, 2025

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement.
CoRR, April, 2025

Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute.
CoRR, March, 2025

A Survey of Direct Preference Optimization.
CoRR, March, 2025

WritingBench: A Comprehensive Benchmark for Generative Writing.
CoRR, March, 2025

AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs.
CoRR, March, 2025

Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding.
CoRR, March, 2025

Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference.
CoRR, February, 2025

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration.
CoRR, February, 2025

Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization.
CoRR, February, 2025

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC.
CoRR, February, 2025

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning.
CoRR, February, 2025

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing.
CoRR, February, 2025

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks.
CoRR, January, 2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking.
CoRR, January, 2025

Unsupervised Query Routing for Retrieval Augmented Generation.
CoRR, January, 2025

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis.
CoRR, January, 2025

Diverse AI Feedback For Large Language Model Alignment.
Trans. Assoc. Comput. Linguistics, 2025

LORE++: Logical location regression network for table structure recognition with pre-training.
Pattern Recognit., 2025

TranSeis: A high precision multitask seismic waveform detector.
Comput. Geosci., 2025

Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025

Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

On the Role of Attention Heads in Large Language Model Safety.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Benchmarking Agentic Workflow Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Supervised Optimism Correction: Be Confident When LLMs Are Sure.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ExploraCoder: Advancing Code Generation for Multiple Unseen APIs via Planning and Chained Exploration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Agentic Knowledgeable Self-awareness.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SDPO: Segment-Level Direct Preference Optimization for Social Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Reverse Preference Optimization for Complex Instruction Following.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

WebWalker: Benchmarking LLMs in Web Traversal.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Debate Helps Weak-to-Strong Generalization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
A Unified Conditional Diffusion Framework for Dual Protein Targets-Based Bioactive Molecule Generation.
IEEE Trans. Artif. Intell., September, 2024

Multilingual named Entity Extraction and Translation from Text and Speech
PhD thesis, 2024

A Survey on Out-of-Distribution Detection in NLP.
Trans. Mach. Learn. Res., 2024

Sequence Labeling as Non-Autoregressive Dual-Query Set Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Unifying Structured Data as Graph for Data-to-Text Pre-Training.
Trans. Assoc. Comput. Linguistics, 2024

STORM: A Spatio-Temporal Factor Model Based on Dual Vector Quantized Variational Autoencoders for Financial Trading.
CoRR, 2024

LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues.
CoRR, 2024

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs.
CoRR, 2024

Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment.
CoRR, 2024

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent.
CoRR, 2024

Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement.
CoRR, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
CoRR, 2024

An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms.
CoRR, 2024

On the Role of Attention Heads in Large Language Model Safety.
CoRR, 2024

In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks.
CoRR, 2024

Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
CoRR, 2024

The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends.
CoRR, 2024

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
CoRR, 2024

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement.
CoRR, 2024

ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions.
CoRR, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
CoRR, 2024

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario.
CoRR, 2024

How to Understand Whole Software Repository?
CoRR, 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
CoRR, 2024

TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning.
CoRR, 2024

A Survey on Self-Evolution of Large Language Models.
CoRR, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
CoRR, 2024

ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy.
CoRR, 2024

RoleInteract: Evaluating the Social Interaction of Role-Playing Agents.
CoRR, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
CoRR, 2024

Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection.
CoRR, 2024

Improving Cross-lingual Representation for Semantic Retrieval with Code-switching.
CoRR, 2024

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
CoRR, 2024

Effective Two-Stage Knowledge Transfer for Multi-Entity Cross-Domain Recommendation.
CoRR, 2024

Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement.
CoRR, 2024

AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis.
CoRR, 2024

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception.
CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.
CoRR, 2024

Editing Personality For Large Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Self-Retrieval: End-to-End Information Retrieval with One Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Agent Planning with World Knowledge Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Fine-Tuning Language Models with Reward Learning on Policy.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DialCLIP: Empowering Clip As Multi-Modal Dialog Retriever.
Proceedings of the IEEE International Conference on Acoustics, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Retrieved In-Context Principles from Previous Mistakes.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MIBench: Evaluating Multimodal Large Language Models over Multiple Images.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving Factual Consistency of News Summarization by Contrastive Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Visual Text Generation in the Wild.
Proceedings of the Computer Vision - ECCV 2024, 2024

Platypus: A Generalized Specialist Model for Reading Text in Various Forms.
Proceedings of the Computer Vision - ECCV 2024, 2024

Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OMNIPARSER: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

GeoGLUE: A Chinese GeoGraphic Language Understanding Evaluation Benchmark.
Proceedings of the Advanced Data Mining and Applications - 20th International Conference, 2024

Budget-Constrained Tool Learning with Planning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Iterative Forward Tuning Boosts In-Context Learning in Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

One-Shot Learning as Instruction Data Prospector for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SocialBench: Sociality Evaluation of Role-Playing Conversational Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Model Composition for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Preference Ranking Optimization for Human Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Schema dependency-enhanced curriculum pre-training for table semantic parsing.
Knowl. Based Syst., February, 2023

Achieving Human Parity on Visual Question Answering.
ACM Trans. Inf. Syst., 2023

LOGEN: Few-Shot Logical Knowledge-Conditioned Text Generation With Self-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data.
CoRR, 2023

One Shot Learning as Instruction Data Prospector for Large Language Models.
CoRR, 2023

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
CoRR, 2023

Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs.
CoRR, 2023

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection.
CoRR, 2023

Constructive Large Language Models Alignment with Diverse Feedback.
CoRR, 2023

Editing Personality for LLMs.
CoRR, 2023

VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue.
CoRR, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
CoRR, 2023

A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
CoRR, 2023

Wider and Deeper LLM Networks are Fairer LLM Evaluators.
CoRR, 2023

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility.
CoRR, 2023

PolyLM: An Open Source Polyglot Large Language Model.
CoRR, 2023

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding.
CoRR, 2023

Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model.
CoRR, 2023

DiffDTM: A conditional structure-free framework for bioactive molecules generation targeted for dual proteins.
CoRR, 2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking.
CoRR, 2023

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks.
CoRR, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains.
CoRR, 2023

Iterative Forward Tuning Boosts In-context Learning in Language Models.
CoRR, 2023

Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
CoRR, 2023

GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark.
CoRR, 2023

Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
CoRR, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation.
CoRR, 2023

Transforming Visual Scene Graphs to Image Captions.
CoRR, 2023

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality.
CoRR, 2023

API-Bank: A Benchmark for Tool-Augmented LLMs.
CoRR, 2023

VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning.
CoRR, 2023

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human.
CoRR, 2023

RRHF: Rank Responses to Align Language Models with Human Feedback without tears.
CoRR, 2023

Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain.
CoRR, 2023

RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training.
CoRR, 2023

Molecular Geometry-aware Transformer for accurate 3D Atomic System modeling.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning.
CoRR, 2023

A Multi-Modal Geographic Pre-Training Method.
CoRR, 2023

Adaptively Clustering Neighbor Elements for Image Captioning.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Contrastive Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MGeo: Multi-Modal Geographic Language Model Pre-Training.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

SPA: A Graph Spectral Alignment Perspective for Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Debiased and Denoised Entity Recognition from Distant Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video.
Proceedings of the International Conference on Machine Learning, 2023

ICDAR 2023 Competition on Born Digital Video Text Question Answering.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Trajectory-Word Alignments for Video-Language Tasks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Causal Document-Grounded Dialogue Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Knowledge Rumination for Pre-trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Question Generation with Multi-level Content Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Diversify Question Generation with Retrieval-Augmented Style Transfer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improving Seq2Seq Grammatical Error Correction via Decoding Interventions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NaSGEC: a Multi-Domain Chinese Grammatical Error Correction Dataset from Native Speaker Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Transforming Visual Scene Graphs to Image Captions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Reasoning with Language Model Prompting: A Survey.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CATS: A Pragmatic Chinese Answer-to-Sequence Dataset with Large Scale and High Quality.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PEER: Pre-training ELECTRA Extended by Ranking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Long-Tailed Question Answering in an Open World.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Domain Incremental Lifelong Learning in an Open World.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Universal Information Extraction with Meta-Pretrained Self-Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Unified Language Representation for Question Answering over Text, Tables, and Images.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Adversarial Self-Attention for Language Understanding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Graphix-T5: Mixing Pre-trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Enhancing Neural Machine Translation With Dual-Side Multimodal Awareness.
IEEE Trans. Multim., 2022

Low-resource extraction with knowledge-aware pairwise prototype learning.
Knowl. Based Syst., 2022

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers.
CoRR, 2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
CoRR, 2022

A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions.
CoRR, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
CoRR, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue System.
CoRR, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
CoRR, 2022

Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction.
CoRR, 2022

On Effectively Learning of Knowledge in Continual Pre-training.
CoRR, 2022

Image Captioning In the Transformer Age.
CoRR, 2022

Contrastive Demonstration Tuning for Pre-trained Language Models.
CoRR, 2022

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population.
CoRR, 2022

Disentangled representation for sequential treatment effect estimation.
Comput. Methods Programs Biomed., 2022

KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Relation Extraction as Open-book Examination: Retrieval-enhanced Prompt Tuning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Meta-Learning Based Knowledge Extrapolation for Knowledge Graphs in the Federated Setting.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners.
Proceedings of the Tenth International Conference on Learning Representations, 2022

AISHELL-NER: Named Entity Recognition from Chinese Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Estimating Soft Labels for Out-of-Domain Intent Detection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Generalizable and Robust Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

SPACE-2: Tree-Structured Semi-Supervised Contrastive Pre-training for Task-Oriented Dialog Understanding.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

S$^4$-Tuning: A Simple Cross-lingual Sub-network Tuning Method.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Parallel Instance Query Network for Named Entity Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Learning to Ask for Data-Efficient Event Argument Extraction (Student Abstract).
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Contrastive Information Extraction With Generative Transformer.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
CoRR, 2021

Achieving Human Parity on Visual Question Answering.
CoRR, 2021

Learning to Ask for Data-Efficient Event Argument Extraction.
CoRR, 2021

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark.
CoRR, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking.
CoRR, 2021

Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction.
CoRR, 2021

Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing.
CoRR, 2021

Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Document-level Relation Extraction as Semantic Segmentation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Prototypical Representation Learning for Relation Extraction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Rethinking Denoised Auto-Encoding in Language Pre-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Unified Encoding of Structures in Transition Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Word Reordering for Zero-shot Cross-lingual Structured Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Improving Biomedical Pretrained Language Models with Knowledge.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automated Concatenation of Embeddings for Structured Prediction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

StructuralLM: Structural Pre-training for Form Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Risk Minimization for Zero-shot Sequence Labeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

OntoED: Low-resource Event Detection with Ontology Embedding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Contrastive Triple Extraction with Generative Transformer.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Bridging the Domain Gap: Improve Informal Language Translation via Counterfactual Domain Adaptation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Unsupervised Learning of Deterministic Dialogue Structure with Edge-Enhanced Graph Auto-Encoder.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dynamic Hybrid Relation Exploration Network for Cross-Domain Context-Dependent Semantic Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Nested Named Entity Recognition with Partially-Observed TreeCRFs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
VECO: Variable Encoder-decoder Pre-training for Cross-lingual Understanding and Generation.
CoRR, 2020

Keyphrase Extraction with Dynamic Graph Convolutional Networks and Diversified Inference.
CoRR, 2020

Structural Knowledge Distillation.
CoRR, 2020

Fast and Accurate Sequence Labeling with Approximate Inference Network.
CoRR, 2020


Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

OpenUE: An Open Toolkit of Universal Extraction from Text.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

An Investigation of Potential Function Designs for Neural CRF.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

More Embeddings, Better Sequence Labelers?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Structure-Level Knowledge Distillation For Multilingual Sequence Labeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Joint Neural Model for Information Extraction with Global Features.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Knowing What, How and Why: A Near Complete Solution for Aspect-Based Sentiment Analysis.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Noisy BiLSTM-Based Models for Disfluency Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Multi-Modal Neural Machine Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Alibaba Speech Translation Systems for IWSLT 2018.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

2016
Semi-supervised Convolutional Networks for Translation Adaptation with Tiny Amount of In-domain Data.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Bilingual Methods for Adaptive Training Data Selection for Machine Translation.
Proceedings of the 12th Conferences of the Association for Machine Translation in the Americas: MT Researchers' Track, 2016

2014
Improving MT post-editing productivity with adaptive confidence estimation for document-specific translation model.
Mach. Transl., 2014

Adaptive HTER Estimation for Document-Specific MT Post-Editing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Generalized Reordering Rules for Improved SMT.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Joint bilingual name tagging for parallel corpora.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Goodness: A Method for Measuring Machine Translation Confidence.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Feature-Rich Discriminative Phrase Rescoring for SMT.
Proceedings of the COLING 2010, 2010

2009
Confidence Measure for Word Alignment.
Proceedings of the ACL 2009, 2009

2008
When Harry Met Harri: Cross-lingual Name Spelling Normalization.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

2007
Hierarchical System Combination for Machine Translation.
Proceedings of the EMNLP-CoNLL 2007, 2007

2005
Mining translations of OOV terms from the web through cross-lingual query expansion.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Mining Key Phrase Translations from Web Corpora.
Proceedings of the HLT/EMNLP 2005, 2005

Cluster-specific Named Entity Transliteration.
Proceedings of the HLT/EMNLP 2005, 2005

Clustering and Classifying Person Names by Origin.
Proceedings of the Proceedings, 2005

2004
Improving Named Entity Translation Combining Phonetic and Semantic Similarities.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Towards named entity extraction and translation in spoken language translation.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

2003
Extracting named entity translingual equivalence with limited resources.
ACM Trans. Asian Lang. Inf. Process., 2003

The CMU statistical machine translation system.
Proceedings of Machine Translation Summit IX: Papers, 2003

Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-Feature Cost Minimization.
Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003

2002
Improved Named Entity Translation and Bilingual Named Entity Extraction.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

2000
Dialogue management for multimodal user registration.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...