Wei Xu

Orcid: 0000-0002-7044-3232

Affiliations:
  • Georgia Institute of Technology, School of Interactive Computing, Atlanta, GA, USA
  • Ohio State University, Department of Computer Science and Engineering, Columbus, OH, USA (former)
  • University of Pennsylvania, Computer Information and Science Department, Philadelphia, PA, USA (former)
  • New York University, NY, USA (former, PhD 2014)
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China (former)


According to our database1, Wei Xu authored at least 65 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Automatic and Human-AI Interactive Text Generation.
CoRR, 2023

Can Language Models be Instructed to Protect Personal Information?
CoRR, 2023

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models.
CoRR, 2023

Multilingual Simplification of Medical Texts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Thresh: A Unified, Customizable and Deployable Platform for Fine-Grained Text Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Revisiting non-English Text Simplification: A Unified Multilingual Benchmark.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

LENS: A Learnable Evaluation Metric for Text Simplification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Improved Instruction Ordering in Recipe-Grounded Conversation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Frustratingly Easy Label Projection for Cross-lingual Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Stanceosaurus: Classifying Stance Towards Multilingual Misinformation.
CoRR, 2022

Stanceosaurus: Classifying Stance Towards Multicultural Misinformation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

arXivEdits: Understanding the Human Revision Process in Scientific Writing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Large-scale Paraphrase Acquisition and Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Extracting a Knowledge Base of COVID-19 Events from Social Media.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Controllable Text Simplification with Explicit Paraphrasing.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

WIKIBIAS: Detecting Multi-Span Subjective Biases in Language.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

BiSECT: Learning to Split and Rephrase Sentences with Bitexts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Pre-train or Annotate? Domain Adaptation with a Constrained Budget.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Neural semi-Markov CRF for Monolingual Word Alignment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
WNUT-2020 Task 1 Overview: Extracting Entities and Relations from Wet Lab Protocols.
CoRR, 2020

Extracting COVID-19 Events from Twitter.
CoRR, 2020

A Focused Study to Compare Arabic Pre-training Models on Newswire IE Tasks.
CoRR, 2020

An Empirical Study of Pre-trained Transformers for Arabic Information Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning Relation Entailment with Structured and Textual Information.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

WNUT-2020 Task 1 Overview: Extracting Entities and Relations from Wet Lab Protocols.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Code and Named Entity Recognition in StackOverflow.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Generalizing Natural Language Analysis through Span-relation Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Neural CRF Model for Sentence Alignment in Text Simplification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Discourse Level Factors for Sentence Deletion in Text Simplification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-task Pairwise Neural Ranking for Hashtag Segmentation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Character-Based Neural Networks for Sentence Pair Modeling.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

An Annotated Corpus for Machine Reading of Instructions in Wet Lab Protocols.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Network Models for Paraphrase Identification, Semantic Textual Similarity, Natural Language Inference, and Question Answering.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
A Continuously Growing Dataset of Sentential Paraphrases.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Optimizing Statistical Machine Translation for Text Simplification.
Trans. Assoc. Comput. Linguistics, 2016

A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter.
CoRR, 2016

TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Results of the WNUT16 Named Entity Recognition Shared Task.
Proceedings of the 2nd Workshop on Noisy User-generated Text, 2016

Discovering User Attribute Stylistic Differences via Paraphrasing.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Problems in Current Text Simplification Research: New Data Can Help.
Trans. Assoc. Comput. Linguistics, 2015

SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT).
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Cost Optimization in Crowdsourcing Translation: Low cost translations made even cheaper.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition.
Proceedings of the Workshop on Noisy User-generated Text, 2015

2014
Data-driven Approaches for Paraphrasing across Language Variations.
PhD thesis, 2014

Extracting Lexically Divergent Paraphrases from Twitter.
Trans. Assoc. Comput. Linguistics, 2014

Poetry of the Crowd: A Human Computation Algorithm to Convert Prose into Rhyming Verse.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

Infusion of Labeled Data into Distant Supervision for Relation Extraction.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Filling Knowledge Base Gaps for Distant Supervision of Relation Extraction.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Gathering and Generating Paraphrases from Twitter with Application to Normalization.
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, 2013

2012
Paraphrasing for Style.
Proceedings of the COLING 2012, 2012

2011
New York University 2011 System for KBP Slot Filling.
Proceedings of the Fourth Text Analysis Conference, 2011

Passage Retrieval for Information Extraction using Distant Supervision.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Exploiting Syntactic and Distributional Information for Spelling Correction with Web-Scale N-gram Models.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2009
Automatic Recognition of Logical Relations for English, Chinese and Japanese in the GLARF Framework.
Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, 2009

Transducing Logical Relations from Automatic and Manual GLARF.
Proceedings of the Third Linguistic Annotation Workshop, 2009

Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task.
Proceedings of the ACL 2009, 2009

2007
Using Non-Local Features to Improve Named Entity Recognition Recall.
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation, 2007

2006
Building Document Graphs for Multiple News Articles Summarization: An Event-Based Approach.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Deriving Event Relevance from the Ontology Constructed with Formal Concept Analysis.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

Extractive Summarization using Inter- and Intra- Event Relevance.
Proceedings of the ACL 2006, 2006


  Loading...