We stand with Ukraine

We stand with Ukraine

Canwen Xu

Orcid: 0000-0002-1552-999X

According to our database¹, Canwen Xu authored at least 46 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Residual Skill Optimization for Text-to-SQL Ensembles.

[DOI]

,

,

Parjanya Prajakta Prashant

,

Nikki Lijing Kuang

,

Seyedeh Baharan Khatami

,

,

,

,

,

,

CoRR, May, 2026

FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents.

[DOI]

Quang Hieu Pham

,

,

,

,

,

,

,

Jocelyn Qiaochu Chen

CoRR, May, 2026

Learning to Hint for Reinforcement Learning.

[DOI]

,

,

,

Julian J. McAuley

,

CoRR, April, 2026

Learning to Self-Evolve.

[DOI]

,

,

,

,

,

CoRR, March, 2026

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

CoRR, February, 2026

2025

ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback.

[DOI]

,

,

,

CoRR, March, 2025

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration.

[DOI]

,

Ashwin Ramachandran

,

,

,

,

,

CoRR, February, 2025

Optimizing Reasoning for Text-to-SQL with Execution Feedback.

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Efficient Natural Language Processing for Language Models

[DOI]

PhD thesis, 2024

StarCoder 2 and The Stack v2: The Next Generation.

[DOI]

,

,

Loubna Ben Allal

,

Federico Cassano

,

Joel Lamy-Poirier

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Abulkhanov

,

,

,

,

,

,

,

,

Evgenii Zheltonozhskii

,

Nii Osae Osae Dade

,

,

,

,

,

,

,

,

,

Niklas Muennighoff

,

,

Muhtasham Oblokulov

,

Christopher Akiki

,

,

,

,

,

,

,

,

Olivier Dehaene

,

,

,

Julian J. McAuley

,

,

Torsten Scholak

,

Sébastien Paquet

,

Jennifer Robinson

,

Carolyn Jane Anderson

,

Nicolas Chapados

,

et al.

CoRR, 2024

Automatic Pair Construction for Contrastive Post-training.

[DOI]

,

,

,

Luciano Del Corro

,

,

Julian J. McAuley

,

Jennifer Neville

,

Ahmed Awadallah

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems.

[DOI]

,

,

Julian J. McAuley

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Small Models are Valuable Plug-ins for Large Language Models.

[DOI]

,

,

,

,

,

Julian J. McAuley

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Contrastive Post-training Large Language Models on Data Curriculum.

[DOI]

,

,

Luciano Del Corro

,

,

Julian J. McAuley

,

Jennifer Neville

,

Ahmed Hassan Awadallah

,

CoRR, 2023

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization.

[DOI]

,

Julian J. McAuley

,

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

LongCoder: A Long-Range Pre-trained Language Model for Code Completion.

[DOI]

,

,

,

,

Julian J. McAuley

Proceedings of the International Conference on Machine Learning, 2023

Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data.

[DOI]

,

,

,

Julian J. McAuley

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Spoiler Detection as Semantic Text Matching.

[DOI]

,

,

Julian J. McAuley

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Survey on Dynamic Neural Networks for Natural Language Processing.

[DOI]

,

Julian J. McAuley

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

A Survey on Model Compression and Acceleration for Pretrained Language Models.

[DOI]

,

Julian J. McAuley

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

A Survey on Model Compression for Natural Language Processing.

[DOI]

,

Julian J. McAuley

CoRR, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.

[DOI]

CoRR, 2022

Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification.

[DOI]

,

,

Julian J. McAuley

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Multitask Prompted Training Enables Zero-Shot Task Generalization.

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficiently Tuned Parameters Are Task Embeddings.

[DOI]

Wangchunshu Zhou

,

,

Julian J. McAuley

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

InforMask: Unsupervised Informative Masking for Language Model Pretraining.

[DOI]

,

,

Julian J. McAuley

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

BERT Learns to Teach: Knowledge Distillation with Meta Learning.

[DOI]

Wangchunshu Zhou

,

,

Julian J. McAuley

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval.

[DOI]

,

,

,

Julian J. McAuley

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Leashing the Inner Demons: Self-Detoxification for Language Models.

[DOI]

,

,

,

Julian J. McAuley

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Multitask Prompted Training Enables Zero-Shot Task Generalization.

[DOI]

CoRR, 2021

Meta Learning for Knowledge Distillation.

[DOI]

Wangchunshu Zhou

,

,

Julian J. McAuley

CoRR, 2021

Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge.

[DOI]

,

Wangchunshu Zhou

,

,

,

Julian J. McAuley

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting.

[DOI]

Wangchunshu Zhou

,

,

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression.

[DOI]

,

Wangchunshu Zhou

,

,

,

Julian J. McAuley

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Datasets: A Community Library for Natural Language Processing.

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

2020

BERT Loses Patience: Fast and Robust Inference with Early Exit.

[DOI]

Wangchunshu Zhou

,

,

,

Julian J. McAuley

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

UnihanLM: Coarse-to-Fine Chinese-Japanese Language Model Pretraining with the Unihan Database.

[DOI]

,

,

,

Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing.

[DOI]

,

Wangchunshu Zhou

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Transformers: State-of-the-Art Natural Language Processing.

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization.

[DOI]

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders.

[DOI]

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders.

[DOI]

,

,

,

CoRR, 2019

Obj-GloVe: Scene-Based Contextual Object Embedding.

[DOI]

,

,

CoRR, 2019

DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets.

[DOI]

,

,

,

,

,

Proceedings of the World Wide Web Conference, 2019

Exploiting Multiple Embeddings for Chinese Named Entity Recognition.

[DOI]

,

,

,

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Loading...