Xiaoyu Shen

Orcid: 0000-0002-0217-2469

Affiliations:
  • Max Planck Institute for Informatics (MPII), Saarbrücken, Germany
  • Saarland University, Spoken Language Systems, Saarbrücken, Germany


According to our database1, Xiaoyu Shen authored at least 38 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unraveling the Mystery of Scaling Laws: Part I.
CoRR, 2024

The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis.
CoRR, 2024

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
LawBench: Benchmarking Legal Knowledge of Large Language Models.
CoRR, 2023

Weaker Than You Think: A Critical Look atWeakly Supervised Learning.
CoRR, 2023

Meta Self-Refinement for Robust Learning with Weak Supervision.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Weaker Than You Think: A Critical Look at Weakly Supervised Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
MDIA: A Benchmark for Multilingual Dialogue Generation in 46 Languages.
CoRR, 2022


RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Deep latent-variable models for neural text generation.
PhD thesis, 2021

Knowledge-enhanced Session-based Recommendation with Temporal Transformer.
CoRR, 2021

The SelectGen Challenge: Finding the Best Training Samples for Few-Shot Neural Text Generation.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Neural Data-to-Text Generation with LM-based Text Augmentation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

On Training Instance Selection for Few-Shot Neural Text Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Integrating Image Captioning with Rule-based Entity Masking.
CoRR, 2020

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence.
CoRR, 2020

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

MovieChats: Chat like Humans in a Closed Domain.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Diversifying Dialogue Generation with Non-Conversational Text.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Improving Latent Alignment in Text Summarization by Generalizing the Pointer Generator.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Select and Attend: Towards Controllable Content Selection in Text Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Rewriter for Multi-Sentence Compression.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Improving Multi-turn Dialogue Modelling with Utterance ReWriter.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Simulating the Large-Scale Erosion of Genomic Privacy Over Time.
IEEE ACM Trans. Comput. Biol. Bioinform., 2018

A comprehensive study: Sentence compression with linguistic knowledge-enhanced gated neural network.
Data Knowl. Eng., 2018

Nexus Network: Connecting the Preceding and the Following in Dialogue Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Dialogue Generation With GAN.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Improving Variational Encoder-Decoders in Dialogue Generation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Towards Better Variational Encoder-Decoders in Seq2Seq Tasks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Gated Neural Network for Sentence Compression Using Linguistic Knowledge.
Proceedings of the Natural Language Processing and Information Systems, 2017

Estimation of Gap Between Current Language Models and Human Performance.
Proceedings of the Interspeech 2017, 2017

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Wake-Sleep Variational Autoencoders for Language Modeling.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

A Conditional Variational Framework for Dialog Generation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017


  Loading...