Siyao Peng

Orcid: 0000-0003-4758-4763

According to our database1, Siyao Peng authored at least 34 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA.
CoRR, March, 2026

2025
EVADE: LLM-Based Explanation Generation and Validation for Error Detection in NLI.
CoRR, November, 2025

Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations.
CoRR, October, 2025

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet).
CoRR, October, 2025

LeWiDi-2025 at NLPerspectives: The Third Edition of the Learning with Disagreements Shared Task.
CoRR, October, 2025

Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics.
CoRR, June, 2025

eRST: A Signaled Graph Theory of Discourse Relations and Organization.
Comput. Linguistics, 2025

What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
MultiClimate: Multimodal Stance Detection on Climate Change Videos.
CoRR, 2024

CLIMATELI: Evaluating Entity Linking on Climate Change Data.
CoRR, 2024

MaiBaam Annotation Guidelines.
CoRR, 2024

EEVEE: An Easy Annotation Tool for Natural Language Processing.
CoRR, 2024

Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations.
CoRR, 2024

Similarity Preserving Transformer Cross-Modal Hashing for Video-Text Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

VariErr NLI: Separating Annotation Error from Human Label Variation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic Evaluation.
CoRR, 2023

Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better Generalization.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

2022
Chinese Discourse Annotation Reference Manual.
CoRR, 2022

GCDT: A Chinese RST Treebank for Multigenre and Multilingual Discourse Parsing.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

2021
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English.
CoRR, 2021

DisCoDisCo at the DISRPT2021 Shared Task: A System for Discourse Segmentation, Classification, and Connective Detection.
CoRR, 2021

2020
Tencent submission for WMT20 Quality Estimation Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

A Corpus of Adpositional Supersenses for Mandarin Chinese.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

AMALGUM - A Free, Balanced, Multilayer English Web Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
GumDrop at the DISRPT2019 Shared Task: A Model Stacking Approach to Discourse Unit Segmentation and Connective Detection.
CoRR, 2019

Modeling Long-Range Context for Concurrent Dialogue Acts Recognition.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Adpositional Supersenses for Mandarin Chinese.
CoRR, 2018

All Roads Lead to UD: Converting Stanford and Penn Parses to English Universal Dependencies with Multilayer Annotations.
Proceedings of the Joint Workshop on Linguistic Annotation, 2018


  Loading...