Ziyu Yao
Orcid: 0009-0007-4571-3505Affiliations:
- George Mason University, Department of Computer Science, Fairfax, VA, USA
- Ohio State University, Columbus, OH, USA (PhD 2021)
According to our database1,
Ziyu Yao
authored at least 58 papers
between 2016 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on ziyuyao.org
-
on orcid.org
-
on cs.gmu.edu
On csauthors.net:
Bibliography
2025
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones.
CoRR, July, 2025
Guiding AI to Fix Its Own Flaws: An Empirical Study on LLM-Driven Secure Code Generation.
CoRR, June, 2025
CoRR, June, 2025
Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models.
CoRR, May, 2025
Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction.
CoRR, April, 2025
Can LLMs Simulate Personas with Reversed Performance? A Benchmark for Counterfactual Instruction Following.
CoRR, April, 2025
AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning.
CoRR, March, 2025
A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models.
CoRR, March, 2025
CoRR, February, 2025
CoRR, February, 2025
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Understanding the Effect of Algorithm Transparency of Model Explanations in Text-to-SQL Semantic Parsing.
CoRR, 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models.
CoRR, 2024
IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers.
CoRR, 2024
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education.
CoRR, 2024
Large Language Model Cascades with Mixture of Thought Representations for Cost-Efficient Reasoning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 20th IEEE International Conference on Automation Science and Engineering, 2024
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning.
CoRR, 2023
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning.
CoRR, 2023
Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance.
CoRR, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Improving Generalization in Language Model-based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-based Techniques.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
Proceedings of the 9th International Conference on Learning Representations, 2021
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021
2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2019
Proceedings of the World Wide Web Conference, 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018
2016
Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016