Jiayi Yuan

Orcid: 0009-0005-9208-6677

Affiliations:
  • Rice University, Department of Computer Science, Houston, TX, USA


According to our database1, Jiayi Yuan authored at least 24 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning.
CoRR, June, 2025

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models.
CoRR, May, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models.
CoRR, March, 2025

The Science of Evaluating Foundation Models.
CoRR, February, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models.
Trans. Mach. Learn. Res., 2025

DHP Benchmark: Are LLMs Good NLG Evaluators?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

ReasonerRank: Redefining Language Model Evaluation with Ground-Truth-Free Ranking Frameworks.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Understanding Different Design Choices in Training Large Time Series Models.
CoRR, 2024

LoRATK: LoRA Once, Backdoor Everywhere in the Share-and-Play Ecosystem.
CoRR, 2024

GNNs Also Deserve Editing, and They Need It More Than Once.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
EyeCoD: Eye Tracking System Acceleration via FlatCam-Based Algorithm and Hardware Co-Design.
IEEE Micro, 2023

LLM for Patient-Trial Matching: Privacy-Aware Data Augmentation Towards Better Performance and Generalizability.
CoRR, 2023

Towards Fair Patient-Trial Matching via Patient-Criterion Level Fairness Constraint.
CoRR, 2023

Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance Measurement.
Proceedings of the IEEE International Conference on Acoustics, 2023

NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Robust Tickets Can Transfer Better: Drawing More Transferable Subnetworks in Transfer Learning.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke.
Proceedings of the 14th ACM International Conference on Bioinformatics, 2023

2022
EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022


  Loading...