Yile Gu

Orcid: 0009-0009-8292-7232

According to our database1, Yile Gu authored at least 29 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices.
CoRR, June, 2025

Semantic Scheduling for LLM Inference.
CoRR, June, 2025

TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval.
CoRR, February, 2025

Tactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMs.
CoRR, February, 2025

Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models.
CoRR, January, 2025

Scalable and Accurate Application-Level Crash-Consistency Testing via Representative Testing.
Proc. ACM Program. Lang., 2025

NanoFlow: Towards Optimal Large Language Model Serving Throughput.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
NanoFlow: Towards Optimal Large Language Model Serving Throughput.
CoRR, 2024

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models.
CoRR, 2024

Reducing Energy Bloat in Large Model Training.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-Modal Retrieval For Large Language Model Based Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Perseus: Removing Energy Bloat from Large Model Training.
CoRR, 2023

Distillation Strategies for Discriminative Speech Recognition Rescoring.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Personalization for BERT-based Discriminative Speech Recognition Rescoring.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Scaling Laws for Discriminative Speech Recognition Rescoring Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Low-Rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Generative Speech Recognition Error Correction With Large Language Models and Task-Activating Prompting.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Discriminative Speech Recognition Rescoring With Pre-Trained Language Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

RescoreBERT: Discriminative Speech Recognition Rescoring With Bert.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Adjunct-Emeritus Distillation for Semi-Supervised Language Model Adaptation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Domain-Aware Neural Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Personalization Strategies for End-to-End Speech Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Task Language Modeling for Improving Speech Recognition of Rare Words.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion.
CoRR, 2020


  Loading...