Ruisi Cai

According to our database1, Ruisi Cai authored at least 19 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Star Elastic: Many-in-One Reasoning LLMs with Efficient Budget Control.
CoRR, May, 2026

∇-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space.
CoRR, March, 2026

2025
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs.
CoRR, November, 2025

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding.
CoRR, January, 2025

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Steepest Descent Density Control for Compact 3D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LoCoCo: Dropping In Convolutions for Long Context Compression.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Flextron: Many-in-One Flexible Large Language Model.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
CoRR, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Proceedings of the International Conference on Machine Learning, 2023

Robust Mixture-of-Expert Training for Convolutional Neural Networks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Many-Task Federated Learning: A New Problem Setting and A Simple Baseline.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...