Ruisi Cai

According to our database¹, Ruisi Cai authored at least 19 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Star Elastic: Many-in-One Reasoning LLMs with Efficient Budget Control.

[BibT_eX]

[DOI]

CoRR, May, 2026

∇-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs.

[BibT_eX]

[DOI]

Ali Taghibakhshi

Sharath Turuvekere Sreenivas

Saurav Muralidharan

Ruisi Cai

Marcin Chochowski

Ameya Sunil Mahabaleshwarkar

CoRR, November, 2025

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding.

[BibT_eX]

[DOI]

CoRR, January, 2025

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Steepest Descent Density Control for Compact 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design.

[BibT_eX]

[DOI]

Babak Ehteshami Bejnordi

Aditya Akella

Zhangyang Wang

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LoCoCo: Dropping In Convolutions for Long Context Compression.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Flextron: Many-in-One Flexible Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?

[BibT_eX]

[DOI]

Ruisi Cai

Zhenyu Zhang

Zhangyang Wang

Proceedings of the International Conference on Machine Learning, 2023

Robust Mixture-of-Expert Training for Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Many-Task Federated Learning: A New Problem Setting and A Simple Baseline.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Ruisi Cai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...