Ashwath Aithal

According to our database1, Ashwath Aithal authored at least 15 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Heterogeneous Parallelism for Multimodal Large Language Model Training.
CoRR, May, 2026

X-Token: Projection-Guided Cross-Tokenizer Knowledge Distillation.
CoRR, May, 2026

Star Elastic: Many-in-One Reasoning LLMs with Efficient Budget Control.
CoRR, May, 2026

Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding.
CoRR, April, 2026

Scalable Training of Mixture-of-Experts Models with Megatron Core.
CoRR, March, 2026

2025
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs.
CoRR, November, 2025

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core.
CoRR, April, 2025

Training Video Foundation Models with NVIDIA NeMo.
CoRR, March, 2025

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024
Llama 3 Meets MoE: Efficient Upcycling.
CoRR, 2024

Upcycling Large Language Models into Mixture of Experts.
CoRR, 2024

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment.
CoRR, 2024

Nemotron-4 15B Technical Report.
CoRR, 2024

2023
Hierarchical Graph Neural Network with Cross-Attention for Cross-Device User Matching.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2023

2020
3D Object Detection on Point Clouds using Local Ground-aware and Adaptive Representation of scenes' surface.
CoRR, 2020


  Loading...