Ashwinee Panda

According to our database¹, Ashwinee Panda authored at least 32 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought.

[BibT_eX]

[DOI]

CoRR, November, 2025

Alignment-Constrained Dynamic Pruning for LLMs: Identifying and Preserving Alignment-Critical Circuits.

[BibT_eX]

[DOI]

CoRR, November, 2025

Shared Parameter Subspaces and Cross-Task Linearity in Emergently Misaligned Behavior.

[BibT_eX]

[DOI]

Daniel Aarao Reis Arturi

CoRR, November, 2025

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization.

[BibT_eX]

[DOI]

CoRR, September, 2025

FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness.

[BibT_eX]

[DOI]

CoRR, September, 2025

Evaluation Awareness Scales Predictably in Open-Weights Large Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

DynaGuard: A Dynamic Guardrail Model With User-Defined Policies.

[BibT_eX]

[DOI]

CoRR, September, 2025

Dense Backpropagation Improves Training for Sparse Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, April, 2025

Analysis of Attention in Video Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, April, 2025

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, April, 2025

Using Attention Sinks to Identify and Evaluate Dormant Heads in Pretrained LLMs.

[BibT_eX]

[DOI]

Pedro Sandoval Segura

CoRR, April, 2025

Gemstones: A Model Suite for Multi-Faceted Scaling Laws.

[BibT_eX]

[DOI]

CoRR, February, 2025

Continual Pre-training of MoEs: How robust is your router?

[BibT_eX]

[DOI]

Benjamin Thérien

Charles-Étienne Joseph

Trans. Mach. Learn. Res., 2025

Private Fine-tuning of Large Language Models with Zeroth-order Optimization.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Safety Alignment Should be Made More Than Just a Few Tokens Deep.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Privacy Auditing of Large Language Models.

[BibT_eX]

[DOI]

Ashwinee Panda

Xinyu Tang

Christopher A. Choquette-Choo

Milad Nasr

Prateek Mittal

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Privacy-Preserving In-Context Learning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Teach LLMs to Phish: Stealing Private Information from Language Models.

[BibT_eX]

[DOI]

Ashwinee Panda

Christopher A. Choquette-Choo

Zhengming Zhang

Yaoqing Yang

Prateek Mittal

Proceedings of the Twelfth International Conference on Learning Representations, 2024

StructMoE: Structured Mixture of Experts Using Low Rank Experts.

[BibT_eX]

[DOI]

Kartik Balasubramaniam

Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

Visual Adversarial Examples Jailbreak Aligned Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Visual Adversarial Examples Jailbreak Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Differentially Private In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Differentially Private Image Classification by Learning Priors from Random Processes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

DP-RAFT: A Differentially Private Recipe for Accelerated Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, 2022

Neurotoxin: Durable Backdoors in Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

SparseFed: Mitigating Model Poisoning Attacks in Federated Learning with Sparsification.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2020

FetchSGD: Communication-Efficient Federated Learning with Sketching.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Ashwinee Panda

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...