Ashwinee Panda

According to our database1, Ashwinee Panda authored at least 30 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Shared Parameter Subspaces and Cross-Task Linearity in Emergently Misaligned Behavior.
CoRR, November, 2025

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs.
CoRR, October, 2025

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization.
CoRR, September, 2025

FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness.
CoRR, September, 2025

Evaluation Awareness Scales Predictably in Open-Weights Large Language Models.
CoRR, September, 2025

DynaGuard: A Dynamic Guardrail Model With User-Defined Policies.
CoRR, September, 2025

Dense Backpropagation Improves Training for Sparse Mixture-of-Experts.
CoRR, April, 2025

Analysis of Attention in Video Diffusion Transformers.
CoRR, April, 2025

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation.
CoRR, April, 2025

Using Attention Sinks to Identify and Evaluate Dormant Heads in Pretrained LLMs.
CoRR, April, 2025

Gemstones: A Model Suite for Multi-Faceted Scaling Laws.
CoRR, February, 2025

Continual Pre-training of MoEs: How robust is your router?
Trans. Mach. Learn. Res., 2025

Private Fine-tuning of Large Language Models with Zeroth-order Optimization.
Trans. Mach. Learn. Res., 2025

Safety Alignment Should be Made More Than Just a Few Tokens Deep.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Privacy Auditing of Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models.
CoRR, 2024

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs.
CoRR, 2024

A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Privacy-Preserving In-Context Learning for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Teach LLMs to Phish: Stealing Private Information from Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

StructMoE: Structured Mixture of Experts Using Low Rank Experts.
Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts.
Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

Visual Adversarial Examples Jailbreak Aligned Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Visual Adversarial Examples Jailbreak Large Language Models.
CoRR, 2023

Differentially Private In-Context Learning.
CoRR, 2023

Differentially Private Image Classification by Learning Priors from Random Processes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
DP-RAFT: A Differentially Private Recipe for Accelerated Fine-Tuning.
CoRR, 2022

Neurotoxin: Durable Backdoors in Federated Learning.
Proceedings of the International Conference on Machine Learning, 2022

SparseFed: Mitigating Model Poisoning Attacks in Federated Learning with Sparsification.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2020
FetchSGD: Communication-Efficient Federated Learning with Sketching.
Proceedings of the 37th International Conference on Machine Learning, 2020


  Loading...