Eric Wong

Orcid: 0000-0002-8568-6659

Affiliations:

University of Pennsylvania, Department of Computer and Information Science, Philadelphia, PA, USA
Massachusetts Institute of Technology (MIT), CSAIL, Cambridge, MA, USA (former)
Carnegie Mellon University, Machine Learning Department, Pittsburgh, PA, USA (former, PhD 2020)

According to our database¹, Eric Wong authored at least 63 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

T-FIX: Text-Based Explanations with Features Interpretable to eXperts.

[BibT_eX]

[DOI]

Sameed Ahmed M. Khatana

Gary E. Weissman

Lyle H. Ungar

Eric Wong

CoRR, November, 2025

Once Upon an Input: Reasoning via Per-Instance Program Synthesis.

[BibT_eX]

[DOI]

CoRR, October, 2025

Stable Prediction of Adverse Events in Medical Time-Series Data.

[BibT_eX]

[DOI]

CoRR, October, 2025

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks.

[BibT_eX]

[DOI]

CoRR, October, 2025

Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs.

[BibT_eX]

[DOI]

CoRR, September, 2025

Probabilistic Soundness Guarantees in LLM Reasoning Chains.

[BibT_eX]

[DOI]

CoRR, July, 2025

Instruction Following by Boosting Attention of Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Benchmarking Misuse Mitigation Against Covert Adversaries.

[BibT_eX]

[DOI]

CoRR, June, 2025

The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Probabilistic Stability Guarantees for Feature Attributions.

[BibT_eX]

[DOI]

CoRR, April, 2025

CTSketch: Compositional Tensor Sketching for Scalable Neurosymbolic Learning.

[BibT_eX]

[DOI]

CoRR, March, 2025

NSF-SciFy: Mining the NSF Awards Database for Scientific Claims.

[BibT_eX]

[DOI]

CoRR, March, 2025

Adaptively evaluating models with task elicitation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Where's the Bug? Attention Probing for Scalable Fault Localization.

[BibT_eX]

[DOI]

CoRR, February, 2025

SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Jailbreaking Black Box Large Language Models in Twenty Queries.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Secure and Trustworthy Machine Learning, 2025

Avoiding Copyright Infringement via Large Language Model Unlearning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Style Alignment in Cross-Cultural Translation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

TorchQL: A Programming Framework for Integrity Constraints in Machine Learning.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2024

Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning.

[BibT_eX]

[DOI]

CoRR, 2024

The FIX Benchmark: Extracting Features Interpretable to eXperts.

[BibT_eX]

[DOI]

CoRR, 2024

Avoiding Copyright Infringement via Machine Unlearning.

[BibT_eX]

[DOI]

CoRR, 2024

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.

[BibT_eX]

[DOI]

CoRR, 2024

Data-Efficient Learning with Neural Programs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models.

[BibT_eX]

[DOI]

Patrick Chao

Edoardo Debenedetti

Alexander Robey

Maksym Andriushchenko

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Compositionality in Concept Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Evaluating Groups of Features via Consistency, Contiguity, and Stability.

[BibT_eX]

[DOI]

Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Initialization Matters for Adversarial Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Sum-of-Parts Models: Faithful Attributions for Groups of Features.

[BibT_eX]

[DOI]

CoRR, 2023

MDB: Interactively Querying Datasets and Models.

[BibT_eX]

[DOI]

CoRR, 2023

Rectifying Group Irregularities in Explanations for Distribution Shift.

[BibT_eX]

[DOI]

CoRR, 2023

Do Machine Learning Models Learn Common Sense?

[BibT_eX]

[DOI]

CoRR, 2023

In-context Example Selection with Influences.

[BibT_eX]

[DOI]

Tai Nguyen

Eric Wong

CoRR, 2023

Adversarial Prompting for Black Box Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Adversarial robustness in discontinuous spaces via alternating sampling & descent.

[BibT_eX]

[DOI]

Rahul Mysore Venkatesh

Eric Wong

Zico Kolter

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Stability Guarantees for Feature Attributions with Multiplicative Smoothing.

[BibT_eX]

[DOI]

Anton Xue

Rajeev Alur

Eric Wong

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Faithful Chain-of-Thought Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Do Machine Learning Models Learn Statistical Rules Inferred from Data?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

TopEx: Topic-based Explanations for Model Comparison.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Comparing Styles across Languages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Data-Based Perspective on Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

When does Bias Transfer in Transfer Learning?

[BibT_eX]

[DOI]

CoRR, 2022

Missingness Bias in Model Debugging.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Certified Patch Robustness via Smoothed Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Provable, Structured, and Efficient Methods for Robustness of Deep Networks to Adversarial Examples.

[BibT_eX]

[DOI]

Eric Wong

PhD thesis, 2021

DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting.

[BibT_eX]

[DOI]

CoRR, 2021

Leveraging Sparse Linear Layers for Debuggable Deep Networks.

[BibT_eX]

[DOI]

Eric Wong

Shibani Santurkar

Aleksander Madry

Proceedings of the 38th International Conference on Machine Learning, 2021

Learning perturbation sets for robust machine learning.

[BibT_eX]

[DOI]

Eric Wong

J. Zico Kolter

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Neural Network Virtual Sensors for Fuel Injection Quantities with Provable Performance Specifications.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Overfitting in adversarially robust deep learning.

[BibT_eX]

[DOI]

Leslie Rice

Eric Wong

J. Zico Kolter

Proceedings of the 37th International Conference on Machine Learning, 2020

Adversarial Robustness Against the Union of Multiple Perturbation Models.

[BibT_eX]

[DOI]

Pratyush Maini

Eric Wong

J. Zico Kolter

Proceedings of the 37th International Conference on Machine Learning, 2020

Fast is better than free: Revisiting adversarial training.

[BibT_eX]

[DOI]

Eric Wong

Leslie Rice

J. Zico Kolter

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Wasserstein Adversarial Examples via Projected Sinkhorn Iterations.

[BibT_eX]

[DOI]

Eric Wong

Frank R. Schmidt

J. Zico Kolter

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Scaling provable adversarial defenses.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Provable Defenses against Adversarial Examples via the Convex Outer Adversarial Polytope.

[BibT_eX]

[DOI]

Eric Wong

J. Zico Kolter

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

A Semismooth Newton Method for Fast, Generic Convex Programming.

[BibT_eX]

[DOI]

Alnur Ali

Eric Wong

J. Zico Kolter

Proceedings of the 34th International Conference on Machine Learning, 2017

2015

An SVD and Derivative Kernel Approach to Learning from Geometric Data.

[BibT_eX]

[DOI]

Eric Wong

J. Zico Kolter

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Eric Wong

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...