Zhengxuan Wu

Orcid: 0000-0001-5581-8908

According to our database1, Zhengxuan Wu authored at least 35 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions.
CoRR, 2024

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation.
CoRR, 2024

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations.
CoRR, 2024

A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments.
CoRR, 2024

2023
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca.
CoRR, 2023

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation.
CoRR, 2023

Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations.
CoRR, 2023

Interpretability at Scale: Identifying Causal Mechanisms in Alpaca.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Causal Proxy Models for Concept-based Model Explanations.
Proceedings of the International Conference on Machine Learning, 2023

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Rigorously Assessing Natural Language Explanations of Neurons.
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Oolong: Investigating What Makes Crosslingual Transfer Hard with Controlled Studies.
CoRR, 2022

Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models.
Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Causal Distillation for Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Inducing Causal Structure for Interpretable Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

2021
Modeling Emotion in Complex Stories: The Stanford Emotional Narratives Dataset.
IEEE Trans. Affect. Comput., 2021

Attention uncovers task-relevant semantics in emotional narrative understanding.
Knowl. Based Syst., 2021

On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification.
CoRR, 2021

ReaSCAN: Compositional Reasoning in Language Grounding.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Dynabench: Rethinking Benchmarking in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Not Now, Ask Later: Users Weaken Their Behavior Change Regimen Over Time, But Expect To Re-Strengthen It Imminently.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

DynaSent: A Dynamic Benchmark for Sentiment Analysis.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis.
CoRR, 2020

Pragmatically Informative Color Generation by Grounding Contextual Modifiers.
CoRR, 2020

Structured Self-AttentionWeights Encode Semantics in Sentiment Analysis.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

2019
Uncovering Political Promotion in China: A Network Analysis of Patronage Relationship in Autocracy.
CoRR, 2019

Disentangling Latent Emotions of Word Embeddings on Complex Emotional Narratives.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Conservation of Procrastination: Do Productivity Interventions Save Time Or Just Redistribute It?
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

Attending to Emotional Narratives.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018
Rotating Online Behavior Change Interventions Increases Effectiveness But Also Increases Attrition.
Proc. ACM Hum. Comput. Interact., 2018


  Loading...