Wenjie Mo

Affiliations:

University of California, Davis, Department of Computer Science, Davis, CA, USA

According to our database¹, Wenjie Mo authored at least 9 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Towards Policy-Compliant Agents: Learning Efficient Guardrails For Policy Violation Detection.

[BibT_eX]

[DOI]

CoRR, October, 2025

RedCoder: Automated Multi-Turn Red Teaming for Code LLMs.

[BibT_eX]

[DOI]

CoRR, July, 2025

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Rethinking Backdoor Detection Evaluation for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Allerton Conference on Communication, 2024

2023

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2023

A Causal View of Entity Bias in (Large) Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Wenjie Mo

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...