We stand with Ukraine

We stand with Ukraine

Qin Liu

Orcid: 0009-0000-8303-8757

Affiliations:

University of California, Davis, CA, USA
University of Southern California, Los Angeles, CA, USA (former)
Fudan University, School of Computer Science, Shanghai, China (former)

According to our database¹, Qin Liu authored at least 32 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize.

[BibT_eX]

[DOI]

,

,

,

CoRR, September, 2025

RedCoder: Automated Multi-Turn Red Teaming for Code LLMs.

[BibT_eX]

[DOI]

Wenjie Jacky Mo

,

,

,

,

,

,

,

CoRR, July, 2025

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Exploring Scaling Laws for EHR Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

Tristan Naumann

,

CoRR, May, 2025

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

VLM-Guard: Safeguarding Vision-Language Models via Fulfilling Safety Alignment Gap.

[BibT_eX]

[DOI]

,

,

,

CoRR, February, 2025

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.

[BibT_eX]

[DOI]

Wenjie Jacky Mo

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Mingyu Derek Ma

,

,

,

,

Tianyi Lorena Yan

,

Wenjie Jacky Mo

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

Nikolaos Pappas

,

,

,

,

Lluís Màrquez

,

Miguel Ballesteros

,

Yassine Benajiba

Proceedings of the Findings of the Association for Computational Linguistics, 2025

SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Mingyu Derek Ma

,

,

,

Adithya Kulkarni

,

,

,

,

,

CoRR, 2024

Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Familiarity-aware Evidence Compression for Retrieval Augmented Generation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Securing Multi-turn Conversational Language Models Against Distributed Backdoor Triggers.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

From Shortcuts to Triggers: Backdoor Defense with Denoised PoE.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Securing Multi-turn Conversational Language Models From Distributed Backdoor Attacks.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Monotonic Paraphrasing Improves Generalization of Language Model Prompting.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Pranav Narayanan Venkit

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Raj Sanjay Shah

,

,

,

,

,

,

,

Surangika Ranathunga

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 60th Annual Allerton Conference on Communication, 2024

2023

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Characterizing the Impacts of Instances on Robustness.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Detecting Adversarial Samples through Sharpness of Loss Landscape.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Flooding-X: Improving BERT's Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Overview of Argumentative Text Understanding for AI Debater Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Changjian Jiang

,

Proceedings of the Natural Language Processing and Chinese Computing, 2021

TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Learning to Generate Representations for Novel Words: Mimic the OOV Situation in Training.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2020

Loading...