Rongwu Xu

Orcid: 0009-0000-8179-7354

According to our database1, Rongwu Xu authored at least 33 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap.
CoRR, August, 2025

AICrypto: A Comprehensive Benchmark For Evaluating Cryptography Capabilities of Large Language Models.
CoRR, July, 2025

The Singapore Consensus on Global AI Safety Research Priorities.
CoRR, June, 2025

LIFEBench: Evaluating Length Instruction Following in Large Language Models.
CoRR, May, 2025

AI Awareness.
CoRR, April, 2025

"Nuclear Deployed!": Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents.
CoRR, February, 2025

On the Role of Attention Heads in Large Language Model Safety.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Nuclear Deployed!: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Does Chain-of-Thought Reasoning Really Reduce Harmfulness from Jailbreaking?
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
LSync: A Universal Timeline-Synchronizing Solution for Live Streaming.
IEEE/ACM Trans. Netw., October, 2024

Impact Load Localization Based on Multi-Scale Feature Fusion Convolutional Neural Network.
Sensors, September, 2024

Impact Localization in Complex Cylindrical Shell Structures Based on the Time-Reversal Virtual Focusing Triangulation Method.
Sensors, August, 2024

Long<sup>2</sup>RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall.
CoRR, 2024

On the Role of Attention Heads in Large Language Model Safety.
CoRR, 2024

DebateQA: Evaluating Question Answering on Debatable Knowledge.
CoRR, 2024

Course-Correction: Safety Alignment Using Synthetic Preferences.
CoRR, 2024

MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models.
CoRR, 2024

Knowledge Conflicts for LLMs: A Survey.
CoRR, 2024

MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training.
Proceedings of the International Joint Conference on Neural Networks, 2024

Exploring Chinese Humor Generation: A Study on Two-part Allegorical Sayings.
Proceedings of the International Joint Conference on Neural Networks, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Sing it, Narrate it: Quality Musical Lyrics Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Knowledge Conflicts for LLMs: A Survey.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Course-Correction: Safety Alignment Using Synthetic Preferences.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

LONG²RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Preemptive Answer "Attacks" on Chain-of-Thought Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Research on structural sound source localization method by neural network.
EURASIP J. Adv. Signal Process., 2023

MISO: Legacy-compatible Privacy-preserving Single Sign-on using Trusted Execution Environments.
Proceedings of the 8th IEEE European Symposium on Security and Privacy, 2023

2022
LSync: A Universal Event-synchronizing Solution for Live Streaming.
Proceedings of the IEEE INFOCOM 2022, 2022

LifeRec: A Mobile App for Lifelog Recording and Ubiquitous Recommendation.
Proceedings of the CHIIR '22: ACM SIGIR Conference on Human Information Interaction and Retrieval, Regensburg, Germany, March 14, 2022


  Loading...