Ruoxi Cheng
Orcid: 0009-0001-3261-642X
  According to our database1,
  Ruoxi Cheng
  authored at least 21 papers
  between 2021 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models.
    
  
    CoRR, September, 2025
    
  
Strata-Sword: A Hierarchical Safety Evaluation towards LLMs based on Reasoning Complexity of Jailbreak Instructions.
    
  
    CoRR, September, 2025
    
  
L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training.
    
  
    CoRR, July, 2025
    
  
    CoRR, March, 2025
    
  
Gibberish is All You Need for Membership Inference Detection in Contrastive Language-Audio Pretraining.
    
  
    Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
    
  
    Proceedings of the Information and Communications Security - 27th International Conference, 2025
    
  
    Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025
    
  
    Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025
    
  
USMID: A Unimodal Speaker-Level Membership Inference Detector for Contrastive Pretraining.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts.
    
  
    Proceedings of the 31st International Conference on Computational Linguistics, 2025
    
  
  2024
PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization.
    
  
    CoRR, 2024
    
  
    CoRR, 2024
    
  
KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs.
    
  
    CoRR, 2024
    
  
RLRF: Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs.
    
  
    CoRR, 2024
    
  
    Proceedings of the 2024 4th International Conference on Big Data, 2024
    
  
Optimal Scheduling of Integrated Electric-Gas-Heat Energy System Considering Carbon Emission and User Satisfaction of Heat Supply Network.
    
  
    Proceedings of the IEEE Industry Applications Society Annual Meeting, 2024
    
  
Active and Reactive Power Coordinated Optimal Dispatch in Active Distribution Network Considering Demand Response and Distributed Energy Storage Cooperative Interaction.
    
  
    Proceedings of the IEEE Industry Applications Society Annual Meeting, 2024
    
  
  2021
AI Enabled IoRT Framework for Rodent Activity Monitoring in a False Ceiling Environment.
    
  
    Sensors, 2021