Zhiheng Xi

According to our database1, Zhiheng Xi authored at least 19 papers between 2022 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals.
CoRR, 2024

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models.
CoRR, 2024

RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions.
CoRR, 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning.
CoRR, 2024

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback.
CoRR, 2024

MouSi: Poly-Visual-Expert Vision-Language Models.
CoRR, 2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling.
CoRR, 2024

2023
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment.
CoRR, 2023

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.
CoRR, 2023

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models.
CoRR, 2023

The Rise and Potential of Large Language Model Based Agents: A Survey.
CoRR, 2023

Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey.
CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.
CoRR, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.
CoRR, 2023

RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Characterizing the Impacts of Instances on Robustness.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Connectivity Patterns are Task Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Efficient Adversarial Training with Robust Early-Bird Tickets.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


  Loading...