Muling Wu
According to our database1,
Muling Wu authored at least 27 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges.
CoRR, April, 2026
Neural Networks, 2026
Sci. China Inf. Sci., 2026
2025
CoRR, August, 2025
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning.
CoRR, June, 2025
RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data.
CoRR, May, 2025
CoRR, April, 2025
Neural Networks, 2025
Proceedings of the Natural Language Processing and Chinese Computing, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Structural Reward Model: Enhancing Interpretability, Efficiency, and Scalability in Reward Modeling.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
SpikeBERT: A Language Understanding Spiking Neural Network Learned from BERT with Knowledge Distillation.
Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025
Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2025
2024
Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing.
CoRR, 2024
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Aligning Large Language Models with Human Preferences through Representation Engineering.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Watermarking PLMs on Classification Tasks by Combining Contrastive Learning with Weight Perturbation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023