Songyang Gao

Orcid: 0009-0003-4900-0458

According to our database¹, Songyang Gao authored at least 42 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Intern-S1: A Scientific Multimodal Foundation Model.

[BibT_eX]

[DOI]

CoRR, August, 2025

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward.

[BibT_eX]

[DOI]

CoRR, August, 2025

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning.

[BibT_eX]

[DOI]

CoRR, July, 2025

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner.

[BibT_eX]

[DOI]

CoRR, July, 2025

Pre-Trained Policy Discriminators are General Reward Models.

[BibT_eX]

[DOI]

CoRR, July, 2025

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future.

[BibT_eX]

[DOI]

CoRR, April, 2025

Unicorn: Text-Only Data Synthesis for Vision Language Model Training.

[BibT_eX]

[DOI]

CoRR, March, 2025

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, February, 2025

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Are Your LLMs Capable of Stable Reasoning?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Are Your LLMs Capable of Stable Reasoning?

[BibT_eX]

[DOI]

CoRR, 2024

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments.

[BibT_eX]

[DOI]

CoRR, 2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.

[BibT_eX]

[DOI]

CoRR, 2024

CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Navigating the OverKill in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks.

[BibT_eX]

[DOI]

Sizhou Chen

Songyang Gao

Sen Fang

CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.

[BibT_eX]

[DOI]

CoRR, 2023

CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing.

[BibT_eX]

[DOI]

CoRR, 2023

RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective.

[BibT_eX]

[DOI]

CoRR, 2022

Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Songyang Gao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...