Ge Zhang
Orcid: 0000-0002-0064-2906Affiliations:
- ByteDance Inc.
- 01.AI (former)
- University of Waterloo, Canada (PhD)
- Beijing Academy of Artificial Intelligence, China (former)
- University of Michigan, Ann Arbor, MI, USA (former)
According to our database1,
Ge Zhang authored at least 197 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2026
CoRR, May, 2026
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining.
CoRR, March, 2026
CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs.
CoRR, March, 2026
Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization.
CoRR, February, 2026
WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints.
CoRR, February, 2026
CoRR, February, 2026
CoRR, February, 2026
TabularMath: Evaluating Computational Extrapolation in Tabular Learning via Program-Verified Synthesis.
CoRR, February, 2026
Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities.
CoRR, January, 2026
CoRR, January, 2026
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning.
CoRR, January, 2026
MMTableBench: A Multi-level Multimodal Benchmark for Reasoning and Layout Complexity in Table QA.
Proceedings of the ACM Web Conference 2026, 2026
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, December, 2025
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents.
CoRR, December, 2025
CoRR, December, 2025
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence.
CoRR, November, 2025
CoRR, November, 2025
CoRR, November, 2025
CoRR, November, 2025
RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization.
CoRR, November, 2025
CoRR, November, 2025
CoRR, October, 2025
CoRR, October, 2025
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems.
CoRR, October, 2025
CoRR, October, 2025
CoRR, September, 2025
CoRR, September, 2025
CoRR, September, 2025
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning.
CoRR, September, 2025
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling.
CoRR, August, 2025
CoRR, August, 2025
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
CoRR, August, 2025
CoRR, August, 2025
CoRR, August, 2025
CoRR, July, 2025
CoRR, July, 2025
CoRR, July, 2025
Multim. Syst., June, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs.
CoRR, April, 2025
CoRR, March, 2025
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
CoRR, February, 2025
Trans. Mach. Learn. Res., 2025
Trans. Mach. Learn. Res., 2025
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025
Proceedings of the Natural Language Processing and Chinese Computing, 2025
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Trans. Mach. Learn. Res., 2024
Trans. Mach. Learn. Res., 2024
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm.
CoRR, 2024
CoRR, 2024
GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models.
CoRR, 2024
CoRR, 2024
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
CoRR, 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
CoRR, 2024
MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
CoRR, 2023
TPDM: Selectively Removing Positional Information for Zero-shot Translation via Token-Level Position Disentangle Module.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
2022
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning.
CoRR, 2022
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022
2020
CoRR, 2020