Chang Zhou

Orcid: 0000-0001-9241-702X

Affiliations:
  • DAMO Academy, Alibaba Group, Hangzhou, China


According to our database1, Chang Zhou authored at least 68 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web.
CoRR, 2024

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.
CoRR, 2024

Qwen2-Audio Technical Report.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback.
CoRR, 2024

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment.
CoRR, 2024

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Training-Free Long-Context Scaling of Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Speculative Contrastive Decoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Synthesizing Text-to-SQL Data from Weak and Strong LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Language Models can Evaluate Themselves via Probability Discrepancy.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Controllable 3D Face Generation with Conditional Style Code Diffusion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Region or Global? A Principle for Negative Sampling in Graph-Based Recommendation.
IEEE Trans. Knowl. Data Eng., June, 2023

CogKR: Cognitive Graph for Multi-Hop Knowledge Reasoning.
IEEE Trans. Knowl. Data Eng., 2023

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models.
CoRR, 2023

OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models.
CoRR, 2023

Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization.
CoRR, 2023

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT.
CoRR, 2023

Qwen Technical Report.
CoRR, 2023

TouchStone: Evaluating Vision-Language Models by Language Models.
CoRR, 2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities.
CoRR, 2023

Scaling Relationship on Learning Mathematical Reasoning with Large Language Models.
CoRR, 2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities.
CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Prompt Tuning for Unified Multimodal Pretrained Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Transferring General Multimodal Pretrained Models to Text Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models.
CoRR, 2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese.
CoRR, 2022

Prompt Tuning for Generative Multimodal Pretrained Models.
CoRR, 2022

M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing.
CoRR, 2022

M6-Rec: Generative Pretrained Language Models are Open-Ended Recommender Systems.
CoRR, 2022

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
CoRR, 2022

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
Proceedings of the International Conference on Machine Learning, 2022

2021
M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining.
CoRR, 2021

Exploring Sparse Expert Models and Beyond.
CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.
CoRR, 2021

M6: A Chinese Multimodal Pretrainer.
CoRR, 2021

UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CogView: Mastering Text-to-Image Generation via Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Are we really making much progress?: Revisiting, benchmarking and refining heterogeneous graph neural networks.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

M6: Multi-Modality-to-Multi-Modality Multitask Mega-transformer for Unified Pretraining.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Learning to Rehearse in Long Sequence Memorization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Connecting Language and Vision for Natural Language-Based Vehicle Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Contrastive Learning for Debiased Candidate Generation at Scale.
CoRR, 2020

CogLTX: Applying BERT to Long Texts.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Understanding Negative Sampling in Graph Representation Learning.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Controllable Multi-Interest Framework for Recommendation.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019
AliGraph: A Comprehensive Graph Neural Network Platform.
Proc. VLDB Endow., 2019

Cognitive Knowledge Graph Reasoning for One-shot Relational Learning.
CoRR, 2019

Cognitive Graph for Multi-Hop Reading Comprehension at Scale.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...