Zhi Chen

Orcid: 0000-0003-4180-8455

Affiliations:

Shanghai Jiao Tong University, Department of Computer Science and Engineering, SpeechLab, and MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai, China

According to our database¹, Zhi Chen authored at least 42 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

The Universal Landscape of Human Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

AI4Research: A Survey of Artificial Intelligence for Scientific Research.

[BibT_eX]

[DOI]

CoRR, July, 2025

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, May, 2025

Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, March, 2025

ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model.

[BibT_eX]

[DOI]

CoRR, February, 2025

A survey of multilingual large language models.

[BibT_eX]

[DOI]

Patterns, 2025

DFM: Dialogue foundation model for universal large-scale dialogue-oriented task learning.

[BibT_eX]

[DOI]

AI Open, 2025

LESA: Learnable LLM Layer Scaling-Up.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity.

[BibT_eX]

[DOI]

CoRR, 2024

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing.

[BibT_eX]

[DOI]

CoRR, 2024

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.

[BibT_eX]

[DOI]

CoRR, 2024

M<sup>3</sup>CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, 2024

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers.

[BibT_eX]

[DOI]

CoRR, 2024

InternLM2 Technical Report.

[BibT_eX]

[DOI]

et al.

CoRR, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

M³CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2023

CLEVA: Chinese Language Models EVAluation Platform.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Schema Generalizability of Text-to-SQL.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

DialogZoo: Large-Scale Dialog-Oriented Task Learning.

[BibT_eX]

[DOI]

CoRR, 2022

UniDU: Towards A Unified Generative Dialogue Understanding Framework.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

AdapterShare: Task Correlation Modeling with Adapter Differentiation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2021

ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking.

[BibT_eX]

[DOI]