Zhi Chen
Orcid: 0000-0003-4180-8455Affiliations:
- Shanghai Jiao Tong University, Department of Computer Science and Engineering, SpeechLab, and MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai, China
According to our database1,
Zhi Chen
authored at least 41 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, July, 2025
CoRR, May, 2025
Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning.
CoRR, March, 2025
ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model.
CoRR, February, 2025
DFM: Dialogue foundation model for universal large-scale dialogue-oriented task learning.
AI Open, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity.
CoRR, 2024
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.
CoRR, 2024
M<sup>3</sup>CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought.
CoRR, 2024
CoRR, 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue.
Trans. Assoc. Comput. Linguistics, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Proceedings of the Natural Language Processing and Chinese Computing, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Proceedings of the Natural Language Processing and Chinese Computing, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning.
CoRR, 2019
2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018