Junzhuo Li

According to our database1, Junzhuo Li authored at least 19 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation.
CoRR, April, 2026

Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design.
CoRR, March, 2026

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities.
CoRR, May, 2025

LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Exploring Engineering Undergraduate's Critical Thinking Patterns in AI-Enhanced Learning: A TCSA Method.
Proceedings of the Blended Learning. Sustainable and Flexible Smart Learning, 2025

Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
CoRR, 2024

2023
Urban Resident Travel Survey Method Based on Cellular Signaling Data.
ISPRS Int. J. Geo Inf., July, 2023

Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Tab-CQA: A Tabular Conversational Question Answering Dataset on Financial Reports.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), 2023

2022
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks.
CoRR, 2022

Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework.
CoRR, 2022

KaFSP: Knowledge-Aware Fuzzy Semantic Parsing for Conversational Question Answering over a Large-Scale Knowledge Base.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2020
MDPeQA: A Pediatric Question Answering System Based on Multiple Data Sources.
Proceedings of the Chinese Lexical Semantics - 21st Workshop, 2020


  Loading...