Shizhe Diao
Orcid: 0000-0002-3325-9209
According to our database1,
Shizhe Diao
authored at least 34 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning.
CoRR, 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards.
CoRR, 2024
ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
2023
Trans. Mach. Learn. Res., 2023
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting.
CoRR, 2023
Speciality vs Generality: An Empirical Study on Catastrophic Forgetting in Fine-tuning Foundation Models.
CoRR, 2023
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models.
CoRR, 2023
Proceedings of the ACM Web Conference 2023, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memories.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training.
Proceedings of the International Conference on Machine Learning, 2022
2021
Efficient Neural Network Training via Forward and Backward Propagation Sparsification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
2017
Proceedings of the 13th International Conference on Semantics, Knowledge and Grids, 2017