We stand with Ukraine

We stand with Ukraine

Shizhe Diao

Orcid: 0000-0002-3325-9209

According to our database¹, Shizhe Diao authored at least 34 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Can We Verify Step by Step for Incorrect Answer Detection?

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023

Black-Box Prompt Learning for Pre-trained Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Plum: Prompt Learning using Metaheuristic.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

MarineGPT: Unlocking Secrets of Ocean to the Public.

[BibT_eX]

[DOI]

,

,

,

,

Yue Him Wong Tim

,

CoRR, 2023

UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Roger Zimmermann

CoRR, 2023

Speciality vs Generality: An Empirical Study on Catastrophic Forgetting in Fine-tuning Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

On the Difference of BERT-style and CLIP-style Text Encoders.

[BibT_eX]

[DOI]

,

Guiming Hardy Chen

,

,

,

CoRR, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment.

[BibT_eX]

[DOI]

,

,

Deepanshu Goyal

,

,

,

,

,

CoRR, 2023

Active Prompting with Chain-of-Thought for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Hashtag-Guided Low-Resource Tweet Classification.

[BibT_eX]

[DOI]

,

Sedrick Scott Keh

,

,

,

,

Proceedings of the ACM Web Conference 2023, 2023

Write and Paint: Generative Vision-Language Models are Unified Modal Learners.

[BibT_eX]

[DOI]

,

Wangchunshu Zhou

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DetGPT: Detect What You Need via Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Doolittle: Benchmarks and Corpora for Academic Writing Formalization.

[BibT_eX]

[DOI]

,

,

,

,

Wangchunshu Zhou

,

Sedrick Scott Keh

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memories.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On the Difference of BERT-style and CLIP-style Text Encoders.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2022

Normalizing Flow with Variational Latent Representation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2022

Prefix Language Models are Unified Modal Learners.

[BibT_eX]

[DOI]

,

Wangchunshu Zhou

,

,

CoRR, 2022

VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models.

[BibT_eX]

[DOI]

Wangchunshu Zhou

,

,

,

CoRR, 2022

Black-box Prompt Learning for Pre-trained Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training.

[BibT_eX]

[DOI]

Wangchunshu Zhou

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

2021

Efficient Neural Network Training via Forward and Backward Propagation Sparsification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2017

GubaLex: Guba-Oriented Sentiment Lexicon for Big Texts in Finance.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 13th International Conference on Semantics, Knowledge and Grids, 2017

Loading...