Simiao Zuo

Orcid: 0009-0002-8014-3150

According to our database1, Simiao Zuo authored at least 26 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning.
CoRR, 2024

2023
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing.
CoRR, 2023

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Less is More: Task-aware Layer-wise Distillation for Language Model Compression.
Proceedings of the International Conference on Machine Learning, 2023

SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process.
Proceedings of the International Conference on Machine Learning, 2023

Machine Learning Force Fields with Data Cost Aware Training.
Proceedings of the International Conference on Machine Learning, 2023

DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2022
Efficient Long Sequence Modeling via State Space Augmented Transformer.
CoRR, 2022

DiP-GNN: Discriminative Pre-Training of Graph Neural Networks.
CoRR, 2022

Differentially Private Estimation of Hawkes Process.
CoRR, 2022

MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Self-Training with Differentiable Teacher.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Adversarially Regularized Policy Learning Guided by Trajectory Optimization.
Proceedings of the Learning for Dynamics and Control Conference, 2022

PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance.
Proceedings of the International Conference on Machine Learning, 2022

Taming Sparsely Activated Transformer with Stochastic Experts.
Proceedings of the Tenth International Conference on Learning Representations, 2022

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach.
CoRR, 2021

Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Hypergradient Approach to Robust Regression without Correspondence.
Proceedings of the 9th International Conference on Learning Representations, 2021

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ARCH: Efficient Adversarial Regularized Training with Caching.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Transformer Hawkes Process.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Tensor maps for synchronizing heterogeneous shape collections.
ACM Trans. Graph., 2019

2018
Image Score: How to Select Useful Samples.
CoRR, 2018


  Loading...