Shaohan Huang

Orcid: 0000-0003-4324-6337

According to our database1, Shaohan Huang authored at least 88 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ResLoRA: Identity Residual Mapping in Low-Rank Adaption.
CoRR, 2024

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits.
CoRR, 2024

HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition.
CoRR, 2024

Se<sup>2</sup>: Sequential Example Selection for In-Context Learning.
CoRR, 2024

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models.
CoRR, 2024

Improving Domain Adaptation through Extended-Text Reading Comprehension.
CoRR, 2024

Text Diffusion with Reinforced Conditioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Improving Log-Based Anomaly Detection by Pre-Training Hierarchical Transformers.
IEEE Trans. Computers, September, 2023

LogEncoder: Log-Based Contrastive Representation Learning for Anomaly Detection.
IEEE Trans. Netw. Serv. Manag., June, 2023

BitNet: Scaling 1-bit Transformers for Large Language Models.
CoRR, 2023

Kosmos-G: Generating Images in Context with Multimodal Large Language Models.
CoRR, 2023

Calibrating LLM-Based Evaluator.
CoRR, 2023

Kosmos-2.5: A Multimodal Literate Model.
CoRR, 2023

Adapting Large Language Models via Reading Comprehension.
CoRR, 2023

LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection.
CoRR, 2023

Scaling Sentence Embeddings with Large Language Models.
CoRR, 2023

Retentive Network: A Successor to Transformer for Large Language Models.
CoRR, 2023

LongNet: Scaling Transformers to 1, 000, 000, 000 Tokens.
CoRR, 2023

Kosmos-2: Grounding Multimodal Large Language Models to the World.
CoRR, 2023

Learning Music Sequence Representation from Text Supervision.
CoRR, 2023

LogQA: Question Answering in Unstructured Logs.
CoRR, 2023

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation.
CoRR, 2023

Language Is Not All You Need: Aligning Perception with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Magneto: A Foundation Transformer.
Proceedings of the International Conference on Machine Learning, 2023

Democratizing Reasoning Ability: Tailored Learning from Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Length-Extrapolatable Transformer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Pre-training Language Model as a Multi-perspective Course Learner.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MoEC: Mixture of Expert Clusters.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
REVAL: Recommend Which Variables to Log With Pretrained Model and Graph Neural Network.
IEEE Trans. Netw. Serv. Manag., December, 2022

TorchScale: Transformers at Scale.
CoRR, 2022

Foundation Transformers.
CoRR, 2022

Language Models are General-Purpose Interfaces.
CoRR, 2022

Task-Specific Expert Pruning for Sparse Mixture-of-Experts.
CoRR, 2022

On the Representation Collapse of Sparse Mixture of Experts.
CoRR, 2022

DeepNet: Scaling Transformers to 1, 000 Layers.
CoRR, 2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts.
CoRR, 2022

Impacts of COVID-19 on the Return and Volatility Nexus among Cryptocurrency Market.
Complex., 2022

Adanomaly: Adaptive Anomaly Detection for System Logs with Adversarial Learning.
Proceedings of the 2022 IEEE/IFIP Network Operations and Management Symposium, 2022

Kformer: Knowledge Injection in Transformer Feed-Forward Layers.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

On the Representation Collapse of Sparse Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Music Sequence Representation From Text Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2022

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Snapshot-Guided Domain Adaptation for ELECTRA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Black-box Attacks to Log-based Anomaly Detection.
Proceedings of the 18th International Conference on Network and Service Management, 2022

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Improving Non-autoregressive Generation with Mixup Training.
CoRR, 2021

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA.
CoRR, 2021

DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders.
CoRR, 2021

mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs.
CoRR, 2021

Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

Consistency Regularization for Cross-Lingual Fine-Tuning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log.
IEEE Trans. Netw. Serv. Manag., 2020

A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Paddy: An Event Log Parsing Approach using Dynamic Dictionary.
Proceedings of the NOMS 2020, 2020

TableBank: Table Benchmark for Image-based Table Detection and Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

LayoutLM: Pre-training of Text and Layout for Document Image Understanding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Generating Commonsense Explanation by Extracting Bridge Concepts from Reasoning Paths.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

A Gated Few-shot Learning Model For Anomaly Detection.
Proceedings of the 2020 International Conference on Information Networking, 2020

Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DocBank: A Benchmark Dataset for Document Layout Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Unsupervised Fine-tuning for Text Clustering.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Transfer Log-based Anomaly Detection with Pseudo Labels.
Proceedings of the 16th International Conference on Network and Service Management, 2020

2019
Neural Melody Composition from Lyrics.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Dictionary-Guided Editing Networks for Paraphrase Generation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Response Generation by Context-Aware Prototype Editing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Text Morphing.
CoRR, 2018

Dictionary-Guided Editing Networks for Paraphrase Generation.
CoRR, 2018

Response Generation by Context-aware Prototype Editing.
CoRR, 2018

Neural Document Summarization by Jointly Learning to Score and Select Sentences.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
PSOM: Periodic Self-Organizing Maps for unsupervised anomaly detection in periodic time series.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017

Learning to Generate Product Reviews from Attributes.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Arena: Adaptive real-time update anomaly prediction in cloud systems.
Proceedings of the 13th International Conference on Network and Service Management, 2017

SuperAgent: A Customer Service Chatbot for E-commerce Websites.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Using recurrent neural networks toward black-box system anomaly prediction.
Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service, 2016

2015
A methodology for root-cause analysis in component based systems.
Proceedings of the 23rd IEEE International Symposium on Quality of Service, 2015

Revisit network anomaly ranking in datacenter network using re-ranking.
Proceedings of the 4th IEEE International Conference on Cloud Networking, 2015


  Loading...