He Bai

Orcid: 0000-0002-8933-647X

Affiliations:
  • University of Waterloo, School of Computer Science, Canada
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (former)


According to our database1, He Bai authored at least 19 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
How Far Are We from Intelligent Visual Deductive Reasoning?
CoRR, 2024

Divide-or-Conquer? Which Part Should You Distill Your LLM?
CoRR, 2024

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling.
CoRR, 2024

2023
KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn't Know.
CoRR, 2023

Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation.
CoRR, 2023

2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.
CoRR, 2022

A<sup>3</sup>T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing.
Proceedings of the International Conference on Machine Learning, 2022

XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Better Language Model with Hypernym Class Prediction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Cross-Lingual Training with Dense Retrieval for Document Retrieval.
CoRR, 2021

Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

Segatron: Segment-Aware Transformer for Language Modeling and Understanding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures.
CoRR, 2020

SegaBERT: Pre-training of Segment-aware BERT for Language Understanding.
CoRR, 2020

Semantics of the Unwritten.
CoRR, 2020

Cross-Lingual Training of Neural Models for Document Ranking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language.
Proceedings of the 27th International Conference on Computational Linguistics, 2018


  Loading...