Zhilin Yang

Orcid: 0009-0008-0681-9603

Affiliations:

Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China
Shanghai Qi Zhi Institute, China
Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA (PhD 2020)

According to our database¹, Zhilin Yang authored at least 51 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

GPT understands, too.

[BibT_eX]

[DOI]

AI Open, 2024

2023

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X.

[BibT_eX]

[DOI]

CoRR, 2023

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Not All Tasks Are Born Equal: Understanding Zero-Shot Generalization.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Compositional Task Representations for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Universal Discriminator for Zero-Shot Generalization.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Prompt-Based Metric Learning for Few-Shot NER.

[BibT_eX]

[DOI]

Yanru Chen

Yanan Zheng

Zhilin Yang

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Zero-Label Prompt Selection.

[BibT_eX]

[DOI]

Chonghua Liao

Yanan Zheng

Zhilin Yang

CoRR, 2022

NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Learning to Detect Noisy Labels Using Model-Based Features.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

P-Tuning: Prompt Tuning Can Be Comparable to Fine-tuning Across Scales and Tasks.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

GLM: General Language Model Pretraining with Autoregressive Blank Infilling.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks.

[BibT_eX]

[DOI]

CoRR, 2021

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2021

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2021

FastMoE: A Fast Mixture-of-Expert Training System.

[BibT_eX]

[DOI]

CoRR, 2021

All NLP Tasks Are Generation Tasks: A General Pretraining Framework.

[BibT_eX]

[DOI]

CoRR, 2021

WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models.

[BibT_eX]

[DOI]

AI Open, 2021

Controllable Generation from Pre-trained Language Models via Inverse Prompting.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

The International Workshop on Pretraining: Algorithms, Architectures, and Applications ([email protected] 2021).

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2019

Mixtape: Breaking the Softmax Bottleneck Efficiently.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

XLNet: Generalized Autoregressive Pretraining for Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations.

[BibT_eX]

[DOI]

CoRR, 2018

GLoMo: Unsupervised Learning of Transferable Relational Graphs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Neural Models for Reasoning over Multiple Mentions Using Coreference.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Breaking the Softmax Bottleneck: A High-Rank RNN Language Model.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering.

[BibT_eX]

[DOI]

Christopher D. Manning

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Cross-lingual Named Entity Recognition with Minimal Resources.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017

Differentiable Learning of Logical Rules for Knowledge Base Completion.

[BibT_eX]

[DOI]

Fan Yang

Zhilin Yang

William W. Cohen

CoRR, 2017

A Probabilistic Framework for Location Inference from Social Media.

[BibT_eX]

[DOI]

CoRR, 2017

Linguistic Knowledge as Memory for Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Differentiable Learning of Logical Rules for Knowledge Base Reasoning.

[BibT_eX]

[DOI]

Fan Yang

Zhilin Yang

William W. Cohen

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Good Semi-supervised Learning That Requires a Bad GAN.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks.

[BibT_eX]

[DOI]

Zhilin Yang

Ruslan Salakhutdinov

William W. Cohen

Proceedings of the 5th International Conference on Learning Representations, 2017

Words or Characters? Fine-grained Gating for Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Semi-Supervised QA with Generative Domain-Adaptive Nets.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Gated-Attention Readers for Text Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Encode, Review, and Decode: Reviewer Module for Caption Generation.

[BibT_eX]

[DOI]

CoRR, 2016

Multi-Task Cross-Lingual Sequence Tagging from Scratch.

[BibT_eX]

[DOI]

Zhilin Yang

Ruslan Salakhutdinov

William W. Cohen

CoRR, 2016

Review Networks for Caption Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs.

[BibT_eX]

[DOI]

Zhilin Yang

Jie Tang

William W. Cohen

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Revisiting Semi-Supervised Learning with Graph Embeddings.

[BibT_eX]

[DOI]

Zhilin Yang

William W. Cohen

Ruslan Salakhutdinov

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Multi-Source Bayesian Embeddings for Learning Social Knowledge Graphs.

[BibT_eX]

[DOI]

Zhilin Yang

Jie Tang

CoRR, 2015

COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency.

[BibT_eX]

[DOI]

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

2014

Active learning for networked data based on non-progressive diffusion model.

[BibT_eX]

[DOI]

Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Active Learning for Streaming Networked Data.

[BibT_eX]

[DOI]

Zhilin Yang

Jie Tang

Yutao Zhang

Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013

SAE: social analytic engine for large networks.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Zhilin Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...