Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Taming Sparsely Activated Transformer with Stochastic Experts.

[BibT_eX]

[DOI]

Simiao Zuo

Proceedings of the Tenth International Conference on Learning Representations, 2022

Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning.

[BibT_eX]

[DOI]

CoRR, 2021

GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification.

[BibT_eX]

[DOI]

Proceedings of the WWW '21: The Web Conference 2021, 2021

Mask Attention Networks: Rethinking and Strengthen Transformer.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

SiameseXML: Siamese Networks meet Extreme Classifiers with 100M Labels.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

HittER: Hierarchical Transformers for Knowledge Graph Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

GLGE: A New General Language Generation Evaluation Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval.

[BibT_eX]

[DOI]

Wenhao Lu

Jian Jiao

Ruofei Zhang

CoRR, 2020

ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2020

An Enhanced Knowledge Injection Model for Commonsense Generation.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval.

[BibT_eX]

[DOI]

Wenhao Lu

Jian Jiao

Ruofei Zhang

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2018

Recurrent Binary Embedding for GPU-Enabled Exhaustive Retrieval from Billion-Scale Semantic Vectors.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2016

Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Jian Jiao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...