Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Generative Representational Instruction Tuning.

[BibT_eX]

[DOI]

Niklas Muennighoff

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Bootstrap Your Own Context Length.

[BibT_eX]

[DOI]

CoRR, 2024

Multilingual E5 Text Embeddings: A Technical Report.

[BibT_eX]

[DOI]

CoRR, 2024

Fine-Tuning LLaMA for Multi-Stage Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LongEmbed: Extending Embedding Models for Long Context Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning to Retrieve In-Context Examples for Large Language Models.

[BibT_eX]

[DOI]

Liang Wang

Nan Yang

Furu Wei

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Improving Text Embeddings with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Learning to Rank in Generative Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Large Search Model: Redefining Search Stack in the Era of LLMs.

[BibT_eX]

[DOI]

SIGIR Forum, December, 2023

Generative retrieval for conversational question answering.

[BibT_eX]

[DOI]

Inf. Process. Manag., September, 2023

Inference with Reference: Lossless Acceleration of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Query2doc: Query Expansion with Large Language Models.

[BibT_eX]

[DOI]

Liang Wang

Nan Yang

Furu Wei

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multiview Identifiers Enhanced Generative Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Text Embeddings by Weakly-Supervised Contrastive Pre-training.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Aligning Cross-lingual Sentence Representations with Dual Momentum Contrast.

[BibT_eX]

[DOI]

Liang Wang

Wei Zhao

Jingming Liu

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Investigating Label Bias in Beam Search for Open-ended Text Generation.

[BibT_eX]

[DOI]

Liang Wang

Jinlong Liu

Jingming Liu

CoRR, 2020

2019

Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Denoising based Sequence-to-Sequence Pre-training for Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension.

[BibT_eX]

[DOI]

Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Multi-Perspective Context Aggregation for Semi-supervised Cloze-style Reading Comprehension.

[BibT_eX]