Daniel M. Cer

Affiliations:
  • Google, Mountain View, CA, USA
  • University of California at Berkeley, CA, USA


According to our database1, Daniel M. Cer authored at least 48 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models.
CoRR, 2024

Gemma: Open Models Based on Gemini Research and Technology.
CoRR, 2024

2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval.
CoRR, 2023

2022
Knowledge Prompts: Injecting World Knowledge into Language Models through Soft Prompts.
CoRR, 2022

Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Language-agnostic BERT Sentence Embedding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
NT5?! Training T5 to Perform Numerical Reasoning.
CoRR, 2021

Universal Sentence Representation Learning with Conditional Masked Language Model.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
SeqGenSQL - A Robust Sequence Generation Model for Structured Query Language.
CoRR, 2020

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models.
CoRR, 2020

Multilingual Universal Sentence Encoder for Semantic Retrieval.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
Hierarchical Document Encoder for Parallel Corpus Mining.
Proceedings of the Fourth Conference on Machine Translation, 2019

Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

ReQA: An Evaluation for End-to-End Answer Retrieval Models.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Universal Sentence Encoder.
CoRR, 2018

Effective Parallel Corpus Mining using Bilingual Sentence Embeddings.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

Learning Semantic Textual Similarity from Conversations.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

Universal Sentence Encoder for English.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

2017
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused Evaluation.
CoRR, 2017

SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

2016
SemEval-2016 Task 1: Semantic Textual Similarity, Monolingual and Cross-Lingual Evaluation.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

2015
SemEval-2015 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

2014
An Empirical Comparison of Features and Tuning for Phrase-based Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Phrasal: A Toolkit for New Directions in Statistical Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

SemEval-2014 Task 10: Multilingual Semantic Textual Similarity.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

2013
Feature-Rich Phrase-based Translation: Stanford University's Submission to the WMT 2013 Translation Task.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Positive Diversity Tuning for Machine Translation System Combination.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

*SEM 2013 shared task: Semantic Textual Similarity.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013

Bilingual Word Embeddings for Phrase-Based Machine Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Fast and Adaptive Online Training of Feature-Rich Translation Models.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Stanford: Probabilistic Edit Distance Metrics for STS.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

2010
The Best Lexical Metric for Phrase-Based Statistical MT System Optimization.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Phrasal: A Statistical Machine Translation Toolkit for Exploring New Model Features.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 2, 2010, Los Angeles, California, USA, 2010

Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
Measuring machine translation quality as semantic equivalence: A metric based on entailment features.
Mach. Transl., 2009

2008
Regularization and Search for Minimum Error Rate Training.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

2007
Learning Alignments and Leveraging Natural Logic.
Proceedings of the ACL-PASCAL@ACL 2007 Workshop on Textual Entailment and Paraphrasing, 2007

Robust Graph Alignment Methods for Textual Inference and Machine Reading.
Proceedings of the Machine Reading, 2007

2006
Learning to recognize features of valid textual entailments.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Neural mechanisms of binding in the hippocampus and neocortex: insights from computational models.
Proceedings of the Handbook of Binding and Memory, 2006

2005
The detection of emphatic words using acoustic and lexical features.
Proceedings of the INTERSPEECH 2005, 2005


  Loading...