Orhan Firat

According to our database1, Orhan Firat authored at least 115 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.
CoRR, 2024

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method.
CoRR, 2024

2023
PaLM: Scaling Language Modeling with Pathways.
J. Mach. Learn. Res., 2023

Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.
CoRR, 2023

Reinforced Self-Training (ReST) for Language Modeling.
CoRR, 2023

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation.
CoRR, 2023

Block-State Transformer.
CoRR, 2023

Cross-Lingual Supervision improves Large Language Models Pre-training.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

Bilex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation.
CoRR, 2023

The unreasonable effectiveness of few-shot learning for machine translation.
CoRR, 2023

The Devil Is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation.
Proceedings of the Eighth Conference on Machine Translation, 2023

Block-State Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Binarized Neural Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

The Unreasonable Effectiveness of Few-shot Learning for Machine Translation.
Proceedings of the International Conference on Machine Learning, 2023

Scaling Laws for Multilingual Neural Machine Translation.
Proceedings of the International Conference on Machine Learning, 2023

UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets.
Trans. Assoc. Comput. Linguistics, 2022

FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation.
CoRR, 2022

Building Machine Translation Systems for the Next Thousand Languages.
CoRR, 2022

Using natural language prompts for machine translation.
CoRR, 2022

Data Scaling Laws in NMT: The Effect of Noise and Architecture.
CoRR, 2022

Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.
CoRR, 2022

Do Current Multi-Task Optimization Methods in Deep Learning Even Help?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Examining Scaling and Transfer of Language Model Architectures for Machine Translation.
Proceedings of the International Conference on Machine Learning, 2022


Data Scaling Laws in NMT: The Effect of Noise and Architecture.
Proceedings of the International Conference on Machine Learning, 2022

A Loss Curvature Perspective on Training Instabilities of Deep Learning Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Scaling Laws for Neural Machine Translation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
A Loss Curvature Perspective on Training Instability in Deep Learning.
CoRR, 2021

Towards Zero-Label Language Learning.
CoRR, 2021

Evaluating Multiway Multilingual NMT in the Turkic Languages.
CoRR, 2021

Towards Universality in Multilingual Text Rewriting.
CoRR, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
CoRR, 2021

Gradient-guided Loss Masking for Neural Machine Translation.
CoRR, 2021

Evaluating Multiway Multilingual NMT in the Turkic Languages.
Proceedings of the Sixth Conference on Machine Translation, 2021

Explicit Alignment Objectives for Multilingual Bidirectional Encoders.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Share or Not? Learning to Schedule Language-Specific Capacity for Multilingual Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding.
Proceedings of the 9th International Conference on Learning Representations, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Large-Scale Study of Machine Translation in Turkic Languages.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Rapid Domain Adaptation for Machine Translation with Monolingual Data.
CoRR, 2020

Towards End-to-End In-Image Neural Machine Translation.
CoRR, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization.
CoRR, 2020

Controlling Computation versus Quality for Neural Sequence Models.
CoRR, 2020

Complete Multilingual Neural Machine Translation.
Proceedings of the Fifth Conference on Machine Translation, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Discrepancy between Density Estimation and Sequence Generation.
Proceedings of the Fourth Workshop on Structured Prediction for NLP@EMNLP 2020, 2020

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation.
CoRR, 2019

Simple, Scalable Adaptation for Neural Machine Translation.
CoRR, 2019

Adaptive Scheduling for Multi-Task Learning.
CoRR, 2019

Investigating Multilingual NMT Representations at Scale.
CoRR, 2019

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation.
CoRR, 2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.
CoRR, 2019

The Missing Ingredient in Zero-Shot Neural Machine Translation.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Findings of the First Shared Task on Machine Translation Robustness.
Proceedings of the Fourth Conference on Machine Translation, 2019

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Non-Parametric Adaptation for Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Massively Multilingual Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Investigating Multilingual NMT Representations at Scale.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Simple, Scalable Adaptation for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

On the Importance of Word Boundaries in Character-level Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

2018
Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation.
CoRR, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
CoRR, 2018

Revisiting Character-Based Neural Machine Translation with Capacity and Compression.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Training Deeper Neural Machine Translation Models with Transparent Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Connectionist multi-sequence modelling and applications to multilingual neural machine translation ; Bağlantıcı çoklu dizi modelleme ve çok dilli nöral makina çevirisi uygulamaları.
PhD thesis, 2017

On integrating a language model into neural machine translation.
Comput. Speech Lang., 2017

Multi-way, multilingual neural machine translation.
Comput. Speech Lang., 2017

Learning Joint Multilingual Sentence Representations with Neural Machine Translation.
CoRR, 2017

Does Neural Machine Translation Benefit from Larger Context?
CoRR, 2017

Nematus: a Toolkit for Neural Machine Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Neural Machine Translation for Cross-Lingual Pronoun Prediction.
Proceedings of the Third Workshop on Discourse in Machine Translation, 2017

2016
Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism.
Proceedings of the NAACL HLT 2016, 2016

Zero-Resource Translation with Multi-Lingual Neural Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
On Using Monolingual Corpora in Neural Machine Translation.
CoRR, 2015

Montreal Neural Machine Translation Systems for WMT'15.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Bilişsel durum analizi i~in beyin Aği modeli.
Proceedings of the 2015 23nd Signal Processing and Communications Applications Conference (SIU), 2015

Learning Deep Temporal Representations for fMRI Brain Decoding.
Proceedings of the Machine Learning Meets Medical Imaging - First International Workshop, 2015

2014
Estimating brain connectivity for pattern analysis.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Modeling the Brain Connectivity for Pattern Analysis.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Representation Learning for Contextual Object and Region Detection in Remote Sensing.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Deep learning for brain decoding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Functional networks of anatomic brain regions.
Proceedings of the IEEE 13th International Conference on Cognitive Informatics and Cognitive Computing, 2014

2013
Enhancing Local Linear Models Using Functional Connectivity for Brain State Decoding.
Int. J. Cogn. Informatics Nat. Intell., 2013

Information distribution analysis in the fMRI measurements with degree of locality estimation.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Representation learning with convolutional sparse autoencoders for remote sensing.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Cognitive process representation with minimum spanning tree of local meshes.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Contextual object recoğnition with conditional random fields.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Dry dock detection in satellite images with representation learning.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Mesh learning for object classification using fMRI measurements.
Proceedings of the IEEE International Conference on Image Processing, 2013

Analyzing the information distribution in the fMRI measurements by estimating the degree of locality.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Representation of cognitive processes using the minimum spanning tree of local meshes.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

An information theoretic approach to classify cognitive states using fMRI.
Proceedings of the 13th IEEE International Conference on BioInformatics and BioEngineering, 2013

Functional Mesh Learning for pattern analysis of cognitive processes.
Proceedings of the IEEE 12th International Conference on Cognitive Informatics and Cognitive Computing, 2013

2012
Conditional Random Fields for Land Use/Land Cover Classification and Complex Region Detection.
Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2012

Application of context invariants in airport region of interest detection for multi-spectral satellite imagery.
Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

Mesh learning approach for brain data modeling.
Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012


  Loading...