Lukasz Kaiser

Henrique Pondé de Oliveira Pinto

Piotr Milos

CoRR, 2024

2022

Hierarchical Transformers Are More Efficient Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Q-Value Weighted Regression: Reinforcement Learning with Limited Data.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

2021

Training Verifiers to Solve Math Word Problems.

[BibT_eX]

[DOI]

CoRR, 2021

Evaluating Large Language Models Trained on Code.

[BibT_eX]

[DOI]

CoRR, 2021

Sparse is Enough in Scaling Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Rethinking Attention with Performers.

[BibT_eX]

[DOI]

Krzysztof Marcin Choromanski

Valerii Likhosherstov

David Benjamin Belanger

Lucy J. Colwell

Adrian Weller

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Reformer: The Efficient Transformer.

[BibT_eX]

[DOI]

Nikita Kitaev

Anselm Levskaya

Proceedings of the 8th International Conference on Learning Representations, 2020

Model Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Parallel Scheduled Sampling.

[BibT_eX]

[DOI]

CoRR, 2019

Sample Efficient Text Summarization Using a Single Pre-Trained Transformer.

[BibT_eX]

[DOI]

CoRR, 2019

Model-Based Reinforcement Learning for Atari.

[BibT_eX]

[DOI]

CoRR, 2019

Area Attention.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Universal Transformers.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Image Transformer.

[BibT_eX]

[DOI]

CoRR, 2018

Discrete Autoencoders for Sequence Models.

[BibT_eX]

[DOI]

Samy Bengio

CoRR, 2018

Image Transformer.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Fast Decoding in Sequence Models Using Discrete Latent Variables.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Generating Wikipedia by Summarizing Long Sequences.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Depthwise Separable Convolutions for Neural Machine Translation.

[BibT_eX]

[DOI]

Aidan N. Gomez

François Chollet

Proceedings of the 6th International Conference on Learning Representations, 2018

Unsupervised Cipher Cracking Using Discrete GANs.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Tensor2Tensor for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

One Model To Learn Them All.

[BibT_eX]

[DOI]

CoRR, 2017

Attention is All you Need.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Regularizing Neural Networks by Penalizing Confident Output Distributions.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Learning to Remember Rare Events.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2016

Multi-task Sequence to Sequence Learning.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

Neural GPUs Learn Algorithms.

[BibT_eX]

[DOI]

Ilya Sutskever

Proceedings of the 4th International Conference on Learning Representations, 2016

Machine Learning with Guarantees using Descriptive Complexity and SMT Solvers.

[BibT_eX]

[DOI]

Charles Jordan

CoRR, 2016

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems.

[BibT_eX]

[DOI]

CoRR, 2016

Can Active Memory Replace Attention?

[BibT_eX]

[DOI]

Samy Bengio

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Adding Gradient Noise Improves Learning for Very Deep Networks.

[BibT_eX]

[DOI]

CoRR, 2015

Graph Searching Games and Width Measures for Directed Graphs.

[BibT_eX]

[DOI]

Saeed Akhoondian Amiri

Proceedings of the 32nd International Symposium on Theoretical Aspects of Computer Science, 2015

Grammar as a Foreign Language.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Characterising Choiceless Polynomial Time with First-Order Interpretations.

[BibT_eX]

[DOI]

Proceedings of the 30th Annual ACM/IEEE Symposium on Logic in Computer Science, 2015

Sentence Compression by Deletion with LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Unified Approach to Boundedness Properties in MSO.

[BibT_eX]

[DOI]

Proceedings of the 24th EACSL Annual Conference on Computer Science Logic, 2015

2014

Model-Theoretic Properties of ω-Automatic Structures.

[BibT_eX]

[DOI]

Theory Comput. Syst., 2014

Directed Width Measures and Monotonicity of Directed Graph Searching.

[BibT_eX]

[DOI]

CoRR, 2014

MPIDepQBF: Towards Parallel QBF Solving without Knowledge Sharing.

[BibT_eX]

[DOI]

Proceedings of the Theory and Applications of Satisfiability Testing - SAT 2014, 2014

2013

Experiments with Reduction Finding.

[BibT_eX]

[DOI]

Charles Jordan

Proceedings of the Theory and Applications of Satisfiability Testing - SAT 2013, 2013

2012

Entanglement and the complexity of directed graphs.

[BibT_eX]

[DOI]

Theor. Comput. Sci., 2012

Model Checking the Quantitative mu-Calculus on Linear Hybrid Systems

[BibT_eX]

[DOI]

Log. Methods Comput. Sci., 2012

The Field of Reals is not omega-Automatic.

[BibT_eX]

[DOI]

Faried Abu Zaid

Proceedings of the 29th International Symposium on Theoretical Aspects of Computer Science, 2012

Solving Counter Parity Games.

[BibT_eX]

[DOI]

Dietmar Berwanger

Simon Leßenich

Proceedings of the Mathematical Foundations of Computer Science 2012, 2012

A Counting Logic for Structure Transition Systems.

[BibT_eX]

[DOI]

Simon Leßenich

Proceedings of the Computer Science Logic, 2012

Learning Games from Videos Guided by Descriptive Complexity.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Expressing cardinality quantifiers in monadic second-order logic over chains.

[BibT_eX]

[DOI]

Alexander Rabinovich

J. Symb. Log., 2011

Model Checking the Quantitative <i>μ</i>-Calculus on Linear Hybrid Systems.

[BibT_eX]

[DOI]

Proceedings of the Automata, Languages and Programming - 38th International Colloquium, 2011

A Perfect-Information Construction for Coordination in Games.

[BibT_eX]

[DOI]

Dietmar Berwanger

Bernd Puchala

Proceedings of the IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, 2011

First-Order Logic with Counting for General Game Playing.

[BibT_eX]

[DOI]

Lukasz Stafiniak

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Logic and Games on Automatic Structures - Playing with Quantifiers and Decompositions

[BibT_eX]

[DOI]

Lecture Notes in Computer Science 6810, Springer, ISBN: 978-3-642-22806-3, 2011

2010

Model Checking Games for the Quantitative <i>µ</i>-Calculus.

[BibT_eX]

[DOI]

Theory Comput. Syst., 2010

Information Tracking in Games on Graphs.

[BibT_eX]

[DOI]

Dietmar Berwanger

J. Log. Lang. Inf., 2010

Expressing Cardinality Quantifiers in Monadic Second-Order Logic over Trees.

[BibT_eX]

[DOI]

Alexander Moshe Rabinovich

Fundam. Informaticae, 2010

Degrees of Lookahead in Regular Infinite Games.

[BibT_eX]

[DOI]

Michael Holtmann

Wolfgang Thomas

Proceedings of the Foundations of Software Science and Computational Structures, 2010

New Algorithm for Weak Monadic Second-Order Logic on Inductive Structures.

[BibT_eX]

[DOI]

Tobias Ganzow

Proceedings of the Computer Science Logic, 24th International Workshop, 2010

2009

Synthesis for Structure Rewriting Systems.

[BibT_eX]

[DOI]

Proceedings of the Mathematical Foundations of Computer Science 2009, 2009

Directed Graphs of Entanglement Two.

[BibT_eX]

[DOI]

Roman Rabinovich

Proceedings of the Fundamentals of Computation Theory, 17th International Symposium, 2009

Cardinality Quantifiers in MLO over Trees.

[BibT_eX]

[DOI]

Alexander Rabinovich

Proceedings of the Computer Science Logic, 23rd international Workshop, 2009

2008

Logic and games on automatic structures.

[BibT_eX]

[DOI]

PhD thesis, 2008

Model Checking Games for the Quantitative mu-Calculus

[BibT_eX]

[DOI]

CoRR, 2008

Cardinality and counting quantifiers on omega-automatic structures.

[BibT_eX]

[DOI]

Sasha Rubin

Proceedings of the 25th Annual Symposium on Theoretical Aspects of Computer Science, 2008

Model Checking Games for the Quantitative µ-Calculus.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Symposium on Theoretical Aspects of Computer Science, 2008

2007

Program Search as a Path to Artificial General Intelligence.

[BibT_eX]

[DOI]

Proceedings of the Artificial General Intelligence, 2007

2006

Game Quantification on Automatic Structures and Hierarchical Model Checking Games.

[BibT_eX]

[DOI]

Proceedings of the Computer Science Logic, 20th International Workshop, 2006

2005

Confluence of Right Ground Term Rewriting Systems Is Decidable.

[BibT_eX]

[DOI]