Lukasz Kaiser

Orcid: 0000-0003-1092-6010

Affiliations:
  • Google Brain
  • Paris Diderot University, LIAFA
  • RWTH Aachen University, Department of Mathematics


According to our database1, Lukasz Kaiser authored at least 69 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
tsGT: Stochastic Time Series Modeling With Transformer.
CoRR, 2024

2022
Hierarchical Transformers Are More Efficient Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Q-Value Weighted Regression: Reinforcement Learning with Limited Data.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
Training Verifiers to Solve Math Word Problems.
CoRR, 2021

Evaluating Large Language Models Trained on Code.
CoRR, 2021

Sparse is Enough in Scaling Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Rethinking Attention with Performers.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Reformer: The Efficient Transformer.
Proceedings of the 8th International Conference on Learning Representations, 2020

Model Based Reinforcement Learning for Atari.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Parallel Scheduled Sampling.
CoRR, 2019

Sample Efficient Text Summarization Using a Single Pre-Trained Transformer.
CoRR, 2019

Model-Based Reinforcement Learning for Atari.
CoRR, 2019

Area Attention.
Proceedings of the 36th International Conference on Machine Learning, 2019

Universal Transformers.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Image Transformer.
CoRR, 2018

Discrete Autoencoders for Sequence Models.
CoRR, 2018

Image Transformer.
Proceedings of the 35th International Conference on Machine Learning, 2018

Fast Decoding in Sequence Models Using Discrete Latent Variables.
Proceedings of the 35th International Conference on Machine Learning, 2018

Generating Wikipedia by Summarizing Long Sequences.
Proceedings of the 6th International Conference on Learning Representations, 2018

Depthwise Separable Convolutions for Neural Machine Translation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Unsupervised Cipher Cracking Using Discrete GANs.
Proceedings of the 6th International Conference on Learning Representations, 2018

Tensor2Tensor for Neural Machine Translation.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
One Model To Learn Them All.
CoRR, 2017

Attention is All you Need.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Regularizing Neural Networks by Penalizing Confident Output Distributions.
Proceedings of the 5th International Conference on Learning Representations, 2017

Learning to Remember Rare Events.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.
CoRR, 2016

Multi-task Sequence to Sequence Learning.
Proceedings of the 4th International Conference on Learning Representations, 2016

Neural GPUs Learn Algorithms.
Proceedings of the 4th International Conference on Learning Representations, 2016

Machine Learning with Guarantees using Descriptive Complexity and SMT Solvers.
CoRR, 2016

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems.
CoRR, 2016

Can Active Memory Replace Attention?
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
Adding Gradient Noise Improves Learning for Very Deep Networks.
CoRR, 2015

Graph Searching Games and Width Measures for Directed Graphs.
Proceedings of the 32nd International Symposium on Theoretical Aspects of Computer Science, 2015

Grammar as a Foreign Language.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Characterising Choiceless Polynomial Time with First-Order Interpretations.
Proceedings of the 30th Annual ACM/IEEE Symposium on Logic in Computer Science, 2015

Sentence Compression by Deletion with LSTMs.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Unified Approach to Boundedness Properties in MSO.
Proceedings of the 24th EACSL Annual Conference on Computer Science Logic, 2015

2014
Model-Theoretic Properties of ω-Automatic Structures.
Theory Comput. Syst., 2014

Directed Width Measures and Monotonicity of Directed Graph Searching.
CoRR, 2014

MPIDepQBF: Towards Parallel QBF Solving without Knowledge Sharing.
Proceedings of the Theory and Applications of Satisfiability Testing - SAT 2014, 2014

2013
Experiments with Reduction Finding.
Proceedings of the Theory and Applications of Satisfiability Testing - SAT 2013, 2013

2012
Entanglement and the complexity of directed graphs.
Theor. Comput. Sci., 2012

Model Checking the Quantitative mu-Calculus on Linear Hybrid Systems
Log. Methods Comput. Sci., 2012

Degrees of Lookahead in Regular Infinite Games
Log. Methods Comput. Sci., 2012

The Field of Reals is not omega-Automatic.
Proceedings of the 29th International Symposium on Theoretical Aspects of Computer Science, 2012

Solving Counter Parity Games.
Proceedings of the Mathematical Foundations of Computer Science 2012, 2012

A Counting Logic for Structure Transition Systems.
Proceedings of the Computer Science Logic (CSL'12), 2012

Learning Games from Videos Guided by Descriptive Complexity.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Expressing cardinality quantifiers in monadic second-order logic over chains.
J. Symb. Log., 2011

Model Checking the Quantitative <i>μ</i>-Calculus on Linear Hybrid Systems.
Proceedings of the Automata, Languages and Programming - 38th International Colloquium, 2011

A Perfect-Information Construction for Coordination in Games.
Proceedings of the IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, 2011

First-Order Logic with Counting for General Game Playing.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Logic and Games on Automatic Structures - Playing with Quantifiers and Decompositions
Lecture Notes in Computer Science 6810, Springer, ISBN: 978-3-642-22806-3, 2011

2010
Model Checking Games for the Quantitative <i>µ</i>-Calculus.
Theory Comput. Syst., 2010

Information Tracking in Games on Graphs.
J. Log. Lang. Inf., 2010

Expressing Cardinality Quantifiers in Monadic Second-Order Logic over Trees.
Fundam. Informaticae, 2010

New Algorithm for Weak Monadic Second-Order Logic on Inductive Structures.
Proceedings of the Computer Science Logic, 24th International Workshop, 2010

2009
Synthesis for Structure Rewriting Systems.
Proceedings of the Mathematical Foundations of Computer Science 2009, 2009

Directed Graphs of Entanglement Two.
Proceedings of the Fundamentals of Computation Theory, 17th International Symposium, 2009

Cardinality Quantifiers in MLO over Trees.
Proceedings of the Computer Science Logic, 23rd international Workshop, 2009

2008
Logic and games on automatic structures.
PhD thesis, 2008

Model Checking Games for the Quantitative mu-Calculus
CoRR, 2008

Cardinality and counting quantifiers on omega-automatic structures.
Proceedings of the STACS 2008, 2008

Model Checking Games for the Quantitative µ-Calculus.
Proceedings of the STACS 2008, 2008

2007
Program Search as a Path to Artificial General Intelligence.
Proceedings of the Artificial General Intelligence, 2007

2006
Game Quantification on Automatic Structures and Hierarchical Model Checking Games.
Proceedings of the Computer Science Logic, 20th International Workshop, 2006

2005
Confluence of Right Ground Term Rewriting Systems Is Decidable.
Proceedings of the Foundations of Software Science and Computational Structures, 2005


  Loading...