Aleksandar Botev

Orcid: 0000-0001-9021-1124

According to our database1, Aleksandar Botev authored at least 18 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models.
CoRR, 2024

Applications of flow models to the generation of correlated lattice QCD ensembles.
CoRR, 2024

2023
Normalizing flows for lattice gauge theory in arbitrary space-time dimension.
CoRR, 2023

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Aspects of scaling and scalability for flow-based sampling of lattice QCD.
CoRR, 2022

Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Which priors matter? Benchmarking models for learning latent dynamics.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

2020
The Gauss-Newton matrix for Deep Learning models and its applications.
PhD thesis, 2020

Better, Faster Fermionic Neural Networks.
CoRR, 2020

Disentangling by Subspace Diffusion.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Hamiltonian Generative Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

2018
Online Structured Laplace Approximations for Overcoming Catastrophic Forgetting.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Scalable Laplace Approximation for Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Overdispersed variational autoencoders.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Nesterov's accelerated gradient and momentum as approximations to regularised update descent.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Practical Gauss-Newton Optimisation for Deep Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Complementary Sum Sampling for Likelihood Approximation in Large Scale Classification.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017


  Loading...