Nathan Grinsztajn

Orcid: 0000-0001-6817-5972

According to our database1, Nathan Grinsztajn authored at least 21 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Averaging log-likelihoods in direct alignment.
CoRR, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024

Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization.
CoRR, 2024

Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


2023
Reinforcement learning for combinatorial optimization : leveraging uncertainty, structure and priors. (Apprentissage par renforcement pour l'optimisation combinatoire : exploiter l'incertitude, les structures et les connaissances a priori).
PhD thesis, 2023

Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A.
CoRR, 2023

Combinatorial Optimization with Policy Adaptation using Latent Space Search.
CoRR, 2023

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX.
CoRR, 2023

Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Combinatorial Optimization with Policy Adaptation using Latent Space Search.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Population-Based Reinforcement Learning for Combinatorial Optimization.
CoRR, 2022

Meta-learning from Learning Curves Challenge: Lessons learned from the First Round and Design of the Second Round.
CoRR, 2022

Meta-learning from Learning Curves: Challenge Design and Baseline Results.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences.
CoRR, 2021

Low-Rank Projections of GCNs Laplacian.
CoRR, 2021

Interferometric Graph Transform for Community Labeling.
CoRR, 2021

MetaREVEAL: RL-based Meta-learning from Learning Curves.
Proceedings of the Workshop on Interactive Adaptive Learning (IAL 2021) co-located with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021), 2021

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Geometric deep reinforcement learning for dynamic DAG scheduling.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020


  Loading...