Sarath Chandar

Orcid: 0000-0002-9678-2830

Affiliations:
  • University of Montreal, Department of Computer Science and Operations Research, Canada


According to our database1, Sarath Chandar authored at least 82 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Mastering Memory Tasks with World Models.
CoRR, 2024

Are self-explanations from Large Language Models faithful?
CoRR, 2024

Fairness-Aware Structured Pruning in Transformers.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
An Empirical Investigation of the Role of Pre-training in Lifelong Learning.
J. Mach. Learn. Res., 2023

Post-hoc Interpretability for Neural NLP: A Survey.
ACM Comput. Surv., 2023

Language Model-In-The-Loop: Data Optimal Approach to Learn-To-Recommend Actions in Text Games.
CoRR, 2023

Faithfulness Measurable Masked Language Models.
CoRR, 2023

Lookbehind Optimizer: k steps back, 1 step forward.
CoRR, 2023

Promoting Exploration in Memory-Augmented Adam using Critical Momenta.
CoRR, 2023

Thompson sampling for improved exploration in GFlowNets.
CoRR, 2023

Should We Attend More or Less? Modulating Attention for Fairness.
CoRR, 2023

Towards Lifelong Learning for Software Analytics Models: Empirical Study on Brown Build and Risk Prediction.
CoRR, 2023

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning.
CoRR, 2023

Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Self-Influence Guided Data Reweighting for Language Model Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

EpiK-Eval: Evaluation for Language Models as Epistemic Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
PatchBlender: A Motion Prior for Video Transformers.
CoRR, 2022

Sharpness-Aware Training for Accurate Inference on Noisy DNN Accelerators.
CoRR, 2022

Segmentation of Multiple Sclerosis Lesions across Hospitals: Learn Continually or Train from Scratch?
CoRR, 2022

An Introduction to Lifelong Supervised Learning.
CoRR, 2022

Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers.
CoRR, 2022

Local Structure Matters Most in Most Languages.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods.
Proceedings of the International Conference on Machine Learning, 2022

Memory Augmented Optimizers for Deep Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Combining Reinforcement Learning and Constraint Programming for Sequence-Generation Tasks with Hard Constraints.
Proceedings of the 28th International Conference on Principles and Practice of Constraint Programming, 2022

TAG: Task-based Accumulated Gradients for Lifelong learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Improving Meta-Learning Generalization with Activation-Based Early-Stopping.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Local Structure Matters Most: Perturbation Study in NLU.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Scaling Laws for the Few-Shot Adaptation of Pre-trained Image Classifiers.
CoRR, 2021

Demystifying Neural Language Models' Insensitivity to Word-Order.
CoRR, 2021

Memory Augmented Optimizers for Deep Learning.
CoRR, 2021

Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
CoRR, 2021

Do Encoder Representations of Generative Dialogue Models have sufficient summary of the Information about the task ?
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Continuous Coordination As a Realistic Scenario for Lifelong Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

IIRC: Incremental Implicitly-Refined Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Survey of Data Augmentation Approaches for NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

MLMLM: Link Prediction with Mean Likelihood Masked Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Towered Actor Critic For Handling Multiple Action Types In Reinforcement Learning For Drug Discovery.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Toward Training Recurrent Neural Networks for Lifelong Learning.
Neural Comput., 2020

Maximum Reward Formulation In Reinforcement Learning.
CoRR, 2020

How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics.
CoRR, 2020

Slot Contrastive Networks: A Contrastive Approach for Representing Objects.
CoRR, 2020

PatchUp: A Regularization Technique for Convolutional Neural Networks.
CoRR, 2020

The Hanabi challenge: A new frontier for AI research.
Artif. Intell., 2020

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning to Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Edge Replacement Grammars : A Formal Language Approach for Generating Graphs.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

Structure Learning for Neural Module Networks.
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Lossless Encoding of Sentences.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Non-Saturating Recurrent Units for Modelling Long-Term Dependencies.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Dynamic Neural Turing Machine with Continuous and Discrete Addressing Schemes.
Neural Comput., 2018

Environments for Lifelong Reinforcement Learning.
CoRR, 2018

On Training Recurrent Neural Networks for Lifelong Learning.
CoRR, 2018

Language Expansion In Text-Based Games.
CoRR, 2018

A Deep Reinforcement Learning Chatbot (Short Version).
CoRR, 2018

Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Deep Reinforcement Learning Chatbot.
CoRR, 2017

Memory Augmented Neural Networks with Wormhole Connections.
CoRR, 2017

GuessWhat?! Visual Object Discovery through Multi-modal Dialogue.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Correlational Neural Networks.
Neural Comput., 2016

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes.
CoRR, 2016

Hierarchical Memory Networks.
CoRR, 2016

Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning.
Proceedings of the NAACL HLT 2016, 2016

Multilingual Multimodal Language Processing Using Neural Networks.
Proceedings of the Tutorial Abstracts, 2016

A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation.
Proceedings of the COLING 2016, 2016

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
TSEB: More Efficient Thompson Sampling for Policy Learning.
CoRR, 2015

Reasoning about Linguistic Regularities in Word Embeddings using Matrix Manifolds.
CoRR, 2015

From multiple views to single view: a neural network approach.
Proceedings of the Second ACM IKDD Conference on Data Sciences, 2015

2014
An Autoencoder Approach to Learning Bilingual Word Representations.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2011
An Adaptive e-Learning Environment Using Distributed Spiking Neural P Systems.
Proceedings of the 2011 IEEE International Conference on Technology for Education, 2011

2010
CDPN: Communicating Dynamic Petri Net for Adaptive Multimedia Presentation.
Proceedings of the Information and Communication Technologies - International Conference, 2010

Personalized e-course composition approach using digital pheromones in improved particle swarm optimization.
Proceedings of the Sixth International Conference on Natural Computation, 2010


  Loading...