Dani Yogatama

According to our database1, Dani Yogatama authored at least 58 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Understanding In-Context Learning with a Pelican Soup Framework.
CoRR, 2024

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models.
CoRR, 2024

2023
On Retrieval Augmentation and the Limitations of Language Model Training.
CoRR, 2023

Interpretable Diffusion via Information Decomposition.
CoRR, 2023

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Emergent Abilities of Large Language Models.
Trans. Mach. Learn. Res., 2022

Relational Memory-Augmented Language Models.
Trans. Assoc. Comput. Linguistics, 2022

Questions Are All You Need to Train a Dense Passage Retriever.
CoRR, 2022

Language Models Can See: Plugging Visual Controls in Text Generation.
CoRR, 2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning.
CoRR, 2022

A Contrastive Framework for Neural Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scale Efficiently: Insights from Pretraining and Finetuning Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ABC: Attention with Bounded-memory Control.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Adaptive Semiparametric Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Balancing Average and Worst-case Accuracy in Multitask Learning.
CoRR, 2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.
CoRR, 2021

Pitfalls of Static Language Modelling.
CoRR, 2021

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mind the Gap: Assessing Temporal Generalization in Neural Language Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


Random Feature Attention.
Proceedings of the 9th International Conference on Learning Representations, 2021

Finetuning Pretrained Transformers into RNNs.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Syntactic Structure Distillation Pretraining for Bidirectional Encoders.
Trans. Assoc. Comput. Linguistics, 2020

Modelling Latent Skills for Multitask Language Generation.
CoRR, 2020

A Mutual Information Maximization Perspective of Language Representation Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Reducing Sentiment Bias in Language Models via Counterfactual Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

A Call for More Rigor in Unsupervised Cross-lingual Learning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Cross-lingual Transferability of Monolingual Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Jointly learning sentence embeddings and syntax with unsupervised Tree-LSTMs.
Nat. Lang. Eng., 2019

Grandmaster level in StarCraft II using multi-agent reinforcement learning.
Nat., 2019

Learning and Evaluating General Linguistic Intelligence.
CoRR, 2019

Episodic Memory in Lifelong Language Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Variational Smoothing in Recurrent Neural Network Language Models.
Proceedings of the 7th International Conference on Learning Representations, 2019

Achieving Verified Robustness to Symbol Substitutions via Interval Bound Propagation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Memory Architectures in Recurrent Neural Network Language Models.
Proceedings of the 6th International Conference on Learning Representations, 2018

LSTMs Can Learn Syntax-Sensitive Dependencies Well, But Modeling Structure Makes Them Better.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Generative and Discriminative Text Classification with Recurrent Neural Networks.
CoRR, 2017

Learning to Compose Words into Sentences with Reinforcement Learning.
Proceedings of the 5th International Conference on Learning Representations, 2017

Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

2015
Bayesian Optimization of Text Representations.
CoRR, 2015

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
CoRR, 2015

Learning Word Representations with Hierarchical Sparse Coding.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Bayesian Optimization of Text Representations.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Extractive Summarization by Maximizing Semantic Volume.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Embedding Methods for Fine Grained Entity Type Classification.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Sparse Overcomplete Word Vector Representations.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Dynamic Language Models for Streaming Text.
Trans. Assoc. Comput. Linguistics, 2014

Making the Most of Bag of Words: Sentence Regularization with Alternating Direction Method of Multipliers.
Proceedings of the 31th International Conference on Machine Learning, 2014

Efficient Transfer Learning Method for Automatic Hyperparameter Tuning.
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

Linguistic Structured Sparsity in Text Categorization.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
A Sparse and Adaptive Prior for Time-Dependent Model Parameters.
CoRR, 2013

A Penny for Your Tweets: Campaign Contributions and Capitol Hill Microblogs.
Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

2012
A Probabilistic Model for Canonicalizing Named Entity Mentions.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Predicting a Scientific Community's Response to an Article.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2009
Multilingual Spectral Clustering Using Document Similarity Propagation.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009


  Loading...