We stand with Ukraine

We stand with Ukraine

Armen Aghajanyan

According to our database¹, Armen Aghajanyan authored at least 36 papers between 2015 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

SkillRater: Untangling Capabilities in Multimodal Data.

[DOI]

,

,

Armen Aghajanyan

,

Akshat Shrivastava

CoRR, February, 2026

Improving MoE Compute Efficiency by Composing Weight and Data Sparsity.

[DOI]

,

,

Luke Zettlemoyer

,

Akshat Shrivastava

,

Armen Aghajanyan

CoRR, January, 2026

2025

Text Quality-Based Pruning for Efficient Training of Language Models.

[DOI]

,

,

Newsha Ardalani

,

Kushal Tirumala

,

,

,

,

,

Armen Aghajanyan

,

,

Luke Zettlemoyer

J. Data-centric Mach. Learn. Res., 2025

When Worse is Better: Navigating the Compression Generation Trade-off In Visual Tokenization.

[DOI]

Vivek Ramanujan

,

Kushal Tirumala

,

Armen Aghajanyan

,

Luke Zettlemoyer

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024

BartSmiles: Generative Masked Language Models for Molecular Representations.

[DOI]

Gayane Chilingaryan

,

Hovhannes Tamoyan

,

,

,

Karen Hambardzumyan

,

,

Armen Aghajanyan

,

Hrant Khachatrian

,

Lusine Khondkaryan

J. Chem. Inf. Model., 2024

When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization.

[DOI]

Vivek Ramanujan

,

Kushal Tirumala

,

Armen Aghajanyan

,

Luke Zettlemoyer

,

CoRR, 2024

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts.

[DOI]

Xi Victoria Lin

,

Akshat Shrivastava

,

,

Srinivasan Iyer

,

,

,

Luke Zettlemoyer

,

Armen Aghajanyan

CoRR, 2024

Small Molecule Optimization with Large Language Models.

[DOI]

Philipp Guevorguian

,

Menua Bedrosian

,

Tigran Fahradyan

,

Gayane Chilingaryan

,

Hrant Khachatrian

,

Armen Aghajanyan

CoRR, 2024

Text Quality-Based Pruning for Efficient Training of Language Models.

[DOI]

,

,

Newsha Ardalani

,

Kushal Tirumala

,

,

,

,

,

Armen Aghajanyan

,

,

Luke Zettlemoyer

CoRR, 2024

Jointly Training Large Autoregressive Multimodal Models.

[DOI]

Emanuele Aiello

,

,

,

Armen Aghajanyan

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

DOMINO: A Dual-System for Multi-step Visual Language Reasoning.

[DOI]

,

,

Armen Aghajanyan

,

,

,

Asli Celikyilmaz

,

Maryam Fazel-Zarandi

CoRR, 2023

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning.

[DOI]

CoRR, 2023

MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers.

[DOI]

,

,

,

Armen Aghajanyan

,

Luke Zettlemoyer

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

D4: Improving LLM Pretraining via Document De-Duplication and Diversification.

[DOI]

Kushal Tirumala

,

,

Armen Aghajanyan

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Retrieval-Augmented Multimodal Language Modeling.

[DOI]

Michihiro Yasunaga

,

Armen Aghajanyan

,

,

,

,

,

,

Luke Zettlemoyer

,

Proceedings of the International Conference on Machine Learning, 2023

Scaling Laws for Generative Mixed-Modal Language Models.

[DOI]

Armen Aghajanyan

,

,

,

,

Karen Hambardzumyan

,

,

,

,

,

Luke Zettlemoyer

Proceedings of the International Conference on Machine Learning, 2023

InCoder: A Generative Model for Code Infilling and Synthesis.

[DOI]

,

Armen Aghajanyan

,

,

,

,

,

,

,

Luke Zettlemoyer

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

CM3: A Causal Masked Multimodal Model of the Internet.

[DOI]

Armen Aghajanyan

,

,

,

Vladimir Karpukhin

,

,

,

,

,

,

,

Luke Zettlemoyer

CoRR, 2022

Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models.

[DOI]

Kushal Tirumala

,

Aram H. Markosyan

,

Luke Zettlemoyer

,

Armen Aghajanyan

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training.

[DOI]

,

Armen Aghajanyan

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

HTLM: Hyper-Text Pre-Training and Prompting of Language Models.

[DOI]

Armen Aghajanyan

,

,

,

,

,

,

Luke Zettlemoyer

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Passage Retrieval with Zero-Shot Question Generation.

[DOI]

Devendra Singh Sachan

,

,

,

Armen Aghajanyan

,

,

,

Luke Zettlemoyer

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

RetroNLU: Retrieval Augmented Task-Oriented Semantic Parsing.

[DOI]

,

Akshat Shrivastava

,

,

Armen Aghajanyan

,

Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

2021

Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog.

[DOI]

,

Akshat Shrivastava

,

Armen Aghajanyan

,

,

,

Marjan Ghazvininejad

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Better Fine-Tuning by Reducing Representational Collapse.

[DOI]

Armen Aghajanyan

,

Akshat Shrivastava

,

,

,

Luke Zettlemoyer

,

Proceedings of the 9th International Conference on Learning Representations, 2021

VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding.

[DOI]

,

,

,

,

Armen Aghajanyan

,

,

Luke Zettlemoyer

,

Christoph Feichtenhofer

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Muppet: Massive Multi-task Representations with Pre-Finetuning.

[DOI]

Armen Aghajanyan

,

,

Akshat Shrivastava

,

,

Luke Zettlemoyer

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning.

[DOI]

Armen Aghajanyan

,

,

Luke Zettlemoyer

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Conversational Semantic Parsing.

[DOI]

Armen Aghajanyan

,

,

Akshat Shrivastava

,

,

,

,

,

,

,

,

CoRR, 2020

Pre-training via Paraphrasing.

[DOI]

,

Marjan Ghazvininejad

,

,

Armen Aghajanyan

,

,

Luke Zettlemoyer

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Conversational Semantic Parsing.

[DOI]

Armen Aghajanyan

,

,

Akshat Shrivastava

,

,

,

,

,

Veselin Stoyanov

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Towards Language Agnostic Universal Representations.

[DOI]

Armen Aghajanyan

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2017

Convolution Aware Initialization.

[DOI]

Armen Aghajanyan

CoRR, 2017

Charged Point Normalization: An Efficient Solution to the Saddle Point Problem.

[DOI]

Armen Aghajanyan

Proceedings of the 5th International Conference on Learning Representations, 2017

SoftTarget Regularization: An Effective Technique to Reduce Over-Fitting in Neural Networks.

[DOI]

Armen Aghajanyan

Proceedings of the 3rd IEEE International Conference on Cybernetics, 2017

2015

Gravitational Clustering.

[DOI]

Armen Aghajanyan

CoRR, 2015

Loading...