Karttikeya Mangalam

Orcid: 0000-0002-2169-1395

According to our database¹, Karttikeya Mangalam authored at least 42 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Ego4D: Around the World in 3,600 Hours of Egocentric Video.

[BibT_eX]

[DOI]

Santhosh Kumar Ramakrishnan

Christoph Feichtenhofer

Kiran K. Somasundaram

Giovanni Maria Farinella

IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization.

[BibT_eX]

[DOI]

Zhanda Zhu

Christina Giannoula

Muralidhar Andoorveedu

Proceedings of the Twentieth European Conference on Computer Systems, 2025

UPSC2M: Benchmarking Adaptive Learning from Two Million MCQ Attempts.

[BibT_eX]

[DOI]

Kevin Shi

Karttikeya Mangalam

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, 2025

2024

Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

xT: Nested Tokenization for Larger Context in Large Images.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptive Human Trajectory Prediction via Latent Corridors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Do Vision and Language Encoders Represent the World Similarly?

[BibT_eX]

[DOI]

Mayug Maniparambil

Raiymbek Akshulakov

Yasser Abdelaziz Dahou Djilali

Mohamed El Amine Seddik

Sanath Narayan

Karttikeya Mangalam

Noel E. O'Connor

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Sequential Modeling Enables Scalable Learning for Large Vision Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dr<sup>2</sup>Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning.

[BibT_eX]

[DOI]

Abdulmohsen Alghannam

Jitendra Malik

Bernard Ghanem

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement.

[BibT_eX]

[DOI]

Gopala Anumanchipalli

Michael W. Mahoney

Kurt Keutzer

Amir Gholami

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Perceiving People over Long Periods: Algorithms, Architectures & Datasets

[BibT_eX]

[DOI]

Karttikeya Mangalam

PhD thesis, 2023

PaReprop: Fast Parallelized Reversible Backpropagation.

[BibT_eX]

[DOI]

Tyler Zhu

Karttikeya Mangalam

CoRR, 2023

Big Little Transformer Decoder.

[BibT_eX]

[DOI]

CoRR, 2023

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Raiymbek Akshulakov

Jitendra Malik

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Speculative Decoding with Big Little Decoder.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Models as Masked Autoencoders.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Latency Matters: Real-Time Action Forecasting Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Re<sup>2</sup>TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Does unsupervised grammar induction need pixels?

[BibT_eX]

[DOI]

CoRR, 2022

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization.

[BibT_eX]

[DOI]

CoRR, 2022

Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022.

[BibT_eX]

[DOI]

CoRR, 2022

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reversible Vision Transformers.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Jitendra Malik

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Object-Region Video Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.

[BibT_eX]

[DOI]

Santhosh Kumar Ramakrishnan

Christoph Feichtenhofer

Kiran K. Somasundaram

Giovanni Maria Farinella

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Overcoming Mode Collapse with Adaptive Multi Adversarial Training.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Rohin Garg

CoRR, 2021

Improved Multiscale Vision Transformers for Classification and Detection.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.

[BibT_eX]

[DOI]

Santhosh Kumar Ramakrishnan

Christoph Feichtenhofer

Kiran K. Somasundaram

Giovanni Maria Farinella

CoRR, 2021

From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LOKI: Long Term and Key Intentions for Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiscale Vision Transformers.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mitigating Mode Collapse by Sidestepping Catastrophic Forgetting.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Rohin Garg

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

It Is Not the Journey But the Destination: Endpoint Conditioned Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Long-Term Human Motion Prediction with Scene Context.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2018

On Compressing U-net Using Knowledge Distillation.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Mathieu Salzamann

CoRR, 2018

Learning Spontaneity to Improve Emotion Recognition in Speech.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Tanaya Guha

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Future Person Localization in First-Person Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Bitwise Operations of Cellular Automaton on Gray-scale Images.

[BibT_eX]

[DOI]

Karttikeya Mangalam

K. S. Venkatesh

CoRR, 2017

Karttikeya Mangalam

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...