Karttikeya Mangalam

According to our database1, Karttikeya Mangalam authored at least 37 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement.
CoRR, 2024

xT: Nested Tokenization for Larger Context in Large Images.
CoRR, 2024

Do Vision and Language Encoders Represent the World Similarly?
CoRR, 2024

Dr<sup>2</sup>Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning.
CoRR, 2024

2023
Adaptive Human Trajectory Prediction via Latent Corridors.
CoRR, 2023

Sequential Modeling Enables Scalable Learning for Large Vision Models.
CoRR, 2023

PaReprop: Fast Parallelized Reversible Backpropagation.
CoRR, 2023

Big Little Transformer Decoder.
CoRR, 2023

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Speculative Decoding with Big Little Decoder.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Models as Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Latency Matters: Real-Time Action Forecasting Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Re<sup>2</sup>TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Does unsupervised grammar induction need pixels?
CoRR, 2022

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization.
CoRR, 2022

Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022.
CoRR, 2022

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reversible Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Object-Region Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


2021
Overcoming Mode Collapse with Adaptive Multi Adversarial Training.
CoRR, 2021

Improved Multiscale Vision Transformers for Classification and Detection.
CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LOKI: Long Term and Key Intentions for Trajectory Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiscale Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mitigating Mode Collapse by Sidestepping Catastrophic Forgetting.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

It Is Not the Journey But the Destination: Endpoint Conditioned Trajectory Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Long-Term Human Motion Prediction with Scene Context.
Proceedings of the Computer Vision - ECCV 2020, 2020

2018
On Compressing U-net Using Knowledge Distillation.
CoRR, 2018

Learning Spontaneity to Improve Emotion Recognition in Speech.
Proceedings of the Interspeech 2018, 2018

Future Person Localization in First-Person Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Bitwise Operations of Cellular Automaton on Gray-scale Images.
CoRR, 2017


  Loading...