Emad Barsoum

Orcid: 0000-0002-4097-8690

According to our database1, Emad Barsoum authored at least 57 papers between 2008 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning.
CoRR, August, 2025

Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks.
CoRR, July, 2025

SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers.
CoRR, July, 2025

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation.
CoRR, June, 2025

TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games.
CoRR, June, 2025

Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models.
CoRR, June, 2025

Unleashing Hour-Scale Video Training for Long Video-Language Understanding.
CoRR, June, 2025

TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering.
CoRR, June, 2025

MOVi: Training-free Text-conditioned Multi-Object Video Generation.
CoRR, May, 2025

Zebra-Llama: Towards Extremely Efficient Hybrid Models.
CoRR, May, 2025

PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation.
CoRR, April, 2025

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.
CoRR, April, 2025

DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models.
CoRR, April, 2025

AMD-Hummingbird: Towards an Efficient Text-to-Video Model.
CoRR, March, 2025

X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression.
CoRR, March, 2025

Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding.
CoRR, March, 2025

Týr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization.
CoRR, March, 2025

Partial Convolution Meets Visual Attention.
CoRR, March, 2025

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE.
CoRR, February, 2025

Edit as You See: Image-guided Video Editing via Masked Motion Modeling.
CoRR, January, 2025

Agent Laboratory: Using LLM Agents as Research Assistants.
CoRR, January, 2025

MSWA: Refining Local Attention with Multi-ScaleWindow Attention.
CoRR, January, 2025

Amphista: Bi-directional Multi-head Decoding for Accelerating LLM Inference.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ReNeg: Learning Negative Embedding with Reward Guidance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Enhancing One-Shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Self-Taught Agentic Long Context Understanding.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

EGSRAL: An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
FTP: A Fine-grained Token-wise Pruner for Large Language Models via Token Routing.
CoRR, 2024

Fast Occupancy Network.
CoRR, 2024

Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs.
CoRR, 2024

Enhancing One-shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism.
CoRR, 2024

VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking.
CoRR, 2024

Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style.
CoRR, 2024

TernaryLLM: Ternarized Large Language Model.
CoRR, 2024

LADDER: An Efficient Framework for Video Frame Interpolation.
CoRR, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
CoRR, 2024

Sparse Laneformer.
CoRR, 2024

DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

QT-ViT: Improving Linear Attention in ViT with Quadratic Taylor Expansion.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Customizing Text-to-Image Generation with Inverted Interaction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Enhancing Vision Transformer: Amplifying Non-Linearity in Feedforward Network Module.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM.
Proceedings of the 35th British Machine Vision Conference, 2024

2021
Scaling Distributed Training with Adaptive Summation.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

2020
3D Human motion anticipation and classification.
CoRR, 2020

2019
Human Motion Anticipation and Recognition from RGB-D.
PhD thesis, 2019

NGEMM: Optimizing GEMM for Deep Learning via Compiler-based Techniques.
CoRR, 2019

2018
Object Localization and Motion Transfer learning with Capsules.
CoRR, 2018

HP-GAN: Probabilistic 3D Human Motion Prediction via GAN.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Automatic speech emotion recognition using recurrent neural networks with local attention.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Articulated Hand Pose Estimation Review.
CoRR, 2016

Training deep networks for facial expression recognition with crowd-sourced label distribution.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Emotion recognition in the wild from videos using images.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

2008
Towards adaptive Web scriptable user interfaces for virtual environments.
Virtual Real., 2008


  Loading...