Sachin Mehta

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining.

[BibT_eX]

[DOI]

Jeffrey Li

Mohammadreza Armandpour

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Bytes Are All You Need: Transformers Operating Directly On File Bytes.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers.

[BibT_eX]

[DOI]

CoRR, 2024

KV Prediction for Improved Time to First Token.

[BibT_eX]

[DOI]

CoRR, 2024

PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery.

[BibT_eX]

[DOI]

CoRR, 2024

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference.

[BibT_eX]

[DOI]

CoRR, 2024

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data.

[BibT_eX]

[DOI]

Mohammad Hossein Sekhavat

Maxwell Horton

Fartash Faghri

CoRR, 2024

OpenELM: An Efficient Language Model Family with Open Training and Inference Framework.

[BibT_eX]

[DOI]

Mohammad Hossein Sekhavat

CoRR, 2024

Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models.

[BibT_eX]

[DOI]

Iman Mirzadeh

Keivan Alizadeh-Vahid

Proceedings of the Twelfth International Conference on Learning Representations, 2024

TiC-CLIP: Continual Training of CLIP Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding.

[BibT_eX]

[DOI]

Haoxiang Wang

Pavan Kumar Anasosalu Vasu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Separable Self-attention for Mobile Vision Transformers.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Weight subcloning: direct initialization of transformers using larger pretrained ones.

[BibT_eX]

[DOI]

CoRR, 2023

Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Diffusion Models as Masked Audio-Video Learners.

[BibT_eX]

[DOI]

CoRR, 2023

On the Efficacy of Multi-scale Data Samplers for Vision Applications.

[BibT_eX]

[DOI]

CoRR, 2023

APE: An Open and Shared Annotated Dataset for Learning Urban Pedestrian Path Networks.

[BibT_eX]

[DOI]

CoRR, 2023

OASIS: Automated Assessment of Urban Pedestrian Paths at Scale.

[BibT_eX]

[DOI]

CoRR, 2023

Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022

DiCENet: Dimension-Wise Convolutions for Efficient Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

End-to-End diagnosis of breast biopsy images with transformers.

[BibT_eX]

[DOI]

Medical Image Anal., 2022

Segmenting Skin Biopsy Images with Coarse and Sparse Annotations using U-Net.

[BibT_eX]

[DOI]

J. Digit. Imaging, 2022

RangeAugment: Efficient Online Augmentation with Range Learning.

[BibT_eX]

[DOI]

CoRR, 2022

CVNets: High Performance Library for Computer Vision.

[BibT_eX]

[DOI]

Farzad Abdolhosseini

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Calibration Error Prediction: Ensuring High-Quality Mobile Eye-Tracking.

[BibT_eX]

[DOI]

Proceedings of the ETRA 2022: Symposium on Eye Tracking Research and Applications, Seattle, WA, USA, June 8, 2022

SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection.

[BibT_eX]

[DOI]

Yuxiang Zhang

Anat Caspi

CoRR, 2021

Corrigendum to "Machine learning techniques for mitoses classification" [Comput. Med. Imaging Graphics 87 January (2021) 101832].

[BibT_eX]

[DOI]

Comput. Medical Imaging Graph., 2021

Machine learning techniques for mitoses classification.

[BibT_eX]

[DOI]

Comput. Medical Imaging Graph., 2021

Scale-Aware Transformers for Diagnosing Melanocytic Lesions.

[BibT_eX]

[DOI]

IEEE Access, 2021

EVRNet: Efficient Video Restoration on Edge Devices.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DeLighT: Deep and Light-weight Transformer.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Analysis of Regions of Interest and Distractor Regions in Breast Biopsy Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2021

Collecting Sidewalk Network Data at Scale for Accessible Pedestrian Travel.

[BibT_eX]

[DOI]

Yuxiang Zhang

Anat Caspi

Proceedings of the ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility, 2021

2020

DeLighT: Very Deep and Light-weight Transformer.

[BibT_eX]

[DOI]

CoRR, 2020

HATNet: An End-to-End Holistic Attention Network for Diagnosis of Breast Biopsy Images.

[BibT_eX]

[DOI]

CoRR, 2020

Leveraging Unlabeled Data for Glioma Molecular Subtype and Survival Prediction.

[BibT_eX]

[DOI]

Nicholas Nuechterlein

Beibin Li

Mehmet Saygin Seyfioglu

Patrick J. Cimino

Linda G. Shapiro

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Classifying Breast Histopathology Images with a Ductal Instance-Oriented Pipeline.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling.

[BibT_eX]

[DOI]

Rik Koncel-Kedziorski

Proceedings of the 8th International Conference on Learning Representations, 2020

MedICaT: A Dataset of Medical Images, Captions, and Textual References.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019

DeFINE: DEep Factorized INput Word Embeddings for Neural Sequence Modeling.

[BibT_eX]

[DOI]

Rik Koncel-Kedziorski

CoRR, 2019

A Facial Affect Analysis System for Autism Spectrum Disorder.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Learning to Segment Breast Biopsy Whole Slide Images.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels.

[BibT_eX]

[DOI]

Amar P. Azad

Saneem A. Chemmengath

Vikas Raykar

Shivkumar Kalyanaraman

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

3D-ESPNet with Pyramidal Refinement for Volumetric Brain Tumor Image Segmentation.

[BibT_eX]

[DOI]

Nicholas Nuechterlein

Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Automated Diagnosis of Breast Cancer and Pre-invasive Lesions on Digital Whole Slide Images.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods, 2018

Pyramidal Recurrent Unit for Language Modeling.

[BibT_eX]

[DOI]

Rik Koncel-Kedziorski

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Identifying Most Walkable Direction for Navigation in an Outdoor Environment.

[BibT_eX]

[DOI]

Linda G. Shapiro

CoRR, 2017

2016

Scene-based fingerprinting method for traitor tracing.

[BibT_eX]

[DOI]

Multim. Syst., 2016

mPDF: Framework for Watermarking PDF Files using Image Watermarking Algorithms.

[BibT_eX]

[DOI]

Derrick Newton

CoRR, 2016

Region graph based method for multi-object detection and tracking using depth cameras.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

2014

3D content fingerprinting.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013

A Study of DWT and SVD Based Watermarking Algorithms for Patient Privacy in Medical Images.

[BibT_eX]

[DOI]

Ranjeet Vinayak Marawar

Proceedings of the IEEE International Conference on Healthcare Informatics, 2013

2012

Tampering resistant self recoverable watermarking method using error correction codes.

[BibT_eX]

[DOI]

Vijayaraghavan Varadharajan

Int. J. Inf. Comput. Secur., 2012

On-the-fly Watermarking of Videos for Real-time Applications.

[BibT_eX]

[DOI]

Vijayaraghavan Varadharajan

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011

Tampering Resistant Dual Watermarking Method for Copyright Protection of Still Images.

[BibT_eX]

[DOI]

Vijayaraghavan Varadharajan