Sachin Mehta

Orcid: 0000-0002-5420-4725

According to our database1, Sachin Mehta authored at least 56 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Separable Self-attention for Mobile Vision Transformers.
Trans. Mach. Learn. Res., 2023

Weight subcloning: direct initialization of transformers using larger pretrained ones.
CoRR, 2023

Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models.
CoRR, 2023

TiC-CLIP: Continual Training of CLIP Models.
CoRR, 2023

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding.
CoRR, 2023

CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement.
CoRR, 2023

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models.
CoRR, 2023

Diffusion Models as Masked Audio-Video Learners.
CoRR, 2023

On the Efficacy of Multi-scale Data Samplers for Vision Applications.
CoRR, 2023

Bytes Are All You Need: Transformers Operating Directly On File Bytes.
CoRR, 2023

APE: An Open and Shared Annotated Dataset for Learning Urban Pedestrian Path Networks.
CoRR, 2023

OASIS: Automated Assessment of Urban Pedestrian Paths at Scale.
CoRR, 2023

Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
DiCENet: Dimension-Wise Convolutions for Efficient Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

End-to-End diagnosis of breast biopsy images with transformers.
Medical Image Anal., 2022

Segmenting Skin Biopsy Images with Coarse and Sparse Annotations using U-Net.
J. Digit. Imaging, 2022

RangeAugment: Efficient Online Augmentation with Range Learning.
CoRR, 2022

CVNets: High Performance Library for Computer Vision.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Calibration Error Prediction: Ensuring High-Quality Mobile Eye-Tracking.
Proceedings of the ETRA 2022: Symposium on Eye Tracking Research and Applications, Seattle, WA, USA, June 8, 2022

SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection.
CoRR, 2021

Corrigendum to "Machine learning techniques for mitoses classification" [Comput. Med. Imaging Graphics 87 January (2021) 101832].
Comput. Medical Imaging Graph., 2021

Machine learning techniques for mitoses classification.
Comput. Medical Imaging Graph., 2021

Scale-Aware Transformers for Diagnosing Melanocytic Lesions.
IEEE Access, 2021

EVRNet: Efficient Video Restoration on Edge Devices.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DeLighT: Deep and Light-weight Transformer.
Proceedings of the 9th International Conference on Learning Representations, 2021

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Analysis of Regions of Interest and Distractor Regions in Breast Biopsy Images.
Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2021

Collecting Sidewalk Network Data at Scale for Accessible Pedestrian Travel.
Proceedings of the ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility, 2021

2020
DeLighT: Very Deep and Light-weight Transformer.
CoRR, 2020

HATNet: An End-to-End Holistic Attention Network for Diagnosis of Breast Biopsy Images.
CoRR, 2020

Leveraging Unlabeled Data for Glioma Molecular Subtype and Survival Prediction.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Classifying Breast Histopathology Images with a Ductal Instance-Oriented Pipeline.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling.
Proceedings of the 8th International Conference on Learning Representations, 2020

MedICaT: A Dataset of Medical Images, Captions, and Textual References.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
DeFINE: DEep Factorized INput Word Embeddings for Neural Sequence Modeling.
CoRR, 2019

A Facial Affect Analysis System for Autism Spectrum Disorder.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning to Segment Breast Biopsy Whole Slide Images.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

3D-ESPNet with Pyramidal Refinement for Volumetric Brain Tumor Image Segmentation.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Automated Diagnosis of Breast Cancer and Pre-invasive Lesions on Digital Whole Slide Images.
Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods, 2018

Pyramidal Recurrent Unit for Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Identifying Most Walkable Direction for Navigation in an Outdoor Environment.
CoRR, 2017

2016
Scene-based fingerprinting method for traitor tracing.
Multim. Syst., 2016

mPDF: Framework for Watermarking PDF Files using Image Watermarking Algorithms.
CoRR, 2016

Region graph based method for multi-object detection and tracking using depth cameras.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

2014
3D content fingerprinting.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
A Study of DWT and SVD Based Watermarking Algorithms for Patient Privacy in Medical Images.
Proceedings of the IEEE International Conference on Healthcare Informatics, 2013

2012
Tampering resistant self recoverable watermarking method using error correction codes.
Int. J. Inf. Comput. Secur., 2012

On-the-fly Watermarking of Videos for Real-time Applications.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011
Tampering Resistant Dual Watermarking Method for Copyright Protection of Still Images.
Proceedings of the Advanced Computing, Networking and Security - International Conference, 2011


  Loading...