We stand with Ukraine

We stand with Ukraine

Trishul Chilimbi

Orcid: 0000-0003-4596-6784

According to our database¹, Trishul Chilimbi authored at least 24 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs.

[BibT_eX]

[DOI]

,

,

,

Mamshad Nayeem Rizve

,

,

Benjamin Z. Yao

,

Trishul Chilimbi

,

CoRR, 2024

Open Vocabulary Multi-Label Video Classification.

[BibT_eX]

[DOI]

,

Mamshad Nayeem Rizve

,

Jayakrishnan Unnikrishnan

,

,

,

,

Benjamin Z. Yao

,

Trishul Chilimbi

CoRR, 2024

VidLA: Video-Language Alignment at Scale.

[BibT_eX]

[DOI]

Mamshad Nayeem Rizve

,

,

Jayakrishnan Unnikrishnan

,

,

Benjamin Z. Yao

,

,

,

Trishul Chilimbi

CoRR, 2024

Robust Multi-Task Learning with Excess Risks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Trishul Chilimbi

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Web-Scale Semantic Product Search with Large Language Models.

[BibT_eX]

[DOI]

,

Sriram Srinivasan

,

,

,

,

Trishul Chilimbi

,

S. V. N. Vishwanathan

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications.

[BibT_eX]

[DOI]

,

,

,

,

Vassilis N. Ioannidis

,

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Trishul Chilimbi

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SST: Semantic and Structural Transformers for Hierarchy-aware Language Models in E-commerce.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the IEEE International Conference on Big Data, 2023

ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models.

[BibT_eX]

[DOI]

,

,

Aditya Anantharaman

,

,

,

,

,

,

,

Trishul Chilimbi

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022

MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud.

[BibT_eX]

[DOI]

,

,

,

,

,

Trishul Chilimbi

,

,

Proc. VLDB Endow., 2022

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing.

[BibT_eX]

[DOI]

,

,

,

,

Trishul Chilimbi

,

Mahdi Soltanolkotabi

,

Salman Avestimehr

CoRR, 2022

Efficient and effective training of language and graph neural network models.

[BibT_eX]

[DOI]

Vassilis N. Ioannidis

,

,

,

,

,

,

,

Trishul Chilimbi

,

CoRR, 2022

DynaMaR: Dynamic Prompt with Mask Token Representation.

[BibT_eX]

[DOI]

,

Sunny Rajagopalan

,

,

,

,

,

Trishul Chilimbi

CoRR, 2022

DCAF-BERT: A Distilled Cachable Adaptable Factorized Model For Improved Ads CTR Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Santosh Rajagopalan

,

,

Trishul Chilimbi

Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks.

[BibT_eX]

[DOI]

,

Sunny Rajagopalan

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

DynaMaR: Dynamic Prompt with Mask Token Representation.

[BibT_eX]

[DOI]

,

Sunny Rajagopalan

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

Vision-Language Pre-Training with Triple Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Trishul Chilimbi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-modal Alignment using Representation Codebook.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Trishul Chilimbi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MICO: Selective Search with Mutual Information Co-training.

[BibT_eX]

[DOI]

,

,

,

,

Trishul Chilimbi

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning.

[BibT_eX]

[DOI]

,

,

,

,

,

Santosh Rajagopalan

,

Trishul Chilimbi

CoRR, 2021

MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling.

[BibT_eX]

[DOI]

,

Mehmet Saygin Seyfioglu

,

,

,

,

Trishul Chilimbi

,

,

Ismail B. Tutar

CoRR, 2021

2020

Tiering as a Stochastic Submodular Optimization Problem.

[BibT_eX]

[DOI]

,

,

Roshan Makhijani

,

,

,

Trishul Chilimbi

CoRR, 2020

Loading...