Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Learning Rich Speech Representations with Acoustic-Semantic Factorization.

[BibT_eX]

[DOI]

Minxue Niu

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Beyond Speaker Identity: Text Guided Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

NowYouSee Me: Context-Aware Automatic Audio Description.

[BibT_eX]

[DOI]

CoRR, 2024

DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism.

[BibT_eX]

[DOI]

Sudha Krishnamurthy

Vimal Bhat

Abhinav Jain

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Text-Guided Video Masked Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos.

[BibT_eX]

[DOI]

CoRR, 2023

A Simple and Efficient method for Dubbed Audio Sync Detection using Compressive Sensing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2023

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation.

[BibT_eX]

[DOI]

Hector J. Santos-Villalobos

Vimal Bhat

Rohith MV

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Motion-Guided Masking for Spatiotemporal Representation Learning.

[BibT_eX]

[DOI]

Hector J. Santos-Villalobos

Rohith MV

Xinyu Li

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LipNeRF: What is the right feature space to lip-sync a NeRF?

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

2021

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Vimal Bhat

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...