Zane Durante

Orcid: 0000-0001-9038-8915

According to our database1, Zane Durante authored at least 22 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
VideoWeave: A Data-Centric Approach for Efficient Video Understanding.
CoRR, January, 2026

2025
SAMDWICH: Moment-aware Video-text Alignment for Referring Video Object Segmentation.
CoRR, August, 2025

Latest Object Memory Management for Temporally Consistent Video Instance Segmentation.
CoRR, July, 2025

Towards Fine-Grained Video Question Answering.
CoRR, March, 2025

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LOMM: Latest Object Memory Management for Temporally Consistent Video Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Re-thinking Temporal Search for Long-Form Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

An Interactive Agent Foundation Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
CoRR, 2024

Position Paper: Agent AI Towards a Holistic Intelligence.
CoRR, 2024

An Interactive Agent Foundation Model.
CoRR, 2024

Agent AI: Surveying the Horizons of Multimodal Interaction.
CoRR, 2024

Differentially Private Video Activity Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

HourVideo: 1-Hour Video-Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MindAgent: Emergent Gaming Interaction.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Few-Shot Classification of Interactive Activities of Daily Living (InteractADL).
Proceedings of the 35th British Machine Vision Conference, 2024

2023
MindAgent: Emergent Gaming Interaction.
CoRR, 2023

2022
Causal indicators for assessing the truthfulness of child speech in forensic interviews.
Comput. Speech Lang., 2022

MOMA-LRG: Language-Refined Graphs for Multi-Object Multi-Actor Activity Parsing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin.
CoRR, 2021

2020
Identifying Truthful Language in Child Interviews.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018
Deep CNN Frame Interpolation with Lessons Learned from Natural Language Processing.
CoRR, 2018


  Loading...