We stand with Ukraine

We stand with Ukraine

Du Tran

Orcid: 0000-0001-9673-7194

According to our database¹, Du Tran authored at least 47 papers between 2008 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

FOCUS +: Enhancing Sustained Attention Through Comparative Evaluation of Digital and Physical Interventions.

[DOI]

,

,

,

,

,

Proceedings of the Recent Challenges in Intelligent Information and Database Systems: 18th Asian Conference, 2026

2025

Layer-Aware Video Composition via Split-then-Merge.

[DOI]

,

,

Ming-Hsuan Yang

,

,

,

CoRR, November, 2025

SEAL: Semantic Attention Learning for Long Video Representation.

[DOI]

,

,

,

Vishnu Naresh Boddeti

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision.

[DOI]

,

,

,

Manmohan Chandraker

,

Lorenzo Torresani

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

FLAVR: flow-free architecture for fast video frame interpolation.

[DOI]

,

,

Manmohan Chandraker

,

Mach. Vis. Appl., September, 2023

Learning Space-Time Semantic Correspondences.

[DOI]

,

CoRR, 2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries.

[DOI]

,

Effrosyni Mavroudi

,

,

Sainbayar Sukhbaatar

,

,

,

Lorenzo Torresani

,

CoRR, 2023

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation.

[DOI]

,

,

Manmohan Chandraker

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Relational Space-Time Query in Long-Form Videos.

[DOI]

,

,

,

,

Lorenzo Torresani

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Long-Short Temporal Contrastive Learning of Video Transformers.

[DOI]

,

Gedas Bertasius

,

,

Lorenzo Torresani

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation.

[DOI]

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Self-Supervised Learning by Cross-Modal Audio-Video Clustering.

[DOI]

,

,

,

Lorenzo Torresani

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Video Modeling With Correlation Networks.

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

What Makes Training Multi-Modal Classification Networks Hard?

[DOI]

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FASTER Recurrent Networks for Efficient Video Classification.

[DOI]

,

,

Laura Sevilla-Lara

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Self-Supervised Learning by Cross-Modal Audio-Video Clustering.

[DOI]

,

,

Lorenzo Torresani

,

,

CoRR, 2019

FASTER Recurrent Networks for Video Classification.

[DOI]

,

Laura Sevilla-Lara

,

,

,

,

CoRR, 2019

UniDual: A Unified Model for Image and Video Understanding.

[DOI]

,

,

Lorenzo Torresani

CoRR, 2019

What Makes Training Multi-Modal Networks Hard?

[DOI]

,

,

CoRR, 2019

Large-scale weakly-supervised pre-training for video action recognition.

[DOI]

Deepti Ghadiyaram

,

,

,

,

,

CoRR, 2019

Learning Temporal Pose Estimation from Sparsely-Labeled Videos.

[DOI]

Gedas Bertasius

,

Christoph Feichtenhofer

,

,

,

Lorenzo Torresani

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Classification With Channel-Separated Convolutional Networks.

[DOI]

,

,

,

Lorenzo Torresani

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SCSampler: Sampling Salient Clips From Video for Efficient Action Recognition.

[DOI]

,

,

Lorenzo Torresani

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DistInit: Learning Video Representations Without a Single Labeled Video.

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Leveraging the Present to Anticipate the Future in Videos.

[DOI]

,

,

,

,

Lorenzo Torresani

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition.

[DOI]

Deepti Ghadiyaram

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Learning Discriminative Motion Features Through Detection.

[DOI]

Gedas Bertasius

,

Christoph Feichtenhofer

,

,

,

Lorenzo Torresani

CoRR, 2018

Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization.

[DOI]

,

,

Lorenzo Torresani

CoRR, 2018

Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization.

[DOI]

,

,

Lorenzo Torresani

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scenes-Objects-Actions: A Multi-task, Multi-label Video Dataset.

[DOI]

,

,

,

,

,

Lorenzo Torresani

,

Proceedings of the Computer Vision - ECCV 2018, 2018

A Closer Look at Spatiotemporal Convolutions for Action Recognition.

[DOI]

,

,

Lorenzo Torresani

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Detect-and-Track: Efficient Pose Estimation in Videos.

[DOI]

,

Georgia Gkioxari

,

Lorenzo Torresani

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

ConvNet Architecture Search for Spatiotemporal Feature Learning.

[DOI]

,

,

,

,

CoRR, 2017

Transformation-Based Models of Video Sequences.

[DOI]

Joost R. van Amersfoort

,

,

Marc'Aurelio Ranzato

,

,

,

Soumith Chintala

CoRR, 2017

Deciphering Severely Degraded License Plates.

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the Media Watermarking, Security, and Forensics 2017, Burlingame, CA, USA, 29 January 2017, 2017

2016

Representations and Models for Large-Scale Video Understanding.

[DOI]

PhD thesis, 2016

EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis.

[DOI]

,

Lorenzo Torresani

Int. J. Comput. Vis., 2016

ViCom: Benchmark and Methods for Video Comprehension.

[DOI]

,

,

Lorenzo Torresani

CoRR, 2016

Deep End2End Voxel2Voxel Prediction.

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015

Learning Spatiotemporal Features with 3D Convolutional Networks.

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search.

[DOI]

,

,

David A. Forsyth

IEEE Trans. Pattern Anal. Mach. Intell., 2014

EXMOVES: Classifier-based Features for Scalable Action Recognition.

[DOI]

,

Lorenzo Torresani

Proceedings of the 2nd International Conference on Learning Representations, 2014

C3D: Generic Features for Video Analysis.

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

CoRR, 2014

2012

Max-Margin Structured Output Regression for Spatio-Temporal Action Localization.

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011

Optimal spatio-temporal path discovery for video event detection.

[DOI]

,

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2008

Human Activity Recognition with Metric Learning.

[DOI]

,

Alexander Sorokin

Proceedings of the Computer Vision, 2008

Loading...