Yu Wu

Orcid: 0000-0002-1680-8253

Affiliations:
  • Princeton University, NJ, USA
  • University of Technology Sydney, Center for Artificial Intelligence, Australia


According to our database1, Yu Wu authored at least 53 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning.
CoRR, 2024

2023
Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Switchable Novel Object Captioner.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

DETER: Detecting Edited Regions for Deterring Generative Manipulations.
CoRR, 2023

RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments.
CoRR, 2023

Unseen Image Synthesis with Diffusion Models.
CoRR, 2023

Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation.
CoRR, 2023

Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models.
CoRR, 2023

Boundary Guided Learning-Free Semantic Control with Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning to Segment Every Referring Object Point by Point.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Identifying Visible Parts via Pose Estimation for Occluded Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., 2022

Learning With Noisy Labels via Self-Reweighting From Class Centroids.
IEEE Trans. Neural Networks Learn. Syst., 2022

Saying the Unseen: Video Descriptions via Dialog Agents.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

NAP: Neural architecture search with pruning.
Neurocomputing, 2022

Vision+X: A Survey on Multimodal Learning in the Light of Data.
CoRR, 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation.
CoRR, 2022

Enabling Detailed Action Recognition Evaluation Through Video Dataset Augmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Quantized GAN for Complex Music Generation from Dance Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

Multi-query Video Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022

SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Large-scale Video Panoptic Segmentation in the Wild: A Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Learn by Jointly Optimizing Neural Architecture and Weights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Multimodal Learning and Video Analysis with Deep Neural Networks
PhD thesis, 2021

Learning to Anticipate Egocentric Actions by Imagination.
IEEE Trans. Image Process., 2021

Holistic LSTM for Pedestrian Trajectory Prediction.
IEEE Trans. Image Process., 2021

Progressive Transfer Learning for Face Anti-Spoofing.
IEEE Trans. Image Process., 2021

Contrastive Video-Language Segmentation.
CoRR, 2021

Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation.
CoRR, 2021

ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation.
CoRR, 2021

Learning Audio-Visual Correlations From Variational Cross-Modal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Revisiting EmbodiedQA: A Simple Baseline and Beyond.
IEEE Trans. Image Process., 2020

Unsupervised Person Re-identification via Cross-Camera Similarity Exploration.
IEEE Trans. Image Process., 2020

Cascaded Revision Network for Novel Object Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Describing Unseen Videos via Multi-modal Cooperative Dialog Agents.
Proceedings of the Computer Vision - ECCV 2020, 2020

Gated Channel Transformation for Visual Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Imitative Non-Autoregressive Modeling for Trajectory Forecasting and Imputation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Person Re-Identification via Softened Similarity Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Symbiotic Attention with Privileged Information for Egocentric Action Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Progressive Learning for Person Re-Identification With One Example.
IEEE Trans. Image Process., 2019

Improving person re-identification by attribute and identity learning.
Pattern Recognit., 2019

Cascaded Revision Network for Novel Object Captioning.
CoRR, 2019

Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019.
CoRR, 2019

Dual Attention Matching for Audio-Visual Event Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Pose-Guided Feature Alignment for Occluded Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Decoupled Novel Object Captioner.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Improving Person Re-identification by Attribute and Identity Learning.
CoRR, 2017


  Loading...