Yiming Wu

Orcid: 0000-0002-9866-669X

Affiliations:

University of Hong Kong, Department of Computer Science, Pokfulam, Hong Kong
University of Sydney, Sydney, NSW, Australia
Zhejiang University of Technology, School of Computer Science and Technology, Zhejiang, China

According to our database¹, Yiming Wu authored at least 30 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2026

Device-Conditioned Neural Architecture Search for Efficient Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

2025

SOEDiff: Efficient Distillation for Small Object Editing.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., November, 2025

Versatile Video Tokenization with Generative 2D Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, August, 2025

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

SOEDiff: Efficient Distillation for Small Object Editing.

[BibT_eX]

[DOI]

CoRR, 2024

Training-Free Unsupervised Prompt for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RE-IDVIS: Person Re-Identification System based on Interactive Visualization.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Self-Distilled Dynamic Fusion Network for Language-Based Fashion Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Mrtnet: Multi-Resolution Temporal Network for Video Sentence Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Panoptic Scene Graph Generation with Semantics-Prototype Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

D<sup>3</sup>T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Progressive Target-Styled Feature Augmentation for Unsupervised Domain Adaptation on Point Clouds.

[BibT_eX]

[DOI]

CoRR, 2023

Dynamic Network for Language-based Fashion Retrieval.

[BibT_eX]

[DOI]

Hangfei Li

Yiming Wu

Fangfang Wang

Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval, 2023

2022

MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding.

[BibT_eX]

[DOI]

CoRR, 2022

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2022

2021

F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

Adaptive Graph Representation Learning for Video Person Re-Identification.

[BibT_eX]

[DOI]

Yiming Wu

Omar El Farouk Bourahla

IEEE Trans. Image Process., 2020

Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2020

BANet: Bidirectional Aggregation Network With Occlusion Handling for Panoptic Segmentation.

[BibT_eX]

[DOI]

Yifeng Chen

Guangchen Lin

Songyuan Li

Omar El Farouk Bourahla

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Adaptive Graph Representation Learning for Video Person Re-identification.

[BibT_eX]

[DOI]

Yiming Wu

Omar El Farouk Bourahla

Xi Li

Fei Wu

Qi Tian

CoRR, 2019

2018

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images.

[BibT_eX]

[DOI]

CoRR, 2018

Yiming Wu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...