Yiming Wu

Orcid: 0000-0002-9866-669X

Affiliations:
  • University of Hong Kong, Department of Computer Science, Pokfulam, Hong Kong
  • University of Sydney, Sydney, NSW, Australia
  • Zhejiang University of Technology, School of Computer Science and Technology, Zhejiang, China


According to our database1, Yiming Wu authored at least 30 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning.
CoRR, May, 2026

Device-Conditioned Neural Architecture Search for Efficient Robotic Manipulation.
CoRR, April, 2026

2025
SOEDiff: Efficient Distillation for Small Object Editing.
ACM Trans. Multim. Comput. Commun. Appl., November, 2025

Versatile Video Tokenization with Generative 2D Gaussian Splatting.
CoRR, August, 2025

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks.
CoRR, 2024

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models.
CoRR, 2024

PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis.
CoRR, 2024

SOEDiff: Efficient Distillation for Small Object Editing.
CoRR, 2024

Training-Free Unsupervised Prompt for Vision-Language Models.
CoRR, 2024

Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RE-IDVIS: Person Re-Identification System based on Interactive Visualization.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Self-Distilled Dynamic Fusion Network for Language-Based Fashion Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mrtnet: Multi-Resolution Temporal Network for Video Sentence Grounding.
Proceedings of the IEEE International Conference on Acoustics, 2024

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds.
Proceedings of the Computer Vision - ECCV 2024, 2024

Panoptic Scene Graph Generation with Semantics-Prototype Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
D<sup>3</sup>T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Progressive Target-Styled Feature Augmentation for Unsupervised Domain Adaptation on Point Clouds.
CoRR, 2023

Dynamic Network for Language-based Fashion Retrieval.
Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval, 2023

2022
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding.
CoRR, 2022

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks.
CoRR, 2022

2021
F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks.
IEEE Trans. Image Process., 2021

MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Adaptive Graph Representation Learning for Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images.
IEEE Trans. Cybern., 2020

BANet: Bidirectional Aggregation Network With Occlusion Handling for Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Adaptive Graph Representation Learning for Video Person Re-identification.
CoRR, 2019

2018
Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images.
CoRR, 2018


  Loading...