Hao Wang

Orcid: 0000-0002-3086-3128

Affiliations:

Hong Kong University of Science and Technology, Guangzhou, China
Nanyang Technological University, Singapore (PhD)

According to our database¹, Hao Wang authored at least 54 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Generator-Refiner-Examiner: A Tri-Module Data Augmentation Framework for 3D Human Avatar Learning from Monocular Videos.

[BibT_eX]

[DOI]

CoRR, May, 2026

MotionGRPO: Overcoming Low Intra-Group Diversity in GRPO-Based Egocentric Motion Recovery.

[BibT_eX]

[DOI]

CoRR, May, 2026

MultiGO++: Monocular 3D Clothed Human Reconstruction via Geometry-Texture Collaboration.

[BibT_eX]

[DOI]

CoRR, March, 2026

VLM-Guided Group Preference Alignment for Diffusion-based Human Mesh Recovery.

[BibT_eX]

[DOI]

CoRR, February, 2026

FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping.

[BibT_eX]

[DOI]

CoRR, July, 2025

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization.

[BibT_eX]

[DOI]

CoRR, May, 2025

3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, April, 2025

InsTex: Indoor Scenes Stylized Texture Synthesis.

[BibT_eX]

[DOI]

CoRR, January, 2025

Exploring Decoupled Generation for Enhancing VR Interaction and Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2025

Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

An Uncertainty-aware DETR Enhancement Framework for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Multimedia Computing for Health and Medicine, 2025

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization.

[BibT_eX]

[DOI]

Yangsen Chen

Hao Wang

Proceedings of the International Joint Conference on Neural Networks, 2025

RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

SCA3D: Enhancing Cross-Modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction.

[BibT_eX]

[DOI]

Junlong Ren

Hao Wang

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

DVM: Towards Controllable LLM Agents in Social Deduction Games.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Diversified Augmentation with Domain Adaptation for Debiased Video Temporal Grounding.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Learning Temporal Variations for 4D Point Cloud Segmentation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., December, 2024

ManiCLIP: Multi-attribute Face Manipulation from Text.

[BibT_eX]

[DOI]

Hao Wang

Guosheng Lin

Ana Garcia del Molino

Anran Wang

Jiashi Feng

Zhiqi Shen

Int. J. Comput. Vis., October, 2024

Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations.

[BibT_eX]

[DOI]

CoRR, 2024

All in a Single Image: Large Multimodal Models are In-Image Learners.

[BibT_eX]

[DOI]

CoRR, 2024

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects.

[BibT_eX]

[DOI]

CoRR, 2024

HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Learning Structural Representations for Recipe Generation and Food Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

StarNet: Style-Aware 3D Point Cloud Generation.

[BibT_eX]

[DOI]

Yunfan Zhang

Hao Wang

Guosheng Lin

Vun Chan Hua Nicholas

Zhiqi Shen

Chunyan Miao

CoRR, 2023

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes With Semantic Consistency and Attention Mechanism.

[BibT_eX]

[DOI]

Palakorn Achananuparp

Ee-Peng Lim

Steven C. H. Hoi

IEEE Trans. Multim., 2022

Cross-Modal Graph With Meta Concepts for Video Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Decomposing generation networks with structure prediction for recipe generation.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

ManiCLIP: Multi-Attribute Face Manipulation from Text.

[BibT_eX]

[DOI]

Hao Wang

Guosheng Lin

Ana Garcia del Molino

CoRR, 2022

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Spatial and Temporal Variations for 4D Point Cloud Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021

Cycle-Consistent Inverse GAN for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

Decomposed Generation Networks with Structure Prediction for Recipe Generation from Food Images.

[BibT_eX]

[DOI]

CoRR, 2020

Structure-Aware Generation Network for Recipe Generation from Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging.

[BibT_eX]

[DOI]

Palakorn Achananuparp

Ee-Peng Lim

Steven C. H. Hoi

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learning Cross-Modal Embeddings With Adversarial Networks for Cooking Recipes and Food Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Hao Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...