Hao Wang

Orcid: 0000-0002-3086-3128

Affiliations:
  • Hong Kong University of Science and Technology, Guangzhou, China
  • Nanyang Technological University, Singapore (PhD)


According to our database1, Hao Wang authored at least 52 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MultiGO++: Monocular 3D Clothed Human Reconstruction via Geometry-Texture Collaboration.
CoRR, March, 2026

VLM-Guided Group Preference Alignment for Diffusion-based Human Mesh Recovery.
CoRR, February, 2026

FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping.
CoRR, July, 2025

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization.
CoRR, May, 2025

3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting.
CoRR, April, 2025

InsTex: Indoor Scenes Stylized Texture Synthesis.
CoRR, January, 2025

Exploring Decoupled Generation for Enhancing VR Interaction and Control.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2025

Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

An Uncertainty-aware DETR Enhancement Framework for Object Detection.
Proceedings of the 2nd International Workshop on Multimedia Computing for Health and Medicine, 2025

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects.
Proceedings of the International Joint Conference on Neural Networks, 2025

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image.
Proceedings of the International Joint Conference on Neural Networks, 2025

Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization.
Proceedings of the International Joint Conference on Neural Networks, 2025

RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

SCA3D: Enhancing Cross-Modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

DVM: Towards Controllable LLM Agents in Social Deduction Games.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Diversified Augmentation with Domain Adaptation for Debiased Video Temporal Grounding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Learning Temporal Variations for 4D Point Cloud Segmentation.
Int. J. Comput. Vis., December, 2024

ManiCLIP: Multi-attribute Face Manipulation from Text.
Int. J. Comput. Vis., October, 2024

Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations.
CoRR, 2024

All in a Single Image: Large Multimodal Models are In-Image Learners.
CoRR, 2024

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects.
CoRR, 2024

HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Learning Structural Representations for Recipe Generation and Food Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

StarNet: Style-Aware 3D Point Cloud Generation.
CoRR, 2023

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes With Semantic Consistency and Attention Mechanism.
IEEE Trans. Multim., 2022

Cross-Modal Graph With Meta Concepts for Video Captioning.
IEEE Trans. Image Process., 2022

Decomposing generation networks with structure prediction for recipe generation.
Pattern Recognit., 2022

ManiCLIP: Multi-Attribute Face Manipulation from Text.
CoRR, 2022

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image.
CoRR, 2022

Learning Spatial and Temporal Variations for 4D Point Cloud Segmentation.
CoRR, 2022

Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Cycle-Consistent Inverse GAN for Text-to-Image Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Decomposed Generation Networks with Structure Prediction for Recipe Generation from Food Images.
CoRR, 2020

Structure-Aware Generation Network for Recipe Generation from Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learning Cross-Modal Embeddings With Adversarial Networks for Cooking Recipes and Food Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


  Loading...