Hao Tang

Orcid: 0000-0002-2077-1246

Affiliations:
  • ETH Zurich, Switzerland
  • University of Trento, Multimedia and Human Understanding Group, Italy
  • Peking University, Shenzhen Graduate School, Beijing, China (former)


According to our database1, Hao Tang authored at least 143 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Cloth Interactive Transformer for Virtual Try-On.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

ControlFace: Feature Disentangling for Controllable Face Swapping.
J. Imaging, January, 2024

Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation.
CoRR, 2024

Efficient Pruning of Large Language Model with Adaptive Estimation Fusion.
CoRR, 2024

InstructGIE: Towards Generalizable Image Editing.
CoRR, 2024

Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Bipartite Graph Diffusion Model for Human Interaction Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis.
Mach. Intell. Res., December, 2023

On-device audio-visual multi-person wake word spotting.
CAAI Trans. Intell. Technol., December, 2023

Measuring the Consistency and Diversity of 3D Face Generation.
IEEE J. Sel. Top. Signal Process., November, 2023

Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Interactive Neural Painting.
Comput. Vis. Image Underst., October, 2023

Multi-hypothesis representation learning for transformer-based 3D human pose estimation.
Pattern Recognit., September, 2023

AO2-DETR: Arbitrary-Oriented Object Detection Transformer.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis.
Int. J. Comput. Vis., March, 2023

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Bidirectional Transformer GAN for Long-term Human Motion Prediction.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Unsupervised Key Frame Extraction for Efficient Video Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation.
IEEE Trans. Multim., 2023

Cross-View Panorama Image Synthesis.
IEEE Trans. Multim., 2023

Interaction Transformer for Human Reaction Generation.
IEEE Trans. Multim., 2023

Adaptive Convolutional Subspace Reasoning Network for Few-Shot SAR Target Recognition.
IEEE Trans. Geosci. Remote. Sens., 2023

Transductive Prototypical Attention Reasoning Network for Few-Shot SAR Target Recognition.
IEEE Trans. Geosci. Remote. Sens., 2023

Local and Global GANs With Semantic-Aware Upsampling for Image Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Towards High-quality HDR Deghosting with Conditional Diffusion Models.
CoRR, 2023

Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition.
CoRR, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.
CoRR, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.
CoRR, 2023

Few-shot Medical Image Segmentation with Cycle-resemblance Attention.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Level Lottery Ticket Hypothesis for Vision Transformers.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

SpeedDETR: Speed-aware Transformers for End-to-end Object Detection.
Proceedings of the International Conference on Machine Learning, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TINYCOD: Tiny and Effective Model for Camouflaged Object Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

MLP-GAN for Brain Vessel Image Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2023

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LSDIR: A Large Scale Dataset for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

DE-net: Dynamic Text-Guided Image Editing Adversarial Networks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Towards Real-Time Segmentation on the Edge.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes.
IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.
IEEE Trans. Image Process., 2022

Quasi-Equilibrium Feature Pyramid Network for Salient Object Detection.
IEEE Trans. Image Process., 2022

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.
IEEE Trans. Image Process., 2022

Supervised Multi-Scale Attention-Guided Ship Detection in Optical Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Facial Expression Translation Using Landmark Guided GANs.
IEEE Trans. Affect. Comput., 2022

Cross-view panorama image synthesis with progressive attention GANs.
Pattern Recognit., 2022

PB-GCN: Progressive binary graph convolutional networks for skeleton-based action recognition.
Neurocomputing, 2022

The Lottery Ticket Hypothesis for Vision Transformers.
CoRR, 2022

Training and Tuning Generative Neural Radiance Fields for Attribute-Conditional 3D-Aware Face Generation.
CoRR, 2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation.
CoRR, 2022

PI-Trans: Parallel-ConvMLP and Implicit-Transformation Based GAN for Cross-View Image Translation.
CoRR, 2022

Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition.
CoRR, 2022

3D-Aware Video Generation.
CoRR, 2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation.
CoRR, 2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis.
CoRR, 2022

Real-Time Portrait Stylization on the Edge.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Cloth-Irrelevant Harmonious Attention Network for Cloth-Changing Person Re-identification.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-Identification.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Unsupervised Domain Adaptation Person Re-Identification by Camera-Aware Style Decoupling and Uncertainty Modeling.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization.
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022

3D-Aware Semantic-Guided Generative Model for Human Synthesis.
Proceedings of the Computer Vision - ECCV 2022, 2022

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Interpretable Video Super-Resolution via Alternating Optimization.
Proceedings of the Computer Vision - ECCV 2022, 2022

FPGA-aware automatic acceleration framework for vision transformer with mixed-scheme quantization: late breaking results.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Restore 3D Face from In-the-Wild Degraded Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Physically-guided Disentangled Implicit Rendering for 3D Face Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data.
IEEE Trans. Neural Networks Learn. Syst., 2021

Layout-to-Image Translation With Double Pooling Generative Adversarial Networks.
IEEE Trans. Image Process., 2021

LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2021

Structured discriminative tensor dictionary learning for unsupervised domain adaptation.
Neurocomputing, 2021

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning.
CoRR, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.
CoRR, 2021

Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation.
CoRR, 2021

Looking Outside the Window: Wider-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.
CoRR, 2021

Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization.
CoRR, 2021

Transformer-Based Source-Free Domain Adaptation.
CoRR, 2021

Cloth Interactive Transformer for Virtual Try-On.
CoRR, 2021

Transformers Solve the Limited Receptive Field for Monocular Depth Prediction.
CoRR, 2021

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.
CoRR, 2021

Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Cross-View Exocentric to Egocentric Video Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

AniFormer: Data-driven 3D Animation with Transformer.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Unified Generative Adversarial Networks for Controllable Image-to-Image Translation.
IEEE Trans. Image Process., 2020

Relevant region prediction for crowd counting.
Neurocomputing, 2020

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis.
CoRR, 2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.
CoRR, 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
CoRR, 2020

Cross-View Image Synthesis with Deformable Convolution and Attention Mechanism.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Attention GANs for Semantic Image Synthesis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

XingGAN for Person Image Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion.
Neurocomputing, 2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.
CoRR, 2019

Improving Semantic Segmentation of Aerial Images Using Patch-based Attention.
CoRR, 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.
CoRR, 2019

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation.
CoRR, 2019

Deep Micro-Dictionary Learning and Coding Network.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.
Proceedings of the International Joint Conference on Neural Networks, 2019

Joint Learning of Self-Representation and Indicator for Multi-View Image Clustering.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Attribute-Guided Sketch Generation.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.
Proceedings of the Computer Vision - ACCV 2018, 2018

2016
Sequential Bag-of-Words model for human action classification.
CAAI Trans. Intell. Technol., 2016

Adaptive Region Boosting method with biased entropy for path planning in changing environment.
CAAI Trans. Intell. Technol., 2016

A Novel Feature Matching Strategy for Large Scale Image Retrieval.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Gender Classification Using Pyramid Segmentation for Unconstrained Back-facing Video Sequences.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

SDM-BSM: A fusing depth scheme for human action recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Two-Layers Local Coordinate Coding.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015


  Loading...