Hao Tang

Mohamed Daoudi

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis.

[BibT_eX]

[DOI]

Mach. Intell. Res., December, 2023

On-device audio-visual multi-person wake word spotting.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., December, 2023

Measuring the Consistency and Diversity of 3D Face Generation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., November, 2023

Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2023

Interactive Neural Painting.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., October, 2023

Multi-hypothesis representation learning for transformer-based 3D human pose estimation.

[BibT_eX]

[DOI]

Pattern Recognit., September, 2023

AO2-DETR: Arbitrary-Oriented Object Detection Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2023

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

[BibT_eX]

[DOI]

Philip H. S. Torr

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., April, 2023

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., March, 2023

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Bidirectional Transformer GAN for Long-term Human Motion Prediction.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Unsupervised Key Frame Extraction for Efficient Video Classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation.

[BibT_eX]

[DOI]

Xavier Alameda-Pineda

Elisa Ricci

IEEE Trans. Multim., 2023

Cross-View Panorama Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Interaction Transformer for Human Reaction Generation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Adaptive Convolutional Subspace Reasoning Network for Few-Shot SAR Target Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Transductive Prototypical Attention Reasoning Network for Few-Shot SAR Target Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Local and Global GANs With Semantic-Aware Upsampling for Image Generation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Towards High-quality HDR Deghosting with Conditional Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.

[BibT_eX]

[DOI]

CoRR, 2023

Few-shot Medical Image Segmentation with Cycle-resemblance Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Level Lottery Ticket Hypothesis for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

SpeedDETR: Speed-aware Transformers for End-to-end Object Detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TINYCOD: Tiny and Effective Model for Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

MLP-GAN for Brain Vessel Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LSDIR: A Large Scale Dataset for Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

DE-net: Dynamic Text-Guided Image Editing Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Towards Real-Time Segmentation on the Edge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Quasi-Equilibrium Feature Pyramid Network for Salient Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Supervised Multi-Scale Attention-Guided Ship Detection in Optical Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Facial Expression Translation Using Landmark Guided GANs.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2022

Cross-view panorama image synthesis with progressive attention GANs.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

PB-GCN: Progressive binary graph convolutional networks for skeleton-based action recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2022

The Lottery Ticket Hypothesis for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

Training and Tuning Generative Neural Radiance Fields for Attribute-Conditional 3D-Aware Face Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation.

[BibT_eX]

[DOI]

CoRR, 2022

PI-Trans: Parallel-ConvMLP and Implicit-Transformation Based GAN for Cross-View Image Translation.

[BibT_eX]

[DOI]

CoRR, 2022

Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

3D-Aware Video Generation.

[BibT_eX]

[DOI]

CoRR, 2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

Real-Time Portrait Stylization on the Edge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Cloth-Irrelevant Harmonious Attention Network for Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Unsupervised Domain Adaptation Person Re-Identification by Camera-Aware Style Decoupling and Uncertainty Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022

3D-Aware Semantic-Guided Generative Model for Human Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Interpretable Video Super-Resolution via Alternating Optimization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

FPGA-aware automatic acceleration framework for vision transformer with mixed-scheme quantization: late breaking results.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Restore 3D Face from In-the-Wild Degraded Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Physically-guided Disentangled Implicit Rendering for 3D Face Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking.

[BibT_eX]

[DOI]

Yidi Li

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Layout-to-Image Translation With Double Pooling Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images.

[BibT_eX]

[DOI]

Lei Ding

Lorenzo Bruzzone

IEEE Trans. Geosci. Remote. Sens., 2021

Structured discriminative tensor dictionary learning for unsupervised domain adaptation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning.

[BibT_eX]

[DOI]

CoRR, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2021

Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2021

Looking Outside the Window: Wider-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.

[BibT_eX]

[DOI]

CoRR, 2021

Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization.

[BibT_eX]

[DOI]

CoRR, 2021

Transformer-Based Source-Free Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

Cloth Interactive Transformer for Virtual Try-On.

[BibT_eX]

[DOI]

CoRR, 2021

Transformers Solve the Limited Receptive Field for Monocular Depth Prediction.

[BibT_eX]

[DOI]

CoRR, 2021

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.

[BibT_eX]

[DOI]

CoRR, 2021

Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Cross-View Exocentric to Egocentric Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation.

[BibT_eX]

[DOI]

Bin Ren

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

AniFormer: Data-driven 3D Animation with Transformer.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Relevant region prediction for crowd counting.

[BibT_eX]

[DOI]

Neurocomputing, 2020

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2020

Cross-View Image Synthesis with Deformable Convolution and Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Attention GANs for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Song Bai

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

XingGAN for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019

Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2019

Improving Semantic Segmentation of Aerial Images Using Patch-based Attention.

[BibT_eX]

[DOI]

Lei Ding

Lorenzo Bruzzone

CoRR, 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Micro-Dictionary Learning and Coding Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

Joint Learning of Self-Representation and Indicator for Multi-View Image Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Attribute-Guided Sketch Generation.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2016

Sequential Bag-of-Words model for human action classification.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2016

Adaptive Region Boosting method with biased entropy for path planning in changing environment.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2016

A Novel Feature Matching Strategy for Large Scale Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Gender Classification Using Pyramid Segmentation for Unconstrained Back-facing Video Sequences.

[BibT_eX]

[DOI]