Hao Tang

Ting Huang

Zeyu Zhang

CoRR, January, 2026

WebCryptoAgent: Agentic Crypto Trading with Web Informatics.

[BibT_eX]

[DOI]

CoRR, January, 2026

MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing.

[BibT_eX]

[DOI]

CoRR, January, 2026

Knowledge-Enhanced Dynamic Scene Graph Attention Network for Fake News Video Detection.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

AAGFormer: A self-adaptive graph-transformer synergy with topological normalization for 3D human pose estimation.

[BibT_eX]

[DOI]

Xing Liu

Image Vis. Comput., 2026

ReactionMamba: Generating Short & Long Human Reaction Sequences.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Automatic Face and Gesture Recognition, 2026

TR-DQ: Time-Rotation Diffusion Quantization.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

GRADRobot: Geometry-Aware Rendering with Articulation and Diffusion for Robot Modeling.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Visio, 2026

3D CoCa: Contrastive Learners are 3D Captioners.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Visio, 2026

2025

Dual Attention Guidance Network for Self-Supervised Monocular Depth Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., December, 2025

TwinAligner: Visual-Dynamic Alignment Empowers Physics-aware Real2Sim2Real for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, December, 2025

DragMesh: Interactive 3D Generation Made Easy.

[BibT_eX]

[DOI]

Tianshan Zhang

Zeyu Zhang

CoRR, December, 2025

EgoLCD: Egocentric Video Generation with Long Context Diffusion.

[BibT_eX]

[DOI]

CoRR, December, 2025

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving.

[BibT_eX]

[DOI]

Zhaohui Wang

Tengbo Yu

CoRR, November, 2025

Alias-free 4D Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, November, 2025

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots.

[BibT_eX]

[DOI]

CoRR, November, 2025

EvoVLA: Self-Evolving Vision-Language-Action Model.

[BibT_eX]

[DOI]

CoRR, November, 2025

Dual-Path Transformer-Based GAN for Co-speech Gesture Synthesis.

[BibT_eX]

[DOI]

Int. J. Soc. Robotics, October, 2025

VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery.

[BibT_eX]

[DOI]

CoRR, October, 2025

AutoViT: Achieving Real-Time Vision Transformers on Mobile via Latency-aware Coarse-to-Fine Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., September, 2025

Fidelity-Aware Data Composition for Robust Robot Generalization.

[BibT_eX]

[DOI]

CoRR, September, 2025

UniVid: The Open-Source Unified Video Model.

[BibT_eX]

[DOI]

CoRR, September, 2025

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes.

[BibT_eX]

[DOI]

CoRR, September, 2025

Nav-R1: Reasoning and Navigation in Embodied Scenes.

[BibT_eX]

[DOI]

CoRR, September, 2025

Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis.

[BibT_eX]

[DOI]

CoRR, September, 2025

Multimodal Data Storage and Retrieval for Embodied AI: A Survey.

[BibT_eX]

[DOI]

Yihao Lu

CoRR, August, 2025

RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory.

[BibT_eX]

[DOI]

CoRR, August, 2025

ReMoMask: Retrieval-Augmented Masked Motion Generation.

[BibT_eX]

[DOI]

CoRR, August, 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding.

[BibT_eX]

[DOI]

Ting Huang

Zeyu Zhang

CoRR, July, 2025

UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing.

[BibT_eX]

[DOI]

CoRR, July, 2025

Graph-based Multi-Modal Interaction Lightweight Network for Brain Tumor Segmentation (GMLN-BTS) in Edge Iterative MRI Lesion Localization System (EdgeIMLocSys).

[BibT_eX]

[DOI]

Guohao Huo

Ruiting Dai

CoRR, July, 2025

ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models.

[BibT_eX]

[DOI]

Chihan Huang

CoRR, July, 2025

Hierarchical Distribution-Based Exemplar Replay for Incremental SAR Automatic Target Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Aerosp. Electron. Syst., June, 2025

Style Transfer: A Decade Survey.

[BibT_eX]

[DOI]

Tianshan Zhang

CoRR, June, 2025

Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration.

[BibT_eX]

[DOI]

CoRR, June, 2025

Resolving Task Objective Conflicts in Unified Multimodal Understanding and Generation via Task-Aware Mixture-of-Experts.

[BibT_eX]

[DOI]

Jiaxing Zhang

Xinyi Zeng

CoRR, June, 2025

FOLIAGE: Towards Physical Intelligence World Models Via Unbounded Surface Evolution.

[BibT_eX]

[DOI]

Xiaoyi Liu

CoRR, June, 2025

Enhanced Multi-Scale Cross-Attention for Person Image Generation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation.

[BibT_eX]

[DOI]

CoRR, May, 2025

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams.

[BibT_eX]

[DOI]

CoRR, May, 2025

SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation.

[BibT_eX]

[DOI]

Guohao Huo

Ruiting Dai

CoRR, May, 2025

CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation.

[BibT_eX]

[DOI]

Chihan Huang

CoRR, May, 2025

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image.

[BibT_eX]

[DOI]

CoRR, May, 2025

Structured Agent Distillation for Large Language Model.

[BibT_eX]

[DOI]

CoRR, May, 2025

TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., April, 2025

Multimodal Large Language Models for Medicine: A Comprehensive Survey.

[BibT_eX]

[DOI]

Jiarui Ye

CoRR, April, 2025

TTTFusion: A Test-Time Training-Based Strategy for Multimodal Medical Image Fusion in Surgical Robots.

[BibT_eX]

[DOI]

Qinhua Xie

CoRR, April, 2025

DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification.

[BibT_eX]

[DOI]

CoRR, April, 2025

Cabbage: A Differential Growth Framework for Open Surfaces.

[BibT_eX]

[DOI]

Xiaoyi Liu

CoRR, April, 2025

Multimodal Perception for Goal-oriented Navigation: A Survey.

[BibT_eX]

[DOI]

I-Tak Ieong

CoRR, April, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, April, 2025

Wakeup-Darkness: When Multimodal Meets Unsupervised Low-Light Image Enhancement.

[BibT_eX]

[DOI]

Abdulmotaleb El Saddik

ACM Trans. Multim. Comput. Commun. Appl., March, 2025

Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2025

Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming.

[BibT_eX]

[DOI]

Jiaxuan Zhu

CoRR, March, 2025

TR-DQ: Time-Rotation Diffusion Quantization.

[BibT_eX]

[DOI]

CoRR, March, 2025

When Continue Learning Meets Multimodal Large Language Model: A Survey.

[BibT_eX]

[DOI]

Yukang Huo

CoRR, March, 2025

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface.

[BibT_eX]

[DOI]

CoRR, March, 2025

Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation.

[BibT_eX]

[DOI]

CoRR, February, 2025

FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation.

[BibT_eX]

[DOI]

CoRR, February, 2025

RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2.

[BibT_eX]

[DOI]

CoRR, February, 2025

Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation.

[BibT_eX]

[DOI]

CoRR, February, 2025

UDiTQC: U-Net-Style Diffusion Transformer for Quantum Circuit Synthesis.

[BibT_eX]

[DOI]

Zhiwei Chen

CoRR, January, 2025

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, January, 2025

Boosting Adversarial Transferability with Spatial Adversarial Alignment.

[BibT_eX]

[DOI]

CoRR, January, 2025

Hierarchical Cross-Attention Network for Virtual Try-On.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

A pure MLP-Mixer-based GAN framework for guided image translation.

[BibT_eX]

[DOI]

Bin Ren

Pattern Recognit., 2025

GraphMLP: A graph MLP-like architecture for 3D human pose estimation.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

BCDPose: Diffusion-based 3D Human Pose Estimation with bone-chain prior knowledge.

[BibT_eX]

[DOI]

Xing Liu

Image Vis. Comput., 2025

Generalization-preserving adaptation of vision-language models for open-vocabulary segmentation.

[BibT_eX]

[DOI]

Zhen Chen

Shiliang Zhang

Comput. Vis. Image Underst., 2025

Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AccidentBlip: Agent of Accident Warning Based on MA-Former.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intelligent Vehicles Symposium, 2025

CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

FairSMOE: Mitigating Multi-Attribute Fairness Problem with Sparse Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

In-Context Meta LoRA Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution.

[BibT_eX]

[DOI]

Zihang Liu

Zhenyu Zhang

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Toward Zero-Shot Learning for Visual Dehazing of Urological Surgical Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

MaskSAM: Auto-Prompt SAM with Mask Classification for Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MambaIC: State Space Models for High-Performance Learned Image Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DiffFNO: Diffusion Fourier Neural Operator.

[BibT_eX]

[DOI]

Xiaoyi Liu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Stable-Hair: Real-World Hair Transfer via Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Physical Adversarial Attack Meets Computer Vision: A Decade Survey.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Shadclips: When Parameter-Efficient Fine-Tuning with Multimodal Meets Shadow Removal.

[BibT_eX]

[DOI]

Int. J. Pattern Recognit. Artif. Intell., December, 2024

Graph Transformer GANs With Graph Masked Modeling for Architectural Layout Generation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Toward High-Quality HDR Deghosting With Conditional Diffusion Models.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2024

Cloth Interactive Transformer for Virtual Try-On.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

ControlFace: Feature Disentangling for Controllable Face Swapping.

[BibT_eX]

[DOI]

J. Imaging, January, 2024

Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

PolSAM: Polarimetric Scattering Mechanism Informed Segment Anything Model.

[BibT_eX]

[DOI]

CoRR, 2024

Network Inversion and Its Applications.

[BibT_eX]

[DOI]

Pirzada Suhail

Amit Sethi

CoRR, 2024

Multimodal Alignment and Fusion: A Survey.

[BibT_eX]

[DOI]

Songtao Li

CoRR, 2024

Text-to-Image Synthesis: A Decade Survey.

[BibT_eX]

[DOI]

Nonghai Zhang

CoRR, 2024

AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations.

[BibT_eX]

[DOI]

CoRR, 2024

KMM: Key Frame Mask Mamba for Extended Motion Generation.

[BibT_eX]

[DOI]

CoRR, 2024

GWQ: Gradient-Aware Weight Quantization for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

M<sup>2</sup>M: Learning controllable Multi of experts and multi-scale operators are the Partial Differential Equations need.

[BibT_eX]

[DOI]

CoRR, 2024

Brain Tumor Classification on MRI in Light of Molecular Markers.

[BibT_eX]

[DOI]

CoRR, 2024

Data-Free Class Incremental Gesture Recognition via Synthetic Feature Sampling.

[BibT_eX]

[DOI]

Zhenyu Lu

CoRR, 2024

Barbie: Text to Barbie-Style 3D Avatars.

[BibT_eX]

[DOI]

CoRR, 2024

InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation.

[BibT_eX]

[DOI]

CoRR, 2024

From Redundancy to Relevance: Enhancing Explainability in Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Multimodal Wearable Sensor-based Human Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Pruning of Large Language Model with Adaptive Estimation Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.

[BibT_eX]

[DOI]

CoRR, 2024

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM.

[BibT_eX]

[DOI]

CoRR, 2024

Machine learning and human-machine trust in healthcare: A systematic survey.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2024

Edge-guided representation learning for underwater object detection.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2024

Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Bipartite Graph Diffusion Model for Human Interaction Generation.

[BibT_eX]

[DOI]

Baptiste Chopin

Mohamed Daoudi

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Revisiting Adversarial Patches for Designing Camera-Agnostic Attacks against Person Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CoIn: A Lightweight and Effective Framework for Story Visualization and Continuation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Monocular Expressive 3D Human Reconstruction of Multiple People.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the 38th ACM International Conference on Supercomputing, 2024

Audio-Visual Navigation with Anti-Backtracking.

[BibT_eX]

[DOI]

Zhenghao Zhao

Yan Yan

Proceedings of the Pattern Recognition - 27th International Conference, 2024

Adaptive Cross-Architecture Mutual Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Motion Mamba: Efficient and Long Sequence Motion Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GiT: Towards Generalist Vision Transformer Through Universal Language Interface.

[BibT_eX]

[DOI]

Muhammad Ferjad Naeem

Hongsheng Li

Bernt Schiele

Liwei Wang

Proceedings of the Computer Vision - ECCV 2024, 2024

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Dataset Growth.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

InstructGIE: Towards Generalizable Image Editing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Versatile Navigation Under Partial Observability via Value-Guided Diffusion Policy.

[BibT_eX]

[DOI]

Gengyu Zhang

Yan Yan

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On the Faithfulness of Vision Transformer Explanations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Token Transformation Matters: Towards Faithful Post-Hoc Explanation for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Online Real-Time Memory-based Video Inpainting Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Distilling ODE Solvers of Diffusion Models into Smaller Steps.

[BibT_eX]

[DOI]

Sanghwan Kim

Fisher Yu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Robust 3D Pose Transfer with Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MS-UMLP: Medical Image Segmentation via Multi-Scale U-shape MLP-Mixer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

STRFormer: Spatial-Temporal-ReTemporal Transformer for 3D human pose estimation.

[BibT_eX]

[DOI]

Xing Liu

Image Vis. Comput., December, 2023

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis.

[BibT_eX]

[DOI]

Mach. Intell. Res., December, 2023

On-device audio-visual multi-person wake word spotting.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., December, 2023

Measuring the Consistency and Diversity of 3D Face Generation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., November, 2023

Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2023

Interactive Neural Painting.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., October, 2023

Multi-hypothesis representation learning for transformer-based 3D human pose estimation.

[BibT_eX]

[DOI]

Pattern Recognit., September, 2023

AO2-DETR: Arbitrary-Oriented Object Detection Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2023

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

[BibT_eX]

[DOI]

Philip H. S. Torr

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., April, 2023

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., March, 2023

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Bidirectional Transformer GAN for Long-term Human Motion Prediction.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Unsupervised Key Frame Extraction for Efficient Video Classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation.

[BibT_eX]

[DOI]

Xavier Alameda-Pineda

Elisa Ricci

IEEE Trans. Multim., 2023

Cross-View Panorama Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Interaction Transformer for Human Reaction Generation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

3D-Aware Video Generation.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Adaptive Convolutional Subspace Reasoning Network for Few-Shot SAR Target Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Transductive Prototypical Attention Reasoning Network for Few-Shot SAR Target Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Local and Global GANs With Semantic-Aware Upsampling for Image Generation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Towards High-quality HDR Deghosting with Conditional Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images.

[BibT_eX]

[DOI]

CoRR, 2023

Reversible Graph Neural Network-based Reaction Distribution Learning for Multiple Appropriate Facial Reactions Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Few-shot Medical Image Segmentation with Cycle-resemblance Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Does Graph Distillation See Like Vision Dataset Counterpart?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LART: Neural Correspondence Learning with Latent Regularization Transformer for 3D Motion Transfer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Level Lottery Ticket Hypothesis for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

RZCR: Zero-shot Character Recognition via Radical-based Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

SpeedDETR: Speed-aware Transformers for End-to-end Object Detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TINYCOD: Tiny and Effective Model for Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

MLP-GAN for Brain Vessel Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Master: Meta Style Transformer for Controllable Zero-Shot and Few-Shot Artistic Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LSDIR: A Large Scale Dataset for Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

DE-net: Dynamic Text-Guided Image Editing Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Towards Real-Time Segmentation on the Edge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Quasi-Equilibrium Feature Pyramid Network for Salient Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Supervised Multi-Scale Attention-Guided Ship Detection in Optical Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Facial Expression Translation Using Landmark Guided GANs.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2022

Cross-view panorama image synthesis with progressive attention GANs.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

PB-GCN: Progressive binary graph convolutional networks for skeleton-based action recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2022

The Lottery Ticket Hypothesis for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

Physical Adversarial Attack meets Computer Vision: A Decade Survey.

[BibT_eX]

[DOI]

CoRR, 2022

Training and Tuning Generative Neural Radiance Fields for Attribute-Conditional 3D-Aware Face Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation.

[BibT_eX]

[DOI]

CoRR, 2022

REZCR: A Zero-shot Character Recognition Method via Radical Extraction.

[BibT_eX]

[DOI]

CoRR, 2022

Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

RCRN: Real-world Character Image Restoration Network via Skeleton Extraction.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CharFormer: A Glyph Fusion based Attentive Framework for High-precision Character Image Denoising.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Real-Time Portrait Stylization on the Edge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Cloth-Irrelevant Harmonious Attention Network for Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Unsupervised Domain Adaptation Person Re-Identification by Camera-Aware Style Decoupling and Uncertainty Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Graph-Based Generative Face Anonymisation with Pose Preservation.

[BibT_eX]

[DOI]

Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022

3D-Aware Semantic-Guided Generative Model for Human Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Interpretable Video Super-Resolution via Alternating Optimization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

FPGA-aware automatic acceleration framework for vision transformer with mixed-scheme quantization: late breaking results.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Restore 3D Face from In-the-Wild Degraded Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Physically-guided Disentangled Implicit Rendering for 3D Face Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking.

[BibT_eX]

[DOI]

Yidi Li

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Layout-to-Image Translation With Double Pooling Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images.

[BibT_eX]

[DOI]

Lei Ding

Lorenzo Bruzzone

IEEE Trans. Geosci. Remote. Sens., 2021

Structured discriminative tensor dictionary learning for unsupervised domain adaptation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning.

[BibT_eX]

[DOI]

CoRR, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2021

Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2021

Looking Outside the Window: Wider-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images.

[BibT_eX]

[DOI]

CoRR, 2021

Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization.

[BibT_eX]

[DOI]

CoRR, 2021

Transformer-Based Source-Free Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

Cloth Interactive Transformer for Virtual Try-On.

[BibT_eX]

[DOI]

CoRR, 2021

Transformers Solve the Limited Receptive Field for Monocular Depth Prediction.

[BibT_eX]

[DOI]

CoRR, 2021

Adversarial Shape Learning for Building Extraction in VHR Remote Sensing Images.

[BibT_eX]

[DOI]

CoRR, 2021

Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Cross-View Exocentric to Egocentric Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Highly Efficient Natural Image Matting.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation.

[BibT_eX]

[DOI]

Bin Ren

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

AniFormer: Data-driven 3D Animation with Transformer.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Relevant region prediction for crowd counting.

[BibT_eX]

[DOI]

Neurocomputing, 2020

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2020

Cross-View Image Synthesis with Deformable Convolution and Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Attention GANs for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Song Bai

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

XingGAN for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019

Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2019

Improving Semantic Segmentation of Aerial Images Using Patch-based Attention.

[BibT_eX]

[DOI]

Lei Ding

Lorenzo Bruzzone

CoRR, 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Micro-Dictionary Learning and Coding Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

Joint Learning of Self-Representation and Indicator for Multi-View Image Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Attribute-Guided Sketch Generation.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2016

Sequential Bag-of-Words model for human action classification.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2016

Adaptive Region Boosting method with biased entropy for path planning in changing environment.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2016

A Novel Feature Matching Strategy for Large Scale Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Gender Classification Using Pyramid Segmentation for Unconstrained Back-facing Video Sequences.

[BibT_eX]

[DOI]