Xiaobin Hu

Orcid: 0000-0002-5764-3096

According to our database1, Xiaobin Hu authored at least 76 papers between 2009 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Anomaly Detection in Medical Images Using Encoder-Attention-2Decoders Reconstruction.
IEEE Trans. Medical Imaging, August, 2025

From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts.
CoRR, August, 2025

Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling.
CoRR, August, 2025

StrandDesigner: Towards Practical Strand Generation with Sketch Guidance.
CoRR, August, 2025

Semantic Frame Interpolation.
CoRR, July, 2025

HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding.
CoRR, July, 2025

Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled Representations.
CoRR, July, 2025

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning.
CoRR, July, 2025

OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography.
CoRR, June, 2025

Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion.
CoRR, May, 2025

Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning.
CoRR, May, 2025

VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models.
CoRR, May, 2025

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation.
CoRR, April, 2025

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer.
CoRR, March, 2025

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors.
CoRR, February, 2025

Image Inversion: A Survey from GANs to Diffusion and Beyond.
CoRR, February, 2025

RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image Segmentation.
CoRR, January, 2025

SVFR: A Unified Framework for Generalized Video Face Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

CustAny: Customizing Anything from A Single Example.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

Joint-individual fusion structure with fusion attention module for multi-modal skin cancer classification.
Pattern Recognit., 2024

Can video generation replace cinematographers? Research on the cinematic language of generated video.
CoRR, 2024

Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration.
CoRR, 2024

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation.
CoRR, 2024

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on.
CoRR, 2024

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding.
CoRR, 2024

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network.
CoRR, 2024

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection.
CoRR, 2024

QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge.
CoRR, 2024

Open-Vocabulary SAM3D: Understand Any 3D Scene.
CoRR, 2024

PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models.
CoRR, 2024

MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model.
CoRR, 2024

A Dual Memory Hybrid Neural Networks for Modeling and Prediction of Nonlinear Systems.
IEEE Access, 2024

Kinematic Coupling Planning Method for Position and Attitude of Manipulator.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024

3D Priors-Guided Diffusion for Blind Face Restoration.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

DiffuMatting: Synthesizing Arbitrary Objects with Matting-Level Annotation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Hierarchical attention vision transformer for fine-grained visual classification.
J. Vis. Commun. Image Represent., March, 2023

Generative Adversarial Networks for Video Summarization Based on Key-frame Selection.
Inf. Technol. Control., March, 2023

The Liver Tumor Segmentation Benchmark (LiTS).
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Medical Image Anal., 2023

SR-R<sup>2</sup>KAC: Improving Single Image Defocus Deblurring.
CoRR, 2023

High-Resolution Iterative Feedback Network for Camouflaged Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Deep Learning for Medical Image Analysis (Deep Learning für die medizinische Bildanalyse)
PhD thesis, 2022

Face Restoration via Plug-and-Play 3D Facial Priors.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models.
CoRR, 2022

High-resolution Iterative Feedback Network for Camouflaged Object Detection.
CoRR, 2022

A robust double-parallel extreme learning machine based on an improved M-estimation algorithm.
Adv. Eng. Informatics, 2022

AutoGAN-Synthesizer: Neural Architecture Search for Cross-Modality MRI Synthesis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Highly Accurate Dichotomous Image Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
SRGAT: Single Image Super-Resolution With Graph Attention Network.
IEEE Trans. Image Process., 2021

Multi-Texture GAN: Exploring the Multi-Scale Texture Translation for Brain MR Images.
CoRR, 2021

A generative adversarial neural network model for industrial boiler data repair.
Appl. Soft Comput., 2021

Feedback Graph Attention Convolutional Network for MR Images Enhancement by Exploring Self-Similarity Features.
Proceedings of the Medical Imaging with Deep Learning, 7-9 July 2021, Lübeck, Germany., 2021

Pyramid Architecture Search for Real-Time Image Deblurring.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Coarse-to-Fine Adversarial Networks and Zone-Based Uncertainty Analysis for NK/T-Cell Lymphoma Segmentation in CT/PET Images.
IEEE J. Biomed. Health Informatics, 2020

Learning Unsupervised Video Summarization with Semantic-Consistent Network.
Proceedings of the Neural Computing for Advanced Applications, 2020

Face Super-Resolution Guided by 3D Facial Priors.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Knowledge-Aided Convolutional Neural Network for Small Organ Segmentation.
IEEE J. Biomed. Health Informatics, 2019

Global quasi-synchronization and global anti-synchronization of delayed neural networks with discontinuous activations via non-fragile control strategy.
Neurocomputing, 2019

Toward a Brain-Inspired System: Deep Recurrent Reinforcement Learning for a Simulated Self-Driving Agent.
Frontiers Neurorobotics, 2019

Towards Brain-inspired System: Deep Recurrent Reinforcement Learning for Simulated Self-driving Agent.
CoRR, 2019

A Dynamic Rectified Linear Activation Units.
IEEE Access, 2019

Spatial-Frequency Non-local Convolutional LSTM Network for pRCC Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

2018
Hierarchical multi-class segmentation of glioma images using networks with multi-level activation function.
CoRR, 2018

Hierarchical Multi-class Segmentation of Glioma Images Using Networks with Multi-level Activation Function.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

2011
Design of multi-channel signal acquisition module for separate storage.
Proceedings of the International Conference on Electronic and Mechanical Engineering and Information Technology, 2011

2010
Application of inertia ellipse in code marker matching.
Geo spatial Inf. Sci., 2010

2009
An Efficient Antenna Selection Algorithm for MIMO Systems.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009


  Loading...