We stand with Ukraine

We stand with Ukraine

Xiaobin Hu

Orcid: 0000-0003-3472-988X

According to our database¹, Xiaobin Hu authored at least 86 papers between 2009 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Towards One-step Causal Video Generation via Adversarial Self-Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

CoRR, November, 2025

OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

,

,

,

CoRR, October, 2025

TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement.

[BibT_eX]

[DOI]

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, October, 2025

IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

,

CoRR, October, 2025

LLM-Oriented Token-Adaptive Knowledge Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Jiangning Zhang

CoRR, October, 2025

Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models.

[BibT_eX]

[DOI]

,

,

,

Jiangning Zhang

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

MAS<sup>2</sup>: Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, September, 2025

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Jiangning Zhang

,

,

,

,

CoRR, September, 2025

Anomaly Detection in Medical Images Using Encoder-Attention-2Decoders Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Medical Imaging, August, 2025

From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts.

[BibT_eX]

[DOI]

,

,

,

,

Jiangning Zhang

,

,

,

,

,

CoRR, August, 2025

Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, August, 2025

StrandDesigner: Towards Practical Strand Generation with Sketch Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, August, 2025

Semantic Frame Interpolation.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

,

CoRR, July, 2025

HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled Representations.

[BibT_eX]

[DOI]

,

,

,

,

Jiangning Zhang

,

,

,

,

,

CoRR, July, 2025

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Jiangning Zhang

CoRR, July, 2025

OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion.

[BibT_eX]

[DOI]

,

,

CoRR, May, 2025

Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning.

[BibT_eX]

[DOI]

,

,

,

Jiangning Zhang

,

,

,

,

CoRR, May, 2025

VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models.

[BibT_eX]

[DOI]

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, May, 2025

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2025

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Image Inversion: A Survey from GANs to Diffusion and Beyond.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

CoRR, February, 2025

RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image Segmentation.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

CoRR, January, 2025

CrossVTON: Mimicking the Logic Reasoning on Cross-Category Virtual Try-On Guided by Tri-Zone Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

SVFR: A Unified Framework for Generalized Video Face Restoration.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

CustAny: Customizing Anything from A Single Example.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Jiangning Zhang

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Jiangning Zhang

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., December, 2024

Experimental Study on Strength and Deformation Moduli of Columnar Jointed Rock Mass - Uniaxial Compression as an Example.

[BibT_eX]

[DOI]

,

,

,

Symmetry, 2024

Joint-individual fusion structure with fusion attention module for multi-modal skin cancer classification.

[BibT_eX]

[DOI]

,

,

,

,

Bjoern H. Menze

,

Sebastian Krammer

,

Pattern Recognit., 2024

Can video generation replace cinematographers? Research on the cinematic language of generated video.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Mingliang Xiong

,

,

,

,

CoRR, 2024

Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration.

[BibT_eX]

[DOI]

,

,

Jiangning Zhang

,

,

,

,

,

,

,

CoRR, 2024

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Jiangning Zhang

CoRR, 2024

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, 2024

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Jiangning Zhang

,

,

CoRR, 2024

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Jiangning Zhang

,

,

CoRR, 2024

QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge.

[BibT_eX]

[DOI]

Hongwei Bran Li

,

Fernando Navarro

,

,

Amirhossein Bayat

,

,

,

Suprosanna Shit

,

Diana Waldmannstetter

,

Johannes C. Paetzold

,

,

Benedikt Wiestler

,

,

Tamaz Amiranashvili

,

Chinmay Prabhakar

,

Christoph Berger

,

,

Michelle Alonso-Basanta

,

,

,

,

,

,

,

,

Sabri Can Cetindag

,

,

,

,

,

Mustafa A. Elattar

,

,

,

Henkjan Huisman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Arlindo L. Oliveira

,

Jimut Bahan Pal

,

,

,

,

Raghavendra Selvan

,

,

João Lourenço Silva

,

,

Sanjay N. Talbar

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Richard K. G. Do

,

Anton S. Becker

,

Amber L. Simpson

,

Ender Konukoglu

,

,

,

,

Bjoern H. Menze

CoRR, 2024

Open-Vocabulary SAM3D: Understand Any 3D Scene.

[BibT_eX]

[DOI]

,

,

Jiangning Zhang

,

,

,

,

,

CoRR, 2024

PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

Jiangning Zhang

,

,

,

CoRR, 2024

MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model.

[BibT_eX]

[DOI]

,

Jiangning Zhang

,

,

,

,

,

,

,

,

,

CoRR, 2024

A Dual Memory Hybrid Neural Networks for Modeling and Prediction of Nonlinear Systems.

[BibT_eX]

[DOI]

IEEE Access, 2024

Kinematic Coupling Planning Method for Position and Attitude of Manipulator.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024

3D Priors-Guided Diffusion for Blind Face Restoration.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models.

[BibT_eX]

[DOI]

,

Iaroslav Ponomarenko

,

,

,

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

DiffuMatting: Synthesizing Arbitrary Objects with Matting-Level Annotation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Jiangning Zhang

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Hierarchical attention vision transformer for fine-grained visual classification.

[BibT_eX]

[DOI]

,

,

J. Vis. Commun. Image Represent., March, 2023

Generative Adversarial Networks for Video Summarization Based on Key-frame Selection.

[BibT_eX]

[DOI]

,

,

,

Inf. Technol. Control., March, 2023

The Liver Tumor Segmentation Benchmark (LiTS).

[BibT_eX]

[DOI]

,

Patrick Ferdinand Christ

,

,

Eugene Vorontsov

,

,

Georgios Kaissis

,

,

,

Gabriel Efrain Humpire Mamani

,

Gabriel Chartrand

,

Fabian Lohöfer

,

Julian Walter Holch

,

Wieland H. Sommer

,

,

Alexandre Hostettler

,

Naama Lev-Cohain

,

Michal Drozdzal

,

Michal Marianne Amitai

,

,

,

,

Anjany Sekuboyina

,

Fernando Navarro

,

,

Johannes C. Paetzold

,

Suprosanna Shit

,

,

,

Markus Rempfler

,

,

,

Benedikt Wiestler

,

,

Christian Hülsemeyer

,

,

Florian Ettlinger

,

Michela Antonelli

,

,

,

,

,

Grzegorz Chlebus

,

,

,

,

Bogdan Georgescu

,

Xavier Giró-i-Nieto

,

,

,

,

,

Jan Hendrik Moltz

,

,

,

,

,

Krishna Chaitanya Kaluva

,

Mahendra Khened

,

,

,

,

,

Tomasz K. Konopczynski

,

,

Ganapathy Krishnamurthi

,

,

,

,

,

John S. Lowengrub

,

,

Klaus H. Maier-Hein

,

Kevis-Kokitsi Maninis

,

,

,

,

Mathias Perslev

,

,

Jordi Pont-Tuset

,

,

,

,

,

Ignacio Sarasua

,

,

,

,

Christian Wachinger

,

,

,

,

,

,

Simon Chun-Ho Yu

,

,

,

,

Manuel Jorge Cardoso

,

,

,

Volker Heinemann

,

Christopher Pal

,

,

,

,

Bram van Ginneken

,

Hayit Greenspan

,

,

Bjoern H. Menze

Medical Image Anal., 2023

SR-R<sup>2</sup>KAC: Improving Single Image Defocus Deblurring.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

High-Resolution Iterative Feedback Network for Camouflaged Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Deep Learning for Medical Image Analysis (Deep Learning für die medizinische Bildanalyse)

[BibT_eX]

[DOI]

PhD thesis, 2022

Face Restoration via Plug-and-Play 3D Facial Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

Bjoern H. Menze

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models.

[BibT_eX]

[DOI]

CoRR, 2022

High-resolution Iterative Feedback Network for Camouflaged Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

A robust double-parallel extreme learning machine based on an improved M-estimation algorithm.

[BibT_eX]

[DOI]

,

,

,

,

Adv. Eng. Informatics, 2022

AutoGAN-Synthesizer: Neural Architecture Search for Cross-Modality MRI Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Bjoern H. Menze

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Highly Accurate Dichotomous Image Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

SRGAT: Single Image Super-Resolution With Graph Attention Network.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Image Process., 2021

Multi-Texture GAN: Exploring the Multi-Scale Texture Translation for Brain MR Images.

[BibT_eX]

[DOI]

CoRR, 2021

A generative adversarial neural network model for industrial boiler data repair.

[BibT_eX]

[DOI]

,

,

,

,

Appl. Soft Comput., 2021

Feedback Graph Attention Convolutional Network for MR Images Enhancement by Exploring Self-Similarity Features.

[BibT_eX]

[DOI]

,

,

,

,

Amirhossein Bayat

,

,

Bjoern H. Menze

Proceedings of the Medical Imaging with Deep Learning, 7-9 July 2021, Lübeck, Germany., 2021

Pyramid Architecture Search for Real-Time Image Deblurring.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Bjoern H. Menze

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Coarse-to-Fine Adversarial Networks and Zone-Based Uncertainty Analysis for NK/T-Cell Lymphoma Segmentation in CT/PET Images.

[BibT_eX]

[DOI]

,

,

,

,

Diana Waldmannstetter

,

,

,

,

Bjoern H. Menze

IEEE J. Biomed. Health Informatics, 2020

Learning Unsupervised Video Summarization with Semantic-Consistent Network.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Neural Computing for Advanced Applications, 2020

Face Super-Resolution Guided by 3D Facial Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Bjoern H. Menze

,

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Knowledge-Aided Convolutional Neural Network for Small Organ Segmentation.

[BibT_eX]

[DOI]

,

,

,

Anjany Sekuboyina

,

,

,

,

Bjoern H. Menze

IEEE J. Biomed. Health Informatics, 2019

Global quasi-synchronization and global anti-synchronization of delayed neural networks with discontinuous activations via non-fragile control strategy.

[BibT_eX]

[DOI]

,

,

,

,

Neurocomputing, 2019

Toward a Brain-Inspired System: Deep Recurrent Reinforcement Learning for a Simulated Self-Driving Agent.

[BibT_eX]

[DOI]

,

,

,

Frontiers Neurorobotics, 2019

Towards Brain-inspired System: Deep Recurrent Reinforcement Learning for Simulated Self-driving Agent.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

A Dynamic Rectified Linear Activation Units.

[BibT_eX]

[DOI]

,

,

,

IEEE Access, 2019

Spatial-Frequency Non-local Convolutional LSTM Network for pRCC Classification.

[BibT_eX]

[DOI]

,

,

,

Anjany Sekuboyina

,

Diana Waldmannstetter

,

,

,

,

,

Bjoern H. Menze

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

2018

Hierarchical multi-class segmentation of glioma images using networks with multi-level activation function.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2018

Hierarchical Multi-class Segmentation of Glioma Images Using Networks with Multi-level Activation Function.

[BibT_eX]

[DOI]

,

,

,

,

Bjoern H. Menze

,

Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

2011

Design of multi-channel signal acquisition module for separate storage.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Electronic and Mechanical Engineering and Information Technology, 2011

2010

Application of inertia ellipse in code marker matching.

[BibT_eX]

[DOI]

,

,

,

Geo spatial Inf. Sci., 2010

2009

An Efficient Antenna Selection Algorithm for MIMO Systems.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

Loading...