Xin Yu

Orcid: 0000-0002-0269-5649

Affiliations:
  • University of Queensland, Brisbane, Australia
  • University of Technology Sydney, Australia (former)
  • Australian National University, College of Engineering and Computer Science, Canberra, Australia (former)
  • Tsinghua University, Department of Electronic Engineering, Beijing, China (former)


According to our database1, Xin Yu authored at least 155 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Detecting Facial Action Units From Global-Local Fine-Grained Expressions.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

CMGNet: Collaborative multi-modal graph network for video captioning.
Comput. Vis. Image Underst., January, 2024

Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations.
IEEE Trans. Multim., 2024

DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition.
IEEE Trans. Image Process., 2024

MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers.
Comput. Animat. Virtual Worlds, 2024

Super-Resolution Multi-Contrast Unbiased Eye Atlases With Deep Probabilistic Refinement.
CoRR, 2024

When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.
Medical Image Anal., December, 2023

Semantic-Aware Contrastive Learning for Multi-Object Medical Image Segmentation.
IEEE J. Biomed. Health Informatics, September, 2023

Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection.
IEEE Trans. Image Process., 2023

Deep Idempotent Network for Efficient Single Image Blind Deblurring.
IEEE Trans. Circuits Syst. Video Technol., 2023

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection.
CoRR, 2023

Text-Guided 3D Face Synthesis - From Generation to Editing.
CoRR, 2023

Towards Open World Active Learning for 3D Object Detection.
CoRR, 2023

CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses.
CoRR, 2023

Divide and Ensemble: Progressively Learning for the Unknown.
CoRR, 2023

DeformUX-Net: Exploring a 3D Foundation Backbone for Medical Image Segmentation with Depthwise Deformable Convolution.
CoRR, 2023

Deep conditional generative models for longitudinal single-slice abdominal computed tomography harmonization.
CoRR, 2023

Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration.
CoRR, 2023

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior.
CoRR, 2023

BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge.
CoRR, 2023

Audio-visual segmentation, sound localization, semantic-aware sounding objects localization.
CoRR, 2023

RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation.
CoRR, 2023

Boosting Model Inversion Attacks with Adversarial Examples.
CoRR, 2023

Multi-Contrast Computed Tomography Atlas of Healthy Pancreas.
CoRR, 2023

EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation.
CoRR, 2023

TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles.
CoRR, 2023

Low-frequency Image Deep Steganography: Manipulate the Frequency Distribution to Hide Secrets with Tenacious Robustness.
CoRR, 2023

Proactive Deepfake Defence via Identity Watermarking.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

TI<sup>2</sup>Net: Temporal Identity Inconsistency Network for Deepfake Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

A Divide-and-conquer Solution to 3D Human Motion Estimation from Raw MoCap Data.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics.
Proceedings of the 31st ACM International Conference on Multimedia, 2023


Unsupervised registration refinement for generating unbiased eye atlas.
Proceedings of the Medical Imaging 2023: Image Processing, 2023

Longitudinal variability analysis on low-dose abdominal CT with deep learning-based segmentation.
Proceedings of the Medical Imaging 2023: Image Processing, 2023

Alleviating tiling effect by random walk sliding window in high-resolution histological whole slide image synthesis.
Proceedings of the Medical Imaging with Deep Learning, 2023

Scaling up 3D Kernels with Bayesian Frequency Re-parameterization for Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Exploring Active 3D Object Detection from a Generalization Perspective.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Efficient Unsupervised Satellite Image-based Building Damage Detection.
Proceedings of the IEEE International Conference on Data Mining, 2023

DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gait Recognition with Mask-based Regularization.
Proceedings of the IEEE International Joint Conference on Biometrics, 2023

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A New Perspective of Weakly Supervised 3D Instance Segmentation via Bounding Boxes.
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

Context-Based Masking for Spontaneous Venous Pulsations Detection.
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

Toward a Unified Framework for RGB and RGB-D Visual Navigation.
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Single-Image Deraining via Recurrent Residual Multiscale Networks.
IEEE Trans. Neural Networks Learn. Syst., 2022

Learning With Noisy Labels via Self-Reweighting From Class Centroids.
IEEE Trans. Neural Networks Learn. Syst., 2022

Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails.
IEEE Trans. Image Process., 2022

Weakly Supervised RGB-D Salient Object Detection With Prediction Consistency Training and Active Scribble Boosting.
IEEE Trans. Image Process., 2022

Understanding Atomic Hand-Object Interaction With Human Intention.
IEEE Trans. Circuits Syst. Video Technol., 2022

Single image based 3D human pose estimation via uncertainty learning.
Pattern Recognit., 2022

Recursive Copy and Paste GAN: Face Hallucination From Shaded Thumbnails.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Geometry-Guided Street-View Panorama Synthesis From Satellite Imagery.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

High Frame Rate Video Reconstruction Based on an Event Camera.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Single Slice Thigh CT Muscle Group Segmentation with Domain Adaptation and Self-Training.
CoRR, 2022

Uncertainty-aware Gait Recognition via Learning from Dirichlet Distribution-based Evidence.
CoRR, 2022

Facial Action Units Detection Aided by Global-Local Expression Embedding.
CoRR, 2022

Adaptive Contrastive Learning with Dynamic Correlation for Multi-Phase Organ Segmentation.
CoRR, 2022

GaitGL: Learning Discriminative Global-Local Feature Representations for Gait Recognition.
CoRR, 2022

Pseudo-Label Guided Multi-Contrast Generalization for Non-Contrast Organ-Aware Segmentation.
CoRR, 2022

GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework.
CoRR, 2022

Characterizing Renal Structures with 3D Block Aggregate Transformers.
CoRR, 2022

Supervised deep generation of high-resolution arterial phase computed tomography kidney substructure atlas.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

Quantification of muscle, bones, and fat on single slice thigh CT.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

Accelerating 2D abdominal organ segmentation with active learning.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

Reducing Positional Variance in Cross-sectional Abdominal CT Slices with Deep Conditional Generative Models.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views.
Proceedings of the Computer Vision - ECCV 2022, 2022

Sign Spotting via Multi-modal Fusion and Testing Time Transferring.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework.
Proceedings of the Computer Vision - ACCV 2022, 2022

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization.
Proceedings of the Computer Vision - ACCV 2022, 2022

One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Face Hallucination With Finishing Touches.
IEEE Trans. Image Process., 2021

Pyramidal Multiple Instance Detection Network With Mask Guided Self-Correction for Weakly Supervised Object Detection.
IEEE Trans. Image Process., 2021

Progressive Transfer Learning for Face Anti-Spoofing.
IEEE Trans. Image Process., 2021

Joint 3D Human Shape Recovery from A Single Imag with Bilayer-Graph.
CoRR, 2021

Attention-Guided Supervised Contrastive Learning for Semantic Segmentation.
CoRR, 2021

Blind Motion Deblurring Super-Resolution: When Dynamic Spatio-Temporal Learning Meets Static Image Understanding.
CoRR, 2021

Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation.
CoRR, 2021

Iterative Optimisation with an Innovation CNN for Pose Refinement.
CoRR, 2021

Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

FDA: Feature Decomposition and Aggregation for Robust Airway Segmentation.
Proceedings of the Domain Adaptation and Representation Transfer, and Affordable Healthcare and AI for Resource Diverse Global Health, 2021

Pancreas CT Segmentation by Predictive Phenotyping.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

A General Approach to State Refinement.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

End-to-end Multi-Instance Robotic Reaching from Monocular Vision.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences.
Proceedings of the 9th International Conference on Learning Representations, 2021

VTNet: Visual Transformer Network for Object Goal Navigation.
Proceedings of the 9th International Conference on Learning Representations, 2021

PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RGB-D Saliency Detection via Cascaded Mutual Information Minimization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Self-Supervised Visibility Learning for Novel View Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Removing Raindrops and Rain Streaks in One Go.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Few-shot Weighted Style Matching for Glaucoma Detection.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Single image portrait relighting via explicit multiple reflectance channel modeling.
ACM Trans. Graph., 2020

Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks.
Int. J. Comput. Vis., 2020

Uncertainty-Aware Deep Calibrated Salient Object Detection.
CoRR, 2020

Learning Effective Representations from Global and Local Features for Cross-View Gait Recognition.
CoRR, 2020

6DoF Object Pose Estimation via Differentiable Proxy Voting Loss.
CoRR, 2020

Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Object Relation Graph and Tentative Policy for Visual Navigation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Weakly-Supervised Salient Object Detection via Scribble Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Copy and Paste GAN: Face Hallucination From Shaded Thumbnails.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Where Am I Looking At? Joint Location and Orientation Estimation by Cross-View Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Transferring Cross-Domain Knowledge for Video Sign Language Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

6DoF Object Pose Estimation via Differentiable Proxy Voting Regularizer.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

When Humans Meet Machines: Towards Efficient Segmentation Networks.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Optimal Feature Transport for Cross-View Image Geo-Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields.
IEEE Trans. Circuits Syst. Video Technol., 2019

Identity-Preserving Face Recovery from Stylized Portraits.
Int. J. Comput. Vis., 2019

Bringing Blurry Alive at High Frame-Rate with an Event Camera.
CoRR, 2019

Recovering Faces From Portraits with Auxiliary Facial Attributes.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

SOSNet: Second Order Similarity Regularization for Local Descriptor Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Residual Multiscale Based Single Image Deraining.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Imagining the Unimaginable Faces by Deconvolutional Networks.
IEEE Trans. Image Process., 2018

PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching.
IEEE Trans. Circuits Syst. Video Technol., 2018

Identity-preserving Face Recovery from Portraits.
CoRR, 2018

Face Super-Resolution Guided by Facial Component Heatmaps.
Proceedings of the Computer Vision - ECCV 2018, 2018

Super-Resolving Very Low-Resolution Face Images With Supplementary Attributes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Face Destylization.
Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Ultra-Resolving Face Images by Discriminative Generative Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Object Tracking With Multi-View Support Vector Machines.
IEEE Trans. Multim., 2015

Multi-local-task learning with global regularization for object tracking.
Pattern Recognit., 2015

Hybrid support vector machines for robust object tracking.
Pattern Recognit., 2015

Self-expressive tracking.
Pattern Recognit., 2015

Removing blur kernel noise via a hybrid ℓp norm.
J. Electronic Imaging, 2015

2014
Efficient Patch-Wise Non-Uniform Deblurring for a Single Image.
IEEE Trans. Multim., 2014

2011
Non-rigid Object Tracking as Salient Region Segmentation and Association.
IEICE Trans. Inf. Syst., 2011


  Loading...