Xiaoyun Zhang

Orcid: 0000-0001-7680-4062

Affiliations:
  • Shanghai Jiao Tong University, School of Electronic Information and Electrical Engineering, Shanghai, China (PhD)


According to our database1, Xiaoyun Zhang authored at least 100 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming With Edge Computing.
IEEE J. Sel. Areas Commun., July, 2025

Robust ID-Specific Face Restoration via Alignment Learning.
CoRR, July, 2025

Controllable Image Colorization with Instance-aware Texts and Masks.
CoRR, May, 2025

TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration.
CoRR, March, 2025

Serial Low-rank Adaptation of Vision Transformer.
CoRR, March, 2025

MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FineVQ: Fine-Grained User Generated Content Video Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025


VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration.
CoRR, 2024

HPC: Hierarchical Progressive Coding Framework for Volumetric Video.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

JOINTRF: End-To-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression.
Proceedings of the IEEE International Conference on Image Processing, 2024

Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models.
CoRR, 2023

Guiding Text-to-Image Diffusion Model Towards Grounded Generation.
CoRR, 2023

Open-vocabulary Object Segmentation with Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boost Video Frame Interpolation via Motion Adaptation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Deep Fourier Kernel Exploitation in Blind Image Super-Resolution.
Proceedings of the Digital Multimedia Communications - The 19th International Forum, 2022

Task Decoupled Framework for Reference-based Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LAR-SR: A Local Autoregressive Model for Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Face Restoration via Learnable Cross-Quality Shift.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

A Simple Plugin for Transforming Images to Arbitrary Scales.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Boundary-Aware Supervoxel-Level Iteratively Refined Interactive 3D Image Segmentation With Multi-Agent Reinforcement Learning.
IEEE Trans. Medical Imaging, 2021

Two-Stream Compare and Contrast Network for Vertebral Compression Fracture Diagnosis.
IEEE Trans. Medical Imaging, 2021

An End-to-End Learning Framework for Video Compression.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization.
CoRR, 2021

Unsupervised Segmentation Framework with Active Contour Models for Cine Cardiac MRI.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

CaT: Weakly Supervised Object Detection with Category Transfer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Deep Non-Local Kalman Network for Video Compression Artifact Reduction.
IEEE Trans. Image Process., 2020

End-to-End Optimized ROI Image Compression.
IEEE Trans. Image Process., 2020

Viewport-adaptive 360-degree video coding.
Multim. Tools Appl., 2020

In-loop perceptual model-based rate-distortion optimization for HEVC real-time encoder.
J. Real Time Image Process., 2020

Universal Model for 3D Medical Image Analysis.
CoRR, 2020

Semi-supervised Segmentation with Self-training Based on Quality Estimation and Refinement.
Proceedings of the Machine Learning in Medical Imaging - 11th International Workshop, 2020

Dual-Task Self-supervision for Cross-modality Domain Adaptation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Error Attention Interactive Segmentation of Medical Image Through Matting and Fusion.
Proceedings of the Machine Learning in Medical Imaging - 11th International Workshop, 2020

Adversarial Text Image Super-Resolution using Sinkhorn Distance.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Content Adaptive and Error Propagation Aware Deep Video Compression.
Proceedings of the Computer Vision - ECCV 2020, 2020

Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
KalmanFlow 2.0: Efficient Video Optical Flow Estimation via Context-Aware Kalman Filtering.
IEEE Trans. Image Process., 2019

Efficient Variable Rate Image Compression With Multi-Scale Decomposition Network.
IEEE Trans. Circuits Syst. Video Technol., 2019

FPGA Based Video Transcoding System with 2K-4K Super-Resolution Conversion.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

A Novel Deep Progressive Image Compression Framework.
Proceedings of the Picture Coding Symposium, 2019

Weakly Supervised Segmentation of Vertebral Bodies with Iterative Slice-Propagation.
Proceedings of the Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data, 2019

Learning-Based Bone Quality Classification Method for Spinal Metastasis.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

Children's Neuroblastoma Segmentation Using Morphological Features.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

Spatial Regularized Classification Network for Spinal Dislocation Diagnosis.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

Large Scale Near-Duplicate Image Retrieval via Patch Embedding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

DVC: An End-To-End Deep Video Compression Framework.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Depth-Aware Video Frame Interpolation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Lasso Regression Based Channel Pruning for Efficient Object Detection Model.
Proceedings of the 2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2019

2018
Novel Integration of Frame Rate Up Conversion and HEVC Coding Based on Rate-Distortion Optimization.
IEEE Trans. Image Process., 2018

High-Order Model and Dynamic Filtering for Frame Rate Up-Conversion.
IEEE Trans. Image Process., 2018

A Wavelet-based Learning for Face Hallucination with Loop Architecture.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Enhancing HEVC Compressed Videos with a Partition-Masked Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

KalmanFlow: Efficient Kalman Filtering for Video Optical Flow.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Rcdfnn: Robust Change Detection Based on Convolutional Fusion Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Kalman Filtering Network for Video Compression Artifact Reduction.
Proceedings of the Computer Vision - ECCV 2018, 2018

Object Detection Implementation and Optimization on Embedded GPU System.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

Video Stitching Based on Optical Flow.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

2017
Complexity control algorithm based on adaptive mode selection for interframe coding in high efficiency video coding.
J. Electronic Imaging, 2017

Edge guided salient object detection.
Neurocomputing, 2017

Spatiotemporal salient object detection based on distance transform and energy optimization.
Neurocomputing, 2017

Weld Defect Images Classification with VGG16-Based Neural Network.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Iterative convolutional neural network for noisy image super-resolution.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A novel frame rate up conversion using iterative non-local means interpolation.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Moving segmentation in HEVC compressed domain based on logistic regression.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

High-quality and real-time frame interpolation on heterogeneous computing system.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

2016
Neyman-Pearson-Based Early Mode Decision for HEVC Encoding.
IEEE Trans. Multim., 2016

Principal Component Analysis-Based Visual Saliency Detection.
IEEE Trans. Broadcast., 2016

Buffer structure optimized VLSI architecture for efficient hierarchical integer pixel motion estimation implementation.
J. Real Time Image Process., 2016

Frame rate up-conversion based on motion-region segmentation.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Scrolling Subtitle Processing of Frame Rate Up-Conversion Based on Global Text Motion Estimation.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

Parallel-Friendly Frame Rate Up-Conversion Algorithm Based on Patch Match.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

Fast Intra Coding in an Efficient HEVC Multi-rate Encoding System Based on x265.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

An Efficient Optimization of Real-Time AVS+ Encoder in Low Bitrate Condition.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

A New Source-Side De-interlacing Method for High Quality Video Content.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

Principal components analysis-based visual saliency detection.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Fast mode and depth decision algorithm for HEVC intra coding based on characteristics of coding bits.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

A complexity scalable mode decision algorithm for interframe coding in HEVC.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

Improved rate distortion optimized quantization for HEVC with adaptive thresholding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

Fast HEVC intra mode decision based on logistic regression classification.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

2015
Efficient SAO coding algorithm for x265 encoder.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Early SKIP mode decision based on Bayesian model for HEVC.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

2014
Frame Rate Up-Conversion Method for Video Processing Applications.
IEEE Trans. Broadcast., 2014

Effective H.264/AVC to HEVC transcoder based on prediction homogeneity.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Analysis and optimization of x265 encoder.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Implementation and improvement of Wavefront Parallel Processing for HEVC encoding on many-core platform.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Efficient parallel-friendly rate control for real-time UHD video encoder on many-core platform.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Prediction unit depth selection based on statistic distribution for HEVC intra coding.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Fast H.264/AVC to HEVC transcoding based on residual homogeneity.
Proceedings of the International Conference on Audio, 2014

2013
A new Local-Main-Gradient-Orientation HOG and contour differences based algorithm for object classification.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Effective early termination using adaptive search order for frame rate up-conversion.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Occlusion handling frame rate up-conversion.
Proceedings of the IEEE International Conference on Acoustics, 2013

Stationary subtitle processing for real-time frame rate up-conversion.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2013

2012
Principal Components Analysis-Based Edge-Directed Image Interpolation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012


  Loading...