Rong Xie

Orcid: 0000-0002-8261-5337

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Rong Xie authored at least 153 papers between 2007 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SSP-IR: Semantic and Structure Priors for Diffusion-Based Realistic Image Restoration.
IEEE Trans. Circuits Syst. Video Technol., July, 2025

Implicit-Explicit Integrated Representations for Multi-View Video Compression.
IEEE Trans. Image Process., 2025

SemanticColorizer: Improving color semantic rationality for vivid automatic image colorization.
Displays, 2025

Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

OL_LaEC: An Online Learning Data Scheduler for Multi-site Parallel Downloading.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2025

2024
ViCoFace: Learning Disentangled Latent Motion Representations for Visual-Consistent Face Reenactment.
ACM Trans. Multim. Comput. Commun. Appl., December, 2024

A Character Position-Aware Compression Framework for Screen Text Image.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Depth-Guided Robust Point Cloud Fusion NeRF for Sparse Input Views.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network.
IEEE Trans. Multim., 2024

Subjective and objective quality evaluation of UGC video after encoding and decoding.
Displays, 2024

Face De-identification: State-of-the-art Methods and Comparative Studies.
CoRR, 2024

PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation.
CoRR, 2024

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration.
CoRR, 2024

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration.
CoRR, 2024

Efficient Bitrate Ladder Construction for Per-Shot Adaptive Encoding.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Pioneer: Offline Reinforcement Learning based Bandwidth Estimation for Real-Time Communication.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

Visibility-Aware Human Mesh Recovery via Balancing Dense Correspondence and Probability Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

A New People-Object Interaction Dataset and NVS Benchmarks.
Proceedings of the IEEE International Conference on Image Processing, 2024

Hdrtvformer: Efficient Sdrtv-to-Hdrtv via Affine Transformation and Spatial-Aware Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

Disentangled Clothed Avatar Generation from Text Descriptions.
Proceedings of the Computer Vision - ECCV 2024, 2024

Identity-Consistent Video De-identification via Diffusion Autoencoders.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

Detailed and Controllable Old Photo Restoration with Diffusion Priors.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

LaEC: Loss-aware Earliest Completion Data Scheduler for Multi-site Parallel Downloading.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

No-reference Quality Assessment of Text-to-Image Generation.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Face De-identification: Safeguarding Identities in the Digital Era
Springer, ISBN: 978-3-031-58221-9, 2024

2023
Steganography for medical record image.
Comput. Biol. Medicine, October, 2023

Multi-scale-based joint super-resolution and inverse tone-mapping with data synthesis for UHD HDR video.
Displays, September, 2023

High-Fidelity Face Reenactment Via Identity-Matched Correspondence Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Online Video Stabilization Using IMU Sensors.
IEEE Trans. Multim., 2023

Image Quality Assessment for Realistic Zoom Photos.
Sensors, 2023

Disentangled Clothed Avatar Generation from Text Descriptions.
CoRR, 2023

Learning Dense UV Completion for Human Mesh Recovery.
CoRR, 2023

High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

NeRF-SDP: Efficient Generalizable Neural Radiance Field with Scene Depth Perception.
Proceedings of the ACM Multimedia Asia 2023, 2023

Achieving Privacy-Preserving Multi-View Consistency with Advanced 3D-Aware Face De-identification.
Proceedings of the ACM Multimedia Asia 2023, 2023

360-Degree Panorama Generation from Few Unregistered NFoV Images.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

PACC: Perception Aware Congestion Control for Real-time Communication.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Old-Photo Restoration with Detail- and Structure-Enhanced Cascaded Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Divide and Conquer: a Two-Step Method for High Quality Face De-identification with Model Explainability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dual-Head Fusion Network for Image Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Boosting Video Object Segmentation via Space-Time Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient Human Rendering with Geometric and Semantic Priors.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2023

2022
Edge-Based Video Compression Texture Synthesis Using Generative Adversarial Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

IdentityMask: Deep Motion Flow Guided Reversible Face Video De-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Ultra-Low Latency, Stable, and Scalable Video Transmission for Free-Viewpoint Video Services.
IEEE Trans. Broadcast., 2022

Multiview nonlinear discriminant structure learning for emotion recognition.
Knowl. Based Syst., 2022

IdentityDP: Differential private identification protection for face images.
Neurocomputing, 2022

L0 structure-prior assisted blur-intensity aware efficient video deblurring.
Neurocomputing, 2022

A Large-scale Sports Tracking Dataset and Progressive Re-detection Based Sports Tracking.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Perceptual Video Coding Based on Semantic-Guided Texture Detection and Synthesis.
Proceedings of the Picture Coding Symposium, 2022

Generative Compression for Face Video: A Hybrid Scheme.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Hiding Among Your Neighbors: Face Image Privacy Protection with Differential Private k-anonymity.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

2021
VMAF Oriented Perceptual Coding Based on Piecewise Metric Coupling.
IEEE Trans. Image Process., 2021

Modeling Acceleration Properties for Flexible INTRA HEVC Complexity Control.
IEEE Trans. Circuits Syst. Video Technol., 2021

An Elastic System Architecture for Edge Based Low Latency Interactive Video Applications.
IEEE Trans. Broadcast., 2021

Current Frame Priors Assisted Neural Network for Intra Prediction.
IEEE Access, 2021

Deep Motion Flow Aided Face Video De-identification.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

HEVC VMAF-oriented Perceptual Rate Distortion Optimization using CNN.
Proceedings of the Picture Coding Symposium, 2021

Deep Face Swapping via Cross-Identity Adversarial Training.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Blindly Predict Image and Video Quality in the Wild.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Personalized and Invertible Face De-identification by Disentangled Identity Information Manipulation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dense 3D Coordinate Code Prior Guidance for High-Fidelity Face Swapping and Face Reenactment.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Region-Aware Adaptive Instance Normalization for Image Harmonization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

3D-BitNet: Flow-Agnostic and Precise Network for video Bit-Depth Expansion.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Configurable Low Delay Congestion Control Scheme for Cellular Networks.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Buffer Displacement Based Online Learning Algorithm For Low Latency HTTP Adaptive Streaming.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Video Enhancement Based on Unpaired Learning.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

2020
A Wavelet-Predominant Algorithm Can Evaluate Quality of THz Security Image and Identify Its Usability.
IEEE Trans. Broadcast., 2020

Real-time UHD video super-resolution and transcoding on heterogeneous hardware.
J. Real Time Image Process., 2020

Quality of Experience Evaluation for Streaming Video Using CGNN.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Hybrid Model for Natural Face De-Identiation with Adjustable Privacy.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Deep Blind Video Quality Assessment for User Generated Videos.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Deep Tracking and Segmentation Approach for Soccer Videos Visual Effects.
Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Learning-Based Quality Enhancement For Scalable Coded Video Over Packet Lossy Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Realistic Talking Face Synthesis With Geometry-Aware Feature Transformation.
Proceedings of the IEEE International Conference on Image Processing, 2020

Toward Fine-Grained Facial Expression Manipulation.
Proceedings of the Computer Vision - ECCV 2020, 2020

TSGAN: A Two-Stream Generative Adversarial Network for Bit-Depth Expansion.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

Native Resolution Detection for 4K-UHD Videos.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

A VMAF Directed Perceptual Rate Distortion Optimization for Video Coding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

FACT: Fused Attention for Clothing Transfer with Generative Adversarial Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
An Improved QoE Evaluation Model for HTTP Adaptive Streaming.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Deep Feature Guided Image Retargeting.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

FPGA Based Video Transcoding System with 2K-4K Super-Resolution Conversion.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Identifying and Pruning Redundant Structures for Deep Neural Networks.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

JND-based Perceptual Rate Distortion Optimization for AV1 Encoder.
Proceedings of the Picture Coding Symposium, 2019

Reinforcement Learning Based Adaptive Bitrate Algorithm for Transmitting Panoramic Videos.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

VMAF Oriented Perceptual Optimization for Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Multi-scale Generative Adversarial Learning for Facial Attribute Transfer.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2019

Gan Based Multi-Exposure Inverse Tone Mapping.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Low-precision CNN Model Quantization based on Optimal Scaling Factor Estimation.
Proceedings of the 2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2019

Viewport Prediction for Panoramic Video with Multi-CNN.
Proceedings of the 2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2019

Deep Video Inverse Tone Mapping.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

2018
An improved Real-Time Video Communication System.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Rate-mixed HEVC Tile based 360 Video Streaming System.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Masking Effects Based Rate Control Scheme for High Efficiency Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

An MCMC based Efficient Parameter Selection Model for x265 Encoder.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Improving Semantic Style Transfer Using Guided Gram Matrices.
Proceedings of the Digital TV and Multimedia Communication - 15th International Forum, 2018

Motion Adaptive Intra Refresh for Low Delay HEVC Encoding.
Proceedings of the Digital TV and Multimedia Communication - 15th International Forum, 2018

Frame Interpolation via Refined Deep Voxel Flow.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

GPU Based Motion-Compensated Frame Interpolation Acceleration for Future Video Coding.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Learning an Inverse Tone Mapping Network with a Generative Adversarial Regularizer.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Segment Constraint ABR Algorithm for HEVC Encoder.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

A No Reference Bitstream-Based Video Quality Assessment Model for H.265/HEVC and H.264/AVC.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

A Generic Distributed Scheduling Algorithm for Frame Rate Up Convert Video Transcoding.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

Video Frame Interpolation Using Recurrent Convolutional Layers.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Evaluation of No Reference Bitstream-based Video Quality Assessment Methods.
CoRR, 2017

Deep Binary Representation for Efficient Image Retrieval.
Adv. Multim., 2017

Two-stream deep encoder-decoder architecture for fully automatic video object segmentation.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

A generic method to improve no-reference image blur metric accuracy in video contents.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Weight-based bit allocation scheme for VR videos in HEVC.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Rate control model for high dynamic range video.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Low Latency MPEG-DASH System Over HTTP 2.0 and WebSocket.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Deep hash learning for efficient image retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Lagrangian method based Rate-Distortion Optimization revisited for dependent video coding.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

CNN based post-processing to improve HEVC.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A lightweight distributed media processing system for UHD service.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Two-stream recurrent convolutional neural networks for video saliency estimation.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Machine learning based VP9-to-HEVC video transcoding.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

2016
Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset.
IEEE Trans. Multim., 2016

Shot boundary detection using convolutional neural networks.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Improved intra angular prediction with novel interpolation filter and boundary filter.
Proceedings of the 2016 Picture Coding Symposium, 2016

Block of Interest Based AVS to HEVC Transcoding with Resolution Conversion.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

SJTU 4K video subjective quality dataset for content adaptive bit rate estimation without encoding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

GPU accelerated high-quality video/image super-resolution.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

Image super resolution by sparse linear regression and iterative back projection.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

A novel parallel-friendly rate control scheme for HEVC.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Review of ITU-T parametric models for compressed video quality estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Saliency based rate control scheme for high efficiency video coding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Temporal dependent bit allocation scheme for rate control in HEVC.
Proceedings of the 2015 IEEE Workshop on Signal Processing Systems, 2015

Fast depth decision with enlarged coding block sizes for HEVC intra coding of 4K ultra-HD video.
Proceedings of the 2015 IEEE Workshop on Signal Processing Systems, 2015

Small group people behavior analysis based on temporal recursive trajectory identification.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Which metric can predict coding gain of H.265/HEVC over H.264/AVC?
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

CNN-based shot boundary detection and video annotation.
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

Gradient based fast mode and depth decision for high efficiency intra frame video coding.
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

Learning based fast H.264 to H.265 transcoding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Temporal Recursive Pedestrian Trajectory Identification for Multi-camera Surveillance System.
Proceedings of the 15th IEEE International Conference on Computer and Information Technology, 2015

2014
You Are What You Watch and When You Watch: Inferring Household Structures From IPTV Viewing Data.
IEEE Trans. Broadcast., 2014

Multilevel DCT-based zerotree image coding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

Fast AVS to HEVC transcoding based on ROI detection using visual characteristics.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

2013
Network Flow Based Collective Behavior Analysis.
Proceedings of the Behavior and Social Computing, 2013

Integer programming based crowd behavior analysis for world expo.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2013

2012
Model-Based Robust Prediction of Cumulative Participant Curve in Large-Scale Events.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Tourists Flow Prediction by Clustering-Based GRNN.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

2010
Backward Adaptive Pre/Post-Filtered DPCM with Near-Optimal Rate-Distortion Performance.
Proceedings of IEEE International Conference on Communications, 2010

2009
An improved block size selection method based on macroblock movement characteristic.
Multim. Tools Appl., 2009

2008
Variable block size selection for a transcoder based on MB movement information.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Bit Allocation for Fine-Granular SNR Scalability Coding with Hierarchical B Pictures.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007


  Loading...