Zhenzhong Chen

Orcid: 0000-0002-7882-1066

According to our database1, Zhenzhong Chen authored at least 179 papers between 2003 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LaMD: Latent Motion Diffusion for Image-Conditional Video Generation.
Int. J. Comput. Vis., July, 2025

Disentangling User Interest and Geographical Context for POI Recommendations.
ACM Trans. Intell. Syst. Technol., June, 2025

Information Bottleneck Based Self-Distillation: Boosting Lightweight Network for Real-World Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

LongCaptioning: Unlocking the Power of Long Video Caption Generation in Large Multimodal Models.
CoRR, February, 2025

Remote Sensing Semantic Segmentation Quality Assessment Based on Vision Language Model.
IEEE Trans. Geosci. Remote. Sens., 2025

Lightweight image super-resolution with sliding Proxy Attention Network.
Signal Process., 2025

Efficient JPEG-AI Image Coding for Remote Sensing Semantic Segmentation.
IEEE Geosci. Remote. Sens. Lett., 2025

HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Learning Many-to-Many Mapping for Unpaired Real-World Image Super-Resolution and Downscaling.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Deep Causal Reasoning for Recommendations.
ACM Trans. Intell. Syst. Technol., August, 2024

Deconfounded Cross-modal Matching for Content-based Micro-video Background Music Recommendation.
ACM Trans. Intell. Syst. Technol., June, 2024

Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Hierarchical Transformer with Spatio-temporal Context Aggregation for Next Point-of-interest Recommendation.
ACM Trans. Inf. Syst., March, 2024

Variational Mixture of Stochastic Experts Auto-Encoder for Multi-Modal Recommendation.
IEEE Trans. Multim., 2024

Learned Video Compression via Heterogeneous Deformable Compensation Network.
IEEE Trans. Multim., 2024

A Benchmark for Controllable Text -Image-to-Video Generation.
IEEE Trans. Multim., 2024

Towards 3D Colored Mesh Saliency: Database and Benchmarks.
IEEE Trans. Multim., 2024

Exploring Long- and Short-Range Temporal Information for Learned Video Compression.
IEEE Trans. Image Process., 2024

JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer.
IEEE Trans. Image Process., 2024

Remote Sensing Image Coding for Machines on Semantic Segmentation via Contrastive Learning.
IEEE Trans. Geosci. Remote. Sens., 2024

DOPNet: Dense Object Prediction Network for Multiclass Object Counting and Localization in Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Learning a Single Convolutional Layer Model for Low Light Image Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2024

Hierarchical Image Feature Compression for Machines via Feature Sparsity Learning.
IEEE Signal Process. Lett., 2024

Corner-to-Center long-range context model for efficient learned image compression.
J. Vis. Commun. Image Represent., 2024

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results.
CoRR, 2024

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results.
CoRR, 2024

UnifiedSSR: A Unified Framework of Sequential Search and Recommendation.
Proceedings of the ACM on Web Conference 2024, 2024

Domain-Agnostic Crowd Counting via Uncertainty-Guided Style Diversity Augmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024


Reconstruction Distortion of Learned Image Compression with Imperceptible Perturbations.
Proceedings of the Data Compression Conference, 2024

Transferable Learned Image Compression-Resistant Adversarial Perturbations.
Proceedings of the Data Compression Conference, 2024




2023
Multi-auxiliary Augmented Collaborative Variational Auto-encoder for Tag Recommendation.
ACM Trans. Inf. Syst., October, 2023

Variational Bandwidth Auto-Encoder for Hybrid Recommender Systems.
IEEE Trans. Knowl. Data Eng., May, 2023

Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation.
IEEE Trans. Multim., 2023

Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck.
IEEE Trans. Multim., 2023

Adversarial Spectral Super-Resolution for Multispectral Imagery Using Spatial Spectral Feature Attention Module.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

Learned Scale-Arbitrary Image Downscaling for Non-Learnable Upscaling.
IEEE Signal Process. Lett., 2023

LaMD: Latent Motion Diffusion for Video Generation.
CoRR, 2023

A Lightweight No-reference Video Quality Assessment Method.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

2022
Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems.
IEEE Trans. Multim., 2022

Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360$^\circ$ Videos: ITU-T Rec. P.919.
IEEE Trans. Multim., 2022

Predicate Correlation Learning for Scene Graph Generation.
IEEE Trans. Image Process., 2022

Learning Discrete Representations From Reference Images for Large Scale Factor Image Super-Resolution.
IEEE Trans. Image Process., 2022

Affective Video Content Analysis via Multimodal Deep Quality Embedding Network.
IEEE Trans. Affect. Comput., 2022

Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Effects of Lossy Compression on Remote Sensing Image Classification Based on Convolutional Sparse Coding.
IEEE Geosci. Remote. Sens. Lett., 2022

Deep Siamese Network With Motion Fitting for Object Tracking in Satellite Videos.
IEEE Geosci. Remote. Sens. Lett., 2022

Decomposing style, content, and motion for videos.
J. Vis. Commun. Image Represent., 2022

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report.
CoRR, 2022

Deep Deconfounded Content-based Tag Recommendation for UGC with Causal Intervention.
CoRR, 2022

Video Quality Assessment based on Quality Aggregation Networks.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

The First Challenge on Moving Object Detection and Tracking in Satellite Videos: Methods and Results.
Proceedings of the 26th International Conference on Pattern Recognition, 2022


Make It Move: Controllable Image-to-Video Generation with Text Descriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection.
IEEE Trans. Multim., 2021

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.
IEEE Trans. Image Process., 2021

A Game Theory Based CTU-Level Bit Allocation Scheme for HEVC Region of Interest Coding.
IEEE Trans. Image Process., 2021

Multi-Objective Optimization of Quality in VVC Rate Control for Low-Delay Video Coding.
IEEE Trans. Image Process., 2021

Toward Visual Distortion in Black-Box Attacks.
IEEE Trans. Image Process., 2021

AFNet: Adaptive Fusion Network for Remote Sensing Image Semantic Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2021

Multimodal Local-Global Attention Network for Affective Video Content Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2021

Semantic Information Oriented No-Reference Video Quality Assessment.
IEEE Signal Process. Lett., 2021

Visual Scanpath Prediction Using IOR-ROI Recurrent Mixture Density Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

No-reference Quality Assessment of Panoramic Video based on Spherical-domain Features.
Proceedings of the Picture Coding Symposium, 2021


2020
Learned Image Downscaling for Upscaling Using Content Adaptive Resampler.
IEEE Trans. Image Process., 2020

MonoFENet: Monocular 3D Object Detection With Feature Enhancement Networks.
IEEE Trans. Image Process., 2020

Deep Learning-Based Technology in Responses to the Joint Call for Proposals on Video Compression With Capability Beyond HEVC.
IEEE Trans. Circuits Syst. Video Technol., 2020

Class-Oriented Discriminative Dictionary Learning for Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2020

A Multi-Scale Position Feature Transform Network for Video Frame Interpolation.
IEEE Trans. Circuits Syst. Video Technol., 2020

Asymmetric Foveated Just-Noticeable-Difference Model for Images With Visual Field Inhomogeneities.
IEEE Trans. Circuits Syst. Video Technol., 2020

Introduction to the Special Section on Deep Learning in Video Enhancement and Evaluation: The New Frontier.
IEEE Trans. Circuits Syst. Video Technol., 2020

Object Tracking in Satellite Videos Based on Convolutional Regression Network With Appearance and Motion Features.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2020

Exploiting the local temporal information for video captioning.
J. Vis. Commun. Image Represent., 2020

Human scanpath prediction based on deep convolutional saccadic model.
Neurocomputing, 2020

Towards Visual Distortion in Black-Box Attacks.
CoRR, 2020

Learning Compact Reward for Image Captioning.
CoRR, 2020

A Multimodal Variational Encoder-Decoder Framework for Micro-video Popularity Prediction.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

A Subjective Quality Assessment Database for Mobile Video Coding.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Subjective Study of Perceptual Quality for Micro-Video Applications.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Performance Analysis of Intra Block Copy for Screen Content Coding in AVS3.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Rate Control For Versatile Video Coding.
Proceedings of the IEEE International Conference on Image Processing, 2020

2019
Improving Saliency Detection Based on Modeling Photographer's Intention.
IEEE Trans. Multim., 2019

Point Cloud Saliency Detection by Local and Global Feature Fusion.
IEEE Trans. Image Process., 2019

Visual Quality Evaluation for Semantic Segmentation: Subjective Assessment Database and Objective Assessment Measure.
IEEE Trans. Image Process., 2019

An Optimized Rate Control for Low-Delay H.265/HEVC.
IEEE Trans. Image Process., 2019

An Adaptive Spectral Decorrelation Method for Lossless MODIS Image Compression.
IEEE Trans. Geosci. Remote. Sens., 2019

Effects of Compression on Remote Sensing Image Classification Based on Fractal Analysis.
IEEE Trans. Geosci. Remote. Sens., 2019

Video Saliency Prediction Based on Spatial-Temporal Two-Stream Network.
IEEE Trans. Circuits Syst. Video Technol., 2019

Assessing Visual Quality of Omnidirectional Videos.
IEEE Trans. Circuits Syst. Video Technol., 2019

Spherical Domain Rate-Distortion Optimization for Omnidirectional Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2019

Object Tracking on Satellite Videos: A Correlation Filter-Based Tracking Method With Trajectory Correction by Kalman Filter.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2019

Joint latent factors and attributes to discover interpretable preferences in recommendation.
Inf. Sci., 2019

Time-semantic-aware Poisson tensor factorization approach for scalable hotel recommendation.
Inf. Sci., 2019

VCG: Exploiting visual contents and geographical influence for Point-of-Interest recommendation.
Neurocomputing, 2019

User-Video Co-Attention Network for Personalized Micro-video Recommendation.
Proceedings of the World Wide Web Conference, 2019

SSIM Prediction for H.265/HEVC based on Convolutional Neural Networks.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

A Novel Partition Mode for Screen Content Video in AVS3.
Proceedings of the Picture Coding Symposium, 2019

Multimodal Deep Denoise Framework for Affective Video Content Analysis.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Hierarchical Global-Local Temporal Modeling for Video Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Cross-Fiber Spatial-Temporal Co-enhanced Networks for Video Action Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Mutually Reinforced Spatio-Temporal Convolutional Tube for Human Action Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Two-Stream Refinement Network for RGB-D Saliency Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Meta Learning for Image Captioning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Subjective Panoramic Video Quality Assessment Database for Coding Applications.
IEEE Trans. Broadcast., 2018

Thermal to Visible Facial Image Translation Using Generative Adversarial Networks.
IEEE Signal Process. Lett., 2018

A saliency prediction model on 360 degree images using color dictionary based sparse representation.
Signal Process. Image Commun., 2018

Recent advances in omnidirectional video coding for virtual reality: Projection and evaluation.
Signal Process., 2018

Joint foveation-depth just-noticeable-difference model for virtual reality environment.
J. Vis. Commun. Image Represent., 2018

Disparity tuning guided stereoscopic saliency detection for eye fixation prediction.
J. Vis. Commun. Image Represent., 2018

Personalized Visual Saliency: Individuality Affects Image Perception.
IEEE Access, 2018

Spatial-Temporal Distance Metric Embedding for Time-Specific POI Recommendation.
IEEE Access, 2018

Dense Residual Convolutional Neural Network based In-Loop Filter for HEVC.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Two-Pass Rate Control for Constant Quality in High Efficiency Video Coding.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Image Cationing with Visual-Semantic LSTM.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Stackelberg Game Based Rate Allocation for HEVC Region of Interest Coding.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

RGB-D Semantic Segmentation: A Review.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Spherical Structural Similarity Index for Objective Omnidirectional Video Quality Assessment.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Towards Subjective Quality Assessment for Panoramic Video.
Proceedings of the Human Vision and Electronic Imaging 2018, Burlingame, CA, USA, 28 January 2018, 2018

CNN-Optimized Image Compression with Uncertainty based Resource Allocation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Multi-Layer Quantization Control for Quality-Constrained H.265/HEVC.
IEEE Trans. Image Process., 2017

CNN-based rate-distortion modeling for H.265/HEVC.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

A fast intra prediction algorithm for 360-degree equirectangular panoramic video.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Layout-Driven Top-Down Saliency Detection for Webpage.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Towards Foveated Just Noticeable Difference Modeling for Virtual Reality.
Proceedings of the Image Quality and System Performance XIV, Electronic Imaging 2017, 2017

Scanpath mining of eye movement trajectories for visual attention analysis.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Spherical domain rate-distortion optimization for 360-degree video coding.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Perceptually-lossless image coding based on foveated-JND and H.265/HEVC.
J. Vis. Commun. Image Represent., 2016

Revisiting visual attention identification based on eye tracking data analytics.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

2015
Visual acuity inspired saliency detection by using sparse features.
Inf. Sci., 2015

Facial Expression Cloning with Elastic and Muscle Models.
CoRR, 2015

Exploiting perceptual redundancy in images.
Proceedings of the Visual Information Processing and Communication VI, 2015

Perceptually Lossless Image Coding Based on Foveated JND.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Recent advances in perceptual H.265/HEVC video coding.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Remote Sensing Image Compression: A Review.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Object Tracking over a Multiple-Camera Network.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014
A Video Saliency Detection Model in Compressed Domain.
IEEE Trans. Circuits Syst. Video Technol., 2014

Facial expression cloning with elastic and muscle models.
J. Vis. Commun. Image Represent., 2014

Saliency based perceptual HEVC.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Performance analysis of AVS2 for remote sensing image compression.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

JND modeling: Approaches and applications.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

2013
A Heat-Map-Based Algorithm for Recognizing Group Activities in Videos.
IEEE Trans. Circuits Syst. Video Technol., 2013

Intra-and-Inter-Constraint-Based Video Enhancement Based on Piecewise Tone Mapping.
IEEE Trans. Circuits Syst. Video Technol., 2013

Introduction to the Special Issue on "Recent advances on analysis and processing for distributed video systems".
J. Vis. Commun. Image Represent., 2013

A saliency detection model based on sparse features and visual acuity.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

2012
Bottom-Up Saliency Detection Model Based on Human Visual Sensitivity and Amplitude Spectrum.
IEEE Trans. Multim., 2012

Saliency Detection in the Compressed Domain for Adaptive Image Retargeting.
IEEE Trans. Image Process., 2012

Macroblock Classification Method for Video Applications Involving Motions.
IEEE Trans. Broadcast., 2012

Video saliency detection in the compressed domain.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
A Fast Sub-Pixel Motion Estimation Algorithm for H.264/AVC Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2011

A new global-based video enhancement algorithm by fusing features of multiple region-of-interests.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Image retargeting based on the sensitivity-tuned visual significance map.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Saliency-based visualization for image search.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Saliency-based image retargeting in the compressed domain.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Unsupervised malaria parasite detection based on phase spectrum.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011

2010
Perceptual video coding: Challenges and approaches.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
A two-pass rate control algorithm for H.264/AVC high definition video coding.
Signal Process. Image Commun., 2009

2008
An efficient algorithm for H.264/AVC high definition video coding.
IEEE Trans. Consumer Electron., 2008

Constant distortion rate control for H.264/AVC high definition videos with scene change.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

2007
A Rate and Distortion Analysis of Multiscale Binary Shape Coding Based on Statistical Learning.
IEEE Trans. Multim., 2007

A Unified Approach of Bit-Rate Control for Binary and Gray-Level Shape Sequences Coding.
IEEE Trans. Circuits Syst. Video Technol., 2007

Towards Rate-Distortion Tradeoff in Real-Time Color Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2007

Recent advances in rate control for video coding.
Signal Process. Image Commun., 2007

A novel statistical learning-based rate distortion analysis approach for multiscale binary shape coding.
Proceedings of the Visual Communications and Image Processing 2007, 2007

2006
Dynamic Bit Allocation for Multiple Video Object Coding.
IEEE Trans. Multim., 2006

Distortion variation minimization in real-time video coding.
Signal Process. Image Commun., 2006

2005
Joint texture-shape optimization for MPEG-4 multiple video objects.
IEEE Trans. Circuits Syst. Video Technol., 2005

A real-time video transport system for the best-effort Internet.
Signal Process. Image Commun., 2005

2004
Linear rate-distortion models for MPEG-4 shape coding.
IEEE Trans. Circuits Syst. Video Technol., 2004

Object-based rate control for MPEG-4 video object coding.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Optimal bit allocation for MPEG-4 multiple video objects.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
Improved single-video-object rate control for MPEG-4.
IEEE Trans. Circuits Syst. Video Technol., 2003

Improved rate control for MPEG-4 video transport over wireless channel.
Signal Process. Image Commun., 2003

Improved single video object rate control for MPEG-4.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003


  Loading...