Zhenzhong Chen

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
SemPT: Semantic Prompt Tuning for Vision-Language Models.
CoRR, August, 2025

DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework.
CoRR, August, 2025

LOP: Learning Optimal Pruning for Efficient On-Demand MLLMs Scaling.
CoRR, June, 2025

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries.
CoRR, May, 2025

Training-Free Reasoning and Reflection in MLLMs.
CoRR, May, 2025

Frequency Enhancement for Image Demosaicking.
CoRR, March, 2025

Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse.
CoRR, January, 2025

Reliability-based design optimization of hinge sleeve using adaptive E-SVM.
Reliab. Eng. Syst. Saf., 2025

Joint reference frame synthesis and post filter enhancement for Versatile Video Coding.
J. Vis. Commun. Image Represent., 2025

Low-Profile All-Metal Dual-Band Dual-Polarized Shared-Aperture Phased Arrays for UAV Applications.
IEEE Internet Things J., 2025

Monotonic Neural Network based Rate-Distortion Modeling for H.266/VVC.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Continuous Space-Time Video Resampling with Invertible Motion Steganography.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Fitted Neural Lossless Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Video saliency prediction for First-Person View UAV videos: Dataset and benchmark.
Neurocomputing, 2024

Visual Context Window Extension: A New Perspective for Long Video Understanding.
CoRR, 2024

Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates.
CoRR, 2024

Realistic Extreme Image Rescaling via Generative Latent Space Learning.
CoRR, 2024

Space-Time Video Super-resolution with Neural Operator.
CoRR, 2024

Lightweight Arbitrary-Scale Super-Resolution of Remote Sensing Images via Super-Scale Feature.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Deep Reference Frame for Versatile Video Coding with Structural Re-parameterization.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Learned Image Compression with Quantization Error Compensator.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Multi-Agent Reinforcement Learning based Bit Allocation for Gaming Video Coding.
Proceedings of the Picture Coding Symposium, 2024

Swin Transformer-Based In-Loop Filter for VVC Intra Coding.
Proceedings of the Picture Coding Symposium, 2024

Mutual Guidance Distillation for Joint Demosaicking and Denoising of Raw Images.
Proceedings of the Picture Coding Symposium, 2024

Perceptual-oriented Learned Image Compression with Dynamic Kernel.
Proceedings of the Data Compression Conference, 2024

Learned Lossless Image Compression Based on Bit Plane Slicing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Multiple visual relationship forecasting and arrangement in videos.
Neurocomputing, July, 2023

3D Deformable Convolution Temporal Reasoning network for action recognition.
J. Vis. Commun. Image Represent., May, 2023

Illumination-aware window transformer for RGBT modality fusion.
J. Vis. Commun. Image Represent., February, 2023

Learning Many-to-Many Mapping for Unpaired Real-World Image Super-resolution and Downscaling.
CoRR, 2023

JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer.
CoRR, 2023

Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image Compression.
CoRR, 2023

Improving Generalization of Image Captioning with Unsupervised Prompt Learning.
CoRR, 2023

Learning a Single Convolutional Layer Model for Low Light Image Enhancement.
CoRR, 2023

Continuous Space-Time Video Super-Resolution Utilizing Long-Range Temporal Information.
CoRR, 2023

Learned Image Compression with Enhanced Dynamic Spatial Aggregation and Asymmetric Entropy model.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

An Efficient Method for Real-Time Image Exposure Correction.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Towards Lightweight Deep Reference Frame for Versatile Video Coding.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Towards Deep Reference Frame in Versatile Video Coding NNVC.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Efficient Learned Video Compression via Bidirectional Temporal Information Exploration.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Joint image demosaicking and denoising with mutual guidance of color channels.
Signal Process., 2022

Limit state Kriging modeling for reliability-based design optimization through classification uncertainty quantification.
Reliab. Eng. Syst. Saf., 2022

A multi-stage spatio-temporal adaptive network for video super-resolution.
J. Vis. Commun. Image Represent., 2022

Towards mesh saliency in 6 degrees of freedom.
Neurocomputing, 2022

Hierarchical Transformer with Spatio-Temporal Context Aggregation for Next Point-of-Interest Recommendation.
CoRR, 2022

Exploring Long & Short Range Temporal Information for Learned Video Compression.
CoRR, 2022

Debiased Cross-modal Matching for Content-based Micro-video Background Music Recommendation.
CoRR, 2022

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit.
CoRR, 2022

Learning Knowledge Representation with Meta Knowledge Distillation for Single Image Super-Resolution.
CoRR, 2022

Pyramid Feature Alignment Network for Video Deblurring.
CoRR, 2022

Deep Causal Reasoning for Recommendations.
CoRR, 2022

Mutually-Regularized Dual Collaborative Variational Auto-encoder for Recommendation Systems.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Controllable Space-Time Video Super-Resolution via Enhanced Bidirectional Flow Warping.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

SalCrop: Spatio-temporal Saliency Based Video Cropping.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Perceptual in-Loop Filter for Image and Video Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Object-Relation Reasoning Graph for Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Coarse-to-Fine Boundary Localization method for Naturalistic Driving Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Subjective Quality Assessment of One-to-One Video-Telephony Services.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

2021
Semantic meaning modulates object importance in human fixation prediction.
J. Vis. Commun. Image Represent., 2021

Optical-Flow-Reuse-Based Bidirectional Recurrent Network for Space-Time Video Super-Resolution.
CoRR, 2021

Cross-modal Variational Auto-encoder for Content-based Micro-video Background Music Recommendation.
CoRR, 2021

Visual Relationship Forecasting in Videos.
CoRR, 2021

Collaborative Variational Bandwidth Auto-encoder for Recommender Systems.
CoRR, 2021

MAPS: Joint Multimodal Attention and POS Sequence Generation for Video Captioning.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Learn to Look Around: Deep Reinforcement Learning Agent for Video Saliency Prediction.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Reinforcement Learning based ROI Bit Allocation for Gaming Video Coding in VVC.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Two Stage Optimal Bit Allocation for HEVC Hierarchical Coding Structure.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Dual-Modality Vehicle Anomaly Detection via Bilateral Trajectory Tracing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Fast sample adaptive offset algorithm for 360-degree video coding.
Signal Process. Image Commun., 2020

Iterative reliable design space approach for efficient reliability-based design optimization.
Eng. Comput., 2020

Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution.
CoRR, 2020

Towards Mesh Saliency Detection in 6 Degrees of Freedom.
CoRR, 2020

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework.
CoRR, 2020

Video Super Resolution Using Temporal Encoding ConvLSTM and Multi-Stage Fusion.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

DOVE: Decomposition Oriented Video super-rEsolution.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Equirectangular Projection Oriented Intra Prediction for 360-Degree Video Coding.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Towards Quantized DCT Coefficients Restoration for Compressed Images.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Deep Inter Coding with Interpolated Reference Frame for Hierarchical Coding Structure.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

In-Loop Filter with Dense Residual Convolutional Neural Network for VVC.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

A Dual Heterogeneous Graph Attention Network to Improve Long-Tail Performance for Shop Search in E-Commerce.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

User Conditional Hashtag Recommendation for Micro-Videos.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

No-Reference Video Quality Assessment Based On Similarity Map Estimation.
Proceedings of the IEEE International Conference on Image Processing, 2020

Hierarchical Graph Attention Network for Visual Relationship Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Gated Heterogeneous Graph Representation Learning for Shop Search in E-commerce.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Video Frame Interpolation via Deformable Separable Convolution.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learned Image Downscaling for Upscaling using Content Adaptive Resampler.
CoRR, 2019

Obj-GloVe: Scene-Based Contextual Object Embedding.
CoRR, 2019

A Review-Driven Neural Model for Sequential Recommendation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Sequential Behavior Modeling for Next Micro-Video Recommendation with Collaborative Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Affective Video Content Analyses by Using Cross-Modal Embedding Learning Features.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
A fast CU partition and mode decision algorithm for HEVC intra coding.
Signal Process. Image Commun., 2018

RecNet: a deep neural network for personalized POI recommendation in location-based social networks.
Int. J. Geogr. Inf. Sci., 2018

Stable Combinatorial Spectrum Matching.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Scanpath Prediction for Visual Attention using IOR-ROI LSTM.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-Level Fusion Based 3D Object Detection From Monocular Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Person Re-Identification With Cascaded Pairwise Convolutions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Exploring visual attention using random walks based eye tracking protocols.
J. Vis. Commun. Image Represent., 2017

3-D BLE Indoor Localization Based on Denoising Autoencoder.
IEEE Access, 2017

A Fast Intra Mode Decision algorithm for HEVC.
Proceedings of the Visual Information Processing and Communication VIII, Burlingame, CA, USA, 29 January 2017, 2017

A Fast Sample Adaptive Offset Algorithm for H.265/HEVC.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Saliency Detection by Superpixel-Based Sparse Representation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Saliency map generation based on saccade target theory.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Individual trait oriented scanpath prediction for visual attention analysis.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2016
Towards Region-of-Attention Analysis in Eye Tracking Protocols.
Proceedings of the Visual Information Processing and Communication VII, 2016

Haze Removal of Single Remote Sensing Image by Combining Dark Channel Prior with Superpixel.
Proceedings of the Visual Information Processing and Communication VII, 2016

A fast mode decision algorithm for HEVC intra prediction.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

IIPWHU@TRECVID 2016 Surveillance Event Detection.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Adaptive background for real-time visual tracking.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015
Rate and Power Allocation for Joint Coding and Transmission in Wireless Video Chat Applications.
IEEE Trans. Multim., 2015

QoE-aware video streaming for SVC over multiuser MIMO-OFDM systems.
J. Vis. Commun. Image Represent., 2015

Quantitative analysis on lossy compression in remote sensing image classification.
Proceedings of the Visual Information Processing and Communication VI, 2015

Saliency-based artificial object detection for satellite images.
Proceedings of the Visual Information Processing and Communication VI, 2015

Quality Assessment for Remote Sensing Images: Approaches and Applications.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Multi-layer Sparse Coding Based Ship Detection for Remote Sensing Images.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Remote sensing image super-resolution: Challenges and approaches.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

Maximizing Lifetime of Data-Gathering Trees with Different Aggregation Modes in WSNs.
Proceedings of the 2015 IEEE Global Communications Conference, 2015

Visual attention identification using random walks based eye tracking protocols.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

2013
QoE-Driven Cache Management for HTTP Adaptive Bit Rate Streaming Over Wireless Networks.
IEEE Trans. Multim., 2013

Scalable Resource Allocation for SVC Video Streaming Over Multiuser MIMO-OFDM Networks.
IEEE Trans. Multim., 2013

Energy Minimization for Wireless Video Transmissions With Deadline and Reliability Constraints.
IEEE Trans. Circuits Syst. Video Technol., 2013

Hybrid Saliency Detection for Images.
IEEE Signal Process. Lett., 2013

On quality of experience of scalable video adaptation.
J. Vis. Commun. Image Represent., 2013

Complexity-scalable video coding and power-rate-distortion modeling forwireless video chat applications.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

A fast rate adaptation scheme for SVC based on the packet dependencies.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Distributed rate and power allocation for wireless video chats via pricing schemes.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Saliency assistant driver alerting.
Proceedings of the International Conference on Connected Vehicles and Expo, 2013

2012
Energy-Efficient Resource Allocation and Scheduling for Multicast of Scalable Video Over Wireless Networks.
IEEE Trans. Multim., 2012

QoE analysis for scalable video adaptation.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

QoE-aware resource allocation for scalable video transmission over multiuser MIMO-OFDM systems.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

A scalable resource allocation framework for SVC video transmissions over downlink MIMO-OFDM networks.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Energy-minimized wireless video transmissions via sharing of retransmission limits.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

QoE-driven cache management for HTTP adaptive bit rate (ABR) streaming over wireless networks.
Proceedings of the 2012 IEEE Global Communications Conference, 2012

2011
Depth No-Synthesis-Error Model for View Synthesis in 3-D Video.
IEEE Trans. Image Process., 2011

Video Quality Assessment Based on Measuring Perceptual Noise From Spatial and Temporal Perspectives.
IEEE Trans. Circuits Syst. Video Technol., 2011

Boundary Artifact Reduction in View Synthesis of 3D Video: From Perspective of Texture-Depth Alignment.
IEEE Trans. Broadcast., 2011

Binocular Just-Noticeable-Difference Model for Stereoscopic Images.
IEEE Signal Process. Lett., 2011

Channel Access Allocation for Scalable Video Transmission Over Contention-Based Wireless Networks.
IEEE Signal Process. Lett., 2011

Cross-layer optimization for SVC video delivery over the IEEE 802.11e wireless networks.
J. Vis. Commun. Image Represent., 2011

Packet scheduling with playout adaptation for scalable video delivery over wireless networks.
J. Vis. Commun. Image Represent., 2011

Cross-view post-filtering for fidelity enhancement on asymmetric coding of 3D video.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

A MAXMIN resource allocation approach for scalable video delivery over multiuser MIMO-OFDM systems.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Frame-level quantization control for perceptual quality constrained H.264/AVC video coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Joint power allocation and bit loading for enhanced SVC video downlink transmissions over SDMA/OFDMA networks.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Perceptually-Friendly H.264/AVC Video Coding Based on Foveated Just-Noticeable-Distortion Model.
IEEE Trans. Circuits Syst. Video Technol., 2010

Perception-Aware Multiple Scalable Video Streaming Over WLANs.
IEEE Signal Process. Lett., 2010

Video Enhancement from Multiple Compressed Copies in Transform Domain.
EURASIP J. Image Video Process., 2010

Suppressing texture-depth misalignment for boundary noise removal in view synthesis.
Proceedings of the Picture Coding Symposium, 2010

Perceptually optimized error resilient transcoding using attention-based intra refresh.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Efficient packet scheduling for scalable video delivery to mobile clients.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Playout adaptation based packet scheduling for scalable video delivery over wireless links.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

An efficient multicast algorithm for the scalable extension of H.264/AVC over IEEE 802.11 WLANs.
Proceedings of the International Conference on Image Processing, 2010

Joint packet prioritization and QoS mapping for SVC over wlans.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Perception-oriented video coding based on foveated JND model.
Proceedings of the 2009 Picture Coding Symposium, 2009

Throughput Adaptation for Scalable Video Multicast in Wireless Networks.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

On the method of multicopy video enhancement in transform domain.
Proceedings of the International Conference on Image Processing, 2009

Enhanced gait recognition based on weighted dynamic feature.
Proceedings of the International Conference on Image Processing, 2009

Perceptually-friendly H.264/AVC video coding.
Proceedings of the International Conference on Image Processing, 2009

2008
Spherical basis functions and uniform distribution of points on spheres.
J. Approx. Theory, 2008

A rate and distortion analysis for H.264/AVC video coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008


  Loading...