Xiongkuo Min

Orcid: 0000-0001-5693-0416

According to our database1, Xiongkuo Min authored at least 349 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MI3S: A multimodal large language model assisted quality assessment framework for AI-generated talking heads.
Inf. Process. Manag., 2026

2025
ScanDTM: A Novel Dual-Temporal Modulation Scanpath Prediction Model for Omnidirectional Images.
IEEE Trans. Circuits Syst. Video Technol., August, 2025

Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild.
IEEE Trans. Circuits Syst. Video Technol., August, 2025

Joint Luminance-Chrominance Learning for Image Debanding.
IEEE Trans. Circuits Syst. Video Technol., August, 2025

Energy-Efficient VR 360 Video Streaming in the IRS-Aided Rate-Splitting Multiple Access Network.
IEEE Trans. Commun., August, 2025

VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning.
CoRR, August, 2025

Who is a Better Player: LLM against LLM.
CoRR, August, 2025

Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning.
CoRR, August, 2025

Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment.
CoRR, August, 2025

Engagement Prediction of Short Videos with Large Multimodal Models.
CoRR, August, 2025

Full-Reference and No-Reference Quality Assessment for Video Frame Interpolation.
IEEE Trans. Circuits Syst. Video Technol., July, 2025

Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads.
CoRR, July, 2025

LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs.
CoRR, July, 2025

Efficient Face Image Quality Assessment via Self-training and Knowledge Distillation.
CoRR, July, 2025

CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos.
CoRR, July, 2025

RGC-VQA: An Exploration Database for Robotic-Generated Video Quality Assessment.
CoRR, June, 2025

ICME 2025 Generalizable HDR and SDR Video Quality Measurement Grand Challenge.
CoRR, June, 2025

Quality Assessment and Distortion-aware Saliency Prediction for AI-Generated Omnidirectional Images.
CoRR, June, 2025

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models.
CoRR, June, 2025

GOBench: Benchmarking Geometric Optics Generation and Understanding of MLLMs.
CoRR, June, 2025

ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images.
IEEE Trans. Vis. Comput. Graph., May, 2025

Unified Approach to Mesh Saliency: Evaluating Textured and Non-Textured Meshes Through VR and Multifunctional Prediction.
IEEE Trans. Vis. Comput. Graph., May, 2025

Time-Smooth Wireless Transmission of Probabilistic Slicing VR 360 Video in MISO-OFDM Systems.
IEEE Trans. Commun., May, 2025

Scaling-up Perceptual Video Quality Assessment.
CoRR, May, 2025

TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs.
CoRR, May, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
CoRR, May, 2025

Exploring Image Quality Assessment from a New Perspective: Pupil Size.
CoRR, May, 2025

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation.
CoRR, May, 2025

Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision.
CoRR, May, 2025

MM-PCQA+: Advancing Multi-Modal Learning for Point Cloud Quality Assessment.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025

Study of Subjective and Objective Naturalness Assessment of AI-Generated Images.
IEEE Trans. Circuits Syst. Video Technol., April, 2025

AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images.
CoRR, April, 2025

LMM4Gen3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs.
CoRR, April, 2025

EEmo-Bench: A Benchmark for Multi-modal Large Language Models on Image Evoked Emotion Assessment.
CoRR, April, 2025

NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results.
CoRR, April, 2025

Omni<sup>2</sup>: Unifying Omnidirectional Image Generation and Editing in an Omni Model.
CoRR, April, 2025

PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving.
CoRR, April, 2025

Towards Explainable Partial-AIGC Image Quality Assessment.
CoRR, April, 2025

LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs.
CoRR, April, 2025

Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes.
CoRR, April, 2025

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy.
CoRR, March, 2025

Information Density Principle for MLLM Benchmarks.
CoRR, March, 2025

No-Reference Image Quality Assessment: Obtain MOS From Image Quality Score Distribution.
IEEE Trans. Circuits Syst. Video Technol., February, 2025

Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model.
CoRR, February, 2025

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment.
CoRR, January, 2025

HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment.
CoRR, January, 2025

IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models.
CoRR, January, 2025

Explain Vision Focus: Blending Human Saliency Into Synthetic Face Images.
IEEE Trans. Multim., 2025

Evaluating Point Cloud From Moving Camera Videos: A No-Reference Metric.
IEEE Trans. Multim., 2025

Aggregate and Discriminate: Pseudo Clips-Guided Boundary Perception for Video Moment Retrieval.
IEEE Trans. Multim., 2025

Quality-Guided Skin Tone Enhancement for Portrait Photography.
IEEE Trans. Multim., 2025

From Haziness to Clarity: A Novel Iterative Memory-Retrospective Emergence Model for Omnidirectional Image Saliency Prediction.
IEEE Trans. Image Process., 2025

How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and Model.
IEEE Trans. Image Process., 2025

Advancing Zero-Shot Digital Human Quality Assessment Through Text-Prompted Evaluation.
IEEE Trans. Image Process., 2025

Blind Image Quality Assessment by Gaussian Mixture Distribution.
IEEE Trans. Image Process., 2025

CT-PCQA: A Convolutional Neural Network and Transformer combined Method for Point Cloud Quality Assessment.
Signal Process. Image Commun., 2025

A study on the user viewing experience of implanted advertisement videos based on visual saliency.
Expert Syst. Appl., 2025

A deep learning approach for music visualization: From audio features to descriptive video generation.
Displays, 2025

LPerceptual Quality Assessment of AI Generated Content Videos: a Dataset and Benchmark.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Visual Saliency Prediction for Augmented Reality Videos.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Machine Vision Quality Assessment for Image Restoration.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

A Dataset and Method for Assessing the Quality of Display Devices.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

HazeCLIP: Towards Language Guided Real-World Image Dehazing.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Explore the Hallucination on Low-level Perception for MLLMs.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Image Quality Assessment: From Human to Machine Preference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

FineVQ: Fine-Grained User Generated Content Video Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

An Empirical Study for Efficient Video Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025


Redundancy Principles for MLLMs Benchmarks.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D Graphics.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Boosting power line inspection in bad weather: Removing weather noise with channel-spatial attention-based UNet.
Multim. Tools Appl., December, 2024

Continuous and Overall Quality of Experience Evaluation for Streaming Video Based on Rich Features Exploration and Dual-Stage Attention.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Hidden Barcode in Sub-Images with Invisible Locating Marker.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

Quality-of-Experience Evaluation for Digital Twins in 6G Network Environments.
IEEE Trans. Broadcast., September, 2024

AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

GMS-3DQA: Projection-Based Grid Mini-patch Sampling for 3D Model Quality Assessment.
ACM Trans. Multim. Comput. Commun. Appl., June, 2024

Un-Gaze: A Unified Transformer for Joint Gaze-Location and Gaze-Object Detection.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

MTCAM: A Novel Weakly-Supervised Audio-Visual Saliency Prediction Model With Multi-Modal Transformer.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2024

Synergetic Assessment of Quality and Aesthetic: Approach and Comprehensive Benchmark Dataset.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Blind Image Quality Assessment: A Fuzzy Neural Network for Opinion Score Distribution Prediction.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio.
IEEE Trans. Multim., 2024

Pixel-Learnable 3DLUT With Saturation-Aware Compensation for Image Enhancement.
IEEE Trans. Multim., 2024

How is Visual Attention Influenced by Text Guidance? Database and Model.
IEEE Trans. Image Process., 2024

BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment.
IEEE Trans. Circuits Syst. Video Technol., 2024

ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos.
CoRR, 2024

Video Quality Assessment: A Comprehensive Survey.
CoRR, 2024

Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment.
CoRR, 2024

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric.
CoRR, 2024

MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis.
CoRR, 2024

VQA<sup>2</sup>: Visual Question Answering for Video Quality Assessment.
CoRR, 2024

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
CoRR, 2024

AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results.
CoRR, 2024

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs.
CoRR, 2024

Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming.
CoRR, 2024

Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending.
CoRR, 2024

LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models.
CoRR, 2024

Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model.
CoRR, 2024

UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content.
CoRR, 2024

CMC-Bench: Towards a New Paradigm of Visual Signal Compression.
CoRR, 2024

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos.
CoRR, 2024

Enhancing Blind Video Quality Assessment with Rich Quality-aware Features.
CoRR, 2024

Dual-Branch Network for Portrait Image Quality Assessment.
CoRR, 2024

Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning.
CoRR, 2024

Compression-Realized Deep Structural Network for Video Quality Enhancement.
CoRR, 2024

fMRI Exploration of Visual Quality Assessment.
CoRR, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.
CoRR, 2024

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results.
CoRR, 2024

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results.
CoRR, 2024

Perceptual video quality assessment: a survey.
Sci. China Inf. Sci., 2024

ReLI-QA: A Multidimensional Quality Assessment Dataset for Relighted Human Heads.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

End-to-end Prediction of Streaming Video Quality of Experience: Dataset and Approach.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

ACIQA: A Dataset and Method for Assessing the Imaging Quality of Automotive Cameras.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Perceptual Skin Tone Color Difference Measurement for Portrait Photography.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Subjective and Objective Quality-of-Experience Assessment for 3D Talking Heads.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Large Multi-modality Model Assisted AI-Generated Image Quality Assessment.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DSA-QoE: Quality of Experience Evaluation for Streaming Video Based on Dual-Stage Attention.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

Multidimensional Similarity Fusion for Speech Quality Assessment.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

PrefIQA: Human Preference Learning for AI-generated Image Quality Assessment.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

Calculating Color Differences of Images via Siamese Neural Network.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

FS-BAND: A Frequency-Sensitive Banding Detector.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Q-Boost: On Visual Quality Assessment Ability of Low-Level Multi-Modality Foundation Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Optimizing Projection-Based Point Cloud Quality Assessment with Human Preferred Viewpoints Selection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Thqa: A Perceptual Quality Assessment Database for Talking Heads.
Proceedings of the IEEE International Conference on Image Processing, 2024

AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images.
Proceedings of the IEEE International Conference on Image Processing, 2024

SG-JND: Semantic-Guided Just Noticeable Distortion Predictor for Image Compression.
Proceedings of the IEEE International Conference on Image Processing, 2024

A Reduced-Reference Quality Assessment Metric for Textured Mesh Digital Humans.
Proceedings of the IEEE International Conference on Acoustics, 2024

AVSal: Enhancing Video Saliency Prediction Through Audio-Visual Fusion and Temporal Aggregation.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

GLARE: Low Light Image Enhancement via Generative Latent Feature Based Codebook Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024



AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Compression-RQ-VQA: Leveraging Rich Quality-Aware Features for Compressed Video Quality Assessment.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024


UniProcessor: A Text-Induced Unified Low-Level Image Processor.
Proceedings of the Computer Vision - ECCV 2024, 2024

SR-VQA: Super-Resolution Video Quality Assessment Model.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024



2023
A Deep Learning-Based Multidimensional Aesthetic Quality Assessment Method for Mobile Game Images.
IEEE Trans. Games, December, 2023

Inverse-tone-mapped HDR video quality assessment: A new dataset and benchmark.
Displays, December, 2023

A no-reference quality assessment metric for dynamic 3D digital human.
Displays, December, 2023

Blind Image Quality Assessment for Pathological Microscopic Image Under Screen and Immersion Scenarios.
IEEE Trans. Medical Imaging, November, 2023

Blind Quality Assessment for in-the-Wild Images via Hierarchical Feature Fusion and Iterative Mixed Database Training.
IEEE J. Sel. Top. Signal Process., November, 2023

Attentive Deep Image Quality Assessment for Omnidirectional Stitching.
IEEE J. Sel. Top. Signal Process., November, 2023

Audio-visual aligned saliency model for omnidirectional video with implicit neural representation learning.
Appl. Intell., October, 2023

Toward a No-Reference Quality Metric for Camera-Captured Images.
IEEE Trans. Cybern., June, 2023

Image Quality Score Distribution Prediction via Alpha Stable Model.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Deep Neural Network for Blind Visual Quality Assessment of 4K Content.
IEEE Trans. Broadcast., June, 2023

Perceptual quality assessment for fine-grained compressed images.
J. Vis. Commun. Image Represent., February, 2023

A Novel Lightweight Audio-visual Saliency Model for Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Toward Visual Behavior and Attention Understanding for Augmented 360 Degree Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Blind Image Quality Assessment via Cross-View Consistency.
IEEE Trans. Multim., 2023

RIVIE: Robust Inherent Video Information Embedding.
IEEE Trans. Multim., 2023

Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition.
IEEE Trans. Multim., 2023

Subjective and Objective Audio-Visual Quality Assessment for User Generated Content.
IEEE Trans. Image Process., 2023

Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment.
IEEE Trans. Image Process., 2023

Implicit Neural Representation Learning for Hyperspectral Image Super-Resolution.
IEEE Trans. Geosci. Remote. Sens., 2023

Decoupled dynamic group equivariant filter for saliency prediction on omnidirectional image.
Neurocomputing, 2023

Human attention based movie summarization: Dataset and baseline model.
Neurocomputing, 2023

Exploring the Naturalness of AI-Generated Images.
CoRR, 2023

FS-BAND: A Frequency-Sensitive Banding Detector.
CoRR, 2023

Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment.
CoRR, 2023

Joint Gaze-Location and Gaze-Object Detection.
CoRR, 2023

NTIRE 2023 Quality Assessment of Video Enhancement Challenge.
CoRR, 2023

GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment.
CoRR, 2023

Masked Autoencoders as Image Processors.
CoRR, 2023

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images.
CoRR, 2023

Split-Conv: A Resource-efficient Compression Method for Image Quality Assessment Models.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Perceptual Quality Assessment for Video Frame Interpolation.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

The Influence of Text-guidance on Visual Attention.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

DDH-QA: A Dynamic Digital Humans Quality Assessment Database.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

EEP-3DQA: Efficient and Effective Projection-Based 3D Model Quality Assessment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

A Perceptual Quality Assessment Exploration for AIGC Images.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

BH-VQA: Blind High Frame Rate Video Quality Assessment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

A No-Reference Quality Assessment Method for Digital Human Head.
Proceedings of the IEEE International Conference on Image Processing, 2023

Geometry-Aware Video Quality Assessment for Dynamic Digital Human.
Proceedings of the IEEE International Conference on Image Processing, 2023

Audio-Visual Quality Assessment for User Generated Content: Database and Method.
Proceedings of the IEEE International Conference on Image Processing, 2023

Audio-Visual Saliency for Omnidirectional Videos.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Perceptual Quality Assessment for Digital Human Heads.
Proceedings of the IEEE International Conference on Acoustics, 2023

MV-VVQA: Multi-View Learning for No-Reference Volumetric Video Quality Assessment.
Proceedings of the 31st European Signal Processing Conference, 2023

MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


VDPVE: VQA Dataset for Perceptual Video Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Perceptual Quality Assessment of Omnidirectional Audio-Visual Signals.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

AIGCIQA2023: A Large-Scale Image Quality Assessment Database for AI Generated Images: From the Perspectives of Quality, Authenticity and Correspondence.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

2022
Transformed Saliency Dataset.
Dataset, May, 2022

QoE Driven VR 360° Video Massive MIMO Transmission.
IEEE Trans. Wirel. Commun., 2022

SMGEA: A New Ensemble Adversarial Attack Powered by Long-Term Gradient Memories.
IEEE Trans. Neural Networks Learn. Syst., 2022

Dynamic Backlight Scaling Considering Ambient Luminance for Mobile Videos on LCD Displays.
IEEE Trans. Mob. Comput., 2022

HazDesNet: An End-to-End Network for Haze Density Prediction.
IEEE Trans. Intell. Transp. Syst., 2022

Confusing Image Quality Assessment: Toward Better Augmented Reality Experience.
IEEE Trans. Image Process., 2022

RIHOOP: Robust Invisible Hyperlinks in Offline and Online Photographs.
IEEE Trans. Cybern., 2022

Viewing Behavior Supported Visual Saliency Predictor for 360 Degree Videos.
IEEE Trans. Circuits Syst. Video Technol., 2022

No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models.
IEEE Trans. Circuits Syst. Video Technol., 2022

Calculation of ophthalmic diagnostic parameters on a single eye image based on deep neural network.
Multim. Tools Appl., 2022

A brief survey on adaptive video streaming quality assessment.
J. Vis. Commun. Image Represent., 2022

Screen Content Quality Assessment: Overview, Benchmark, and Beyond.
ACM Comput. Surv., 2022

Perceptual Quality Assessment for Digital Human Heads.
CoRR, 2022

Treating Point Cloud as Moving Camera Videos: A No-Reference Quality Assessment Metric.
CoRR, 2022

A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences.
CoRR, 2022

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency.
CoRR, 2022

Perceptual Quality Assessment for Fine-Grained Compressed Images.
CoRR, 2022

Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation.
CoRR, 2022

Confusing Image Quality Assessment: Towards Better Augmented Reality Experience.
CoRR, 2022

Parameterized Image Quality Score Distribution Prediction.
CoRR, 2022

Distinguishing Computer-Generated Images from Photographic Images: a Texture-Aware Deep Learning-Based Method.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

MRIQA: Subjective Method and Objective Model for Magnetic Resonance Image Quality Assessment.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Video-based Human-Object Interaction Detection from Tubelet Tokens.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Subjective Quality Assessment for Images Generated by Computer Graphics.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

A Full- Reference Quality Assessment Metric for Cartoon Images.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

A Deep Learning based No-reference Quality Assessment Model for UGC Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Image Quality Assessment: From Mean Opinion Score to Opinion Score Distribution.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Saliency in Augmented Reality.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

A No-Reference Deep Learning Quality Assessment Method for Super-Resolution Images Based on Frequency Maps.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

A Lightweight Segmentation Network Based on Weak Supervision for COVID-19 Detection.
Proceedings of the Digital Multimedia Communications - The 19th International Forum, 2022

MSPP-IQA: Adaptive Blind Image Quality Assessment Based on Multi-level Spatial Pyramid Pooling.
Proceedings of the Digital Multimedia Communications - The 19th International Forum, 2022

Perceptual Quality Assessment of TTS-Synthesized Speech.
Proceedings of the Digital Multimedia Communications - The 19th International Forum, 2022

Surveillance Video Quality Assessment Based on Quality Related Retraining.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

A Unified Two-Stage Model for Separating Superimposed Images.
Proceedings of the IEEE International Conference on Acoustics, 2022

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows.
Proceedings of the Computer Vision - ECCV 2022, 2022

End-to-End Human-Gaze-Target Detection with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Invisible Markers for Hidden Codes in Offline-to-online Photography.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Where are the Children with Autism Looking in Reality?
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Blind Quality Assessment for in-the-Wild Images via Hierarchical Feature Fusion Strategy.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

Augmented Reality Image Quality Assessment Based on Visual Confusion Theory.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

2021
Learning a Deep Agent to Predict Head Movement in 360-Degree Images.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Perceptual Quality Assessment of Low-light Image Enhancement.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Comparative Perceptual Assessment of Visual Signals Using Free Energy Features.
IEEE Trans. Multim., 2021

Video Frame Interpolation and Enhancement via Pyramid Recurrent Framework.
IEEE Trans. Image Process., 2021

Quality Assessment of Free-Viewpoint Videos by Quantifying the Elastic Changes of Multi-Scale Motion Trajectories.
IEEE Trans. Image Process., 2021

Subjective and Objective Quality Assessment of Compressed Screen Content Videos.
IEEE Trans. Broadcast., 2021

Fine localization and distortion resistant detection of multi-class barcode in complex environments.
Multim. Tools Appl., 2021

Enhancing Decoding Rate of Barcode Decoders in Complex Scenes for IoT Systems.
IEEE Internet Things J., 2021

An Accurate and Efficient 1-D Barcode Detector for Medium of Deployment in IoT Systems.
IEEE Internet Things J., 2021

RANSP: Ranking attention network for saliency prediction on omnidirectional images.
Neurocomputing, 2021

Structured Computational Modeling of Human Visual System for No-reference Image Quality Assessment.
Int. J. Autom. Comput., 2021

No-reference screen content video quality assessment.
Displays, 2021

EAN: Event Adaptive Network for Enhanced Action Recognition.
CoRR, 2021

QoE Driven VR 360 Video Massive MIMO Transmission.
CoRR, 2021

Blind Quality Assessment for in-the-Wild Images via Hierarchical Feature Fusion and Iterative Mixed Database Training.
CoRR, 2021

A Full-Reference Quality Assessment Metric for Fine-Grained Compressed Images.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Inter-Observer Visual Congruency in Video-Viewing.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

A Multi-dimensional Aesthetic Quality Assessment Model for Mobile Game Images.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Lavs: A Lightweight Audio-Visual Saliency Prediction Model.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A Lightweight Saliency Prediction Model for Omnidirectional Images.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A No-Reference Evaluation Metric for Low-Light Image Enhancement.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A No-Reference Visual Quality Metric For 3D Color Meshes.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Deep Learning Based Full-Reference and No-Reference Quality Assessment Models for Compressed UGC Videos.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Attention Based Network For No-Reference UGC Video Quality Assessment.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Deep Audio-Visual Fusion Neural Network for Saliency Estimation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Accurate Compensation Makes the World More Clear for the Visually Impaired.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Modeling Image Quality Score Distribution Using Alpha Stable Model.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Muiqa: Image Quality Assessment Database And Algorithm For Medical Ultrasound Images.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Deep Neural Networks For Full-Reference And No-Reference Audio-Visual Quality Assessment.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Self-Conditioned Probabilistic Learning of Video Rescaling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Perceptual Quality Assessment for Recognizing True and Pseudo 4k Content.
Proceedings of the IEEE International Conference on Acoustics, 2021

Comparative Sharpness Evaluation for Mobile Phone Photos.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

2020
The Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images.
IEEE Trans. Multim., 2020

A Multimodal Saliency Model for Videos With High Audio-Visual Correspondence.
IEEE Trans. Image Process., 2020

Study of Subjective and Objective Quality Assessment of Audio-Visual Signals.
IEEE Trans. Image Process., 2020

A Metric for Light Field Reconstruction, Compression, and Display Quality Evaluation.
IEEE Trans. Image Process., 2020

How is Gaze Influenced by Image Transformations? Dataset and Model.
IEEE Trans. Image Process., 2020

A Wavelet-Predominant Algorithm Can Evaluate Quality of THz Security Image and Identify Its Usability.
IEEE Trans. Broadcast., 2020

DevsNet: Deep Video Saliency Network using Short-term and Long-term Cues.
Pattern Recognit., 2020

MC360IQA: A Multi-channel CNN for Blind 360-Degree Image Quality Assessment.
IEEE J. Sel. Top. Signal Process., 2020

Tiny-BDN: An Efficient and Compact Barcode Detection Network.
IEEE J. Sel. Top. Signal Process., 2020

Perceptual image quality assessment: a survey.
Sci. China Inf. Sci., 2020

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Blind Stereoscopic Image Quality Assessment By Deep Neural Network Of Multi-Level Feature Fusion.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

A Multiple Attributes Image Quality Database for Smartphone Camera Photo Quality Assessment.
Proceedings of the IEEE International Conference on Image Processing, 2020

Automatic Region Selection For Objective Sharpness Assessment Of Mobile Device Photos.
Proceedings of the IEEE International Conference on Image Processing, 2020

Identifying Children with Autism Spectrum Disorder Based on Gaze-Following.
Proceedings of the IEEE International Conference on Image Processing, 2020

Blurry Video Frame Interpolation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
QA4Camera.
Dataset, November, 2019

Multi-Channel Decomposition in Tandem With Free-Energy Principle for Reduced-Reference Image Quality Assessment.
IEEE Trans. Multim., 2019

Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images.
IEEE Trans. Multim., 2019

Objective Quality Evaluation of Dehazed Images.
IEEE Trans. Intell. Transp. Syst., 2019

EMBDN: An Efficient Multiclass Barcode Detection Network for Complicated Environments.
IEEE Internet Things J., 2019

Free-energy principle inspired visual quality assessment: An overview.
Digit. Signal Process., 2019

Utility-oriented resource allocation for 360-degree video transmission over heterogeneous networks.
Digit. Signal Process., 2019

A Saliency Dataset of Head and Eye Movements for Augmented Reality.
CoRR, 2019

GazeGAN: A Generative Adversarial Saliency Model based on Invariance Analysis of Human Gaze During Scene Free Viewing.
CoRR, 2019

A dataset of eye movements for the children with autism spectrum disorder.
Proceedings of the 10th ACM Multimedia Systems Conference, 2019

MC360IQA: The Multi-Channel CNN for Blind 360-Degree Image Quality Assessment.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

A Reading Assistant System for Blind People Based on Hand Gesture Recognition.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2019

Fine Detection and Classification of Multi-class Barcode in Complex Environments.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Video-Based Early ASD Detection via Temporal Pyramid Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

LPHD: A Large-Scale Head Pose Dataset for RGB Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Evaluating Quality of Screen Content Images Via Structural Variation Analysis.
IEEE Trans. Vis. Comput. Graph., 2018

Blind Quality Assessment Based on Pseudo-Reference Image.
IEEE Trans. Multim., 2018

Blind Image Quality Estimation via Distortion Aggravation.
IEEE Trans. Broadcast., 2018

Partial-Reference Sonar Image Quality Assessment for Underwater Transmission.
IEEE Trans. Aerosp. Electron. Syst., 2018

The prediction of head and eye movement for 360 degree images.
Signal Process. Image Commun., 2018

Saliency-induced reduced-reference quality index for natural scene and screen content images.
Signal Process., 2018

Invariance Analysis of Saliency Models versus Human Gaze During Scene Free Viewing.
CoRR, 2018

Eye Fatigue Assessment Using Unobtrusive Eye Tracker.
IEEE Access, 2018

Perceptual Quality Assessment of Omnidirectional Images.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Learning to Predict where the Children with Asd Look.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

2017
Unified Blind Quality Assessment of Compressed Natural, Graphic, and Screen Content Images.
IEEE Trans. Image Process., 2017

A Fast Reliable Image Quality Predictor by Fusing Micro- and Macro-Structures.
IEEE Trans. Ind. Electron., 2017

Visual attention analysis and prediction on human faces.
Inf. Sci., 2017

Terahertz Security Image Quality Assessment by No-reference Model Observers.
CoRR, 2017

Assessment of Visually Induced Motion Sickness in Immersive Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Terahertz Security Image Quality Assessment by No-reference Model Observers.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Dynamic backlight scaling considering ambient luminance for mobile energy saving.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Fixation Prediction through Multimodal Analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Visual attention analysis and prediction on human faces with mole.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Blind quality assessment of compressed images via pseudo structural similarity.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

No-reference quality assessment for image sharpness and noise.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Fixation prediction through multimodal analysis.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Visual attention on human face.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Influence of Spatial Resolution on State-of-the-Art Saliency Models.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A hierarchical saliency detection approach for bokeh images.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

2014
Information security display via uncrowded window.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

DLP based anti-piracy display system.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Demo: DLP based anti-piracy display system.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Sound influences visual attention discriminately in videos.
Proceedings of the Sixth International Workshop on Quality of Multimedia Experience, 2014

Visual attention data for image quality assessment databases.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Information security display system based on temporal psychovisual modulation.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Influence of compression artifacts on visual attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Information security display system based on Spatial Psychovisual Modulation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Uncrowded window inspired information security display.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Dual-view medical image visualization based on spatial-temporal psychovisual modulation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Simultaneous dual-subtitles exhibition via Spatial Psychovisual Modulation.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

2013
Brightness preserving video contrast enhancement using S-shaped Transfer function.
Proceedings of the 2013 Visual Communications and Image Processing, 2013


  Loading...