Yan Wang

Orcid: 0000-0002-4953-2660

Affiliations:
  • East China Normal University, School of Data Science and Engineering, Shanghai, China (since 2025)
  • Fudan University, Academy for Engineering and Technology, Shanghai Engineering Research Center of AI & Robotics, Shanghai, China (PhD 2023)


According to our database1, Yan Wang authored at least 63 papers between 2012 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
EventVAD: Training-Free Event-Aware Video Anomaly Detection.
CoRR, April, 2025

AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process.
CoRR, March, 2025

TR-DQ: Time-Rotation Diffusion Quantization.
CoRR, March, 2025

Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection.
CoRR, February, 2025

A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems.
CoRR, February, 2025

In-Context Meta LoRA Generation.
CoRR, January, 2025

Observe finer to select better: Learning key frame extraction via semantic coherence for dynamic facial expression recognition in the wild.
Inf. Sci., 2025

A survey on RGB, 3D, and multimodal approaches for unsupervised industrial image anomaly detection.
Inf. Fusion, 2025

MambaIC: State Space Models for High-Performance Learned Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models.
ACM Comput. Surv., July, 2024

MGR<sup>3</sup>Net: Multigranularity Region Relation Representation Network for Facial Expression Recognition in Affective Robots.
IEEE Trans. Ind. Informatics, May, 2024

MSC-AD: A Multiscene Unsupervised Anomaly Detection Dataset for Small Defect Detection of Casting Surface.
IEEE Trans. Ind. Informatics, April, 2024

A hierarchical probabilistic underwater image enhancement model with reinforcement tuning.
J. Vis. Commun. Image Represent., 2024

P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision.
CoRR, 2024

GWQ: Gradient-Aware Weight Quantization for Large Language Models.
CoRR, 2024

A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection.
CoRR, 2024

RenderWorld: World Model with Self-Supervised 3D Label.
CoRR, 2024

A Survey on Facial Expression Recognition of Static and Dynamic Emotions.
CoRR, 2024

Hi-EF: Benchmarking Emotion Forecasting in Human-interaction.
CoRR, 2024

All rivers run into the sea: Unified Modality Brain-like Emotional Central Mechanism.
CoRR, 2024

From Efficient Multimodal Models to World Models: A Survey.
CoRR, 2024

Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition.
CoRR, 2024

Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution.
CoRR, 2024

OUS: Scene-Guided Dynamic Facial Expression Recognition.
CoRR, 2024

AccidentBlip2: Accident Detection With Multi-View MotionBlip2.
CoRR, 2024

A<sup>3</sup>lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP.
CoRR, 2024

Mixed noise-guided mutual constraint framework for unsupervised anomaly detection in smart industries.
Comput. Commun., 2024

Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning.
Comput. Commun., 2024

LCGen: Mining in Low-Certainty Generation for View-consistent Text-to-3D.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FD-UAD: Unsupervised Anomaly Detection Platform Based on Defect Autonomous Imaging and Enhancement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2024, 2024

Pixel-Level Semantic Correspondence Through Layout-Aware Representation Learning and Multi-Scale Matching Integration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Target and source modality co-reinforcement for emotion understanding from asynchronous multimodal sequences.
Knowl. Based Syst., April, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
CoRR, 2023

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models.
CoRR, 2023

A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Decision-based Sparse Attacks on Video Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
A systematic review on affective computing: emotion models, databases, and recent advances.
Inf. Fusion, 2022

Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization.
CoRR, 2022

DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Video Salient Object Detection via Point Supervision.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly-Supervised Salient Object Detection Using Point Supervison.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Enhancement of Underwater Images With Statistical Model of Background Light and Optimization of Transmission Map.
IEEE Trans. Broadcast., 2020

TCMINet: Face Parsing for Traditional Chinese Medicine Inspection via a Hybrid Neural Network With Context Aggregation.
IEEE Access, 2020

2019
An Experimental-Based Review of Image Enhancement and Image Restoration Methods for Underwater Imaging.
IEEE Access, 2019

Automatic Tongue Image Segmentation For Real-Time Remote Diagnosis.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

2018
A Rapid Scene Depth Estimation Model Based on Underwater Light Attenuation Prior for Underwater Image Restoration.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Shallow-Water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

2016
Towards a secure hybrid adaptive gateway discovery mechanism for intelligent transportation systems.
Secur. Commun. Networks, 2016

2012
Secure gateway localization and communication system for vehicular ad hoc networks.
Proceedings of the 2012 IEEE Global Communications Conference, 2012


  Loading...