Chao Huang

Orcid: 0009-0006-2244-638X

Affiliations:
  • Harbin Institute of Technology, Shenzhen, China
  • Ningbo University, Faculty of Information Science and Engineering, China (former)


According to our database1, Chao Huang authored at least 61 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Lesion-Fusion Neural Network for Multi-View Diabetic Retinopathy Grading.
IEEE J. Biomed. Health Informatics, May, 2025

ZeroSep: Separate Anything in Audio with Zero Training.
CoRR, May, 2025

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness.
CoRR, May, 2025

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought.
CoRR, May, 2025

The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability.
CoRR, April, 2025

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting.
CoRR, April, 2025

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1).
CoRR, April, 2025

FreSca: Unveiling the Scaling Space in Diffusion Models.
CoRR, April, 2025

Generative AI for Cel-Animation: A Survey.
CoRR, January, 2025

Multimodal Evidential Learning for Open-World Weakly-Supervised Video Anomaly Detection.
IEEE Trans. Multim., 2025

Optimal Graph Learning-Based Label Propagation for Cross-Domain Image Classification.
IEEE Trans. Image Process., 2025

Deep Label Propagation With Nuclear Norm Maximization for Visual Domain Adaptation.
IEEE Trans. Image Process., 2025

Prototype-guided and dynamic-aware video anomaly detection.
Neural Networks, 2025

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Multi-view Evidential Learning-based Medical Image Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Information Recovery-Driven Deep Incomplete Multiview Clustering Network.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.
IEEE Trans. Cybern., May, 2024

Video-Based Fall Detection Using Human Pose and Constrained Generative Adversarial Network.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

MLFA: Toward Realistic Test Time Adaptive Object Detection by Multi-Level Feature Alignment.
IEEE Trans. Image Process., 2024

Uncertainty-aware prototypical learning for anomaly detection in medical images.
Neural Networks, 2024

Scaling Concept With Text-Guided Diffusion Models.
CoRR, 2024

Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Optimal Graph Learning and Nuclear Norm Maximization for Deep Cross-Domain Robust Label Propagation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Multimodal Representation Distribution Learning for Medical Image Segmentation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Long Short-Term Dynamic Prototype Alignment Learning for Video Anomaly Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Batch Singular Value Polarization and Weighted Semantic Augmentation for Universal Domain Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Partial Multi-View Multi-Label Classification via Semantic Invariance Learning and Prototype Modeling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation.
Proceedings of the Computer Vision - ACCV 2024, 2024

High-Quality Visually-Guided Sound Separation from Diverse Categories.
Proceedings of the Computer Vision - ACCV 2024, 2024

HACDR-Net: Heterogeneous-Aware Convolutional Network for Diabetic Retinopathy Multi-Lesion Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Attention-Induced Embedding Imputation for Incomplete Multi-View Partial Multi-Label Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Class-guided human motion prediction via multi-spatial-temporal supervision.
Neural Comput. Appl., May, 2023

Localized Sparse Incomplete Multi-View Clustering.
IEEE Trans. Multim., 2023

Robust fall detection in video surveillance based on weakly supervised learning.
Neural Networks, 2023

Video Understanding with Large Language Models: A Survey.
CoRR, 2023

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields.
CoRR, 2023

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models.
CoRR, 2023

Information Recovery-Driven Deep Incomplete Multi-view Clustering Network.
CoRR, 2023

AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized and Balanced Efficient Incomplete Multi-view Clustering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Highly Confident Local Structure Based Consensus Graph Learning for Incomplete Multi-view Clustering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Egocentric Audio-Visual Object Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DICNet: Deep Instance-Level Contrastive Network for Double Incomplete Multi-View Multi-Label Classification.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Unsupervised Decomposition and Correction Network for Low-Light Image Enhancement.
IEEE Trans. Intell. Transp. Syst., 2022

Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System.
IEEE Trans. Ind. Informatics, 2022

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.
IEEE Trans. Cybern., 2022

Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning.
IEEE Signal Process. Lett., 2022

Pixel-Level Anomaly Detection via Uncertainty-aware Prototypical Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Graph Embedded Pose Regularity Learning via Spatio-Temporal Transformer for Abnormal Behavior Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Deep Object Detection with Example Attribute Based Prediction Modulation.
Proceedings of the IEEE International Conference on Acoustics, 2022

How to Prepare for the Next Pandemic - Investigation of Correlation Between Food Prices and COVID-19 From Global and Local Perspectives.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Online Learning-Based Multi-Stage Complexity Control for Live Video Coding.
IEEE Trans. Image Process., 2021

Inter-layer correlation-based adaptive bit allocation for enhancement layer in scalable high efficiency video coding.
Signal Process. Image Commun., 2021

2019
Multiple classifier-based fast coding unit partition for intra coding in future video coding.
Signal Process. Image Commun., 2019

Encoding Complexity Control for Live Video Applications: An Interpretable Machine Learning Approach.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Efficient CU and PU Decision Based on Neural Network and Gray Level Co-Occurrence Matrix for Intra Prediction of Screen Content Coding.
IEEE Access, 2018


  Loading...