Chao Huang

Orcid: 0000-0003-1490-2171

Affiliations:

Harbin Institute of Technology, Shenzhen, China
Ningbo University, Faculty of Information Science and Engineering, China (former)

According to our database¹, Chao Huang authored at least 68 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling.

[BibT_eX]

[DOI]

CoRR, September, 2025

A Lesion-Fusion Neural Network for Multi-View Diabetic Retinopathy Grading.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, May, 2025

ZeroSep: Separate Anything in Audio with Zero Training.

[BibT_eX]

[DOI]

CoRR, May, 2025

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness.

[BibT_eX]

[DOI]

CoRR, May, 2025

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, May, 2025

The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability.

[BibT_eX]

[DOI]

CoRR, April, 2025

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting.

[BibT_eX]

[DOI]

CoRR, April, 2025

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1).

[BibT_eX]

[DOI]

CoRR, April, 2025

FreSca: Unveiling the Scaling Space in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, April, 2025

Generative AI for Cel-Animation: A Survey.

[BibT_eX]

[DOI]

CoRR, January, 2025

Multimodal Evidential Learning for Open-World Weakly-Supervised Video Anomaly Detection.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

Optimal Graph Learning-Based Label Propagation for Cross-Domain Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Deep Label Propagation With Nuclear Norm Maximization for Visual Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Prototype-guided and dynamic-aware video anomaly detection.

[BibT_eX]

[DOI]

Chao Huang

Qianyi Li

Bob Zhang

Neural Networks, 2025

Deep Opinion-Unaware Blind Image Quality Assessment by Learning and Adapting from Multiple Annotators.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Towards VLM-based Hybrid Explainable Prompt Enhancement for Zero-Shot Industrial Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Omni-Dimensional State Space Model-driven SAM for Pixel-level Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Ex-VAD: Explainable Fine-grained Video Anomaly Detection Based on Visual-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Multi-view Evidential Learning-based Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Information Recovery-Driven Deep Incomplete Multiview Clustering Network.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2024

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., May, 2024

Video-Based Fall Detection Using Human Pose and Constrained Generative Adversarial Network.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2024

MLFA: Toward Realistic Test Time Adaptive Object Detection by Multi-Level Feature Alignment.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

Uncertainty-aware prototypical learning for anomaly detection in medical images.

[BibT_eX]

[DOI]

Neural Networks, 2024

Scaling Concept With Text-Guided Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Optimal Graph Learning and Nuclear Norm Maximization for Deep Cross-Domain Robust Label Propagation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Multimodal Representation Distribution Learning for Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Long Short-Term Dynamic Prototype Alignment Learning for Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Batch Singular Value Polarization and Weighted Semantic Augmentation for Universal Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Partial Multi-View Multi-Label Classification via Semantic Invariance Learning and Prototype Modeling.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

High-Quality Visually-Guided Sound Separation from Diverse Categories.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

HACDR-Net: Heterogeneous-Aware Convolutional Network for Diabetic Retinopathy Multi-Lesion Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Attention-Induced Embedding Imputation for Incomplete Multi-View Partial Multi-Label Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Class-guided human motion prediction via multi-spatial-temporal supervision.

[BibT_eX]

[DOI]

Neural Comput. Appl., May, 2023

Localized Sparse Incomplete Multi-View Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Robust fall detection in video surveillance based on weakly supervised learning.

[BibT_eX]

[DOI]

Neural Networks, 2023

Video Understanding with Large Language Models: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields.

[BibT_eX]

[DOI]

CoRR, 2023

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Information Recovery-Driven Deep Incomplete Multi-view Clustering Network.

[BibT_eX]

[DOI]

CoRR, 2023

AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized and Balanced Efficient Incomplete Multi-view Clustering.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Highly Confident Local Structure Based Consensus Graph Learning for Incomplete Multi-view Clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Egocentric Audio-Visual Object Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DICNet: Deep Instance-Level Contrastive Network for Double Incomplete Multi-View Multi-Label Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Unsupervised Decomposition and Correction Network for Low-Light Image Enhancement.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2022

Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, 2022

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2022

Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Pixel-Level Anomaly Detection via Uncertainty-aware Prototypical Transformer.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Graph Embedded Pose Regularity Learning via Spatio-Temporal Transformer for Abnormal Behavior Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Deep Object Detection with Example Attribute Based Prediction Modulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

How to Prepare for the Next Pandemic - Investigation of Correlation Between Food Prices and COVID-19 From Global and Local Perspectives.

[BibT_eX]

[DOI]

Yufei Zhao

Chao Huang

Jiebo Luo

Proceedings of the IEEE International Conference on Big Data, 2022

2021

Online Learning-Based Multi-Stage Complexity Control for Live Video Coding.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Inter-layer correlation-based adaptive bit allocation for enhancement layer in scalable high efficiency video coding.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2021

2019

Multiple classifier-based fast coding unit partition for intra coding in future video coding.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2019

Encoding Complexity Control for Live Video Applications: An Interpretable Machine Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018

Efficient CU and PU Decision Based on Neural Network and Gray Level Co-Occurrence Matrix for Intra Prediction of Screen Content Coding.

[BibT_eX]

[DOI]

IEEE Access, 2018

Chao Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...