Yahong Han

Orcid: 0000-0003-2768-1398

According to our database1, Yahong Han authored at least 189 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Cascade & allocate: A cross-structure adversarial attack against models fusing vision and language.
Inf. Fusion, April, 2024

Joint Correcting and Refinement for Balanced Low-Light Image Enhancement.
IEEE Trans. Multim., 2024

Generalizing to Out-of-Sample Degradations via Model Reprogramming.
IEEE Trans. Image Process., 2024

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization.
CoRR, 2024

Multi-Source Collaborative Gradient Discrepancy Minimization for Federated Domain Generalization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Improving transferable adversarial attack for vision transformers via global attention and local drop.
Multim. Syst., December, 2023

Source-free and black-box domain adaptation via distributionally adversarial training.
Pattern Recognit., November, 2023

Dynamic parameterized learning for unsupervised domain adaptation.
Frontiers Inf. Technol. Electron. Eng., November, 2023

Weakly supervised anomaly detection with multi-level contextual modeling.
Multim. Syst., August, 2023

Domain-specific feature elimination: multi-source domain adaptation for image classification.
Frontiers Comput. Sci., August, 2023

Multi-Source Collaborative Contrastive Learning for Decentralized Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Active and Compact Entropy Search for High-Dimensional Bayesian Optimization.
IEEE Trans. Knowl. Data Eng., 2023

Query-Efficient Black-Box Adversarial Attack With Customized Iteration and Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Weighted progressive alignment for multi-source domain adaptation.
Multim. Syst., 2023

Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention.
CoRR, 2023

Joint Correcting and Refinement for Balanced Low-Light Image Enhancement.
CoRR, 2023

A Cross-modal and Redundancy-reduced Network for Weakly-Supervised Audio-Visual Violence Detection.
Proceedings of the ACM Multimedia Asia 2023, 2023

Saliency Prototype for RGB-D and RGB-T Salient Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

OraclePoints: A Hybrid Neural Representation for Oracle Character.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Uncertainty-Aware Variate Decomposition for Self-supervised Blind Image Deblurring.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Discriminative and Contrastive Consistency for Semi-supervised Domain Adaptive Image Classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Exploring Instance Relation for Decentralized Multi-Source Domain Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Reliable and Interpretable Personalized Federated Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Curiosity-Driven Salient Object Detection With Fragment Attention.
IEEE Trans. Image Process., 2022

Action Keypoint Network for Efficient Video Recognition.
IEEE Trans. Image Process., 2022

Image Translation for Oracle Bone Character Interpretation.
Symmetry, 2022

Effective full-scale detection for salient object based on condensing-and-filtering network.
Pattern Recognit., 2022

Instance-Invariant Domain Adaptive Object Detection Via Progressive Disentanglement.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multi-attribute object detection benchmark for smart city.
Multim. Syst., 2022

Complementary spatiotemporal network for video question answering.
Multim. Syst., 2022

Dual collaboration for decentralized multi-source domain adaptation.
Frontiers Inf. Technol. Electron. Eng., 2022

Exploring uncertainty in regression neural networks for construction of prediction intervals.
Neurocomputing, 2022

Instance-sequence reasoning for video question answering.
Frontiers Comput. Sci., 2022

Unidirectional RGB-T salient object detection with intertwined driving of encoding and fusion.
Eng. Appl. Artif. Intell., 2022

Prototype-guided Cross-task Knowledge Distillation for Large-scale Models.
CoRR, 2022

Peng Cheng Object Detection Benchmark for Smart City.
CoRR, 2022

Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Granularity Semantic Clues Extraction for Video Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Mining Valuable Source Domain Instances for Privacy-Preserving Domain Adaptive Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Hierarchical Recurrent Contextual Attention Network for Video Question Answering.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Maintaining Structural Information by Pairwise Similarity for Unsupervised Domain Adaptation.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Logic Rule Guided Attribution with Dynamic Ablation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Hierarchical Memory Decoder for Visual Narrating.
IEEE Trans. Circuits Syst. Video Technol., 2021

Deep multi-scale and multi-modal fusion for 3D object detection.
Pattern Recognit. Lett., 2021

An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers.
Mob. Networks Appl., 2021

Visual commonsense reasoning with directional visual connections.
Frontiers Inf. Technol. Electron. Eng., 2021

Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal.
CoRR, 2021

Black-box Probe for Unsupervised Domain Adaptation without Model Transferring.
CoRR, 2021

Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings.
CoRR, 2021

Exploring Uncertainty in Deep Learning for Construction of Prediction Intervals.
CoRR, 2021

Universal-Prototype Augmentation for Few-Shot Object Detection.
CoRR, 2021

Locating Visual Explanations for Video Question Answering.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

WAB'21: 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Video-to-Image Casting: A Flatting Method for Video Analysis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Graph-in-Graph Contrastive Learning for Semi-Supervised Adaptation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Zero Knowledge Adversarial Defense Via Iterative Translation Cycle.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Free Adversarial Training with Layerwise Heuristic Learning.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Adversarial Attack with KD-Tree Searching on Training Set.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings.
Proceedings of the IEEE International Conference on Data Mining, 2021

Vector-Decomposed Disentanglement for Domain-Invariant Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Universal-Prototype Enhancing for Few-Shot Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Discern Depth Under Foul Weather: Estimate PM<sub>2.5</sub> for Depth Inference.
IEEE Trans. Ind. Informatics, 2020

Convolutional Reconstruction-to-Sequence for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Movie Question Answering via Textual Memory and Plot Graph.
IEEE Trans. Circuits Syst. Video Technol., 2020

Sequence in sequence for video captioning.
Pattern Recognit. Lett., 2020

Adaptive iterative attack towards explainable adversarial robustness.
Pattern Recognit., 2020

Multi-Modal fusion with multi-level attention for Visual Dialog.
Inf. Process. Manag., 2020

Hierarchical Memory Decoding for Video Captioning.
CoRR, 2020

Bidirectional Adversarial Training for Semi-Supervised Domain Adaptation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Two-Way Feature-Aligned And Attention-Rectified Adversarial Training.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Video Anomaly Detection Via Predictive Autoencoder With Gradient-Based Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Extract and Merge: Superpixel Segmentation with Regional Attributes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Polishing Decision-Based Adversarial Noise With a Customized Sampling.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Speaker Video Dialog with Frame-Level Temporal Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Reasoning with Heterogeneous Graph Alignment for Video Question Answering.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Introduction to the Special Issue on the Cross-Media Analysis for Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Semisupervised Regression With Optimized Rank for Matrix Data Classification.
IEEE Trans. Cybern., 2019

Image captioning: from structural tetrad to translated sentences.
Multim. Tools Appl., 2019

Multi-cue fusion: Discriminative enhancing for person re-identification.
J. Vis. Commun. Image Represent., 2019

Detecting adversarial examples via prediction difference for deep neural networks.
Inf. Sci., 2019

DCT-CNN-based classification method for the Gongbi and Xieyi techniques of Chinese ink-wash paintings.
Neurocomputing, 2019

Capturing the spatio-temporal continuity for video semantic segmentation.
IET Image Process., 2019

A feature selection framework for video semantic recognition via integrated cross-media analysis and embedded learning.
EURASIP J. Image Video Process., 2019

Convolutional Neural Network Style Transfer Towards Chinese Paintings.
IEEE Access, 2019

Connective Cognition Network for Directional Visual Commonsense Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Ranking Video Salient Object Detection.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Hierarchical Variational Network for User-Diversified & Query-Focused Video Summarization.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Video Interactive Captioning with Human Prompts.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Untargeted Adversarial Attack via Expanding the Semantic Gap.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Visual Dialog with Targeted Objects.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Multi-Timescale Context Encoding for Scene Parsing Prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

3D Shape Retrieval through Multilayer RBF Neural Network.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Curls & Whey: Boosting Black-Box Adversarial Attacks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Adaptive Sparse Confidence-Weighted Learning for Online Feature Selection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Sequential Video VLAD: Training the Aggregation Locally and Temporally.
IEEE Trans. Image Process., 2018

Pooling the Convolutional Layers in Deep ConvNets for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Distribution Sensitive Product Quantization.
IEEE Trans. Circuits Syst. Video Technol., 2018

Discriminative multi-task multi-view feature selection and fusion for multimedia analysis.
Multim. Tools Appl., 2018

Understanding the effective receptive field in semantic image segmentation.
Multim. Tools Appl., 2018

Guest Editorial: Spatio-temporal Feature Learning for Unconstrained Video Analysis.
Multim. Tools Appl., 2018

Multi-task CNN Model for Action Detection.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Sequential Feature Fusion for Object Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

VAL: Visual-Attention Action Localizer.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Spotting and Aggregating Salient Regions for Video Captioning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Explore Multi-Step Reasoning in Video Question Answering.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

HeterStyle: A Heterogeneous Video Style Transfer Application.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Explore Multi-Step Reasoning in Video Question Answering.
Proceedings of the 1st Workshop and Challenge on Comprehensive Video Understanding in the Wild, 2018

Multi-modal Circulant Fusion for Video-to-Language and Backward.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Universal Perturbation Generation for Black-box Attack Using Evolutionary Algorithms.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Image-based Air Pollution Estimation Using Hybrid Convolutional Neural Network.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Schmidt: Image Augmentation for Black-Box Adversarial Attack.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Image-Based PM2.5 Estimation and its Application on Depth Estimation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Movie Question Answering: Remembering the Textual Cues for Layered Visual Contents.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Semisupervised Online Multikernel Similarity Learning for Image Retrieval.
IEEE Trans. Multim., 2017

Semi-Supervised Image-to-Video Adaptation for Video Action Recognition.
IEEE Trans. Cybern., 2017

Semi-supervised tensor learning for image classification.
Multim. Syst., 2017

Guest Editorial: Intermediate representation for vision and multimedia applications.
J. Vis. Commun. Image Represent., 2017

Efficient and Robust Lane Detection Using Three-Stage Feature Extraction with Line Fitting.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Spatio-Temporal Context Networks for Video Question Answering.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multirate Multimodal Video Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Catching the Temporal Regions-of-Interest for Video Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Top attention in line with time: A light-weight strategy.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Choose the Largest Contributor: A Fusion Coefficient Learning Network for Semantic Segmentation.
Proceedings of the Internet Multimedia Computing and Service, 2017

Joint Deep Learning and Gaussian Representation for Person Re-identification.
Proceedings of the Internet Multimedia Computing and Service, 2017

Initialized Frame Attention Networks for Video Question Answering.
Proceedings of the Internet Multimedia Computing and Service, 2017

Video Question Answering Using a Forget Memory Network.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

2016
Guest editorial: web multimedia semantic inference using multi-cues.
World Wide Web, 2016

Sketch4Image: a novel framework for sketch-based image retrieval based on product quantization with coding residuals.
Multim. Tools Appl., 2016

Image attribute learning with ontology guided fused lasso.
Multim. Tools Appl., 2016

Tucker decomposition-based tensor learning for human action recognition.
Multim. Syst., 2016

Semi-supervised feature selection via hierarchical regression for web image classification.
Multim. Syst., 2016

Semi-supervised image clustering with multi-modal information.
Multim. Syst., 2016

Combining neighborhood separable subspaces for classification via sparsity regularized optimization.
Inf. Sci., 2016

Hierarchical support vector machine based structural classification with fused hierarchies.
Neurocomputing, 2016

Cluster structure preserving unsupervised feature selection for multi-view tasks.
Neurocomputing, 2016

Guest editorial: Adaptation methods for multimedia analysis.
Neurocomputing, 2016

Describing Images with Ontology-Aware Dictionary Learning.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Large-Scale E-Commerce Image Retrieval with Top-Weighted Convolutional Neural Networks.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Describing images by feeding LSTM with structural words.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

TSMV: Task-Specific Multi-View Feature Learning.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

2015
Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2015

Compact and Discriminative Descriptor Inference Using Multi-Cues.
IEEE Trans. Image Process., 2015

Robust Face Clustering Via Tensor Decomposition.
IEEE Trans. Cybern., 2015

An Object-Level High-Order Contextual Descriptor Based on Semantic, Spatial, and Scale Cues.
IEEE Trans. Cybern., 2015

Guest Editorial: Ad Hoc Web Multimedia Analysis with Limited Supervision.
Multim. Tools Appl., 2015

Image aesthetics enhancement using composition-based saliency detection.
Multim. Syst., 2015

Tensor rank selection for multimedia analysis.
J. Vis. Commun. Image Represent., 2015

Pooling the Convolutional Layers in Deep ConvNets for Action Recognition.
CoRR, 2015

Supervised Dictionary Learning Based on Relationship Between Edges and Levels.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Summarization-based Video Caption via Deep Neural Networks.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Describing Images with Hierarchical Concepts and Object Class Localization.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Inferring Painting Style with Multi-Task Dictionary Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Discriminative multi-view feature selection and fusion.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Multi-layer supervised dictionary learning for visual classification.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Exploiting the locality information of dense trajectory feature for human action recognition.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

2014
Image Attribute Adaptation.
IEEE Trans. Multim., 2014

Augmenting Image Descriptions Using Structured Prediction Output.
IEEE Trans. Multim., 2014

Regularity Preserved Superpixels and Supervoxels.
IEEE Trans. Multim., 2014

Feature selection with spatial path coding for multimedia analysis.
Inf. Sci., 2014

Image decomposing for inpainting using compressed sensing in DCT domain.
Frontiers Comput. Sci., 2014

What Can We Learn about Motion Videos from Still Images?
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Augmented Image Retrieval using Multi-order Object Layout with Attributes.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Attribute prediction with long-range interactions via path coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Video Segmentation via Adaptive Higher-Order CRF with Windowed Dynamics.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

A Real-World Web Cross-Media Dataset Containing Images, Texts and Videos.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Locality Preserving Hashing Method for Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Output Feature Augmented Lasso.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

2013
Discovering Discriminative Graphlets for Aerial Image Categories Recognition.
IEEE Trans. Image Process., 2013

Image classification with manifold learning for out-of-sample data.
Signal Process., 2013

Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation.
Comput. Vis. Image Underst., 2013

Object coding on the semantic graph for scene classification.
Proceedings of the ACM Multimedia Conference, 2013

Co-Regularized Ensemble for Feature Selection.
Proceedings of the IJCAI 2013, 2013

Robust Tensor Clustering with Non-Greedy Maximization.
Proceedings of the IJCAI 2013, 2013

Visual saliency detection based on photographic composition.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Discriminative Multi-Task Feature Selection.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2012
Image Annotation by Input-Output Structural Grouping Sparsity.
IEEE Trans. Image Process., 2012

Sparse Unsupervised Dimensionality Reduction for Multiple View Data.
IEEE Trans. Circuits Syst. Video Technol., 2012

The heterogeneous feature selection with structural sparsity for multimedia annotation and hashing: a survey.
Int. J. Multim. Inf. Retr., 2012

Correlated attribute transfer with multi-task graph-guided fusion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Graph-guided sparse reconstruction for region tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Stable multi-label boosting for image annotation with structural feature selection.
Sci. China Inf. Sci., 2011

Multi-label Image Annotation by Structural Grouping Sparsity.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Multi-Label Transfer Learning With Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2010

Multiple hypergraph ranking for video concept detection.
J. Zhejiang Univ. Sci. C, 2010

Multiple Hypergraph Clustering of Web Images by MiningWord2Image Correlations.
J. Comput. Sci. Technol., 2010

Multi-label boosting for image annotation by structural grouping sparsity.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multi-Task Sparse Discriminant Analysis (MtSDA) with Overlapping Categories.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Application of Apriori Algorithm in Oracle Bone Inscription Explication.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2006
s-HITSc: an improved model and algorithm for topic distillation on the Web.
Soft Comput., 2006


  Loading...