Yanbin Hao

Orcid: 0000-0002-0695-1566

According to our database1, Yanbin Hao authored at least 55 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FTCM: Frequency-Temporal Collaborative Module for Efficient 3D Human Pose Estimation in Video.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling.
IEEE Trans. Multim., 2024

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model.
CoRR, 2024

Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise.
CoRR, 2024

Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Boosting Few-Shot Learning via Attentive Feature Regularization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Boosting Hyperspectral Image Classification with Dual Hierarchical Learning.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

MLP-JCG: Multi-Layer Perceptron With Joint-Coordinate Gating for Efficient 3D Human Pose Estimation.
IEEE Trans. Multim., 2023

Question-aware dynamic scene graph of local semantic representation learning for visual question answering.
Pattern Recognit. Lett., 2023

CAR: Consolidation, Augmentation and Regulation for Recipe Retrieval.
CoRR, 2023

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.
CoRR, 2023

Selective Volume Mixup for Video Action Recognition.
CoRR, 2023

TKN: Transformer-based Keypoint Prediction Network For Real-time Video Prediction.
CoRR, 2023

CgT-GAN: CLIP-guided Text GAN for Image Captioning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Semantic-based Selection, Synthesis, and Supervision for Few-shot Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D Human Pose Estimation with Spatio-Temporal Criss-Cross Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

How Can Contrastive Pre-training Benefit Audio-Visual Segmentation? A Study from Supervised and Zero-shot Perspectives.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Social Context-aware Person Search in Videos via Multi-modal Cues.
ACM Trans. Inf. Syst., 2022

Spatio-Temporal Collaborative Module for Efficient Action Recognition.
IEEE Trans. Image Process., 2022

Attention in Attention: Modeling Context Correlation for Efficient Video Classification.
IEEE Trans. Circuits Syst. Video Technol., 2022

MF-GAN: Multi-conditional Fusion Generative Adversarial Network for Text-to-Image Synthesis.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Long-term Leap Attention, Short-term Periodic Shift for Video Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Parameterization of Cross-token Relations with Relative Positional Encoding for Vision MLP.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Hourglass Convolutional Network for Efficient Video Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Unified QA-aware Knowledge Graph Generation Based on Multi-modal Modeling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Unsupervised Video Hashing with Multi-granularity Contextualization and Multi-structure Preservation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Multi-directional Knowledge Transfer for Few-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Group Contextualization for Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Learning to Match Anchor-Target Video Pairs With Dual Attentional Holographic Networks.
IEEE Trans. Image Process., 2021

Quantitative Analysis of the Research Trends and Areas in Grassland Remote Sensing: A Scientometrics Analysis of Web of Science from 1980 to 2020.
Remote. Sens., 2021

Auxiliary Diagnosis for COVID-19 with Deep Transfer Learning.
J. Digit. Imaging, 2021

Token Shift Transformer for Video Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Selective Dependency Aggregation for Action Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

NASTER: Non-local Attentional Scene Text Recognizer.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Motion Prediction using Trajectory Cues.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Aggregated Multi-GANs for Controlled 3D Human Motion Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking.
IEEE Trans. Multim., 2020

Cross-Domain Sentiment Encoding through Stochastic Word Embedding.
IEEE Trans. Knowl. Data Eng., 2020

Advance on large scale near-duplicate video retrieval.
Frontiers Comput. Sci., 2020

Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Person-level Action Recognition in Complex Events via TSD-TSM Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cross-sentence Pre-trained Model for Interactive QA matching.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Quantitative Assessment of the Impact of Physical and Anthropogenic Factors on Vegetation Spatial-Temporal Variation in Northern Tibet.
Remote. Sens., 2019

3D human pose estimation via human structure-aware fully connected network.
Pattern Recognit. Lett., 2019

R2GAN: Cross-Modal Recipe Retrieval With Generative Adversarial Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval.
IEEE Trans. Multim., 2017

Unsupervised t-Distributed Video Hashing and Its Deep Hashing Extension.
IEEE Trans. Image Process., 2017

2016
Variability and Changes in Climate, Phenology, and Gross Primary Production of an Alpine Wetland Ecosystem.
Remote. Sens., 2016

基于信息系统属性同态的数据压缩 (Data Compression with Attribute Homomorphism in Information Systems).
计算机科学, 2016

2014
On improving behavior subtraction.
Proceedings of the 2014 IEEE International Conference on Systems, Man, and Cybernetics, 2014

2012
Verification of a threshold concept of ecologically effective precipitation pulse: From plant individuals to ecosystem.
Ecol. Informatics, 2012

2010
The sensitivity of temperate steppe CO<sub>2</sub> exchange to the quantity and timing of natural interannual rainfall.
Ecol. Informatics, 2010

2006
TV Program Recommendation for Multiple Viewers Based on user Profile Merging.
User Model. User Adapt. Interact., 2006


  Loading...