Hongyuan Zhu

Orcid: 0000-0001-5177-8320

Affiliations:
  • A*STAR, Fusionopolis, Singapore


According to our database1, Hongyuan Zhu authored at least 92 papers between 2013 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Object-level Correlation for Few-Shot Segmentation.
CoRR, September, 2025

Correction: Consistent Prompt Tuning for Generalized Category Discovery.
Int. J. Comput. Vis., August, 2025

Consistent Prompt Tuning for Generalized Category Discovery.
Int. J. Comput. Vis., July, 2025

Evaluating Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition.
ACM Trans. Sens. Networks, March, 2025

WI3D: Weakly Incremental 3D Detection via Vision Foundation Models.
IEEE Trans. Multim., 2025

Beyond Instance Consistency: Investigating View Diversity in Self-supervised Learning.
Trans. Mach. Learn. Res., 2025

Object Adaptive Self-Supervised Dense Visual Pre-Training.
IEEE Trans. Image Process., 2025

Balancing Privacy and Performance: A Many-in-One Approach for Image Anonymization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Learning Student Network Under Universal Label Noise.
IEEE Trans. Image Process., 2024

Deep Supervised Multi-View Learning With Graph Priors.
IEEE Trans. Image Process., 2024

Multi-View Vision Fusion Network: Can 2D Pre-Trained Model Boost 3D Point Cloud Data-Scarce Learning?
IEEE Trans. Circuits Syst. Video Technol., 2024

Blessing few-shot segmentation via semi-supervised learning with noisy support images.
Pattern Recognit., 2024

Revisiting 3D visual grounding with Context-aware Feature Aggregation.
Neurocomputing, 2024

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark.
CoRR, 2024

Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Robust Variational Contrastive Learning for Partially View-unaligned Clustering.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

G-Former: A Grouping Transformer for Weakly Supervised Point Cloud Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024

HCMA'24: The 5th International Workshop on Human-centric Multimedia Analysis Summary.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

Direct Distillation Between Different Domains.
Proceedings of the Computer Vision - ECCV 2024, 2024

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Contributing Dimension Structure of Deep Feature for Coreset Selection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

PrefAce: Face-Centric Pretraining with Self-Structure Aware Distillation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Closer Look at Video Sampling for Sequential Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Unsupervised Contrastive Cross-Modal Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A Closer Look at Few-Shot 3D Point Cloud Classification.
Int. J. Comput. Vis., March, 2023

Dual-Stream Contrastive Learning for Channel State Information Based Human Activity Recognition.
IEEE J. Biomed. Health Informatics, 2023

LPCL: Localized prominence contrastive learning for self-supervised dense visual pre-training.
Pattern Recognit., 2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts.
CoRR, 2023

Exploit the antenna response consistency to define the alignment criteria for CSI data.
CoRR, 2023

Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study.
CoRR, 2023

An Overview of Challenges in Egocentric Text-Video Retrieval.
CoRR, 2023

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
CoRR, 2023

HCMA '23: 4th International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language Models can do Zero-Shot Visual Referring Expression Comprehension.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Semi-Supervised Few-Shot Segmentation with Noisy Support Images.
Proceedings of the IEEE International Conference on Image Processing, 2023

Rethinking Image Super Resolution from Long-Tailed Distribution Learning Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-End 3D Dense Captioning with Vote2Cap-DETR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
A Survey of Embodied AI: From Simulators to Research Tasks.
IEEE Trans. Emerg. Top. Comput. Intell., 2022

Deep Semisupervised Multiview Learning With Increasing Views.
IEEE Trans. Cybern., 2022

Hierarchical Point Cloud Encoding and Decoding With Lightweight Self-Attention Based Model.
IEEE Robotics Autom. Lett., 2022

Locality-Aware Crowd Counting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Point Cloud Instance Segmentation With Semi-Supervised Bounding-Box Mining.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

XAI Beyond Classification: Interpretable Neural Clustering.
J. Mach. Learn. Res., 2022

Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.
CoRR, 2022

RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval.
CoRR, 2022

What Makes for Effective Few-shot Point Cloud Classification?
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

HCMA'22: 3rd International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Spectral Representation Learning From Multi-View Data.
IEEE Trans. Image Process., 2021

Single-Image Dehazing via Compositional Adversarial Network.
IEEE Trans. Cybern., 2021

Joint Versus Independent Multiview Hashing for Cross-View Retrieval.
IEEE Trans. Cybern., 2021

Cross-modal discriminant adversarial network.
Pattern Recognit., 2021

A comprehensive survey of procedural video datasets.
Comput. Vis. Image Underst., 2021

Semantic Role Aware Correlation Transformer For Text To Video Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Spcr: semi-supervised point cloud instance segmentation with perturbation consistency regularization.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

A Diagnostic Study Of Visual Question Answering With Analogical Reasoning.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Learning Cross-Modal Retrieval With Noisy Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

OPQ: Compressing Deep Neural Networks with One-shot Pruning-Quantization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Deep Clustering With Sample-Assignment Invariance Prior.
IEEE Trans. Neural Networks Learn. Syst., 2020

Zero-Shot Image Dehazing.
IEEE Trans. Image Process., 2020

Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection.
IEEE Trans. Image Process., 2020

Holistic Multi-Modal Memory Network for Movie Question Answering.
IEEE Trans. Image Process., 2020

Improving Night-Time Pedestrian Retrieval With Distribution Alignment and Contextual Distance.
IEEE Trans. Ind. Informatics, 2020

A novel hybrid approach for crack detection.
Pattern Recognit., 2020

Partition level multiview subspace clustering.
Neural Networks, 2020

6D Pose Estimation with Correlation Fusion.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Semi-Supervised Multi-Modal Learning with Balanced Spectral Decomposition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
AnomalyNet: An Anomaly Detection Network for Video Surveillance.
IEEE Trans. Inf. Forensics Secur., 2019

Multiple Marginal Fisher Analysis.
IEEE Trans. Ind. Electron., 2019

Clustering with similarity preserving.
Neurocomputing, 2019

6D Pose Estimation with Correlation Fusion.
CoRR, 2019

Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

Multi-view Spectral Clustering Network.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

COMIC: Multi-view Clustering Without Parameter Selection.
Proceedings of the 36th International Conference on Machine Learning, 2019

Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Singe Image Rain Removal with Unpaired Information: A Differentiable Programming Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
k-meansNet: When k-means Meets Differentiable Programming.
CoRR, 2018

DehazeGAN: When Image Dehazing Meets Differential Programming.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text.
CoRR, 2017

2016
Multiple Human Identification and Cosegmentation: A Human-Oriented CRF Approach With Poselets.
IEEE Trans. Multim., 2016

Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation.
J. Vis. Commun. Image Represent., 2016

2015
Diagnosing state-of-the-art object proposal methods.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Multiple foreground recognition and cosegmentation: An object-oriented CRF model with robust higher-order potentials.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Poselet-based multiple human identification and cosegmentation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Object-Level Image Segmentation Using Low Level Cues.
IEEE Trans. Image Process., 2013

Multi-class Cosegmentation with Pairwise Active Learning.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Salient object cutout using Google images.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013


  Loading...