Haiyun Guo

Orcid: 0000-0001-6774-0298

According to our database1, Haiyun Guo authored at least 65 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Rubric-based On-policy Distillation.
CoRR, May, 2026

ST-Prune: Training-Free Spatio-Temporal Token Pruning for Vision-Language Models in Autonomous Driving.
CoRR, April, 2026

CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models.
CoRR, April, 2026

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing.
CoRR, April, 2026

PLUME: Latent Reasoning Based Universal Multimodal Embedding.
CoRR, April, 2026

Rethinking Representativeness and Diversity in Dynamic Data Selection.
CoRR, March, 2026

TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval.
CoRR, March, 2026

WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval.
CoRR, February, 2026

R-Diverse: Mitigating Diversity Illusion in Self-Play LLM Training.
CoRR, February, 2026

Active Zero: Self-Evolving Vision-Language Models through Active Environment Exploration.
CoRR, February, 2026

Guidestar-Free Adaptive Optics with Asymmetric Apertures.
CoRR, February, 2026

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval.
CoRR, February, 2026

Continual Instruction Tuning for Large Multimodal Models.
IEEE Trans. Image Process., 2026

HB-Mamba: Hierarchical Bi-directional State Space Modeling for LiDAR Semantic Segmentation in Autonomous Driving.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

PASs-MoE: Mitigating Misaligned Co-drift among Router and Experts via Pathway Activation Subspaces for Continual Learning.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning.
CoRR, November, 2025

MLLM-CBench:A Comprehensive Benchmark for Continual Instruction Tuning of Multimodal LLMs with Chain-of-Thought Reasoning Analysis.
CoRR, August, 2025

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval.
CoRR, August, 2025

MaSA: Mamba-Based Global Feature Selective Aggregator for Efficient Lane Detection.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Referring Expression Instance Retrieval and A Strong End-to-End Baseline.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Semantic-aware Fine-grained Point Augmentation for 3D Multi-modal Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
AAformer: Auto-Aligned Transformer for Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., December, 2024

NeuWS: Neural Wavefront Shaping for guidestar-free imaging through static and dynamic scattering media.
Dataset, July, 2024

Structural Dependence Learning Based on Self-attention for Face Alignment.
Mach. Intell. Res., June, 2024

Monocular Lane Detection Based on Deep Learning: A Survey.
CoRR, 2024

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

WaveMo: Learning Wavefront Modulations to See Through Scattering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Contrastive Learning with Information Compensation for Visible-Infrared Person Re-Identification.
Proceedings of the 14th Asian Control Conference, 2024

2023
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Effectiveness of information and communication technology(ICT) for addictive behaviors: An umbrella review of systematic reviews and meta-analysis of randomized controlled trials.
Comput. Hum. Behav., October, 2023

Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification.
IEEE Trans. Intell. Transp. Syst., April, 2023

Pseudo Label Rectification With Joint Camera Shift Adaptation and Outlier Progressive Recycling for Unsupervised Person Re-Identification.
IEEE Trans. Intell. Transp. Syst., March, 2023

Continual Instruction Tuning for Large Multimodal Models.
CoRR, 2023

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection.
CoRR, 2023

Instance-Proxy Loss for Semi-supervised Learning with Coarse Labels.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

2022
Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Multi-Granularity Mutual Learning Network for Object Re-Identification.
IEEE Trans. Intell. Transp. Syst., 2022

Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification.
CoRR, 2022

Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
CoRR, 2022

Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Unsupervised cycle-consistent person pose transfer.
Neurocomputing, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.
CoRR, 2021

2020
A novel data augmentation scheme for pedestrian detection with attribute preserving GAN.
Neurocomputing, 2020

Unsupervised Domain Adaptive Re-Identification with Feature Adversarial Learning and Self-similarity Clustering.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Identity-Guided Human Semantic Parsing for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adaptive Variance Based Label Distribution Learning for Facial Age Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection.
IEEE Trans. Image Process., 2019

Two-Level Attention Network With Multi-Grain Ranking Loss for Vehicle Re-Identification.
IEEE Trans. Image Process., 2019

Elite Loss for scene text detection.
Neurocomputing, 2019

Vehicle Re-Identification with Refined Part Model.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Cascade Attention Network for Person Re-Identification.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deep embedding network for robust age estimation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2016
Multi-View 3D Object Retrieval With Deep Embedding Network.
IEEE Trans. Image Process., 2016

Multiple deep features learning for object retrieval in surveillance videos.
IET Comput. Vis., 2016

Scale-Adaptive Deconvolutional Regression Network for Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Learning deep compact descriptor with bagging auto-encoders for object retrieval.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015



  Loading...