Shuo Yang

Orcid: 0000-0001-6145-0150

Affiliations:

Harbin Institute of Technology, School of Computer Science and Technology, China
University of Technology Sydney, Faculty of Engineering and IT, School of Electrical and Data Engineering, NSW, Australia (PhD 2023)

According to our database¹, Shuo Yang authored at least 53 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, May, 2026

TouchAnything: A Dataset and Framework for Bimanual Tactile Estimation from Egocentric Video.

[BibT_eX]

[DOI]

CoRR, May, 2026

Exploring Data-Free LoRA Transferability for Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue.

[BibT_eX]

[DOI]

CoRR, May, 2026

Noisy Correspondence Rectification in Multimodal Clustering Space for Cross-Modal Matching.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2026

MEGG: replay via maximally extreme GGscore in incremental learning for neural recommendation models.

[BibT_eX]

[DOI]

Data Min. Knowl. Discov., April, 2026

CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think.

[BibT_eX]

[DOI]

CoRR, March, 2026

Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Optimizing Few-Step Generation with Adaptive Matching Distillation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching.

[BibT_eX]

[DOI]

CoRR, February, 2026

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, February, 2026

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning.

[BibT_eX]

[DOI]

CoRR, February, 2026

EgoGrasp: World-Space Hand-Object Interaction Estimation from Egocentric Videos.

[BibT_eX]

[DOI]

CoRR, January, 2026

Logic Unseen: Revealing the Logical Blindspots of Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

HiconAgent: History Context-aware Policy Optimization for GUI Agents.

[BibT_eX]

[DOI]

CoRR, December, 2025

VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference.

[BibT_eX]

[DOI]

CoRR, November, 2025

Calibrated Multimodal Representation Learning with Missing Modalities.

[BibT_eX]

[DOI]

CoRR, November, 2025

UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data.

[BibT_eX]

[DOI]

CoRR, July, 2025

Visible-Infrared Person Re-Identification With Real-World Label Noise.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2025

Cognition-Driven Structural Prior for Instance-Dependent Label Transition Matrix Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2025

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

BOOD: Boundary-based Out-Of-Distribution Data Generation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Stable Fair Graph Representation Learning with Lipschitz Constraint.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Learning from Ambiguous Data with Hard Labels.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

U<sup>2</sup>D<sup>2</sup>Net: Unsupervised Unified Image Dehazing and Denoising Network for Single Hazy Image Enhancement.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Data-efficient Fine-tuning for LLM-based Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Concentrating Estimation Attention: Human Prior Constrained Methods for Robust Classification.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Mind the Boundary: Coreset Selection via Reconstructing the Decision Boundary.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Context Aggregation for Image Matting.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

A Parametrical Model for Instance-Dependent Label Noise.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Adversarial Recurrent Time Series Imputation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., April, 2023

Towards Efficient Robotic Software Development by Reusing Behavior Tree Structures for Task Planning Paradigms.

[BibT_eX]

[DOI]

Shuo Yang

Qi Zhang

Complex Syst. Model. Simul., 2023

Dataset Pruning: Reducing Training Data by Examining Generalization Influence.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Speech4Mesh: Speech-Assisted Monocular 3D Facial Reconstruction for Speech-Driven 3D Facial Animation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Bridging the Gap Between Few-Shot and Many-Shot Learning via Distribution Calibration.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Graph-based few-shot learning with transformed feature propagation and optimal class allocation.

[BibT_eX]

[DOI]

Neurocomputing, 2022

An efficient multitask neural network for face alignment, head pose estimation and face tracking.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2022

Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels.

[BibT_eX]

[DOI]

CoRR, 2022

Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Objects in Semantic Topology.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-Attention Gated Cognitive Diagnosis For Faster Adaptive Educational Assessments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2022

One Size Does NOT Fit All: Data-Adaptive Adversarial Training.

[BibT_eX]

[DOI]

Shuo Yang

Chang Xu

Proceedings of the Computer Vision - ECCV 2022, 2022

CAFE: Learning to Condense Dataset by Aligning Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Estimating Instance-dependent Label-noise Transition Matrix using DNNs.

[BibT_eX]

[DOI]

CoRR, 2021

Free Lunch for Few-shot Learning: Distribution Calibration.

[BibT_eX]

[DOI]

Shuo Yang

Lu Liu

Min Xu

Proceedings of the 9th International Conference on Learning Representations, 2021

Structure-Aware Stabilization of Adversarial Robustness with Massive Contrastive Adversaries.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2021

Single-View 3D Object Reconstruction From Shape Priors in Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adversarial Robustness through Disentangled Representations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Meta3D: Single-View 3D Object Reconstruction from Shape Priors in Memory.

[BibT_eX]

[DOI]

Shuo Yang

Min Xu

Hongxun Yao

CoRR, 2020

Shuo Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...