Haoyu Zhao

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives.
CoRR, August, 2025

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors.
CoRR, August, 2025

ShoulderShot: Generating Over-the-Shoulder Dialogue Videos.
CoRR, August, 2025

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction.
CoRR, August, 2025

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
CoRR, July, 2025

GIGNet: A Graph-in-Graph Neural Network for Automatic Modulation Recognition.
IEEE Trans. Veh. Technol., June, 2025

UniAdapter: All-in-One Control for Flexible Video Generation.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

SMAP: Self-supervised Motion Adaptation for Physically Plausible Humanoid Whole-body Control.
CoRR, May, 2025

The Role of Diversity in In-Context Learning for Large Language Models.
CoRR, May, 2025

A Survey of LLM ⨉ DATA.
CoRR, May, 2025

TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation.
CoRR, May, 2025

Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities.
CoRR, May, 2025

DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation.
CoRR, March, 2025

Towards Synthesized and Editable Motion In-Betweening Through Part-Wise Phase Representation.
CoRR, March, 2025

Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set.
CoRR, February, 2025

From interface to inference: mapping the impact of generative artificial intelligence affordances on user risk perception.
Telematics Informatics, 2025

Information poverty in Southwest China: self and interactive behaviors among heterogeneous rural groups.
J. Documentation, 2025

Pathological Prior-Guided Multiple Instance Learning For Mitigating Catastrophic Forgetting in Breast Cancer Whole Slide Image Classification.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
An Appearance-Semantic Descriptor with Coarse-to-Fine Matching for Robust VPR.
Sensors, April, 2024

MSCNet: Dense vehicle counting method based on multi-scale dilated convolution channel-aware deep network.
GeoInformatica, April, 2024

Faster Rates for Compressed Federated Learning with Client-Variance Reduction.
SIAM J. Math. Data Sci., March, 2024

No tricks no bluff, focusing on localizing crisp boundaries in image media.
Neurocomputing, 2024

GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.
CoRR, 2024

Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting.
CoRR, 2024

Serp-Mamba: Advancing High-Resolution Retinal Vessel Segmentation with Selective State-Space Model.
CoRR, 2024

EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation.
CoRR, 2024

SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting.
CoRR, 2024

CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning.
CoRR, 2024

AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding.
CoRR, 2024

LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding.
CoRR, 2024

Can Models Learn Skill Composition from Examples?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WIA-LD2ND: Wavelet-Based Image Alignment for Self-supervised Low-Dose CT Denoising.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MoreStyle: Relax Low-Frequency Constraint of Fourier-Based Image Reconstruction in Generalizable Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

High-Utilization GPGPU Design for Accelerating GEMM Workloads: An Incremental Approach.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

Adversarial Attacks on Combinatorial Multi-Armed Bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Semi-Supervised Unmanned Aerial Vehicle Recognition Method Based on Self-Distillation.
Proceedings of the 10th International Conference on Communication and Information Processing, 2024

A Supervised Contrastive Learning Framework with Graph-Based Feature Extraction for Small-Sample Automatic Modulation Recognition.
Proceedings of the 10th International Conference on Communication and Information Processing, 2024

MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
A novel framework for crowd counting using video and audio.
Comput. Electr. Eng., August, 2023

Dual similarity pre-training and domain difference encouragement learning for vehicle re-identification in the wild.
Pattern Recognit., July, 2023

Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes.
IEEE Trans. Multim., 2023

Vehicle Logo Recognition Using Spatial Structure Correlation and YOLO-T.
Sensors, 2023

Memory-efficient document layout analysis method using LD-net.
Multim. Tools Appl., 2023

Scene-adaptive crowd counting method based on meta learning with dual-input network DMNet.
Frontiers Comput. Sci., 2023

VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model.
CoRR, 2023

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation.
CoRR, 2023

Novel micro-structure design of dielectric layer for capacitive tactile sensor.
Proceedings of the IEEE International Conference on Imaging Systems and Techniques, 2023

Randomized Testing Framework for Dissecting NVIDIA GPGPU Thread Block-To-SM Scheduling Mechanisms.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Task-Specific Skill Localization in Fine-tuned Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RBGC: Repurpose the Buffer of Fixed Graphics Pipeline to Enhance GPU Cache.
Proceedings of the Great Lakes Symposium on VLSI 2023, 2023

Do Transformers Parse while Predicting the Masked Word?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
ECNFP: Edge-constrained network using a feature pyramid for image inpainting.
Expert Syst. Appl., November, 2022

Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification.
IEEE Trans. Multim., 2022

SPACE: Finding Key-Speaker in Complex Multi-Person Scenes.
IEEE Trans. Emerg. Top. Comput., 2022

A Two-Stage Approach to Important Area Detection in Gathering Place Using a Novel Multi-Input Attention Network.
Sensors, 2022

Global-Aware Ranking Deep Metric Learning for Remote Sensing Image Retrieval.
IEEE Geosci. Remote. Sens. Lett., 2022

Dense Vehicle Counting Method Based on Deep Spatio-Temporal Network.
Proceedings of the IEEE Smartworld, 2022

BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Coresets for Vertical Federated Learning: Regularized Linear Regression and $K$-Means Clustering.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Global Optimization: Combining Local Loss With Result Ranking Loss in Remote Sensing Image Retrieval.
IEEE Trans. Geosci. Remote. Sens., 2021

Viewpoint adaptation learning with cross-view distance metric for robust vehicle re-identification.
Inf. Sci., 2021

MSR-FAN: Multi-scale residual feature-aware network for crowd counting.
IET Image Process., 2021

Research on the optimization of the management process on internet of things (Iot) for electronic market.
Electron. Libr., 2021

FedPAGE: A Fast Local Stochastic Gradient Method for Communication-Efficient Federated Learning.
CoRR, 2021

Multi-Objective Optimization for Football Team Member Selection.
IEEE Access, 2021

Combinatorial semi-bandit in the non-stationary environment.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Flexible tactile sensing based on electrical resistance tomography and wavelet image fusion.
Proceedings of the IEEE International Conference on Imaging Systems and Techniques, 2021

Illumination-Enhanced Crowd Counting Based on IC-Net in Low Lighting Conditions.
Proceedings of the Image and Graphics - 11th International Conference, 2021

2020
Negative Curvature Hollow Core Fiber Based All-Fiber Interferometer and Its Sensing Applications to Temperature and Strain.
Sensors, 2020

Distribution Consistency Loss for Large-Scale Remote Sensing Image Retrieval.
Remote. Sens., 2020

Artificial intelligence based ensemble approach for intrusion detection systems.
J. Vis. Commun. Image Represent., 2020

Similarity Retention Loss (SRL) Based on Deep Metric Learning for Remote Sensing Image Retrieval.
ISPRS Int. J. Geo Inf., 2020

Wise optimisation: deep image embedding by informative pair weighting and ranked list learning.
IET Image Process., 2020

Combinatorial Semi-Bandit in the Non-Stationary Environment.
CoRR, 2020

A Two-Stream Approach to Fall Detection With MobileVGG.
IEEE Access, 2020

Combinatorial Pure Exploration for Dueling Bandit.
Proceedings of the 37th International Conference on Machine Learning, 2020

Online Second Price Auction with Semi-Bandit Feedback under the Non-Stationary Setting.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Gradient Method for Continuous Influence Maximization with Budget-Saving Considerations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Image Retrieval Based on Learning to Rank and Multiple Loss.
ISPRS Int. J. Geo Inf., 2019

Distribution Structure Learning Loss (DSLL) Based on Deep Metric Learning for Image Retrieval.
Entropy, 2019

Mildly Overparametrized Neural Nets can Memorize Training Data Efficiently.
CoRR, 2019

Stochastic One-Sided Full-Information Bandit.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

An FPTAS for Stochastic Unbounded Min-Knapsack Problem.
Proceedings of the Frontiers in Algorithmics - 13th International Workshop, 2019

2017
A Grasp Strategy for Polygonal Objects Using a Honeycomb Pneumatic Network Soft Gripper.
Proceedings of the Robot Intelligence Technology and Applications 5, 2017

Intelligent Robot Safety Control System Based on MFC.
Proceedings of the 3rd International Conference on Robotics and Artificial Intelligence, 2017

E-governent, corruption reduction and culture: a study based on panel data of 57 countries.
Proceedings of the 18th Annual International Conference on Digital Government Research, 2017

2015
A Ratiometric Wavelength Measurement Based on a Silicon-on-Insulator Directional Coupler Integrated Device.
Sensors, 2015

Image fusion with Internal Generative Mechanism.
Expert Syst. Appl., 2015

General and Local: Averaged k-Dependence Bayesian Classifiers.
Entropy, 2015

Learning a Flexible K-Dependence Bayesian Classifier from the Chain Rule of Joint Probability Distribution.
Entropy, 2015


  Loading...