Yanhao Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
OASIS: On-Demand Hierarchical Event Memory for Streaming Video Reasoning.
CoRR, April, 2026

Enhanced receptive field multi-scale feature fusion for optical remote sensing target detection.
J. Supercomput., March, 2026

Click-to-Ask: An AI Live Streaming Assistant with Offline Copywriting and Online Interactive QA.
CoRR, March, 2026

Thinking in Streaming Video.
CoRR, March, 2026

SPP-SBL: Space-Power Prior Sparse Bayesian Learning for Block Sparse Recovery.
IEEE Trans. Signal Process., 2026

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OwlCap: Harmonizing Motion-Detail for Video Captioning via HMD-270K and Caption Set Equivalence Reward.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Aligning Cross-View Visual Geometries in LVLMs Through Human-Like Reasoning Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
PIGEON: VLM-Driven Object Navigation via Points of Interest Selection.
CoRR, November, 2025

DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter.
CoRR, August, 2025

Convergence rate of projected subgradient method with time-varying step-sizes.
Optim. Lett., June, 2025

Dynamic-I2V: Exploring Image-to-Video Generation Models via Multimodal LLM.
CoRR, May, 2025

Improved Visual-Spatial Reasoning via R1-Zero-Like Training.
CoRR, April, 2025

H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding.
CoRR, March, 2025

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens.
CoRR, March, 2025

Towards dynamic virtual machine placement based on safety parameters and resource utilization fluctuation for energy savings and QoS improvement in cloud computing.
Future Gener. Comput. Syst., 2025

Delay-aware resource-efficient interleaved task scheduling strategy in spark.
Comput. Sci. Inf. Syst., 2025

Best Subset Selection: Optimal Pursuit for Feature Selection and Elimination.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Free-Moref: Instantly Multiplexing Context Perception Capabilities of Video-Mllms Within Single Inference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Load-Aware Task-Based Scheduling Policy in Spark.
Proceedings of the 2025 4th International Conference on Big Data, 2025

2024
PoseAnimate: Zero-shot high fidelity pose controllable character animation.
CoRR, 2024

LoopAnimate: Loopable Salient Object Animation.
CoRR, 2024

Learning with Diversification from Block Sparse Signal.
CoRR, 2024

Block Sparse Bayesian Learning: A Diversified Scheme.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LoopAnimate: Loopable Salient Object Animation.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Zero-shot High-fidelity and Pose-controllable Character Animation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Lightweight High-Resolution Subject Matting in the Real World.
Proceedings of the IEEE International Conference on Acoustics, 2024

BARET: Balanced Attention Based Real Image Editing Driven by Target-Text Inversion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Incomplete multi-view clustering via attention-based contrast learning.
Int. J. Mach. Learn. Cybern., December, 2023

An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems.
CoRR, 2023

GAM : Gradient Attention Module of Optimization for Point Clouds Analysis.
CoRR, 2023

Knowledge Distillation from Single to Multi Labels: an Empirical Study.
CoRR, 2023

Matting Moments: A Unified Data-Driven Matting Engine for Mobile AIGC in Photo Gallery.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TALL: Thumbnail Layout for Deepfake Video Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GAM: Gradient Attention Module of Optimization for Point Clouds Analysis.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
RAVEN: Resource Allocation Using Reinforcement Learning for Vehicular Edge Computing Networks.
IEEE Commun. Lett., 2022

2021
Impact of Rapid Urban Sprawl on the Local Meteorological Observational Environment Based on Remote Sensing Images and GIS Technology.
Remote. Sens., 2021

2020
A Case-Based Reasoning Model for Depression Based on Three-Electrode EEG Data.
IEEE Trans. Affect. Comput., 2020

2017
Robust decentralised stabilisation of uncertain large-scale interconnected nonlinear descriptor systems via proportional plus derivative feedback.
Int. J. Syst. Sci., 2017

A Virtual-Reality Based Neurofeedback Game Framework for Depression Rehabilitation using Pervasive Three-Electrode EEG Collector.
Proceedings of the 12th Chinese Conference on Computer Supported Cooperative Work and Social Computing, 2017

Study on Depression Classification Based on Electroencephalography Data Collected by Wearable Devices.
Proceedings of the Brain Informatics - International Conference, 2017

2015
Real-Time Visual Animation of Explosions.
J. Softw., 2015


  Loading...