Zhaoyang Li

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion.
CoRR, May, 2026

Revisiting and Expanding the IPv6 Network Periphery: Global-Scale Measurement and Security Analysis.
CoRR, April, 2026

Hierarchical Codec Diffusion for Video-to-Speech Generation.
CoRR, April, 2026

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search.
CoRR, February, 2026

Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories.
CoRR, January, 2026

Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning.
CoRR, January, 2026

All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection via Design Concept Reproduction.
CoRR, January, 2026

E 2 AD: Enhanced and explainable Alzheimer's disease detection framework via anatomy- and relation-aware cross-modal knowledge distillation.
Medical Image Anal., 2026

Conditional VQ-VAE for Action-Conditioned Motion Generation.
Proceedings of the MultiMedia Modeling, 2026

OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking.
CoRR, November, 2025

FM4Com: Foundation Model for Scene-Adaptive Communication Strategy Optimization.
CoRR, November, 2025

PortGPT: Towards Automated Backporting Using Large Language Models.
CoRR, October, 2025

3rd Place Solution to ICCV LargeFineFoodAI Retrieval.
CoRR, October, 2025

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation.
CoRR, July, 2025

Cross-attention and Self-attention for Audio-visual Speaker Diarization in MISP-Meeting Challenge.
CoRR, June, 2025

Trajectory Planning of Radio Source Tracking for the New Feed Cabin Mechanism in FAST.
IEEE Trans. Control. Syst. Technol., May, 2025

KPIs 2024 Challenge: Advancing Glomerular Segmentation from Patch- to Slide-Level.
CoRR, February, 2025

OCELOT 2023: Cell detection from cell-tissue interaction challenge.
Medical Image Anal., 2025

Feature aware-contrastive learning network for arbitrary-sized image steganalysis.
J. Vis. Commun. Image Represent., 2025

Deconfounded and debiased estimation for high-dimensional linear regression under hidden confounding with application to omics data.
Bioinform., 2025

Cross-attention and Self-attention for Audio-visual Speaker Diarization in MISP-Meeting Challenge.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Two-Stage Temporal ConvTransformer for Continuous Sign Language Recognition.
Proceedings of the 51st Annual Conference of the IEEE Industrial Electronics Society, 2025

Vehicle Following control using Transformer-based Soft Actor-Critic with Behavior Cloning.
Proceedings of the 51st Annual Conference of the IEEE Industrial Electronics Society, 2025

MamKO: Mamba-based Koopman operator for modeling and predictive control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Reliable Learning From LLM Features for Multimodal Emotion and Intent Joint Understanding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Enhancing Task-Specific Feature Learning with LLMs for Multimodal Emotion and Intent Joint Understanding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

BrainLoc: Brain Signal-Based Object Detection with Multi-modal Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Multimodal Dual Cross-Attention Fusion Strategy for Autonomous Garbage Classification System.
IEEE Trans. Ind. Informatics, November, 2024

An Exploration into the Fault Diagnosis of Analog Circuits Using Enhanced Golden Eagle Optimized 1D-Convolutional Neural Network (CNN) with a Time-Frequency Domain Input and Attention Mechanism.
Sensors, January, 2024

Evaluating and Advancing Multimodal Large Language Models in Ability Lens.
CoRR, 2024

Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization.
CoRR, 2024

Dynamically Expanding Capacity of Autonomous Driving with Near-Miss Focused Training Framework.
CoRR, 2024

Machine learning-based input-augmented Koopman modeling and predictive control of nonlinear processes.
Comput. Chem. Eng., 2024

Risk-Aware Non-Myopic Motion Planner for Large-Scale Robotic Swarm Using CVaR Constraints.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Recognize Anything: A Strong Image Tagging Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Structurally incoherent adaptive weighted low-rank matrix decomposition for image classification.
Appl. Intell., November, 2023

Research on Denoising Method for Hydroelectric Unit Vibration Signal Based on ICEEMDAN-PE-SVD.
Sensors, July, 2023

Injection control algorithm of diesel electronic control system based on neural network technology.
Int. J. Syst. Assur. Eng. Manag., April, 2023

Cooperative control strategy of wheel-legged robot based on attitude balance.
Robotica, February, 2023

Salt-and-pepper denoising method for colour images based on tensor low-rank prior and implicit regularization.
IET Image Process., February, 2023

DPW-RRM: Random Routing Mutation Defense Method Based on Dynamic Path Weight.
KSII Trans. Internet Inf. Syst., 2023

The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions.
CoRR, 2023

Defect Detection in Computer Motherboard Assembly through Fusion of Multi-Scale Features and Attention Mechanisms.
Proceedings of the 9th International Conference on Systems and Informatics, 2023

Dimensional Optimization and Anti-Disturbance Analysis of an Upgraded Feed Mechanism in FAST.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

PRIME: 3D Human Pose and Body Shape Recovery with Perspective Projection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Error Analysis of the Cable-Driven Parallel Robot in the Upgraded Feed Cabin of FAST.
Proceedings of the International Conference on Advanced Robotics and Mechatronics, 2023

Forward Kinematics and Natural Frequency Analysis of the Upgraded Feed Cabin in FAST.
Proceedings of the International Conference on Advanced Robotics and Mechatronics, 2023

Research on IC curve feature extraction and lithium battery SOH estimation method based on QPSO-BP Algorithm.
Proceedings of the International Conference on Computers, 2023

2022
A Cooperative Caching Scheme for VCCN With Mobility Prediction and Consistent Hashing.
IEEE Trans. Intell. Transp. Syst., 2022

Lightweight and Efficient Distributed Cooperative Intrusion Detection System for Intelligent Swarms.
Proceedings of the International Conference on Networking and Network Applications, 2022

Target-Aware Auto-Augmentation for Unsupervised Domain Adaptive Object Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Local thinning of 3D stereo images based on symmetric decryption algorithm.
Microprocess. Microsystems, 2021

Cross-Modal Pyramid Translation for RGB-D Scene Recognition.
Int. J. Comput. Vis., 2021

Extracting knowledge from features with multilevel abstraction.
CoRR, 2021

Box Re-Ranking: Unsupervised False Positive Suppression for Domain Adaptive Pedestrian Detection.
CoRR, 2021

FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Information Bottleneck Disentanglement for Identity Swapping.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Popularity prediction caching based on logistic regression in vehicular content centric networks.
Int. J. Ad Hoc Ubiquitous Comput., 2020

A Two-Stream Graph Convolutional Neural Network for Dynamic Traffic Flow Forecasting.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

2019
Deep Adversarial Multi-view Clustering Network.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Neural Network for Detailed Human Depth Estimation From a Single Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks.
IEEE Trans. Circuits Syst. Video Technol., 2018

Combining ConvNets with hand-crafted features for action recognition based on an HMM-SVM classifier.
Multim. Tools Appl., 2018

Detecting Evil-Twin Attack with the Crowd Sensing of Landmark in Physical Layer.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2018

2016
Combining ConvNets with Hand-Crafted Features for Action Recognition Based on an HMM-SVM Classifier.
CoRR, 2016

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Review of Heterogeneous Wireless Fusion in Mobile 5G Networks: Benefits and Challenges.
Proceedings of the Collaborate Computing: Networking, Applications and Worksharing, 2016

2014
Compressor Design for a 30fs-300J 10PW Ti: sapphire Laser - Divided-compressor with an Object-Image-Grating Self-tiling Tiled Grating.
Proceedings of 2nd International Conference on Photonics, Optics and Laser Technology, 2014

2011
An Improved Approximation Algorithm for a Class of Batch Scheduling Problems.
Proceedings of the Advanced Intelligent Computing - 7th International Conference, 2011


  Loading...