Yuxuan Cai

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Ask Only When Needed: Proactive Retrieval from Memory and Skills for Experience-Driven Lifelong Agents.
CoRR, April, 2026

DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval.
CoRR, April, 2026

WebForge: Breaking the Realism-Reproducibility-Scalability Trilemma in Browser Agent Benchmark.
CoRR, April, 2026

GenMask: Adapting DiT for Segmentation via Direct Mask Generation.
CoRR, March, 2026

AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution.
CoRR, March, 2026

Yunque DeepResearch Technical Report.
CoRR, January, 2026

Octopus: A Robust and Privacy-Preserving Scheme for Compressed Gradients in Federated Learning.
IEEE Trans. Dependable Secur. Comput., 2026

Synergistic Bayesian Optimization and Reinforcement Learning with Bidirectional Interaction for Efficient VLSI Constraint Tuning.
Proceedings of the 31st Asia and South Pacific Design Automation Conference, 2026

2025
Dynamic Expert Routing for Unsupervised Continual Anomaly Detection.
IEEE Trans. Ind. Informatics, October, 2025

FastMTP: Accelerating LLM Inference with Enhanced Multi-Token Prediction.
CoRR, September, 2025

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark.
CoRR, September, 2025

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark.
CoRR, August, 2025

Qwen-Image Technical Report.
CoRR, August, 2025

HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding.
CoRR, July, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.
CoRR, June, 2025

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding.
CoRR, June, 2025

Task-Core Memory Management and Consolidation for Long-term Continual Learning.
CoRR, May, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Integrating Frequency Adaptive Normalization Into Inland Ship Trajectory Prediction: A Non-Stationary Time Series Forecasting Approach.
Proceedings of the 28th IEEE International Conference on Intelligent Transportation Systems, 2025

Prediction-Enhanced Soft Actor-Critic for Optimal Energy Management of Electric Vehicles.
Proceedings of the 51st Annual Conference of the IEEE Industrial Electronics Society, 2025

Surrounding Vehicle-Aware Predictive Torque Distribution for Dual-Motor Electric Vehicles.
Proceedings of the 51st Annual Conference of the IEEE Industrial Electronics Society, 2025

Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

A Comprehensive Library for Benchmarking Multi-Class Visual Anomaly Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
A Discrepancy Aware Framework for Robust Anomaly Detection.
IEEE Trans. Ind. Informatics, March, 2024

Robot Collisions Classification Based on Variational Mode Decomposition of Vibration Measurements.
IEEE Trans. Instrum. Meas., 2024

SecFed: A Secure and Efficient Federated Learning Based on Multi-Key Homomorphic Encryption.
IEEE Trans. Dependable Secur. Comput., 2024

Assessing the Association Between Urban Amenities and Urban Green Space Transformation in Guangzhou.
ISPRS Int. J. Geo Inf., 2024

Fleximo: Towards Flexible Text-to-Human Motion Video Generation.
CoRR, 2024

Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention.
CoRR, 2024

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.
CoRR, 2024

Allegro: Open the Black Box of Commercial-Level Video Generation Model.
CoRR, 2024

Attention-Guided Perturbation for Unsupervised Image Anomaly Detection.
CoRR, 2024

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection.
CoRR, 2024

Anomaly Detection by Adapting a pre-trained Vision Language Model.
CoRR, 2024

Yi: Open Foundation Models by 01.AI.
CoRR, 2024

Zephyr: A High-Performance Framework for Graph Attention Networks on Heterogeneous Data.
Proceedings of the 23rd IEEE International Conference on Trust, 2024

Dual Kalman Filter Based on Maximum Correntropy Criterion for Adaptive Decoding in Brain-Machine Interface.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

High-Performance Temporal Reversible Spiking Neural Networks with O(L) Training Memory and O(1) Inference Cost.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Control backbone detection in omnigenic disease neighborhood targeting core module.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

2023
ESMAC: Efficient and Secure Multi-Owner Access Control With TEE in Multi-Level Data Processing.
IEEE Trans. Dependable Secur. Comput., 2023

RevColV2: Exploring Disentangled Representations in Masked Image Modeling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reversible Column Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Comparing BOLD and VASO-CBV population receptive field estimates in human visual cortex.
NeuroImage, 2022

2021
Adaptation to visual numerosity changes neural numerosity selectivity.
NeuroImage, 2021

Individualized cognitive neuroscience needs 7T: Comparing numerosity maps at 3T and 7T MRI.
NeuroImage, 2021

2019
Eyes Closed Elevates Brain Intrinsic Activity of Sensory Dominance Networks: A Classifier Discrimination Analysis.
Brain Connect., 2019

2017
Exploring the Associations Between Intrinsic Brain Connectivity and Creative Ability Using Functional Connectivity Strength and Connectome Analysis.
Brain Connect., 2017


  Loading...