Yucheng Zhao

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2026

IGARN: IVUS Guidewire Artifact Removal Network based on unpaired unsupervised learning.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2026

2025

Dexbotic: Open-Source Vision-Language-Action Toolbox.

[BibT_eX]

[DOI]

CoRR, October, 2025

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies.

[BibT_eX]

[DOI]

CoRR, October, 2025

ManiAgent: An Agentic Framework for General Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, October, 2025

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction.

[BibT_eX]

[DOI]

CoRR, October, 2025

LLaDA-VLA: Vision Language Diffusion Action Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Hita: Holistic Tokenizer for Autoregressive Image Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

ROSA: Harnessing Robot States for Vision-Language and Action Alignment.

[BibT_eX]

[DOI]

CoRR, June, 2025

PADriver: Towards Personalized Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, May, 2025

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness.

[BibT_eX]

[DOI]

CoRR, April, 2025

Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

Multimode Huauv Aerial trajectory tracking with Antisaturation control.

[BibT_eX]

[DOI]

Int. J. Robotics Autom., 2025

CSF-NET: Cross-Modal Spatiotemporal Fusion Network for Pulmonary Nodule Malignancy Predicting.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

PADriver: Towards Personalized Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

Reconstructive Visual Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

[BibT_eX]

[DOI]

CoRR, 2024

Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2024

Stream Query Denoising for Vectorized HD Map Construction.

[BibT_eX]

[DOI]

CoRR, 2024

Methods for Removing Artifacts from EEG signals: A review.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Conference on Software Quality, 2024

Different Control Methods for FES Cycling: Review.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Conference on Software Quality, 2024

Stream Query Denoising for Vectorized HD-Map Construction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

ADriver-I: A General World Model for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2023

VLM-Eval: A General Evaluation on Video Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Attention-Guided Contrastive Masked Image Modeling for Transformer-Based Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2023

Research on Construction Method of IoT Knowledge System Based on Knowledge Graph.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2023

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Video Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Look Before You Match: Instance Understanding Matters in Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Evaluation and prediction of free driving behavior type based on fuzzy comprehensive support vector machine.

[BibT_eX]

[DOI]

J. Intell. Fuzzy Syst., 2022

OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Peripheral Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP.

[BibT_eX]

[DOI]

CoRR, 2021

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework.

[BibT_eX]

[DOI]

CoRR, 2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AsymmNet: Towards Ultralight Convolution Neural Networks Using Asymmetrical Bottlenecks.

[BibT_eX]

[DOI]

Haojin Yang

Zhen Shen

Yucheng Zhao

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019

Joint face alignment and segmentation via deep multi-task learning.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

2015

Abnormal traffic-indexed state estimation: A cyber-physical fusion approach for Smart Grid attack detection.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2015

Yucheng Zhao

Bibliography

Loading...