Yucheng Zhao

Orcid: 0000-0002-9610-3025

According to our database1, Yucheng Zhao authored at least 40 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
IGARN: IVUS Guidewire Artifact Removal Network based on unpaired unsupervised learning.
Biomed. Signal Process. Control., 2026

2025
Hita: Holistic Tokenizer for Autoregressive Image Generation.
CoRR, July, 2025

ROSA: Harnessing Robot States for Vision-Language and Action Alignment.
CoRR, June, 2025

PADriver: Towards Personalized Autonomous Driving.
CoRR, May, 2025

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness.
CoRR, April, 2025

Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

Multimode Huauv Aerial trajectory tracking with Antisaturation control.
Int. J. Robotics Autom., 2025

CSF-NET: Cross-Modal Spatiotemporal Fusion Network for Pulmonary Nodule Malignancy Predicting.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Reconstructive Visual Instruction Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving.
CoRR, 2024

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
CoRR, 2024

Stream Query Denoising for Vectorized HD Map Construction.
CoRR, 2024

Methods for Removing Artifacts from EEG signals: A review.
Proceedings of the 24th IEEE International Conference on Software Quality, 2024

Different Control Methods for FES Cycling: Review.
Proceedings of the 24th IEEE International Conference on Software Quality, 2024

Stream Query Denoising for Vectorized HD-Map Construction.
Proceedings of the Computer Vision - ECCV 2024, 2024

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
ADriver-I: A General World Model for Autonomous Driving.
CoRR, 2023

VLM-Eval: A General Evaluation on Video Large Language Models.
CoRR, 2023

Attention-Guided Contrastive Masked Image Modeling for Transformer-Based Self-Supervised Learning.
Proceedings of the IEEE International Conference on Image Processing, 2023

Research on Construction Method of IoT Knowledge System Based on Knowledge Graph.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2023

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Video Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Look Before You Match: Instance Understanding Matters in Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Evaluation and prediction of free driving behavior type based on fuzzy comprehensive support vector machine.
J. Intell. Fuzzy Syst., 2022

OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Peripheral Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP.
CoRR, 2021

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework.
CoRR, 2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AsymmNet: Towards Ultralight Convolution Neural Networks Using Asymmetrical Bottlenecks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
Joint face alignment and segmentation via deep multi-task learning.
Multim. Tools Appl., 2019

2015
Abnormal traffic-indexed state estimation: A cyber-physical fusion approach for Smart Grid attack detection.
Future Gener. Comput. Syst., 2015


  Loading...