Yutong Bai

Orcid: 0000-0002-6210-7757

According to our database1, Yutong Bai authored at least 57 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Hierarchical Document-Aware Interest Profiling in Personalized Search.
IEEE Trans. Knowl. Data Eng., April, 2026

Lifting Embodied World Models for Planning and Control.
CoRR, April, 2026

PreSight: Preoperative Outcome Prediction for Parkinson's Disease via Region-Prior Morphometry and Patient-Specific Weighting.
CoRR, March, 2026

Virtual Biopsy for Intracranial Tumors Diagnosis on MRI.
CoRR, February, 2026

ReBA-Pred-Net: Weakly-Supervised Regional Brain Age Prediction on MRI.
CoRR, February, 2026

Deep learning aided intelligent signal recognition for backscatter based metamaterial passive Internet of Things system.
Complex Intell. Syst., 2026

2025
Vibe Spaces for Creatively Connecting and Expressing Visual Concepts.
CoRR, December, 2025

Pillar-0: A New Frontier for Radiology Foundation Models.
CoRR, November, 2025

GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation.
CoRR, October, 2025

Transformers Discover Molecular Structure Without Graph Priors.
CoRR, October, 2025

PD-Diag-Net: Clinical-Priors guided Network on Brain MRI for Auxiliary Diagnosis of Parkinson's Disease.
CoRR, September, 2025

The Serial Scaling Hypothesis.
CoRR, July, 2025

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy.
CoRR, June, 2025

"I Know It When I See It": Mood Spaces for Connecting and Expressing Visual Concepts.
CoRR, April, 2025

Vector Quantized Feature Fields for Fast 3D Semantic Lifting.
CoRR, March, 2025

Subthalamic nucleus stimulation at high and low frequencies engages different brain networks to enhance gait performance in Parkinson's disease.
NeuroImage, 2025

REOrdering Patches Improves Vision Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Whole-Body Conditioned Egocentric Video Prediction.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Intent-Oriented Dynamic Interest Modeling for Personalized Web Search.
ACM Trans. Inf. Syst., July, 2024

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
CoRR, 2024

Analyzing The Language of Visual Tokens.
CoRR, 2024

Evaluating Multiview Object Consistency in Humans and Image Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The Impact of Blockchain Implementations on Supply Chain Collaboration.
Proceedings of the Advances in Production Management Systems. Production Management Systems for Volatile, Uncertain, Complex, and Ambiguous Environments, 2024

Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Dynamic Multi-attribute Interest for Personalized Product Search.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Finding Visual Task Vectors.
Proceedings of the Computer Vision - ECCV 2024, 2024

Masked Autoencoders are Secretly Efficient Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Sequential Modeling Enables Scalable Learning for Large Vision Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

2023
Understanding Pan-Sharpening via Generalized Inverse.
CoRR, 2023

Intriguing Properties of Text-guided Diffusion Models.
CoRR, 2023

Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

CoKe: Contrastive Learning for Robust Keypoint Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Making Your First Choice: To Address Cold Start Problem in Medical Active Learning.
Proceedings of the Medical Imaging with Deep Learning, 2023

Can CNNs Be More Robust Than Transformers?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Masked Autoencoders Enable Efficient Knowledge Distillers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Low-frequency oscillations link frontal and parietal cortex with subthalamic nucleus in conflicts.
NeuroImage, 2022

Making Your First Choice: To Address Cold Start Problem in Vision Active Learning.
CoRR, 2022

CateNorm: Categorical Normalization for Robust Medical Image Segmentation.
Proceedings of the Domain Adaptation and Representation Transfer - 4th MICCAI Workshop, 2022

Fast AdvProp.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Point-Level Region Contrast for Object Detection Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransFG: A Transformer Architecture for Fine-Grained Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
TransFG: A Transformer Architecture for Fine-grained Recognition.
CoRR, 2021

Glance-and-Gaze Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Are Transformers more robust than CNNs?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mask Guided Matting via Progressive Refinement Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Unsupervised Part Discovery via Feature Alignment.
CoRR, 2020

Can Temporal Information Help with Contrastive Self-Supervised Learning?
CoRR, 2020

CoKe: Localized Contrastive Learning for Robust Keypoint Detection.
CoRR, 2020

C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
SSDC-DenseNet: A Cost-Effective End-to-End Spectral-Spatial Dual-Channel Dense Network for Hyperspectral Image Classification.
IEEE Access, 2019

Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints From Limited Training Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A computer-assisted instructional method based on machine learning in software testing class.
Comput. Appl. Eng. Educ., 2018

2017
Underactuated control of swing in orbit debris towing removal via tether space robots.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017


  Loading...