Yuexiang Zhai

According to our database¹, Yuexiang Zhai authored at least 24 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

DexHoldem: Playing Texas Hold'em with Dexterous Embodied System.

[BibT_eX]

[DOI]

CoRR, May, 2026

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2024

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement.

[BibT_eX]

[DOI]

Ruiqi Zhang

Yuexiang Zhai

Andrea Zanette

CoRR, 2024

RLIF: Interactive Imitation Learning as Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2024

Closed-Loop Transcription via Convolutional Sparse Coding.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2024

2023

Investigating the Catastrophic Forgetting in Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

Understanding the Complexity Gains of Single-Task RL with a Curriculum.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Computational Benefits of Intermediate Rewards for Hierarchical Planning.

[BibT_eX]

[DOI]

CoRR, 2021

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training.

[BibT_eX]

[DOI]

Carlos Fernandez-Granda

Qing Qu

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Analysis of the Optimization Landscapes for Overcomplete Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Complete Dictionary Learning via 𝓁<sup>4</sup>-Norm Maximization over the Orthogonal Group.

[BibT_eX]

[DOI]

CoRR, 2019

Learning to Reconstruct 3D Manhattan Wireframes From a Single Image.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Yuexiang Zhai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...