Yuexiang Zhai

According to our database1, Yuexiang Zhai authored at least 23 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
J. Mach. Learn. Res., 2024

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning.
CoRR, 2024

Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement.
CoRR, 2024

RLIF: Interactive Imitation Learning as Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning.
Proceedings of the Conference on Parsimony and Learning, 2024

Closed-Loop Transcription via Convolutional Sparse Coding.
Proceedings of the Conference on Parsimony and Learning, 2024

2023
Investigating the Catastrophic Forgetting in Multimodal Large Language Models.
CoRR, 2023

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.
CoRR, 2023

Understanding the Complexity Gains of Single-Task RL with a Curriculum.
Proceedings of the International Conference on Machine Learning, 2023

2022
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning.
J. Artif. Intell. Res., 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Computational Benefits of Intermediate Rewards for Hierarchical Planning.
CoRR, 2021

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group.
J. Mach. Learn. Res., 2020

Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Analysis of the Optimization Landscapes for Overcomplete Representation Learning.
CoRR, 2019

Complete Dictionary Learning via 𝓁<sup>4</sup>-Norm Maximization over the Orthogonal Group.
CoRR, 2019

Learning to Reconstruct 3D Manhattan Wireframes From a Single Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...