Steven Lu
Affiliations:- Bloomberg, New York, NY USA
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2023
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023