Jinyi Hu

Orcid: 0009-0002-9440-4198

According to our database¹, Jinyi Hu authored at least 30 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, February, 2026

Exploring Perceptual Limitations of Multimodal LLMs on Small Visual Objects.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

2025

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience.

[BibT_eX]

[DOI]

CoRR, December, 2025

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents.

[BibT_eX]

[DOI]

CoRR, January, 2025

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

NVILA: Efficient Frontier Visual Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GUICourse: From General Vision Language Model to Versatile GUI Agent.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions.

[BibT_eX]

[DOI]

CoRR, 2024

NVILA: Efficient Frontier Visual Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Perceptual Limitation of Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

scMulan: A Multitask Generative Pre-Trained Language Model for Single-Cell Analysis.

[BibT_eX]

[DOI]

Proceedings of the Research in Computational Molecular Biology, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LEGENT: Open Platform for Embodied Agents.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2024

2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Towards Interpretable Natural Language Understanding with Explanations as Latent Variables.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generating Major Types of Chinese Classical Poetry in a Uniformed Framework.

[BibT_eX]

[DOI]

Jinyi Hu

Maosong Sun

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Jinyi Hu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...