Jinyi Hu

Orcid: 0009-0002-9440-4198

According to our database1, Jinyi Hu authored at least 30 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning.
CoRR, February, 2026

Exploring Perceptual Limitations of Multimodal LLMs on Small Visual Objects.
Trans. Mach. Learn. Res., 2026

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer.
Trans. Mach. Learn. Res., 2026

2025
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience.
CoRR, December, 2025

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents.
CoRR, January, 2025

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

NVILA: Efficient Frontier Visual Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GUICourse: From General Vision Language Model to Versatile GUI Agent.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions.
CoRR, 2024

NVILA: Efficient Frontier Visual Language Models.
CoRR, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.
CoRR, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
CoRR, 2024

Exploring Perceptual Limitation of Multimodal Large Language Models.
CoRR, 2024

scMulan: A Multitask Generative Pre-Trained Language Model for Single-Cell Analysis.
Proceedings of the Research in Computational Molecular Biology, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LEGENT: Open Platform for Embodied Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2024

2023
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots.
CoRR, 2023

2022
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Towards Interpretable Natural Language Understanding with Explanations as Latent Variables.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generating Major Types of Chinese Classical Poetry in a Uniformed Framework.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


  Loading...