Ying Shen

Affiliations:
  • University of Illinois Urbana-Champaign, IL, USA
  • Virginia Tech, Blacksburg, VA, USA (former)


According to our database1, Ying Shen authored at least 20 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer.
CoRR, June, 2025

R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation.
CoRR, May, 2025

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models.
CoRR, February, 2025

ELBA: Learning by Asking for Embodied Visual Navigation and Task Completion.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Modality-Specialized Synergizers for Interleaved Vision-Language Generalists.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models.
CoRR, 2024

Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations.
CoRR, 2024

InternalInspector I<sup>2</sup>: Robust Confidence Estimation in LLMs through Internal States.
CoRR, 2024

Many-to-many Image Generation with Auto-regressive Diffusion Models.
CoRR, 2024

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

InternalInspector I²: Robust Confidence Estimation in LLMs through Internal States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
The Art of SOCRATIC QUESTIONING: Zero-shot Multimodal Reasoning with Recursive Thinking and Self-Questioning.
CoRR, 2023

Learning by Asking for Embodied Visual Navigation and Task Completion.
CoRR, 2023

The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2019
Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Efficient Low-rank Multimodal Fusion With Modality-Specific Factors.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018


  Loading...