Yuchi Wang

Orcid: 0009-0006-3242-3851

According to our database1, Yuchi Wang authored at least 20 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control.
CoRR, April, 2026

TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Human or LLM as Standardized Patients? A Comparative Study for Medical Education.
CoRR, November, 2025

SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model.
CoRR, October, 2025

Reinforcement Learning Meets Large Language Models: A Survey of Advancements and Applications Across the LLM Lifecycle.
CoRR, September, 2025

Towards Assessing Medical Ethics from Knowledge to Practice.
CoRR, August, 2025

YOLO-SSFA: A Lightweight Real-Time Infrared Detection Method for Small Targets.
Inf., 2025

Multiple Queries with Multiple Keys: A Precise Prompt Matching Paradigm for Prompt-based Continual Learning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

VidTwin: Video VAE with Decoupled Structure and Dynamics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Modeling Interactions Between Stocks Using LLM-Enhanced Graphs for Volume Prediction.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Proxy Tuning for Financial Sentiment Analysis: Overcoming Data Scarcity and Computational Barriers.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement.
CoRR, 2024

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

GAIA: Zero-shot Talking Avatar Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond.
CoRR, 2023


  Loading...