We stand with Ukraine

We stand with Ukraine

Yuchi Wang

Orcid: 0009-0006-3242-3851

According to our database¹, Yuchi Wang authored at least 20 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control.

[DOI]

,

,

,

,

,

,

CoRR, April, 2026

TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation.

[DOI]

Victor Shea-Jay Huang

,

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Human or LLM as Standardized Patients? A Comparative Study for Medical Education.

[DOI]

,

,

,

,

,

CoRR, November, 2025

SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Reinforcement Learning Meets Large Language Models: A Survey of Advancements and Applications Across the LLM Lifecycle.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Towards Assessing Medical Ethics from Knowledge to Practice.

[DOI]

,

,

,

,

,

,

,

CoRR, August, 2025

YOLO-SSFA: A Lightweight Real-Time Infrared Detection Method for Small Targets.

[DOI]

,

,

,

,

Inf., 2025

Multiple Queries with Multiple Keys: A Precise Prompt Matching Paradigm for Prompt-based Continual Learning.

[DOI]

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing.

[DOI]

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

VidTwin: Video VAE with Decoupled Structure and Dynamics.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Modeling Interactions Between Stocks Using LLM-Enhanced Graphs for Volume Prediction.

[DOI]

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Proxy Tuning for Financial Sentiment Analysis: Overcoming Data Scarcity and Computational Barriers.

[DOI]

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints.

[DOI]

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

GAIA: Zero-shot Talking Avatar Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Loading...