Xinyu Wei

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing.
CoRR, February, 2026

GENIUS: Generative Fluid Intelligence Evaluation Suite.
CoRR, February, 2026

2025
MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition.
CoRR, December, 2025

VideoVerse: How Far is Your T2V Generator from a World Model?
CoRR, October, 2025

Retrieval Feedback Memory Enhancement Large Model Retrieval Generation Method.
CoRR, August, 2025

CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation.
CoRR, August, 2025

Dynamic Embedding of Hierarchical Visual Features for Efficient Vision-Language Fine-Tuning.
CoRR, August, 2025

Separation and Collaboration: Two-Level Routing Grouped Mixture-of-Experts for Multi-Domain Continual Learning.
CoRR, August, 2025

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos.
CoRR, June, 2025

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?
CoRR, June, 2025

Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO.
CoRR, May, 2025

Traffic Models of Dynamic Periodic Event-Triggered Control Systems.
IEEE Trans. Control. Netw. Syst., March, 2025

Are Large Language Models Good In-context Learners for Financial Sentiment Analysis?
CoRR, March, 2025

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Understanding the Economic Consequences of Information Security Incidents on Companies and Industry Peers: The Moderating Roles of Incident Attribution and Organizational Response.
Proceedings of the 46th International Conference on Information Systems, 2025

Tracing Your Roots: Exploring the Security Issues of Root Certificates in Android TLS Connections.
Proceedings of the Information Security and Cryptology - 21st International Conference, 2025

2024
Accretionary Learning With Deep Neural Networks With Applications.
IEEE Trans. Cogn. Commun. Netw., April, 2024

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions.
CoRR, 2024

MAVIS: Mathematical Visual Instruction Tuning.
CoRR, 2024

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception.
CoRR, 2024

In-field grading and sorting technology of apples: A state-of-the-art review.
Comput. Electron. Agric., 2024

Cloud-Device Collaborative Learning for Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Cloud-Device Collaborative Learning for Multimodal Large Language Models.
CoRR, 2023

2022
Privacy-Preserving Robust Federated Learning with Distributed Differential Privacy.
Proceedings of the IEEE International Conference on Trust, 2022

2021
Accretionary Learning with Deep Neural Networks.
CoRR, 2021

2020
Characteristic analysis of humidity control in a fresh-keeping container using CFD model.
Comput. Electron. Agric., 2020

2019
Camouflage Design of Analysis Based on HSV Color Statistics and K-means Clustering.
CoRR, 2019

A Books Recommendation Approach Based on Online Bookstore Data.
CoRR, 2019

Impact of Information Access on Poverty Alleviation Effectiveness: Evidence From China.
IEEE Access, 2019

A Hierarchical Framwork with Improved Loss for Large-scale Multi-modal Video Identification.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Real-Time Monocular Visual SLAM by Combining Points and Lines.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Spatiotemporal Attention Networks for Wind Power Forecasting.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

2017
A Non-Orthogonal Selection Cooperation Protocol with Interference in Multi-Source Cooperative Networks.
Wirel. Pers. Commun., 2017


  Loading...