Yuhui Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think.
CoRR, April, 2026

CellFluxRL: Biologically-Constrained Virtual Cell Modeling via Reinforcement Learning.
CoRR, March, 2026

Uncertainty Quantification for Distribution-to-Distribution Flow Matching in Scientific Imaging.
CoRR, March, 2026

Fine-tuning MLLMs Without Forgetting Is Easier Than You Think.
CoRR, March, 2026

Tool Verification for Test-Time Reinforcement Learning.
CoRR, March, 2026

Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models.
CoRR, February, 2026

SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking.
CoRR, February, 2026

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions.
CoRR, January, 2026

Audio-Driven Talking Face Generation with Blink Embedding and Hash Grid Landmarks Encoding.
CoRR, January, 2026

RadDiff: Describing Differences in Radiology Image Sets with Natural Language.
CoRR, January, 2026

A Deep-DiD Method to Estimate Heterogeneous Treatment Effects: Application to Content Creator Selection.
Mark. Sci., 2026

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Intelligent recognition of GPR road hidden defect images based on feature fusion and attention mechanism.
CoRR, December, 2025

Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning.
CoRR, December, 2025

Lightweight framework for underground pipeline recognition and spatial localization based on multi-view 2D GPR images.
CoRR, December, 2025

Three Forms of Stochastic Injection for Improved Distribution-to-Distribution Generative Modeling.
CoRR, October, 2025

No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models.
CoRR, October, 2025

MuSLR: Multimodal Symbolic Logical Reasoning.
CoRR, September, 2025

Closing the Modality Gap for Mixed Modality Search.
CoRR, July, 2025

Can Large Language Models Match the Conclusions of Systematic Reviews?
CoRR, May, 2025

A 2D Semantic-Aware Position Encoding for Vision Transformers.
CoRR, May, 2025

A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI.
CoRR, March, 2025

EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking.
CoRR, February, 2025

CellFlow: Simulating Cellular Morphology Changes via Flow Matching.
CoRR, February, 2025

Temporal Preference Optimization for Long-Form Video Understanding.
CoRR, January, 2025

Intelligent Recognition of GPR Road Hidden Defect Images Based on Feature Fusion and Attention Mechanism.
IEEE Trans. Geosci. Remote. Sens., 2025

Lightweight Framework for Underground Pipeline Recognition and Spatial Localization Based on Multiview 2-D GPR Images.
IEEE Trans. Geosci. Remote. Sens., 2025

Performance enhancement of LED displays through optimized light distribution.
Displays, 2025

Dilated residual convolutional network for surface electromyographic hand gesture recognition.
Biomed. Signal Process. Control., 2025

An extended variational autoencoder for cross-subject electromyograph gesture recognition.
Biomed. Signal Process. Control., 2025

AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

TULiP: Test-Time Uncertainty Estimation via Linearization and Weight Perturbation.
Proceedings of the Neural Information Processing - 32nd International Conference, 2025

CellFlux: Simulating Cellular Morphology Changes via Flow Matching.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Video Action Differencing.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RDA: Regularized Domain Adaptation for Multimedia Event Extraction.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

DataLab: A Unified Platform for LLM-Powered Business Intelligence.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Data or Language Supervision: What Makes CLIP Better than DINO?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MAKAR: a Multi-Agent framework based Knowledge-Augmented Reasoning for Grounded Multimodal Named Entity Recognition.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025


The Academic as an Entrepreneur: Real Options and Co-authoring in IS Scholarship.
Proceedings of the 31st Americas Conference on Information Systems: Intelligent Technologies for a Better Future, 2025

NegVQA: Can Vision Language Models Understand Negation?
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Single-Shot Direct Transmission Terahertz Imaging Based on Intense Broadband Terahertz Radiation.
Sensors, July, 2024

Denoising cosine similarity: A theory-driven approach for efficient representation learning.
Neural Networks, January, 2024

Online cross session electromyographic hand gesture recognition using deep learning and transfer learning.
Eng. Appl. Artif. Intell., January, 2024

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding.
CoRR, 2024

A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data.
CoRR, 2024

Augmenting Particle Swarm Optimization with Simulated Annealing and Dimensional Learning for UAVs Path Planning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Why are Visually-Grounded Language Models Bad at Image Classification?
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Micro-Bench: A Microscopy Benchmark for Vision-Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

FedSecurity: A Benchmark for Attacks and Defenses in Federated Learning and Federated LLMs.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Competitive Swarm Optimizer with Momentum for Numerical Optimization.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2024

Robust VAEs via Generating Process of Noise Augmented Data.
Proceedings of the IEEE International Symposium on Information Theory, 2024

MuEP: A Multimodal Benchmark for Embodied Planning with Foundation Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Open-World Learning Under Dataset Shift.
Proceedings of the IEEE Conference on Artificial Intelligence, 2024

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Locally Informed Competitive Swarm Optimizer with an External Archive for Multimodal Optimization.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

VideoAgent: Long-Form Video Understanding with Large Language Model as Agent.
Proceedings of the Computer Vision - ECCV 2024, 2024

Describing Differences in Image Sets with Natural Language.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
A trust region based local Bayesian optimization without exhausted optimization of acquisition function.
Evol. Syst., October, 2023

Deep Clustering With a Constraint for Topological Invariance Based on Symmetric InfoNCE.
Neural Comput., July, 2023

A multi-objective evolutionary algorithm based on mixed encoding for community detection.
Multim. Tools Appl., April, 2023

Quantifying the Effects of Snow on the Beginning of Vegetation Growth in the Mongolian Plateau.
Remote. Sens., March, 2023

Co-creation environment with cloud virtual reality and real-time artificial intelligence toward the design of molecular robots.
J. Integr. Bioinform., March, 2023

Personalized immune subtypes based on machine learning predict response to checkpoint blockade in gastric cancer.
Briefings Bioinform., January, 2023

Inverse Scaling: When Bigger Isn't Better.
Trans. Mach. Learn. Res., 2023

Holistic Evaluation of Language Models.
Trans. Mach. Learn. Res., 2023

Multi-domain adaptation for cross-domain semantic slot filling.
Eng. Appl. Artif. Intell., 2023

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
CoRR, 2023

FedMLSecurity: A Benchmark for Attacks and Defenses in Federated Learning and LLMs.
CoRR, 2023

DGP-Net: Dense Graph Prototype Network for Few-Shot SAR Target Recognition.
CoRR, 2023

Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adjusting Exploitation and Exploration Rates of Differential Evolution: A Novel Mutation Strategy.
Proceedings of the Digital Multimedia Communications, 2023

Diagnosing and Rectifying Vision Models using Language.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Effects of Personalized Nudging in eLearning Environments.
Proceedings of the 44th International Conference on Information Systems, 2023

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation.
Proceedings of the Proceedings on "I Can't Believe It's Not Better: Failure Modes in the Age of Foundation Models" at NeurIPS 2023 Workshops, 2023

An Exploration of Operational Image Recognition Application for Agricultural Meteorological Observation Based on YOLO5 Framework.
Proceedings of the 11th International Conference on Agro-Geoinformatics, 2023

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation.
Proceedings of the Machine Learning for Health, 2022

Deep Self-Supervised Learning of Speech Denoising from Noisy Speeches.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Effects of the Bias Magnetic Field and Annealing on the Magnetization of Terfenol-D Films.
Proceedings of the 2022 IEEE Sensors, Dallas, TX, USA, October 30 - Nov. 2, 2022, 2022

Unfettered Access Tokens: Discovering Security Flaws of the Access Token in Smart Home Platforms.
Proceedings of the IEEE International Conference on Communications, 2022

2021
GTree: an Open-source Tool for Dense Reconstruction of Brain-wide Neuronal Population.
Neuroinformatics, 2021

Biomedical and clinical English model packages for the Stanza Python NLP library.
J. Am. Medical Informatics Assoc., 2021

An improved method for soft tissue modeling.
Biomed. Signal Process. Control., 2021

Fsft-Net: Face Transfer Video Generation With Few-Shot Views.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Design and Implementation of Intelligent Agricultural Meteorological App<sup>*</sup>.
Proceedings of the 9th International Conference on Agro-Geoinformatics, 2021

2020
Spatio-Temporal Mapping of Multi-Satellite Observed Column Atmospheric CO2 Using Precision-Weighted Kriging Method.
Remote. Sens., 2020

Construction of force haptic reappearance system based on Geomagic Touch haptic device.
Comput. Methods Programs Biomed., 2020

A Genetic Algorithm-Based Solver for Small-Scale Jigsaw Puzzles.
Proceedings of the Advances in Swarm Intelligence - 11th International Conference, 2020

Enhancing Transformer with Sememe Knowledge.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Intelligent Online Shopping Store for Specialties of Fujian.
Proceedings of the 3rd IEEE International Conference on Knowledge Innovation and Invention, 2020

Inducing Grammar from Long Short-Term Memory Networks by Shapley Decomposition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

Stanza: A Python Natural Language Processing Toolkit for Many Human Languages.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
VetTag: improving automated veterinary diagnosis coding via large-scale language modeling.
npj Digit. Medicine, 2019

Irregular Convolutional Auto-Encoder on Point Clouds.
CoRR, 2019

Long Term Background Reference Based Satellite Video Coding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Study on the Effect of App Reverse Cycle Propagation Under Multi-screen Propagation.
Proceedings of the HCI International 2019 - Posters - 21st International Conference, 2019

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
DeepTag: inferring diagnoses from veterinary clinical notes.
npj Digit. Medicine, 2018

Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding.
CoRR, 2018

Taxi or Hitchhiking: Predicting Passenger's Preferred Service on Ride Sharing Platforms.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

2016
Novel Downhole Electromagnetic Flowmeter for Oil-Water Two-Phase Flow in High-Water-Cut Oil-Producing Wells.
Sensors, 2016

Separation cutoff for upward skip-free chains.
J. Appl. Probab., 2016

2015
Reverse Training for Leaf Image Set Classification.
Proceedings of the Advanced Intelligent Computing Theories and Applications, 2015

2013
Spatial Distribution of Wall Shear Stress in Common Carotid Artery by Color Doppler Flow Imaging.
J. Digit. Imaging, 2013

2012
Broadband circularly polarized patch antenna for small satellites applications.
Proceedings of the 8th International Symposium on Communication Systems, 2012

2011
Traffic Risk Assessment of Freeway On-Ramp and Off-Ramp Areas Based on Simulation Analysis.
Proceedings of the Modeling Risk Management in Sustainable Construction, 2011

A component inspection algorithm based on low-dimensional image feature.
Proceedings of the Third International Conference on Digital Image Processing, 2011

2006
Four-Wing attractors: from Pseudo to Real.
Int. J. Bifurc. Chaos, 2006


  Loading...