Bihui Yu

Orcid: 0000-0001-8614-7350

According to our database1, Bihui Yu authored at least 33 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
StructVRM: Aligning Multimodal Reasoning with Structured and Verifiable Reward Models.
CoRR, August, 2025

Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions.
CoRR, August, 2025

SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches.
CoRR, August, 2025

LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation.
CoRR, July, 2025

ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering.
CoRR, June, 2025

ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering.
CoRR, May, 2025

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification.
CoRR, February, 2025

Brain-inspired computing based on deep learning for human-computer interaction: A review.
Neurocomputing, 2025

EEGTCT: Electroencephalogram-Based Chinese Text Decoding.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Beyond Relevance: Utility-Driven Retrieval for Visual Document Question Answering.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

ChiImpAVE: An Open-Source Benchmark for Chinese Implicit Attribute Value Extraction.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Enhancing human-like multimodal reasoning: a new challenging dataset and comprehensive framework.
Neural Comput. Appl., November, 2024

BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search.
CoRR, 2024

Synth-Empathy: Towards High-Quality Synthetic Empathy Data.
CoRR, 2024

Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data.
CoRR, 2024

Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning.
CoRR, 2024

mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning.
CoRR, 2024

A survey on advancements in image-text multimodal models: From general techniques to biomedical implementations.
Comput. Biol. Medicine, 2024

DGHC: A Hybrid Algorithm for Multi-Modal Named Entity Recognition Using Dynamic Gating and Correlation Coefficients With Visual Enhancements.
IEEE Access, 2024

Faster and More Efficient Subject Image Generation for Text-to-Image Diffusion Models.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

SAM-Wav2lip++: Enhancing Behavioral Realism in Synthetic Agents Through Audio-Driven Speech and Action Refinement.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Interpretable and Generalizable Spatiotemporal Predictive Learning with Disentangled Consistency.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-consistency Training.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Rational Sensibility: LLM Enhanced Empathetic Response Generation Guided by Self-presentation Theory.
CoRR, 2023

Human-computer Interaction for Brain-inspired Computing Based on Machine Learning And Deep Learning: A Review.
CoRR, 2023

A Survey on Image-text Multimodal Models.
CoRR, 2023

Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework.
CoRR, 2023

TED-CS: Textual Enhanced Sensitive Video Detection with Common Sense Knowledge.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

2022
Feature-guided Multimodal Sentiment Analysis towards Industry 4.0.
Comput. Electr. Eng., 2022

2021
Research on SPARQL Semantic Query Technology Based on Knowledge Hybrid Storage.
Proceedings of the ICACS '21: 2021 The 5th International Conference on Algorithms, Computing and Systems, Xi'an, China, September 24, 2021


  Loading...