Guoxin Wang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Perception-decision-execution coordination mechanism driven dynamic autonomous collaboration method for human-like collaborative robot based on multimodal large language model.
Robotics Comput. Integr. Manuf., 2026

A review on large language models for industrial embodied intelligence.
Adv. Eng. Informatics, 2026

2025
The method for underground personnel behavior recognition based on multi-information flow collaborative graph convolutional neural networks.
Signal Image Video Process., November, 2025

Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning.
CoRR, September, 2025

MoCrop: Training Free Motion Guided Cropping for Efficient Video Action Recognition.
CoRR, September, 2025

Emergence of Hierarchies in Multi-Agent Self-Organizing Systems Pursuing a Joint Objective.
CoRR, August, 2025

JoyTTS: LLM-based Spoken Chatbot With Voice Cloning.
CoRR, July, 2025

Debiased Estimation and Inference for Spatial-Temporal EEG/MEG Source Imaging.
IEEE Trans. Medical Imaging, March, 2025

Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support.
CoRR, February, 2025

OMAL-YOLOv8: real-time detection algorithm for insulator defects based on optimized feature fusion.
J. Real Time Image Process., January, 2025

DAR-Prompt: Dynamic Regulation in Prompt Tuning for Multi-Label Zero-Shot Learning.
IEEE Trans. Image Process., 2025

Digital twin-driven self-adaptive reconfiguration planning method of smart manufacturing systems using game theory and deep Q-network for industry 5.0.
J. Ind. Inf. Integr., 2025

CS2former: Multimodal feature fusion transformer with dual channel-spatial feature extraction module for bipolar disorder diagnosis.
Comput. Medical Imaging Graph., 2025

MHAD: Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
The real-time detection method for coal gangue based on YOLOv8s-GSC.
J. Real Time Image Process., April, 2024

An alternative three-dimensional subspace method based on conic model for unconstrained optimization.
RAIRO Oper. Res., January, 2024

LMKG: A large-scale and multi-source medical knowledge graph for intelligent medicine applications.
Knowl. Based Syst., 2024

3SHNet: Boosting image-sentence retrieval via visual semantic-spatial self-highlighting.
Inf. Process. Manag., 2024

JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation.
CoRR, 2024

JoyType: A Robust Design for Multilingual Visual Text Creation.
CoRR, 2024

JoyHallo: Digital human model for Mandarin.
CoRR, 2024

Multi modality fusion transformer with spatio-temporal feature aggregation module for psychiatric disorder diagnosis.
Comput. Medical Imaging Graph., 2024

Robust deep learning from incomplete annotation for accurate lung nodule detection.
Comput. Biol. Medicine, 2024

TaD: A Plug-and-Play Task-Aware Decoding Method to Better Adapt LLMs on Downstream Tasks.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

A Bi-Pyramid Multimodal Fusion Method for the Diagnosis Of Bipolar Disorders.
Proceedings of the IEEE International Conference on Acoustics, 2024

PYRA: Parallel Yielding Re-activation for Training-Inference Efficient Task Adaptation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
ChatGPT Performs on the Chinese National Medical Licensing Examination.
J. Medical Syst., December, 2023

Automatic identification of building structure types using unmanned aerial vehicle oblique images and deep learning considering facade prior knowledge.
Int. J. Digit. Earth, December, 2023

Robotic drilling for the Chinese Chang'E 5 lunar sample-return mission.
Int. J. Robotics Res., July, 2023

A Three-Dimensional Subspace Algorithm Based on the Symmetry of the Approximation Model and WYL Conjugate Gradient Method.
Symmetry, 2023

A three-stage algorithm for coordinate controlling of multi-intersection signal.
Expert Syst. Appl., 2023

Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification.
CoRR, 2023

Kosmos-2.5: A Multimodal Literate Model.
CoRR, 2023

Understanding the Impact of AI Decision speed and Historical Decision Quality on User adoption in AI-assisted Decision Making.
Proceedings of the 27th Pacific Asia Conference on Information Systems, 2023


Unifying Vision, Text, and Layout for Universal Document Processing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-modal Contrastive-Generative Pre-training for Fine-grained Skin Disease Diagnosis.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

Effect of AI Decision Speed on User Adoption in Human-AI Collaboration: The Moderating Role of Historical Decision Quality.
Proceedings of the 29th Americas Conference on Information Systems, 2023

Can AI Chatbots with Anthropomorphic Attributes Enhance User Engagement in Emotional Support Settings? Investigating the Role of Conversational Styles and Avatar Type.
Proceedings of the 29th Americas Conference on Information Systems, 2023

2022
A Class of Three-Dimensional Subspace Conjugate Gradient Algorithms for Unconstrained Optimization.
Symmetry, 2022

Automatic extraction of building geometries based on centroid clustering and contour analysis on oblique images taken by unmanned aerial vehicles.
Int. J. Geogr. Inf. Sci., 2022

Understanding Long Documents with Different Position-Aware Attentions.
CoRR, 2022

A Simple yet Effective Learnable Positional Encoding Method for Improving Document Transformer Model.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Signal adaptive cooperative control of two adjacent traffic intersections using a two-stage algorithm.
Expert Syst. Appl., 2021

BoningKnife: Joint Entity Mention Detection and Typing for Nested NER via prior Boundary Knowledge.
CoRR, 2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding.
CoRR, 2021

Image Haze Removal Algorithm Using a Logarithmic Guide Filtering and Multi-Channel Prior.
IEEE Access, 2021

Understanding the Status of Reemployment of Elderly Talents in Digital Society: Evidence from Aged Job Search Website in China.
Proceedings of the 20th Wuhan International Conference on E-Business, 2021

Research on Early Warning of Non-performing Loans of Small and Medium-sized Micro-enterprises Under the Background of COVID-19 - Taking XX Branch of N Bank as an Example.
Proceedings of the 20th Wuhan International Conference on E-Business, 2021

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
A Novel Image Dehazing Algorithm via Adaptive Gamma-Correction and Modified AMEF.
IEEE Access, 2020

Reinforcement learning based curiosity-driven testing of Android applications.
Proceedings of the ISSTA '20: 29th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2020

Visual Style Extraction from Chart Images for Chart Restyling.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Enhanced Meta-Learning for Cross-Lingual Named Entity Recognition with Minimal Resources.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition.
CoRR, 2019

DeepMRT at the NTCIR-14 FinNum Task: A Hybrid Neural Model for Numeral Type Classification in Financial Tweets.
Proceedings of the 14th NTCIR Conference on Evaluation of Information Access Technologies, 2019

CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Expected residual minimization formulation for a class of stochastic linear second-order cone complementarity problems.
Eur. J. Oper. Res., 2018

Optimization of industrial boiler combustion control system based on genetic algorithm.
Comput. Electr. Eng., 2018

Research and Implementation of UHPLC Pump Dual-axis Cooperative Control.
Proceedings of the 5th International Conference on Systems and Informatics, 2018

2016
Local Adaptive Calibration of the Satellite-Derived Surface Incident Shortwave Radiation Product Using Smoothing Spline.
IEEE Trans. Geosci. Remote. Sens., 2016

Evaluation of the Reanalysis Surface Incident Shortwave Radiation Products from NCEP, ECMWF, GSFC, and JMA Using Satellite and Surface Observations.
Remote. Sens., 2016

2008
Experimental Study for Automatic Colony Counting System Based Onimage Processing.
Proceedings of the Computer and Computing Technologies in Agriculture II, Volume 2, 2008

2006
Determination of Design Ground Motion For Critical Engineering Structures Based on Probabilistic Seismic Hazard Analysis.
Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), 2006


  Loading...