Bin Li

Orcid: 0000-0002-6508-5071

Affiliations:
  • Shenzhen Institute of Advanced Technology, Shenzhen, China
  • Hunan University, College of Electrical and Information Engineering, Changsha, China (PhD)


According to our database1, Bin Li authored at least 75 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
WOW-Seg: A Word-free Open World Segmentation Model.
CoRR, May, 2026

DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation.
CoRR, January, 2026

DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models.
CoRR, January, 2026

DarwinTOD: LLM Driven Lifelong Self Evolution for Task Oriented Dialog Systems.
CoRR, January, 2026

Hierarchical Prototype Alignment for Video Temporal Grounding.
Entropy, 2026

MSA-UNet3+: Multi-Scale Attention UNet3+ with New Supervised Prototypical Contrastive Loss for Coronary DSA Image Segmentation.
Biomed. Signal Process. Control., 2026

LumiCRS: Asymmetric contrastive prototype learning for long-tail conversational recommender systems.
Appl. Soft Comput., 2026

Boosting Large Language Models for Mental Manipulation Detection via Data Augmentation and Distillation.
Proceedings of the ACM Web Conference 2026, 2026

Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Med-CRAFT: Automated Construction of Interpretable and Multi-Hop Video Workloads via Knowledge Graph Traversal.
CoRR, December, 2025

Adaptive Residual-Update Steering for Low-Overhead Hallucination Mitigation in Large Vision Language Models.
CoRR, November, 2025

Context-Aware Pseudo-Label Scoring for Zero-Shot Video Summarization.
CoRR, October, 2025

HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST.
CoRR, September, 2025

DyBBT: Dynamic Balance via Bandit inspired Targeting for Dialog Policy with Cognitive Dual-Systems.
CoRR, September, 2025

Consistency-Aware Parameter-Preserving Knowledge Editing Framework for Multi-Hop Question Answering.
CoRR, September, 2025

TsqLoRA: Towards Sensitivity and Quality Low-Rank Adaptation for Efficient Fine-Tuning.
CoRR, September, 2025

CPCLDETECTOR: Knowledge Enhancement and Alignment Selection for Chinese Patronizing and Condescending Language Detection.
CoRR, September, 2025

DA-Mamba: Dialogue-aware selective state-space model for multimodal engagement estimation.
CoRR, September, 2025

MVCL-DAF++: Enhancing Multimodal Intent Recognition via Prototype-Aware Contrastive Alignment and Coarse-to-Fine Dynamic Attention Fusion.
CoRR, September, 2025

MedFact-R1: Towards Factual Medical Reasoning via Pseudo-Label Augmentation.
CoRR, September, 2025

Low-Cost Test-Time Adaptation for Robust Video Editing.
CoRR, July, 2025

M<sup>3</sup>-Med: A Benchmark for Multi-lingual, Multi-modal, and Multi-hop Reasoning in Medical Instructional Video Understanding.
CoRR, July, 2025

Small but mighty: enhancing time series forecasting with lightweight LLMs.
J. Supercomput., June, 2025

ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant.
CoRR, May, 2025

Enhancing video temporal grounding with large language model-based data augmentation.
J. Supercomput., April, 2025

Ask2Loc: Learning to Locate Instructional Visual Answers by Asking Questions.
CoRR, April, 2025

MSA-UNet3+: Multi-Scale Attention UNet3+ with New Supervised Prototypical Contrastive Loss for Coronary DSA Image Segmentation.
CoRR, April, 2025

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion.
CoRR, April, 2025

MambaVesselNet: A Novel Approach to Blood Vessel Segmentation Based on State-Space Models.
IEEE J. Biomed. Health Informatics, March, 2025

VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention.
CoRR, February, 2025

ClinKD: Cross-Modal Clinic Knowledge Distiller For Multi-Task Medical Images.
CoRR, February, 2025

FocusMorph: A novel multi-scale fusion network for 3D brain MR image registration.
Pattern Recognit., 2025

MCD-Temporal: Constructing a New Time-Entropy Enhanced Dynamic Weighted Heterogeneous Ensemble for Cognitive Level Classification.
Informatics, 2025

SpineMamba: Enhancing 3D spinal segmentation in clinical imaging through residual visual Mamba layers and shape priors.
Comput. Medical Imaging Graph., 2025

Tcn-Net: A Novel multi-branch segmentation network for vertebrae MRI and X-ray image.
Biomed. Signal Process. Control., 2025

Overview of the NLPCC 2025 Shared Task 4: Multi-modal, Multilingual, and Multi-hop Medical Instructional Video Question Answering Challenge.
Proceedings of the Natural Language Processing and Chinese Computing, 2025

Learning to Unify Audio, Visual and Text for Audio-Enhanced Visual Answer Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

LoRA-PAR: A Flexible Dual-System LoRA Partitioning Approach to Efficient LLM Fine-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Towards Visual-Prompt Temporal Answer Grounding in Instructional Video.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Towards better Chinese-centric neural machine translation for low-resource languages.
Comput. Speech Lang., March, 2024

Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization.
CoRR, 2024

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors.
CoRR, 2024

Distinct but correct: generating diversified and entity-revised medical response.
Sci. China Inf. Sci., 2024

Large Language Models With Holistically Thought Could Be Better Doctors.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Overview of the NLPCC 2024 Shared Task 7: Multi-lingual Medical Instructional Video Question Answering.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Bilateral personalized dialogue generation with contrastive learning.
Soft Comput., March, 2023

Large Language Models Need Holistically Thought in Medical Conversational QA.
CoRR, 2023

Neural Comprehension: Language Models with Compiled Neural Networks.
CoRR, 2023

Overview of the NLPCC 2023 Shared Task: Chinese Medical Instructional Video Question Answering.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Visual Answer Localization with Cross-Modal Mutual Knowledge Transfer.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning To Locate Visual Answer In Video Corpus Using Question.
Proceedings of the IEEE International Conference on Acoustics, 2023

Large Language Models are Better Reasoners with Self-Verification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Prompt-based system for personality and interpersonal reactivity prediction.
Softw. Impacts, 2022

Artificial Text Detection with Multiple Training Strategies.
CoRR, 2022

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs.
CoRR, 2022

Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video.
CoRR, 2022

Continuing Pre-trained Model with Multiple Training Strategies for Emotional Classification.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

Prompt-based Pre-trained Model for Personality and Interpersonal Reactivity Prediction.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

LingJing at SemEval-2022 Task 3: Applying DeBERTa to Lexical-level Presupposed Relation Taxonomy with Knowledge Transfer.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

LingJing at SemEval-2022 Task 1: Multi-task Self-supervised Pre-training for Multilingual Reverse Dictionary.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

MedConQA: Medical Conversational Question Answering System based on Knowledge Graphs.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Multi-tasking and Multi-stage Chinese Minority Pre-trained Language Model.
Proceedings of the Machine Translation - 18th China Conference, 2022

VPAI_Lab at MedVidQA 2022: A Two-Stage Cross-modal Fusion Method for Medical Instructional Video Classification.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

A Knowledge storage and semantic space alignment Method for Multi-documents dialogue generation.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

ANACONDA: Adversarial training with iNtrust loss in ACrONym DisambiguAtion.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

ADBCMM : Acronym Disambiguation by Building Counterfactuals and Multilingual Mixing.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

SimCLAD: A Simple Framework for Contrastive Learning of Acronym Disambiguation.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

PSG: Prompt-based Sequence Generation for Acronym Extraction.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

A Novel Initial Reminder Framework for Acronym Extraction.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

2021
ADBCMM : Acronym Disambiguation by Building Counterfactuals and Multilingual Mixing.
CoRR, 2021

SimCLAD: A Simple Framework for Contrastive Learning of Acronym Disambiguation.
CoRR, 2021

More but Correct: Generating Diversified and Entity-revised Medical Response.
CoRR, 2021

Bilateral Personalized Dialogue Generation with Dynamic Persona-Aware Fusion.
CoRR, 2021


  Loading...