Dianbo Sui

Orcid: 0000-0002-5200-2265

According to our database1, Dianbo Sui authored at least 60 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning.
CoRR, May, 2026

Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic Objectives.
CoRR, February, 2026

Beyond Confidence: The Rhythms of Reasoning in Generative Models.
CoRR, February, 2026

EchoReview: Learning Peer Review from the Echoes of Scientific Citations.
CoRR, February, 2026

Towards enhanced LLM pretraining: Dynamic checkpoint merging via generation quality.
Inf. Fusion, 2026

2025
Large Language Models for Planning: A Comprehensive and Systematic Survey.
CoRR, May, 2025

Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers.
CoRR, May, 2025

LFTF: Locating First and Then Fine-Tuning for Mitigating Gender Bias in Large Language Models.
CoRR, May, 2025

LLMSR@XLLM25: An Empirical Study of LLM for Structural Reasoning.
CoRR, May, 2025

A Federated Social Recommendation Approach with Enhanced Hypergraph Neural Network.
ACM Trans. Intell. Syst. Technol., February, 2025

Ask and Retrieve Knowledge: Towards Proactive Asking with Imperfect Information in Medical Multi-turn Dialogues.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

VPO: Reasoning Preferences Optimization Based on V-Usable Information.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Unlocking Hidden Capabilities: A Self-Improving Workflow for Chatbots to Utilize Unintegrated Services.
Proceedings of the IEEE International Conference on Web Services, 2025

A Weighted Preference Optimization Service Recommendation Method Based on Knowledge Graph and Large Language Model.
Proceedings of the IEEE International Conference on Web Services, 2025

Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

M2Edit: Locate and Edit Multi-Granularity Knowledge in Multimodal Large Language Model.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SHARP: Steering Hallucination in LVLMs via Representation Engineering.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Exploring Deductive and Inductive Reasoning Capabilities of Large Language Models in Procedural Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

A Framework for Effective Invocation Methods of Various LLM Services.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Advanced News Event Clustering via Topic Enhanced Modeling with Multi-Aspect Contrastive Learning.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

ScEdit: Script-based Assessment of Knowledge Editing.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Joint Entity and Relation Extraction With Set Prediction Networks.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression.
ACM Trans. Asian Low Resour. Lang. Inf. Process., February, 2024

TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs.
CoRR, 2024

Mitigating Gender Bias in Code Large Language Models via Model Editing.
CoRR, 2024

HBot: A Chatbot for Healthcare Applications in Traditional Chinese Medicine Based on Human Body 3D Visualization.
CoRR, 2024

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging.
CoRR, 2024

Checkpoint Merging via Bayesian Optimization in LLM Pretraining.
CoRR, 2024

A Survey on Effective Invocation Methods of Massive LLM Services.
CoRR, 2024

Economics Arena for Large Language Models.
CoRR, 2024

Woodpecker: hallucination correction for multimodal large language models.
Sci. China Inf. Sci., 2024

A Privacy-Preserving Framework for Medical Chatbot Based on LLM with Retrieval Augmented Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Can We Debias Multimodal Large Language Models via Model Editing?
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RealWeb: A Benchmark for Universal Instruction Following in Realistic Web Services Navigation.
Proceedings of the IEEE International Conference on Web Services, 2024

Learning Dynamic Knowledge Graph Embedding in Evolving Service Ecosystems via Meta-Learning.
Proceedings of the IEEE International Conference on Web Services, 2024

Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data.
Proceedings of the Service-Oriented Computing - 22nd International Conference, 2024

A Federated Graph to Embedding Approach for Knowledge Graph Completion.
Proceedings of the IEEE International Conference on Acoustics, 2024

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Analyzing Chain-of-thought Prompting in Black-Box Large Language Models via Estimated V-information.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Novel Relation Detection: Discovering Unknown Relation Types via Multi-Strategy Self-Supervised Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Learning with Partial Annotations for Event Detection.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Class Lifelong Learning for Intent Detection via Structure Consolidation Networks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
CASIA at SemEval-2022 Task 11: Chinese Named Entity Recognition for Complex and Ambiguous Entities.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

CogKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021
Knowledge Guided Metric Learning for Few-Shot Text Classification.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Set Generation Networks for End-to-End Knowledge Base Population.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Distantly Supervised Relation Extraction in Federated Settings.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Document-level Event Extraction via Parallel Prediction Networks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

A Large-Scale Chinese Multimodal NER Dataset with Speech Clues.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Joint Entity and Relation Extraction with Set Prediction Networks.
CoRR, 2020

FedED: Federated Learning via Ensemble Distillation for Medical Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Graph-Based Knowledge Integration for Question Answering over Dialogue.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...