Xingyuan Bu

Orcid: 0000-0002-6445-4306

According to our database1, Xingyuan Bu authored at least 34 papers between 2016 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents.
CoRR, August, 2025

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation.
CoRR, May, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
CoRR, April, 2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.
CoRR, April, 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
CoRR, February, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.
CoRR, February, 2025

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models.
CoRR, February, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Large-Scale Object Detection in the Wild With Imbalanced Data Distribution, and Multi-Labels.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
CoRR, 2024

Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment.
CoRR, 2024

Online Learning of Multiple Tasks and Their Relationships : Testing on Spam Email Data and EEG Signals Recorded in Construction Fields.
CoRR, 2024

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level.
CoRR, 2024

RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
GAIA-Universe: Everything is Super-Netify.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

2022
Visual Encoding and Debiasing for CTR Prediction.
CoRR, 2022

Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection.
CoRR, 2022

Visual Encoding and Debiasing for CTR Prediction.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
GAIA: A Transfer Learning System of Object Detection That Fits Your Needs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
DETR for Pedestrian Detection.
CoRR, 2020

Large-Scale Object Detection in the Wild From Imbalanced Multi-Labels.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning a robust representation via a deep network on symmetric positive definite manifolds.
Pattern Recognit., 2019

Deep convolutional network with locality and sparsity constraints for texture classification.
Pattern Recognit., 2019

Learning an Efficient Network for Large-Scale Hierarchical Object Detection with Data Imbalance: 3rd Place Solution to Open Images Challenge 2019.
CoRR, 2019

2018
Solution for Large-Scale Hierarchical Object Detection Datasets with Incomplete Annotation and Data Imbalance.
CoRR, 2018

2017
Learning a Robust Representation via a Deep Network on Symmetric Positive Definite Manifolds.
CoRR, 2017

2016
Attention Estimation for Input Switch in Scalable Multi-display Environments.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016


  Loading...