Xinlong Chen

Orcid: 0009-0009-7146-9782

According to our database1, Xinlong Chen authored at least 42 papers between 2012 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models.
CoRR, April, 2026

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization.
CoRR, February, 2026

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions.
CoRR, February, 2026

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models.
CoRR, February, 2026

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.
CoRR, February, 2026

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search.
CoRR, January, 2026

DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models.
CoRR, January, 2026

Curriculum-guided graph self-augmentation: A progressive deepening framework for GNNs.
Neural Networks, 2026

UniTrain: A universal iterative semi-supervised training framework for graph representation learning.
Neural Networks, 2026

Feature space variation-based active learning sample query strategy for graph deep learning.
Expert Syst. Appl., 2026

2025
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models.
CoRR, December, 2025

VABench: A Comprehensive Benchmark for Audio-Video Generation.
CoRR, December, 2025

The Unseen Bias: How Norm Discrepancy in Pre-Norm MLLMs Leads to Visual Information Loss.
CoRR, December, 2025

A Fine-grained Classification Method for Cross-domain Policy Texts Based on Instruction Tuning.
Inf. Syst. Frontiers, October, 2025

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration.
CoRR, October, 2025

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.
CoRR, September, 2025

D<sup>2</sup>HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs.
CoRR, September, 2025

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.
CoRR, June, 2025

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.
CoRR, May, 2025

Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models.
CoRR, January, 2025

GSSCL: A framework for Graph Self-Supervised Curriculum Learning based on clustering label smoothing.
Neural Networks, 2025

SE-GSSL: Soft-Mask enhanced graph self-supervised learning with multi-aspect knowledge encoding and adaptive sample selection.
Knowl. Based Syst., 2025

A Novel YJQR-LSTM Model for Nonparametric Probabilistic Sustainable Agriculture Wind Power Forecasting Based on Intelligent IoT.
IEEE Internet Things J., 2025

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
LightCapsGNN: light capsule graph neural network for graph classification.
Knowl. Inf. Syst., October, 2024

TSAE-UNet: A Novel Network for Multi-Scene and Multi-Temporal Water Body Detection Based on Spatiotemporal Feature Extraction.
Remote. Sens., 2024

DWSSA: Alleviating over-smoothness for deep Graph Neural Networks.
Neural Networks, 2024

A novel surface deformation prediction method based on AWC-LSTM model.
Int. J. Appl. Earth Obs. Geoinformation, 2024

Training Graph Transformers via Curriculum-Enhanced Attention Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Curriculum-Enhanced Residual Soft An-Isotropic Normalization for Over-Smoothness in Deep GNNs.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Spacecraft Depth Completion Based on the Gray Image and the Sparse Depth Map.
IEEE Trans. Aerosp. Electron. Syst., October, 2023

Exploiting Temporal-Spatial Feature Correlations for Sequential Spacecraft Depth Completion.
Remote. Sens., October, 2023

Position Awareness Network for Noncooperative Spacecraft Pose Estimation Based on Point Cloud.
IEEE Trans. Aerosp. Electron. Syst., February, 2023

Semi-Supervised Node Classification via Semi-Global Graph Transformer Based on Homogeneity Augmentation.
Parallel Process. Lett., 2023

Graph Contrastive Representation Learning with Input-Aware and Cluster-Aware Regularization.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

2022
Integrated Control of Attitude Maneuver and Vibration Suppression Using Pyramid-Type SGCMGs.
IEEE Trans. Aerosp. Electron. Syst., 2022

2020
Design and Analysis of Preload Control for Space Debris Impact Adhesion Capture Method.
IEEE Access, 2020

2013
On the convergence of a modified regularized Newton method for convex optimization with singular solutions.
J. Comput. Appl. Math., 2013

2012
A note on "A globally convergent BFGS method with nonmonotone line search for non-convex minimization".
Appl. Math. Comput., 2012


  Loading...