Xuelong Li

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Dual-frequency awareness network for lightweight super resolution.
Pattern Recognit., 2026

AFPN: Alignment feature pyramid network for real-time semantic segmentation.
Pattern Recognit., 2026

Vision and acoustic emission multi-modal learning for aircraft crack monitoring.
Adv. Eng. Informatics, 2026

2025
CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs.
CoRR, October, 2025

Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models.
CoRR, October, 2025

<i>M</i><sup>3</sup>\<i>PDB</i>: A Multimodal, Multi-Label, Multilingual Prompt Database for Speech Generation.
CoRR, August, 2025

Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation.
CoRR, August, 2025

DIFFA: Large Language Diffusion Models Can Listen and Understand.
CoRR, July, 2025

Technical Report of TeleChat2, TeleChat2.5 and T1.
CoRR, July, 2025

METER: Multi-modal Evidence-based Thinking and Explainable Reasoning - Algorithm and Benchmark.
CoRR, July, 2025

HAMLET: Hyperadaptive Agent-based Modeling for Live Embodied Theatrics.
CoRR, July, 2025

HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models.
CoRR, June, 2025

Learn Beneficial Noise as Graph Augmentation.
CoRR, May, 2025

AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars.
CoRR, May, 2025

Decentralized Nonconvex Composite Federated Learning with Gradient Tracking and Momentum.
CoRR, April, 2025

Task-Oriented Feature Compression for Multimodal Understanding via Device-Edge Co-Inference.
CoRR, March, 2025

Large model enhanced computational ghost imaging.
CoRR, March, 2025

NFIG: Autoregressive Image Generation with Next-Frequency Prediction.
CoRR, March, 2025

DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model.
CoRR, February, 2025

Improve LLM-as-a-Judge Ability as a General Ability.
CoRR, February, 2025

Leader and Follower: Interactive Motion Generation under Trajectory Constraints.
CoRR, February, 2025

AudioSpa: Spatializing Sound Events with Text.
CoRR, February, 2025

Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR.
CoRR, January, 2025

Consensus multi-view spectral clustering network with unified similarity.
Neural Networks, 2025

Potential region attention network for RGB-D salient object detection.
Neural Networks, 2025

Augment Mandarin to Cantonese Speech Databases via Retrieval-Augmented Generation and Speech Synthesis.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

MSMAR-RL: Multi-Step Masked-Attention Recovery Reinforcement Learning for Safe Maneuver Decision in High-Speed Pursuit-Evasion Game.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

2024
Robust multilayer bootstrap networks in ensemble for unsupervised representation learning and clustering.
Pattern Recognit., 2024

On Atangana-Baleanu fractional granular calculus and its applications to fuzzy economic models in market equilibrium.
J. Comput. Appl. Math., 2024

VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation.
CoRR, 2024

AI Flow at the Network Edge.
CoRR, 2024

2022
An improved Henry gas solubility optimization algorithm based on Lévy flight and Brown motion.
Appl. Intell., 2022


  Loading...