Xuelong Geng

Orcid: 0009-0008-1243-1955

According to our database1, Xuelong Geng authored at least 14 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Construction and validation of high altitude pulmonary edema prediction models based on deep learning and quantitative analysis of X-ray images.
BMC Medical Imaging, December, 2026

OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs.
CoRR, March, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.
CoRR, March, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.
CoRR, January, 2026

2025
WEST: LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction.
CoRR, September, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.
CoRR, August, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.
CoRR, February, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.
CoRR, January, 2025

Three-Dimensional Trajectory Tracking Control Strategy for Underactuated UUVs Based on Improved ADRC.
Symmetry, 2025

A Method for Assisting Guided Projectile SINS/GNSS Integrated Navigation System During GNSS Signal Rejection Based on TOC-NARX.
IEEE Access, 2025

Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text.
CoRR, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024


  Loading...