Shuai Fan

Orcid: 0009-0007-0260-6080

Affiliations:
  • AI Speech Co., Ltd., Suzhou, China


According to our database1, Shuai Fan authored at least 17 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning.
CoRR, March, 2025

DFM: Dialogue foundation model for universal large-scale dialogue-oriented task learning.
AI Open, 2025

One-Dimensional Object Detection for Streaming Text Segmentation of Meeting Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Reducing Tool Hallucination via Reliability Alignment.
CoRR, 2024

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity.
CoRR, 2024

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge.
CoRR, 2024

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback.
CoRR, 2024

ChemDFM: Dialogue Foundation Model for Chemistry.
CoRR, 2024

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

The X-Lance Technical Report for Interspeech 2024 Speech Processing using Discrete Speech Unit Challenge.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

DiffDub: Person-Generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-Encoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

Sparsity-Accelerated Training for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2022
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2018
Joint Spoken Language Understanding and Domain Adaptive Language Modeling.
Proceedings of the Intelligence Science and Big Data Engineering, 2018


  Loading...