Shifu Xiong

Orcid: 0000-0003-4759-147X

According to our database1, Shifu Xiong authored at least 16 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization.
CoRR, January, 2026

2025
Lightweight Audio-Visual Wake Word Spotting With Diverse Acoustic Knowledge Distillation.
IEEE Trans. Circuits Syst. Video Technol., July, 2025

HPCNet: Hybrid Pixel and Contour Network for Audio-Visual Speech Enhancement With Low-Quality Video.
IEEE J. Sel. Top. Signal Process., May, 2025

MISP-QEKS: A Large-Scale Dataset with Multimodal Cues for Query-by-Example Keyword Spotting.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Language Adaptation Wake Word Spotting via Latent Space from Pre-Trained Speech Models.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading.
IEEE Trans. Multim., 2024

Deep CLAS: Deep Contextual Listen, Attend and Spell.
CoRR, 2024

Layer-Adaptive Low-Rank Adaptation of Large ASR Model for Low-Resource Multilingual Scenarios.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Exploring Semi-Supervised, Subcategory Classification and Subwords Alignment for Visual Wake Word Spotting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

2022
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2018
The USTC-NEL Speech Translation system at IWSLT 2018.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

2016
Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2014
The Vietnamese speech recognition based on rectified linear units deep neural network and spoken term detection system combination.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Lattice based optimization of bottleneck feature extractor with linear transformation.
Proceedings of the IEEE International Conference on Acoustics, 2014


  Loading...