Shucong Zhang

According to our database1, Shucong Zhang authored at least 25 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Robust Unsupervised Adaptation of a Speech Recogniser Using Entropy Minimisation and Speaker Codes.
CoRR, June, 2025

Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition.
CoRR, May, 2025

Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use.
CoRR, May, 2025

Benchmarking Rotary Position Embeddings for Automatic Speech Recognition.
CoRR, January, 2025

Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.
Comput. Speech Lang., 2024

Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition.
CoRR, 2024

Open-Source Conversational AI with SpeechBrain 1.0.
CoRR, 2024

Linear-Complexity Self-Supervised Learning for Speech Processing.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition.
CoRR, 2023

Real-Time Personalised Speech Enhancement Transformers with Dynamic Cross-attended Speaker Representations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the (In)Efficiency of Acoustic Feature Extractors for Self-Supervised Speech Representation Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement.
CoRR, 2022

Transformer-Based Streaming ASR with Cumulative Attention.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers.
CoRR, 2020

When Can Self-Attention Be Replaced by Feed Forward Layers?
CoRR, 2020

Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Trainable Dynamic Subsampling for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Windowed Attention Mechanisms for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations.
J. Multivar. Anal., 2018


  Loading...