Shucong Zhang

According to our database¹, Shucong Zhang authored at least 26 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, May, 2025

Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use.

[BibT_eX]

[DOI]

CoRR, May, 2025

Benchmarking Rotary Position Embeddings for Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, January, 2025

Loquacious Set: 25, 000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Robust Unsupervised Adaptation of a Speech Recogniser Using Entropy Minimisation and Speaker Codes.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2024

Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

Open-Source Conversational AI with SpeechBrain 1.0.

[BibT_eX]

[DOI]

CoRR, 2024

Linear-Complexity Self-Supervised Learning for Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Real-Time Personalised Speech Enhancement Transformers with Dynamic Cross-attended Speaker Representations.

[BibT_eX]

[DOI]

Shucong Zhang

Malcolm Chadwick

Alberto Gil C. P. Ramos

Titouan Parcollet

Rogier van Dalen

Sourav Bhattacharya

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the (In)Efficiency of Acoustic Feature Extractors for Self-Supervised Speech Representation Learning.

[BibT_eX]

[DOI]

Titouan Parcollet

Shucong Zhang

Rogier van Dalen

Alberto Gil C. P. Ramos

Sourav Bhattacharya

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement.

[BibT_eX]

[DOI]

Shucong Zhang

Malcolm Chadwick

Alberto Gil C. P. Ramos

Sourav Bhattacharya

CoRR, 2022

Transformer-Based Streaming ASR with Cumulative Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers.

[BibT_eX]

[DOI]

CoRR, 2020

When Can Self-Attention Be Replaced by Feed Forward Layers?

[BibT_eX]

[DOI]

CoRR, 2020

Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness.

[BibT_eX]

[DOI]

Cong-Thanh Do

Shucong Zhang

Thomas Hain

Proceedings of the 28th European Signal Processing Conference, 2020

2019

Trainable Dynamic Subsampling for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Windowed Attention Mechanisms for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations.

[BibT_eX]

[DOI]

Shucong Zhang

Yong Zhou

J. Multivar. Anal., 2018

Shucong Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...