Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Speech ReaLLM - Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time.

[BibT_eX]

[DOI]

Frank Seide

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

D-Router: Decoupled Content Routers with Remote Content Store.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Communications, 2024

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models.

[BibT_eX]

[DOI]

Raghuraman Krishnamoorthi

Proceedings of the IEEE International Conference on Acoustics, 2024

End-to-End Speech Recognition Contextualization with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Effective Internal Language Model Training and Fusion for Factorized Transducer Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Prompting Large Language Models with Speech Recognition Abilities.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Selection of Text-to-speech Data to Augment ASR Training.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-Head State Space Model for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improved Repetitive Control with Enhanced Active Damping Method for 400Hz Inverter.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023

Anchored Speech Recognition with Neural Transducers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Explore the weakness: Instructive exploration adversarial robust reinforcement learning.

[BibT_eX]

[DOI]

Chunyang Wu

Fei Zhu

Quan Liu

J. King Saud Univ. Comput. Inf. Sci., 2022

Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios.

[BibT_eX]

[DOI]

CoRR, 2021

Denoising method of ECG signal with power threshold function under wavelet transform and smoothing filter✱.

[BibT_eX]

[DOI]

Chunyang Wu

Bei-Wei Zhang

Jinhai Li

Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Hybrid Event / Melbourne, VIC, Australia, December 14 - 17, 2021, 2021

Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Transformer-Based Acoustic Modeling for Streaming Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Weak-Attention Suppression for Transformer Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018

Structured deep neural networks for speech recognition.

[BibT_eX]

[DOI]

Chunyang Wu

PhD thesis, 2018

Improving Interpretability and Regularization in Deep Learning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

2017

A machine learning-based method for the large-scale evaluation of the qualities of the urban environment.

[BibT_eX]

[DOI]

Comput. Environ. Urban Syst., 2017

I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep Activation Mixture Model for Speech Recognition.

[BibT_eX]

[DOI]

Chunyang Wu

Mark J. F. Gales

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Stimulated training for automatic speech recognition and keyword search in limited resource conditions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

A machine learning method for the large-scale evaluation of urban visual environment.

[BibT_eX]

[DOI]

Lun Liu

Hui Wang

Chunyang Wu

CoRR, 2016

Stimulated Deep Neural Network for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining i-vector representation and structured neural networks for rapid adaptation.

[BibT_eX]

[DOI]

Chunyang Wu

Penny Karanasou

Mark J. F. Gales

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Incentive-compatible adaptive-width channel allocation for non-cooperative wireless networks.

[BibT_eX]

[DOI]

Int. J. Sens. Networks, 2015

Multi-basis adaptive neural network for rapid adaptation in speech recognition.

[BibT_eX]

[DOI]

Chunyang Wu

Mark J. F. Gales

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A novel word reordering method for statistical machine translation.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

2012

Regression with Phrase Indicators for Estimating MT Quality.

[BibT_eX]

[DOI]

Chunyang Wu

Hai Zhao

Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

AMPLE: A Novel Incentive Approach to Adaptive-Width Channel Allocation in Multi-hop, Non-cooperative Wireless Networks.

[BibT_eX]

[DOI]

Proceedings of the Wireless Algorithms, Systems, and Applications, 2012

Chinese Coreference Resolution via Ordered Filtering.

[BibT_eX]

[DOI]

Xiaotian Zhang

Chunyang Wu

Hai Zhao

Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Chunyang Wu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...