Chunyang Wu

Orcid: 0009-0001-9007-7495

According to our database1, Chunyang Wu authored at least 48 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Speech-N-LlaMA: Improving Speech LLMs with Multi-Pass Training.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Streaming LLM for Speech Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Pneumonia Detection and Classification Using Lightweight ShuffleNetV2 Based on Transfer Learning.
Proceedings of the 2025 2nd International Conference on Generative Artificial Intelligence and Information Security, 2025

2024
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech.
CoRR, 2024

The Llama 3 Herd of Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
et al.
CoRR, 2024

Speech ReaLLM - Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time.
CoRR, 2024

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Speech ReaLLM - Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

D-Router: Decoupled Content Routers with Remote Content Store.
Proceedings of the IEEE International Conference on Communications, 2024

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

End-to-End Speech Recognition Contextualization with Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Effective Internal Language Model Training and Fusion for Factorized Transducer Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

Prompting Large Language Models with Speech Recognition Abilities.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data.
CoRR, 2023

Towards Selection of Text-to-speech Data to Augment ASR Training.
CoRR, 2023

Multi-Head State Space Model for Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improved Repetitive Control with Enhanced Active Damping Method for 400Hz Inverter.
Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023

Anchored Speech Recognition with Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Explore the weakness: Instructive exploration adversarial robust reinforcement learning.
J. King Saud Univ. Comput. Inf. Sci., 2022

Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios.
CoRR, 2021

Denoising method of ECG signal with power threshold function under wavelet transform and smoothing filter✱.
Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Hybrid Event / Melbourne, VIC, Australia, December 14 - 17, 2021, 2021

Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Transformer-Based Acoustic Modeling for Streaming Speech Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications.
Proceedings of the IEEE International Conference on Acoustics, 2021

Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition.
CoRR, 2020

Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Weak-Attention Suppression for Transformer Based Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
Structured deep neural networks for speech recognition.
PhD thesis, 2018

Improving Interpretability and Regularization in Deep Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

2017
A machine learning-based method for the large-scale evaluation of the qualities of the urban environment.
Comput. Environ. Urban Syst., 2017

I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep Activation Mixture Model for Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Stimulated training for automatic speech recognition and keyword search in limited resource conditions.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
A machine learning method for the large-scale evaluation of urban visual environment.
CoRR, 2016

Stimulated Deep Neural Network for Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining i-vector representation and structured neural networks for rapid adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Incentive-compatible adaptive-width channel allocation for non-cooperative wireless networks.
Int. J. Sens. Networks, 2015

Multi-basis adaptive neural network for rapid adaptation in speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A novel word reordering method for statistical machine translation.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

2012
Regression with Phrase Indicators for Estimating MT Quality.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

AMPLE: A Novel Incentive Approach to Adaptive-Width Channel Allocation in Multi-hop, Non-cooperative Wireless Networks.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2012

Chinese Coreference Resolution via Ordered Filtering.
Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012


  Loading...