Chanwoo Kim

Orcid: 0000-0003-4085-2470

Affiliations:
  • Samsung Research, Seoul, South Korea


According to our database1, Chanwoo Kim authored at least 64 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech.
CoRR, 2024

2023
On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition.
CoRR, 2023

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction.
CoRR, 2023

Counterfactual Two-Stage Debiasing For Video Corpus Moment Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Accent Learning for Under-Resourced Accents Using Native Language Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space.
CoRR, 2022

Into-TTS : Intonation Template based Prosody Control System.
CoRR, 2022

Conformer-Based on-Device Streaming Speech Recognition with KD Compression and Two-Pass Architecture.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Macro-Block Dropout for Improved Regularization in Training End-to-End Speech Recognition Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Cross-Modal Decision Regularization for Simultaneous Speech Translation.
Proceedings of the Interspeech 2022, 2022

Prototypical speaker-interference loss for target voice separation using non-parallel audio samples.
Proceedings of the Interspeech 2022, 2022

2021
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems.
CoRR, 2021

Convolution-Based Attention Model With Positional Encoding For Streaming Speech Recognition On Embedded Devices.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Streaming End-to-End Speech Recognition with Jointly Trained Neural Feature Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

Task Aware Multi-Task Learning for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

Comparative Study of Different Tokenization Strategies for Streaming End-to-End ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

A Comparison of Streaming Models and Data Augmentation Methods for Robust Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Semi-Supervised Transfer Learning for Language Expansion of End-to-End Speech Recognition Models to Low-Resource Languages.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Voice to Action: Spoken Language Understanding for Memory-Constrained Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

HiTNet: Byte-to-BPE Hierarchical Transcription Network for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Two-Pass End-to-End ASR Model Compression.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Faster Re-translation Using Non-Autoregressive Model For Simultaneous Neural Machine Translation.
CoRR, 2020

Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios.
Proceedings of the Interspeech 2020, 2020

Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing.
Proceedings of the Interspeech 2020, 2020

Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Small Energy Masking for Improved Neural Network Training for End-To-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-end Speech-to-Text Translation with Modality Agnostic Meta-Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Review of On-Device Fully Neural End-to-End Automatic Speech Recognition Algorithms.
Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

2019
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning.
CoRR, 2019

Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System.
Proceedings of the Interspeech 2019, 2019

Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Robust Recognition of Reverberant and Noisy Speech Using Coherence-based Processing.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Power-Law Nonlinearity with Maximally Uniform Distribution Criterion for Improved Neural Network Training in Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Improved Multi-Stage Training of Online Attention-Based Encoder-Decoder Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition.
Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sound Source Separation Using Phase Difference and Reliable Mask Selection Selection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Robust Speech Recognition Based on Binaural Auditory Processing.
Proceedings of the Interspeech 2017, 2017


Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home.
Proceedings of the Interspeech 2017, 2017

Binaural processing for robust recognition of degraded speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A Subband-Based Stationary-Component Suppression Method Using Harmonics and Power Ratio for Reverberant Speech Recognition.
IEEE Signal Process. Lett., 2016

2014
Robust speech recognition in reverberant environments using subband-based steady-state monaural and binaural suppression.
Proceedings of the INTERSPEECH 2014, 2014

Robust speech recognition using temporal masking and thresholding algorithm.
Proceedings of the INTERSPEECH 2014, 2014

2012
Two-microphone source separation algorithm based on statistical modeling of angle distributions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Delta-spectral cepstral coefficients for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Binaural sound source separation motivated by auditory processing.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Automatic selection of thresholds for signal separation algorithms based on interaural delay.
Proceedings of the INTERSPEECH 2010, 2010

Nonlinear enhancement of onset for robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction.
Proceedings of the INTERSPEECH 2009, 2009

Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.
Proceedings of the INTERSPEECH 2009, 2009

Power function-based power distribution normalization algorithm for robust speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Robust speech recognition using a Small Power Boosting algorithm.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis.
Proceedings of the INTERSPEECH 2008, 2008

2006
Physiologically-motivated synchrony-based processing for robust automatic speech recognition.
Proceedings of the INTERSPEECH 2006, 2006


  Loading...