Nam Soo Kim

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Integrated DNN-based model adaptation technique for noise-robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Weakly labeled acoustic event detection using local detector and global classifier.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Overlapping acoustic event classification based on joint training with source separation.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

DNN-Based Voice Activity Detection with Multi-Task Learning.

[BibT_eX]

[DOI]

Tae Gyoon Kang

IEICE Trans. Inf. Syst., 2016

DNN-Based Feature Enhancement Using Joint Training Framework for Robust Multichannel Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Multi-microphone approach for reliable acoustic data transmission.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2016

Two-stage noise aware training using asymmetric deep denoising autoencoder.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

NMF-based source separation utilizing prior knowledge on encoding vector.

[BibT_eX]

[DOI]

Kisoo Kwon

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

DNN-Based Sound Event Detection with Exemplar-Based Approach for Noise Reduction.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Acoustic Scene Classification Using Parallel Combination of LSTM and CNN.

[BibT_eX]

[DOI]

Soo Hyun Bae

In Kyu Choi

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Incremental approach to NMF basis estimation for audio source separation.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

DNN-based voice activity detection with local feature shift technique.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

NMF-Based Speech Enhancement Using Bases Update.

[BibT_eX]

[DOI]

Kisoo Kwon

IEEE Signal Process. Lett., 2015

NMF-based Target Source Separation Using Deep Neural Network.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Tampering Detection Scheme for Speech Signals using Formant Enhancement based Watermarking.

[BibT_eX]

[DOI]

J. Inf. Hiding Multim. Signal Process., 2015

Target Source Separation Based on Discriminative Nonnegative Matrix Factorization Incorporating Cross-Reconstruction Error.

[BibT_eX]

[DOI]

Kisoo Kwon

IEICE Trans. Inf. Syst., 2015

Supervised Denoising Pre-Training for Robust ASR with DNN-HMM.

[BibT_eX]

[DOI]

Kang Hyun Lee

IEICE Trans. Inf. Syst., 2015

An acoustic data transmission system based on audio data hiding: method and performance evaluation.

[BibT_eX]

[DOI]

Jae Choi

EURASIP J. Audio Speech Music. Process., 2015

DNN-based residual echo suppression.

[BibT_eX]

[DOI]

Chul Min Lee

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker adaptation using relevance vector regression for HMM-based expressive TTS.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Reverberation-robust acoustic indoor localization.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Acoustic modeling and parameter generation using relevance vector machines for speech synthesis.

[BibT_eX]

[DOI]

Doo Hwa Hong

Joun Yeop Lee

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Stereophonic Acoustic Echo Suppression Incorporating Spectro-Temporal Correlations.

[BibT_eX]

[DOI]

Chul Min Lee

IEEE Signal Process. Lett., 2014

Spectro-Temporal Filtering for Multichannel Speech Enhancement in Short-Time Fourier Transform Domain.

[BibT_eX]

[DOI]

Yu Gwang Jin

IEEE Signal Process. Lett., 2014

Factored Maximum Penalized Likelihood Kernel Regression for HMM-Based Style-Adaptive Speech Synthesis.

[BibT_eX]

[DOI]

June Sig Sung

Doo Hwa Hong

IEEE J. Sel. Top. Signal Process., 2014

Formant enhancement based speech watermarking for tampering detection.

[BibT_eX]

[DOI]

Shengbei Wang

Masashi Unoki

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A data-driven approach to speech enhancement using Gaussian process.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

NMF-based speech enhancement incorporating deep neural network.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker Adaptation Using Nonlinear Regression Techniques for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Crossband filtering for stereophonic acoustic echo suppression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Speech enhancement combining statistical models and NMF with update of speech and noise bases.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Reverberation and noise robust feature enhancement using multiple inputs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Parametric multichannel noise reduction algorithm utilizing temporal correlations in reverberant environment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Reverberation and Noise Robust Feature Compensation Based on IMM.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2013

Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Robust Audio Data Hiding Method Based on Phase of Modulated Complex Lapped Transform.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Blind method of estimating speech transmission index from reverberant speech signals.

[BibT_eX]

[DOI]

Proceedings of the 21st European Signal Processing Conference, 2013

Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

IMM-based feature compensation robust to slowly time-varying noise and reverberation.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012

Speech Feature Mapping Based on Switching Linear Dynamic System.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Spectral Magnitude Adjustment for MCLT-Based Acoustic Data Transmission.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Outlier Detection and Removal for HMM-Based Speech Synthesis with an Insufficient Speech Database.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Factored MLLR Adaptation Algorithm for HMM-based Expressive TTS.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Quality Enhancement of Audio Watermarking for Data Transmission in Aerial Space Based on Segmental SNR Adjustment.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Artificial stereo data generation for speech feature mapping.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Feature enhancement error compensation for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the International Multi-Conference on Systems, Signals & Devices, 2012

2011

Factored MLLR Adaptation.

[BibT_eX]

[DOI]

June Sig Sung

Doo Hwa Hong

IEEE Signal Process. Lett., 2011

Speech Enhancement Based on Data-Driven Residual Gain Estimation.

[BibT_eX]

[DOI]

Yu Gwang Jin

IEICE Trans. Inf. Syst., 2011

Factored MLLR Adaptation for Singing Voice Generation.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A data-driven residual gain approach for two-stage speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Switching linear dynamic transducer for stereo data based speech feature mapping.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Acoustic Data Transmission Based on Modulated Complex Lapped Transform.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Frequency-Domain Double-Talk Detection Based on the Gaussian Mixture Model.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Robust Data Hiding for MCLT Based Acoustic Data Transmission.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Study of Prominence Detection Based on Various Phone-Specific Features.

[BibT_eX]

[DOI]

Sung Soo Kim

IEICE Trans. Inf. Syst., 2010

On Detecting Target Acoustic Signals Based on Non-negative Matrix Factorization.

[BibT_eX]

[DOI]

Yu Gwang Jin

IEICE Trans. Inf. Syst., 2010

Estimation of Phone Mismatch Penalty Matricesfor Two-Stage Keyword Spotting.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Implementation of HMM-Based Human Activity Recognition Using Single Triaxial Accelerometer.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Voice activity detection based on statistical models and machine learning approaches.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2010

Excitation modeling based on waveform interpolation for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Phone mismatch penalty matrices for two-stage keyword spotting via multi-pass phone recognizer.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Multichannel noise reduction using low order RTF estimate.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Audio Fingerprinting Based on Multiple Hashing in DCT Domain.

[BibT_eX]

[DOI]

Yu Liu

IEEE Signal Process. Lett., 2009

Global Soft Decision Employing Support Vector Machine For Speech Enhancement.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2009

Computationally Efficient Cepstral Domain Feature Compensation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

Speech reinforcement based on partial masking effect.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

DCT based multiple hashing technique for robust audio fingerprinting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Analysis and Improvement of Speech/Music Classification for 3GPP2 SMV Based on GMM.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2008

Voice Activity Detection Based on Conditional MAP Criterion.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2008

Frame Splitting Scheme for Error-Robust Audio Streaming over Packet-Switching Networks.

[BibT_eX]

[DOI]

IEICE Trans. Commun., 2008

Improved Frame Mode Selection for AMR-WB+ Based on Decision Tree.

[BibT_eX]

[DOI]

Jong Kyu Kim

IEICE Trans. Inf. Syst., 2008

Decision tree based frame mode selection for AMR-WB+.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Cepstral domain feature compensation based on diagonal approximation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

On Using Multiple Models for Automatic Speech Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Perceptual Reinforcement of Speech Signal Based on Partial Specific Loudness.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Feature Compensation Incorporating Modeling Error Statistics.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

A Statistical Model-Based Residual Echo Suppression.

[BibT_eX]

[DOI]

Seung Yeol Lee

IEEE Signal Process. Lett., 2007

Voice activity detection based on a family of parametric distributions.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2007

Multiple statistical models for soft decision in noisy speech enhancement.

[BibT_eX]

[DOI]

Pattern Recognit., 2007

Speech Enhancement Based on Perceptually Comfortable Residual Noise.

[BibT_eX]

[DOI]

IEICE Trans. Commun., 2007

Feature Compensation with Model-Based Estimation for Noise Masking.

[BibT_eX]

[DOI]

Young Joon Kim

IEICE Trans. Inf. Syst., 2007

Improved Global Soft Decision Using Smoothed Global Likelihood Ratio for Speech Enhancement.

[BibT_eX]

[DOI]

IEICE Trans. Commun., 2007

Speech reinforcement based on partial specific loudness.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A multiple-model based framework for automatic speech segmentation.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A statistical model based post-filtering algorithm for residual echo suppression.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Feature Compensation using More Accurate Statistics of Modeling Error.

[BibT_eX]

[DOI]

Jong Kyu Kim

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Voice activity detection based on multiple statistical models.

[BibT_eX]

[DOI]

Sanjit K. Mitra

IEEE Trans. Signal Process., 2006

A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

Signal modification for ADPCM based on analysis-by-synthesis framework.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2006

Automatic Speech Segmentation Based on Boundary-Type Candidate Selection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2006

Speech enhancement based on residual noise shaping.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automatic speech segmentation with multiple statistical models.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Clean speech feature estimation based on soft spectral masking.

[BibT_eX]

[DOI]

Young Joon Kim

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Signal modification incorporating perceptual weighting filter.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Rapid online adaptation based on transformation space model evolution.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

Statistical modeling of speech signals based on generalized gamma distribution.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2005

An approach to robust unsupervised speaker adaptation.

[BibT_eX]

[DOI]

Dong Jin Seo

IEEE Signal Process. Lett., 2005

Feature compensation based on switching linear dynamic model.

[BibT_eX]

[DOI]

Richard M. Stern

IEEE Signal Process. Lett., 2005

Image probability distribution based on generalized gamma function.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2005

Pitch estimation of speech signal based on adaptive lattice notch filter.

[BibT_eX]

[DOI]

Sanjit K. Mitra

Signal Process., 2005

Feature compensation based on switching linear dynamic model and soft decision.

[BibT_eX]

[DOI]

Bong Kyoung Kim

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A new structural preprocessor for low-bit rate speech coding.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Voice Activity Detection based on Generalized Gamma Distribution.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Signal modification for robust speech coding.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2004

Discriminative training for concatenative speech synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2004

Feature compensation based on soft decision.

[BibT_eX]

[DOI]

Young Joon Kim

Hyun Woo Kim

IEEE Signal Process. Lett., 2004

Rapid online adaptation using speaker space model evolution.

[BibT_eX]

[DOI]

Speech Commun., 2004

Maximum a posteriori adaptation of HMM parameters based on speaker space projection.

[BibT_eX]

[DOI]

Speech Commun., 2004

A Statistical Model-Based V/UV Decision under Background Noise Environments.

[BibT_eX]

[DOI]

Sanjit K. Mitra

IEICE Trans. Inf. Syst., 2004

Distorted Speech Rejection for Automatic Speech Recognition in Wireless Communication.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

Speech probability distribution based on generalized gama distribution.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Inner product based-multiband vector quantization for wideband speech coding at 16 kbps.

[BibT_eX]

[DOI]

Seung Yeol Lee

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Discriminative weight training for unit-selection based speech synthesis.

[BibT_eX]

[DOI]

Chong Kyu Kim

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Feature compensation technique for robust speech recognition in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Likelihood ratio test with complex laplacian model for voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Online adaptation using speatransformation space model evolution.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

A preprocessor for low-bit-rate speech coding.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2002

Feature domain compensation of nonstationary noise for robust speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2002

Markov models based on speaker space model evolution.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Generalized analysis-by-synthesis based on system identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

A new double-talk detector using echo path estimation.

[BibT_eX]

[DOI]

Hae Kyung Jung

Taejeong Kim

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Rapid speaker adaptation using probabilistic principal component analysis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2001

Robust correlation estimation for EMAP-based speaker adaptation.

[BibT_eX]

[DOI]

Eugene Jon

IEEE Signal Process. Lett., 2001

EMAP-based speaker adaptation with robust correlation estimation.

[BibT_eX]

[DOI]

Eugene Jon

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Filtering on hidden Markov models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2000

Spectral enhancement based on global soft decision.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2000

Bayesian speaker adaptation based on probabilistic principal component analysis.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech enhancement: new approaches to soft decision.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

A statistical model-based voice activity detection.

[BibT_eX]

[DOI]

Jongseo Sohn

Wonyong Sung

IEEE Signal Process. Lett., 1999

Time-varying noise compensation using multiple Kalman filters.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Deleted strategy for MMI-based HMM training.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1998

IMM-based estimation for slowly evolving environments.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 1998

Nonstationary environment compensation based on sequential estimation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 1998

Statistical linear approximation for environment compensation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 1998

Speech recognition in noisy environments using first-order vector Taylor series.

[BibT_eX]

[DOI]

Do Yeong Kim

Speech Commun., 1998

1997

Statistically reliable deleted interpolation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1997

Frame-correlated hidden Markov model based on extended logarithmic pool.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1997

Model-based approach for robust speech recognition in noisy environements with multiple noise sources.

[BibT_eX]

[DOI]

Do Yeong Kim

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995

On estimating robust probability distribution in HMM-based speech recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1995

1990

Generalized training of hidden Markov model parameters for speech recognition.

[BibT_eX]

[DOI]