Hong Kook Kim

Sensors, April, 2024

Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4.

[BibT_eX]

[DOI]

CoRR, 2024

Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9.

[BibT_eX]

[DOI]

Do Hyun Lee

Yoonah Song

CoRR, 2024

Graph neural networks based framework to analyze social media platforms for malicious user detection.

[BibT_eX]

[DOI]

Appl. Soft Comput., 2024

Knowledge Distillation-Based Training of Speech Enhancement for Noise-Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

Duk-Jo Kong

IEEE Access, 2024

Leveraging Low-Rank Adaptation for Parameter-Efficient Fine-Tuning in Multi-Speaker Adaptive Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Changi Hong

Jung Hyuk Lee

IEEE Access, 2024

Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE Consumer Communications & Networking Conference, 2024

2023

Informer-Based Temperature Prediction Using Observed and Numerical Weather Prediction Data.

[BibT_eX]

[DOI]

Jimin Jun

Sensors, August, 2023

Adversarial Continual Learning to Transfer Self-Supervised Speech Representations for Voice Pathology Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model.

[BibT_eX]

[DOI]

Sensors, 2023

Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4.

[BibT_eX]

[DOI]

CoRR, 2023

Vehicle CAN Bus Data Prediction Using Transformers with Auxiliary Decoder Loss.

[BibT_eX]

[DOI]

Muhammad Aasim Rafique

Muhammad Ishfaq Hussain

Byung-Geun Lee

Moongu Jeon

Proceedings of the IEEE International Conference on Consumer Electronics, 2023

Sound Event Detection Using EfficientNet-B2 with an Attentional Pyramid Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2023

Non-Parallel Voice Conversion Using Cycle-Consistent Adversarial Networks with Self-Supervised Representations.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE Consumer Communications & Networking Conference, 2023

2022

Two-Step Joint Optimization with Auxiliary Loss Function for Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

Sensors, 2022

An Efficient Compression Method of Underwater Acoustic Sensor Signals for Underwater Surveillance.

[BibT_eX]

[DOI]

Sensors, 2022

DenseBert4Ret: Deep bi-modal for image retrieval.

[BibT_eX]

[DOI]

Inf. Sci., 2022

Auxiliary Loss of Transformer with Residual Connection for End-to-End Speaker Diarization.

[BibT_eX]

[DOI]

Yechan Yu

Dongkeon Park

Proceedings of the IEEE International Conference on Acoustics, 2022

Sound Event Detection Using Attention and Aggregation-Based Feature Pyramid Network.

[BibT_eX]

[DOI]

Proceedings of the 27th Asia Pacific Conference on Communications, 2022

2021

TAU-Net: Temporal Activation U-Net Shared With Nonnegative Matrix Factorization for Speech Enhancement in Unseen Noise Environments.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Temperature Prediction Based on Bidirectional Long Short-Term Memory and Convolutional Neural Network Combining Observed and Numerical Forecast Data.

[BibT_eX]

[DOI]

Sensors, 2021

Self-training with noisy student model and semi-supervised loss function for dcase 2021 challenge task 4.

[BibT_eX]

[DOI]

CoRR, 2021

Polyphonic Sound Event Detection Based on Residual Convolutional Recurrent Neural Network With Semi-Supervised Loss Function.

[BibT_eX]

[DOI]

IEEE Access, 2021

Verbal Abuse Classification Using Multiple Deep Neural Networks.

[BibT_eX]

[DOI]

Hyunju Park

Proceedings of the International Conference on Artificial Intelligence in Information and Communication, 2021

2020

Deep-Learning-Based Detection of Infants with Autism Spectrum Disorder Using Auto-Encoder Feature Representation.

[BibT_eX]

[DOI]

Sensors, 2020

Sparsity-based phase spectrum compensation for single-channel speech source separation.

[BibT_eX]

[DOI]

Digit. Signal Process., 2020

Polyphonic sound event detection based on convolutional recurrent neural networks with semi-supervised loss function for DCASE challenge 2020 task 4.

[BibT_eX]

[DOI]

CoRR, 2020

Two-Stage Polyphonic Sound Event Detection Based on Faster R-CNN-LSTM with Multi-Token Connectionist Temporal Classification.

[BibT_eX]

[DOI]

In Young Park

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

U-Net-Based Single-Channel Wind Noise Reduction in Outdoor Environments.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Consumer Electronics (ICCE), 2020

2019

Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones.

[BibT_eX]

[DOI]

Sensors, 2019

Directional Audio Rendering Using a Neural Network Based Personalized HRTF.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-linear Acoustic Echo Cancellation Based on Mel-Frequency Domain Volterra Filtering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2019

Multi-Channel Audio Source Separation Using Azimuth-Frequency Analysis and Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence in Information and Communication, 2019

2018

Coordinate-based direction-of-arrival estimation method using distributed microphones.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Single-channel speech dereverberation based on block-wise weighted prediction error and nonnegative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2018

2017

A lossless compression method incorporating sensor fault detection for underwater acoustic sensor array.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2017

Audio enhancement using local SNR-based sparse binary mask estimation and spectral imputation.

[BibT_eX]

[DOI]

Digit. Signal Process., 2017

Rediscovering 50 years of discoveries in speech and language processing: A survey.

[BibT_eX]

[DOI]

Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

Design of multi-channel indoor noise database for speech processing in noise.

[BibT_eX]

[DOI]

Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

Low-Frequency Ultrasonic Communication for Speech Broadcasting in Public Transportation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Application of low-frequency ultrasonic communication to audio marker for augmented reality.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2017

Speech emotion recognition based on multi-task learning using a convolutional neural network.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Noncoherent Low-Frequency Ultrasonic Communication System with Optimum Symbol Length.

[BibT_eX]

[DOI]

Myung Jong Lee

Int. J. Distributed Sens. Networks, 2016

Underwater acoustic sensor fault detection for passive sonar systems.

[BibT_eX]

[DOI]

Proceedings of the First International Workshop on Sensing, 2016

Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks.

[BibT_eX]

[DOI]

Su-Yeon Park

Chan Jun Chun

Proceedings of the International Conference on Information and Communication Technology Convergence, 2016

A discriminative training method incorporating pronunciation variations for dysarthric automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Adaptive Speech Streaming Based on Speech Quality Estimation and Artificial Bandwidth Extension for Voice over Wireless Multimedia Sensor Networks.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2015

Conversion of nearly monaural audio to 5.1-channel audio for portable multimedia devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Two-stage lexicon optimization of G2P-converted pronunciation dictionary based on statistical acoustic confusability measure.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Lexicon Optimization for WFST-Based Speech Recognition Using Acoustic Distance Based Confusability Measure and G2P Conversion.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

2014

Multi-channel audio recording based on superdirective beamforming for portable multimedia recording devices.

[BibT_eX]

[DOI]

Chan Jun Chun

IEEE Trans. Consumer Electron., 2014

Direction-of-arrival based SNR estimation for dual-microphone speech enhancement.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

A Packet Loss Concealment Technique Improving Quality of Service for Wideband Speech Coding in Wireless Sensor Networks.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2014

Nonnegative Matrix Factorization Based Adaptive Noise Sensing over Wireless Sensor Networks.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2014

Reducing Speech Noise for Patients with Dysarthria in Noisy Environments.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

Noise variance estimation based on dual-channel phase difference for speech enhancement.

[BibT_eX]

[DOI]

Digit. Signal Process., 2014

Hybrid probabilistic adaptation mode controller for generalized sidelobe cancellers applied to multi-microphone speech enhancement.

[BibT_eX]

[DOI]

Digit. Signal Process., 2014

Single-channel speech enhancement based on non-negative matrix factorization and online noise adaptation.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Feasibility Study for Objective Measurement on Sound Localization Using Auditory Evoked Potential.

[BibT_eX]

[DOI]

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Audio restoration based on multi-band spectral subtraction and missing data imputation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2014

2013

An MDCT-domain audio denoising method with a block switching scheme.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2013

Mechanical noise suppression based on non-negative matrix factorization and multi-band spectral subtraction for digital cameras.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2013

Ultrasonic Sensor-Based Personalized Multichannel Audio Rendering for Multiview Broadcasting Services.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2013

Target-to-non-target directional ratio estimation based on dual-microphone phase differences for target-directional speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Multi-band spectral subtraction based zoom-noise suppression for digital cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2013

2012

A user voice reduction algorithm based on binaural signal separation for portable digital imaging devices.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2012

Dysarthric Speech Recognition Error Correction Using Weighted Finite State Transducers Based on Context-Dependent Pronunciation Variation.

[BibT_eX]

[DOI]

Proceedings of the Computers Helping People with Special Needs, 2012

Adaptation mode control with residual noise estimation for beamformer-based multi-channel speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Non-negative Matrix Factorization Based Noise Reduction for Noise Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2012

2011

Probabilistic spectral gain modification applied to beamformer-based noise reduction in a car environment.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2011

A smart background music mixing algorithm for portable digital imaging devices.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2011

Sound source elevation using spectral notch filtering and directional band boosting in stereo loudspeaker reproduction.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2011

Burst Packet Loss Concealment Using Multiple Codebooks and Comfort Noise for CELP-Type Speech Coders in Wireless Sensor Networks.

[BibT_eX]

[DOI]

Sensors, 2011

Adaptive Redundant Speech Transmission over Wireless Multimedia Sensor Networks Based on Estimation of Perceived Speech Quality.

[BibT_eX]

[DOI]

Jin Ah Kang

Sensors, 2011

Phonetically Balanced Text Corpus Design Using a Similarity Measure for a Stereo Super-Wideband Speech Database.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

MDCT-Domain Packet Loss Concealment for Scalable Wideband Speech Coding.

[BibT_eX]

[DOI]

Nam In Park

Proceedings of the Ubiquitous Computing and Multimedia Applications, 2011

A Smart Error Protection Scheme Based on Estimation of Perceived Speech Quality for Portable Digital Speech Streaming Systems.

[BibT_eX]

[DOI]

Jin Ah Kang

Proceedings of the Ubiquitous Computing and Multimedia Applications, 2011

Audio Effect for Highlighting Speaker's Voice Corrupted by Background Noise on Portable Digital Imaging Devices.

[BibT_eX]

[DOI]

Proceedings of the Ubiquitous Computing and Multimedia Applications, 2011

High-Quality and Low-Complexity Real-Time Voice Changing with Seamless Switching for Digital Imaging Devices.

[BibT_eX]

[DOI]

Proceedings of the Ubiquitous Computing and Multimedia Applications, 2011

Complexity Reduction of Virtual Reverberation Filtering Based on Index-Based Convolution for Resource-Constrained Devices.

[BibT_eX]

[DOI]

Proceedings of the Ubiquitous Computing and Multimedia Applications, 2011

Preprocessing of Dysarthric Speech in Noise Based on CV-Dependent Wiener Filtering.

[BibT_eX]

[DOI]

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Hybrid probabilistic adaptation mode controller for generalized sidelobe canceller-based target-directional speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Artificial Bandwidth Extension of Narrowband Speech Signals for the Improvement of Perceptual Speech Communication Quality.

[BibT_eX]

[DOI]

Nam In Park

Proceedings of the Communication and Networking, 2011

Discrimination of Speech Activity and Impact Noise Using an Accelerometer and a Microphone in a Car Environment.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2011

Quality-Aware Loss-Robust Scalable Speech Streaming Based on Speech Quality Estimation.

[BibT_eX]

[DOI]

Jin Ah Kang

Proceedings of the Communication and Networking, 2011

Crosstalk Cancellation for Spatial Sound Reproduction in Portable Devices with Stereo Loudspeakers.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2011

Perceptual Enhancement of Sound Field Reproduction in a Nearly Monaural Sensing System.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2011

2010

Entropy coding of compressed feature parameters for distributed speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2010

Despeckling of medical ultrasound images using Daubechies complex wavelet transform.

[BibT_eX]

[DOI]

Signal Process., 2010

Acoustic Model Combination Incorporated With Mask-Based Multi-Channel Source Separation for Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Immersive modeling system (IMMS) for personal electronic products using a multi-modal interface.

[BibT_eX]

[DOI]

Comput. Aided Des., 2010

An Integrated Approach of 3D Sound Rendering Techniques for Sound Externalization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

SNR-based mask compensation for computational auditory scene analysis applied to speech recognition in a car environment.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

On the use of feature-space MLLR adaptation for non-native speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Statistical Model-Based Voice Activity Detection Using Spatial Cues and Log Energy for Dual-Channel Noisy Speech Recognition.

[BibT_eX]

[DOI]

Min Hwa Shin

Proceedings of the Communication and Networking, 2010

Design and Implementation of a Video-Zoom Driven Digital Audio-Zoom System for Portable Digital Imaging Devices.

[BibT_eX]

[DOI]

Proceedings of the Signal Processing and Multimedia, 2010

A Packet Loss Concealment Algorithm Robust to Burst Packet Loss Using Multiple Codebooks and Comfort Noise for CELP-Type Speech Coders.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2010

Duration Model-Based Post-processing for the Performance Improvement of a Keyword Spotting System.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2010

Complexity Reduction of WSOLA-Based Time-Scale Modification Using Signal Period Estimation.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2010

3D Sound Techniques for Sound Source Elevation in a Loudspeaker Listening Environment.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2010

A Real-Time Audio Upmixing Method from Stereo to 7.1-Channel Audio.

[BibT_eX]

[DOI]

Proceedings of the Communication and Networking, 2010

2009

Cepstrum-Domain Model Combination Based on Decomposition of Speech and Noise Using MMSE-LSA for ASR in Noisy Environments.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Bandwidth-Scalable Stereo Audio Coding Based on a Layered Structure.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

A media-specific FEC based on huffman coding for distributed speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Acoustic model combination to compensate for residual noise in multi-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Class-dependent and differential Huffman coding of compressed feature parameters for distributed speech recognition.

[BibT_eX]

[DOI]

Deok Su Kim

Proceedings of the IEEE International Conference on Acoustics, 2009

Upmixing Stereo Audio into 5.1 Channel Audio for Improving Audio Realism.

[BibT_eX]

[DOI]

Proceedings of the Signal Processing, Image Processing and Pattern Recognition, 2009

MLLR/MAP adaptation using pronunciation variation for non-native speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Cepstral domain interpretations of line spectral frequencies.

[BibT_eX]

[DOI]

Signal Process., 2008

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Gammatone-domain model combination for consonant recognition in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Mask estimation incorporating time-frequency trajectories for a CASA-based ASR front-end.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Acoustic and pronunciation model adaptation for context-independent and context-dependent pronunciation variability of non-native speech.

[BibT_eX]

[DOI]

Mina Kim

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

A Statistical Approach to Error Compensation in Spectral Quantization.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2007

Bandwidth Extension of a Narrowband Speech Coder for Music Streaming Services Over IP Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Signal Processing Systems, 2007

Non-native pronunciation variation modeling using an indirect data driven method.

[BibT_eX]

[DOI]

Mina Kim

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Bandwidth Extension of a Narrowband Speech Coder for Music Delivery over IP.

[BibT_eX]

[DOI]

Proceedings of the Advances in Hybrid Information Technology, 2006

Acoustic Model Adaptation Based on Pronunciation Variability Analysis for Non-Native Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Highly Adaptive Acoustic Echo Cancellation Solution for VoIP Conferencing Systems.

[BibT_eX]

[DOI]

Umar Iqbal Choudhry

JongWon Kim

Proceedings of the 2006 IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2006), 2006

2005

Procedural Constraints in the Extended RBAC and the Coloured Petri Net Modeling.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

A CELP coder using MFCC for server-based speech recognition in mobile.

[BibT_eX]

Gil Ho Lee

Proceedings of the Signal and Image Processing (SIP 2005), 2005

A MFCC-based CELP speech coder for server-based speech recognition in network environments.

[BibT_eX]

[DOI]

Gil Ho Lee

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Error Prediction in Spoken Dialog: From Signal-to-Noise Ratio to Semantic Confidence Scores.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Harmonic Model Based Excitation Enhancement for Low-Bit-Rate Speech Coding.

[BibT_eX]

[DOI]

Mi Suk Lee

Chul Hong Kwon

IEICE Trans. Inf. Syst., 2004

Compensation of Speech Coding Distortion for Wireless Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

A Forward-Backward Voice Packet Loss Concealment Algorithm for Multimedia over IP Network Services.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Robust speech recognition in client-server scenarios.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Why speech recognizers make errors ? a robustness view.

[BibT_eX]

[DOI]

Mazin G. Rahim

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Improving the transcoding capability of speech coders.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2003

Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2003

2002

Performance improvement of a bitstream-based front-end for wireless speech recognition in adverse environments.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2002

An adaptive short-term postfilter based on pseudo-cepstral representation of line spectral frequencies.

[BibT_eX]

[DOI]

Speech Commun., 2002

Algorithms for distributed speech recognition in a noisy automobile environment.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Cepstrum-domain model combination based on decomposition of speech and noise for noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

A phase generation method for speech reconstruction from spectral envelope and pitch intervals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

A bitstream-based front-end for wireless speech recognition on IS-136 communications system.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2001

A frame erasure concealment algorithm based on gain parameter re-estimation for CELP coders.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2001

A new distortion measure for spectral quantization based on the LSF intermodel interlacing property.

[BibT_eX]

[DOI]

Mi Suk Lee

Speech Commun., 2001

Robust speech recognition techniques applied to a speech in noise task.

[BibT_eX]

[DOI]

Donald Hindle

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Feature enhancement for a bitstream-based front-end in wireless speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

On approximating line spectral frequencies to LPC cepstral coefficients.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2000

Speech recognition using quantized LSP parameters and their transformations in digital communication.

[BibT_eX]

[DOI]

Speech Commun., 2000

Bitstream-based feature extraction for wireless speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Use of spectral autocorrelation in spectral envelope linear prediction for speech recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1999

Interlacing properties of line spectrum pair frequencies.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1999

A 4 kbps adaptive fixed code-excited linear prediction speech coder.

[BibT_eX]

[DOI]

Mi Suk Lee

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

LSP weighting functions based on spectral sensitivity and mel-frequency warping for speech recognition in digital communication.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Adaptive encoding of fixed codebook in CELP coders.

[BibT_eX]

[DOI]