Yu Tsao

According to our database1, Yu Tsao authored at least 233 papers between 2001 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes.
IEEE Trans. Multim., 2021

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.
CoRR, 2021

The AS-NU System for the M2VoC Challenge.
CoRR, 2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification.
CoRR, 2021

EMA2S: An End-to-End Multimodal Articulatory-to-Speech System.
CoRR, 2021

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification.
CoRR, 2021

Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario.
CoRR, 2021

MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

2020
Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder.
IEEE J. Biomed. Health Informatics, 2020

Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion.
IEEE Trans. Emerg. Top. Comput. Intell., 2020

Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation.
IEEE Trans. Cogn. Dev. Syst., 2020

Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement.
IEEE Signal Process. Lett., 2020

WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement.
IEEE Signal Process. Lett., 2020

Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality.
IEEE Signal Process. Lett., 2020

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.
Comput. Speech Lang., 2020

Unsupervised neural adaptation model based on optimal transport for spoken language identification.
CoRR, 2020

Domain-adaptive Fall Detection Using Deep Adversarial Training.
CoRR, 2020

Speech Enhancement with Zero-Shot Model Selection.
CoRR, 2020

Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems.
CoRR, 2020

One Shot Learning for Speech Separation.
CoRR, 2020

Speech enhancement guided by contextual articulatory information.
CoRR, 2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement.
CoRR, 2020

The Academia Sinica Systems of Voice Conversion for VCC2020.
CoRR, 2020

Improved Lite Audio-Visual Speech Enhancement.
CoRR, 2020

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application.
CoRR, 2020

Using Deep Learning and Explainable Artificial Intelligence in Patients' Choices of Hospital Levels.
CoRR, 2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing.
CoRR, 2020

SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning.
CoRR, 2020

Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders.
CoRR, 2020

The IPIN 2019 Indoor Localisation Competition - Description and Results.
IEEE Access, 2020

Incorporating Broad Phonetic Information for Speech Enhancement.
Proceedings of the Interspeech 2020, 2020

iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning.
Proceedings of the Interspeech 2020, 2020

SERIL: Noise Adaptive Speech Enhancement Using Regularization-Based Incremental Learning.
Proceedings of the Interspeech 2020, 2020

Lite Audio-Visual Speech Enhancement.
Proceedings of the Interspeech 2020, 2020

Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-Based Voice Conversion System.
Proceedings of the Interspeech 2020, 2020

Space-Time Guided Association Learning For Unsupervised Person Re-Identification.
Proceedings of the IEEE International Conference on Image Processing, 2020

Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Computation-Performance Optimization of Convolutional Neural Networks With Redundant Filter Removal.
IEEE Trans. Circuits Syst. I Regul. Pap., 2019

Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques.
IEEE Signal Process. Lett., 2019

Deep progressive multi-scale attention for acoustic event classification.
CoRR, 2019

MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation.
CoRR, 2019

MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing.
CoRR, 2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement.
CoRR, 2019

Distributed Microphone Speech Enhancement based on Deep Learning.
CoRR, 2019

The ASVspoof 2019 database.
CoRR, 2019

Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation.
CoRR, 2019

Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement.
CoRR, 2019

Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks.
CoRR, 2019

Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training.
Biomed. Signal Process. Control., 2019

Evaluating Indoor Positioning Systems in a Shopping Mall: The Lessons Learned From the IPIN 2018 Competition.
IEEE Access, 2019

Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders.
IEEE Access, 2019

Garment Detectives: Discovering Clothes and Its Genre in Consumer Photos.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement.
Proceedings of the 2019 International Symposium on Intelligent Signal Processing and Communication Systems, 2019

Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric.
Proceedings of the Interspeech 2019, 2019

Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection.
Proceedings of the Interspeech 2019, 2019

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion.
Proceedings of the Interspeech 2019, 2019

IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network.
Proceedings of the Interspeech 2019, 2019

Noise Adaptive Speech Enhancement Using Domain Adversarial Training.
Proceedings of the Interspeech 2019, 2019

Incorporating Symbolic Sequential Modeling for Speech Enhancement.
Proceedings of the Interspeech 2019, 2019

Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR.
Proceedings of the Interspeech 2019, 2019

Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement.
Proceedings of the Interspeech 2019, 2019

Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech.
Proceedings of the Interspeech 2019, 2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement.
Proceedings of the 36th International Conference on Machine Learning, 2019

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine.
Proceedings of the 27th European Signal Processing Conference, 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion.
Proceedings of the 27th European Signal Processing Conference, 2019

Subjective Feedback-based Neural Network Pruning for Speech Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

A Pruned-CELP Speech Codec Using Denoising Autoencoder with Spectral Compensation for Quality and Intelligibility Enhancement.
Proceedings of the IEEE International Conference on Artificial Intelligence Circuits and Systems, 2019

2018
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks.
IEEE Trans. Emerg. Top. Comput. Intell., 2018

Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Bone-conducted speech enhancement using deep denoising autoencoder.
Speech Commun., 2018

SmartHear: A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies.
IEEE Syst. J., 2018

Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition.
Sensors, 2018

Locally Linear Embedding Based Post-Filtering for Speech Enhancement.
J. Inf. Sci. Eng., 2018

Voice Conversion Based on Locally Linear Embedding.
J. Inf. Sci. Eng., 2018

Robustness against the channel effect in pathological voice detection.
CoRR, 2018

Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform.
CoRR, 2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning.
CoRR, 2018

Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller.
IEEE Access, 2018

A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN).
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform.
Proceedings of the 2018 IEEE International Workshop on Signal Processing Systems, 2018

WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

IOS-based Ear Scale application for Clinical Audiology and Otology Usage.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Hearing aids APP design based on deep learning technology.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Exemplar-Based Spectral Detail Compensation for Voice Conversion.
Proceedings of the Interspeech 2018, 2018

Temporal Attentive Pooling for Acoustic Event Detection.
Proceedings of the Interspeech 2018, 2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.
Proceedings of the Interspeech 2018, 2018

An Industrial IoT Analysis System Based on Machining Data of Metal Materials.
Proceedings of the International Conference on Fuzzy Theory and Its Applications, 2018

A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Enhancement and Analysis of Conversational Speech: JSALT 2017.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Congruent Visual Stimulation Facilitates Auditory Frequency Change Detection: An ERP Study.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Improving the performance of hearing aids in noisy environments based on deep learning technology.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation.
IEEE Trans. Biomed. Eng., 2017

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.
IEEE Trans. Biomed. Eng., 2017

S1 and S2 Heart Sound Recognition Using Deep Neural Networks.
IEEE Trans. Biomed. Eng., 2017

Personalizing Recurrent-Neural-Network-Based Language Model by Social Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

A Replay Spoofing Detection System Based on Discriminative Autoencoders.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

Regularization of neural network model with distance metric learning for i-vector based spoken language identification.
Comput. Speech Lang., 2017

Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT).
Comput. Speech Lang., 2017

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
CoRR, 2017

Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller.
CoRR, 2017

Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network.
CoRR, 2017

Multi-Metrics Learning for Speech Enhancement.
CoRR, 2017

Experimental Study on Extreme Learning Machine Applications for Speech Enhancement.
IEEE Access, 2017

A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom.
IEEE Access, 2017

以軟體為基礎建構語音增強系統使用者介面 (Development of a software-based User-Interface of Speech Enhancement System) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

改進的向量空間可適性濾波器用於聲學回聲消除 (Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

多樣訊雜比之訓練語料於降噪自動編碼器其語音強化功能之初步研究 (A Preliminary Study of Various SNR-level Training Data in the Denoising Auto-encoder (DAE) Technique for Speech Enhancement) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Object-based on-line video summarization for internet of video things.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Discriminative Autoencoders for Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement.
Proceedings of the Interspeech 2017, 2017

Wavelet Speech Enhancement Based on Robust Principal Component Analysis.
Proceedings of the Interspeech 2017, 2017

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks.
Proceedings of the Interspeech 2017, 2017

A locally linear embbeding based postfiltering approach for speech enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Discriminative autoencoders for speaker verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Track-Clustering Error Evaluation for Track-Based Multi-camera Tracking System Employing Human Re-identification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Fast locally linear embedding algorithm for exemplar-based voice conversion.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Raw waveform-based speech enhancement by fully convolutional networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Acoustic echo cancellation using deep cerebellar model articulation controller.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization.
IEEE Signal Process. Lett., 2016

Generalized maximum a posteriori spectral amplitude estimation for speech enhancement.
Speech Commun., 2016

Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus.
Speech Commun., 2016

Transportation Modes Classification Using Sensors on Smartphones.
Sensors, 2016

Maximum Entropy Learning with Deep Belief Networks.
Entropy, 2016

Robust Beamforming Against DoA Mismatch Using Subspace-Constrained Diagonal Loading.
CoRR, 2016

Image Retrieval Using Color-Aware Tag on Progressive Image Search and Recommendation System.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A pseudo-task design in multi-task learning deep neural network for speaker recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving the performance of speech perception in noisy environment based on an FAME strategy.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Incorporating local environment information with ensemble neural networks to robust automatic speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Dictionary update for NMF-based voice conversion using an encoder-decoder network.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Locally Linear Embedding for Exemplar-Based Spectral Conversion.
Proceedings of the Interspeech 2016, 2016

Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification.
Proceedings of the Interspeech 2016, 2016

Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation.
Proceedings of the Interspeech 2016, 2016

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A study of mobile advertisement recommendation using real big data from AdLocus.
Proceedings of the IEEE 5th Global Conference on Consumer Electronics, 2016

A linear regression model with dynamic pulse transit time features for noninvasive blood pressure prediction.
Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2016

Temporal Modulation Spectral Restoration for Robust Speech Recognition.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Adaptive subspace-constrained diagonal loading.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Voice conversion from non-parallel corpora using variational auto-encoder.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Audio-visual speech enhancement using deep neural networks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Compensating for Orientation Mismatch in Robust Wi-Fi Localization Using Histogram Equalization.
IEEE Trans. Veh. Technol., 2015

Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm.
IEEE Signal Process. Lett., 2015

Ensemble environment modeling using affine transform group.
Speech Commun., 2015

Rapid Converging M-Max Partial Update Least Mean Square Algorithms with New Variable Step-Size Methods.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015

Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation.
IEICE Trans. Inf. Syst., 2015

類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

Sparse representation with temporal max-smoothing for acoustic event detection.
Proceedings of the INTERSPEECH 2015, 2015

Speech recognition with temporal neural networks.
Proceedings of the INTERSPEECH 2015, 2015

A discriminative post-filter for speech enhancement in hearing aids.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Multimodal arousal rating using unsupervised fusion technique.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A new frequency lowering technique for Mandarin-speaking hearing aid users.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Temporal alignment for deep neural networks.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

A probabilistic interpretation for artificial neural network-based voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Variable Selection Linear Regression for Robust Speech Recognition.
IEICE Trans. Inf. Syst., 2014

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation.
Comput. Speech Lang., 2014

Effect of adaptive envelope compression in simulated electric hearing in reverberation.
Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

Acoustic feature conversion using a polynomial based feature transferring algorithm.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Spectral patch based sparse coding for acoustic event detection.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Ensemble modeling of denoising autoencoder for speech spectrum restoration.
Proceedings of the INTERSPEECH 2014, 2014

Automatic speech recognition with primarily temporal envelope information.
Proceedings of the INTERSPEECH 2014, 2014

Clustering-based i-vector formulation for speaker recognition.
Proceedings of the INTERSPEECH 2014, 2014

An adaptive envelope compression strategy for speech processing in cochlear implants.
Proceedings of the INTERSPEECH 2014, 2014

Ensemble of machine learning algorithms for cognitive and physical speaker load detection.
Proceedings of the INTERSPEECH 2014, 2014

A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Sparse representation based on a bag of spectral exemplars for acoustic event detection.
Proceedings of the IEEE International Conference on Acoustics, 2014

Speech enhancement using segmental nonnegative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014

Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Evaluation of generalized maximum a posteriori spectral amplitude (GMAPA) speech enhancement algorithm in hearing aids.
Proceedings of the IEEE International Symposium on Consumer Electronics, 2013

Recurrent neural network based language model personalization by social network crowdsourcing.
Proceedings of the INTERSPEECH 2013, 2013

Speech enhancement based on deep denoising autoencoder.
Proceedings of the INTERSPEECH 2013, 2013

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition.
Proceedings of the INTERSPEECH 2013, 2013

Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training.
Proceedings of the INTERSPEECH 2013, 2013

Sparse maximum entropy deep belief nets.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Semantic Naïve Bayes Classifier for Document Classification.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Filtering on the temporal probability sequence in histogram equalization for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speech enhancement using generalized maximum a posteriori spectral amplitude estimator.
Proceedings of the IEEE International Conference on Acoustics, 2013

Robust Wi-Fi location fingerprinting against device diversity based on spatial mean normalization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Incorporating global variance in the training phase of GMM-based voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
A study on cepstral sub-band normalization for robust ASR.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Acoustic space partition based on broad phonetic class for ensemble acoustic modeling.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Exploring mutual information for GMM-based spectral conversion.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A Study of Mutual Information for GMM-Based Spectral Conversion.
Proceedings of the INTERSPEECH 2012, 2012

Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation.
Proceedings of the INTERSPEECH 2012, 2012

A linear projection approach to environment modeling for robust speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

A sampling-based environment population projection approach for rapid acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

A particle filter feature compensation approach to robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Shrinkage model adaptation in automatic speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2009

Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling.
Proceedings of the 3rd International Universal Communication Symposium, 2009

A study on soft margin estimation of linear regression parameters for speaker adaptation.
Proceedings of the INTERSPEECH 2009, 2009

Ensemble speaker and speaking environment modeling approach with advanced online estimation process.
Proceedings of the IEEE International Conference on Acoustics, 2009

MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
An ensemble speaker and speaking environment modeling approach to robust speech recognition.
PhD thesis, 2008

Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process.
Proceedings of the INTERSPEECH 2008, 2008

A programmable analog radial-basis-function based classifier.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
An ensemble modeling approach to joint characterization of speaker and speaking environments.
Proceedings of the INTERSPEECH 2007, 2007

Detection-based ASR in the automatic speech attribute transcription project.
Proceedings of the INTERSPEECH 2007, 2007

Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
A vector space approach to environment modeling for robust speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

A study on detection based automatic speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

2005
Segmental eigenvoice with delicate eigenspace for improved speaker adaptation.
IEEE Trans. Speech Audio Process., 2005

A study on separation between acoustic models and its applications.
Proceedings of the INTERSPEECH 2005, 2005

A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2001
Segmental eigenvoice for rapid speaker adaptation.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001


  Loading...