Yu Tsao

Supratip Ghose

Chia-Yu Chang

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Deep Learning and Explainable Artificial Intelligence to Predict Patients' Choice of Hospital Levels in Urban and Rural Areas.

[BibT_eX]

[DOI]

Proceedings of the MEDINFO 2021: One World, One Health - Global Partnership for Digital Innovation, 2021

MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Attention-Based Multi-Task Learning for Speech-Enhancement and Speaker-Identification in Multi-Speaker Dialogue Scenario.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

EMA2S: An End-to-End Multimodal Articulatory-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

One Shot Learning for Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Enhancement with Zero-Shot Model Selection.

[BibT_eX]

[DOI]

Chiou-Shann Fuh

Hsin-Min Wang

Aswin Shanmugam Subramanian

Proceedings of the 29th European Signal Processing Conference, 2021

A Study of Incorporating Articulatory Movement Information in Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder.

[BibT_eX]

[DOI]

Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2021

Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

A Study on Speech Enhancement Based on Diffusion Probabilistic Model.

[BibT_eX]

[DOI]

Yen-Ju Lu

Shinji Watanabe

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues.

[BibT_eX]

[DOI]

Zicheng Feng

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, 2020

Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput. Intell., 2020

Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders.

[BibT_eX]

[DOI]

Cheng Yu

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation.

[BibT_eX]

[DOI]

Hsiao-Lan Sharon Wang

Valerio Mario Salerno

IEEE Trans. Cogn. Dev. Syst., 2020

Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality.

[BibT_eX]

[DOI]

Szu-Wei Fu

Chien-Feng Liao

IEEE Signal Process. Lett., 2020

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

Domain-adaptive Fall Detection Using Deep Adversarial Training.

[BibT_eX]

[DOI]

CoRR, 2020

ECG Signal Super-resolution by Considering Reconstruction and Cardiac Arrhythmias Classification Loss.

[BibT_eX]

[DOI]

CoRR, 2020

Speech enhancement guided by contextual articulatory information.

[BibT_eX]

[DOI]

CoRR, 2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2020

The Academia Sinica Systems of Voice Conversion for VCC2020.

[BibT_eX]

[DOI]

Yu-Huai Peng

Cheng-Hung Hu

Alexander Chao-Fu Kang

CoRR, 2020

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application.

[BibT_eX]

[DOI]

Alexander Chao-Fu Kang

CoRR, 2020

Using Deep Learning and Explainable Artificial Intelligence in Patients' Choices of Hospital Levels.

[BibT_eX]

[DOI]

Lichin Chen

Ji-Tian Sheu

CoRR, 2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing.

[BibT_eX]

[DOI]

CoRR, 2020

SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders.

[BibT_eX]

[DOI]

Cheng Yu

Antonio Ramón Jiménez Ruiz

CoRR, 2020

The IPIN 2019 Indoor Localisation Competition - Description and Results.

[BibT_eX]

[DOI]

Joaquín Torres-Sospedra

Antoni Pérez-Navarro

Germán Martín Mendoza-Silva

Emilio Sansano-Sansano

Vicente Cortés Puschel

Tomás Lungenstrass Poulsen

Adriano J. C. Moreira

IEEE Access, 2020

Incorporating Broad Phonetic Information for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

SERIL: Noise Adaptive Speech Enhancement Using Regularization-Based Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

Lite Audio-Visual Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-Based Voice Conversion System.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

Space-Time Guided Association Learning For Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Exponentiated magnitude spectrogram-based relative-to-maximum masking for speech enhancement in adverse environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2020

Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Cross-Technology Interference Mitigation Using Fully Convolutional Denoising Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE Global Communications Conference, 2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing.

[BibT_eX]

[DOI]

Germán Martín Mendoza-Silva

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Computation-Performance Optimization of Convolutional Neural Networks With Redundant Filter Removal.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2019

Toward Automating Oral Presentation Scoring During Principal Certification Program Using Audio-Video Low-Level Behavior Profiles.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2019

Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2019

Deep progressive multi-scale attention for acoustic event classification.

[BibT_eX]

[DOI]

CoRR, 2019

MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation.

[BibT_eX]

[DOI]

CoRR, 2019

MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing.

[BibT_eX]

[DOI]

CoRR, 2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2019

Distributed Microphone Speech Enhancement based on Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2019

The ASVspoof 2019 database.

[BibT_eX]

[DOI]

CoRR, 2019

Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation.

[BibT_eX]

[DOI]

CoRR, 2019

Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement.

[BibT_eX]

[DOI]

Natalie Yu-Hsien Wang

Hsiao-Lan Sharon Wang

CoRR, 2019

Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2019

Evaluating Indoor Positioning Systems in a Shopping Mall: The Lessons Learned From the IPIN 2018 Competition.

[BibT_eX]

[DOI]

Valérie Renaudin

Miguel Ortiz

Johan Perul

Joaquín Torres-Sospedra

Antonio Ramón Jiménez

Antoni Pérez-Navarro

IEEE Access, 2019

Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders.

[BibT_eX]

[DOI]

IEEE Access, 2019

Speech enhancement based on the integration of fully convolutional network, temporal lowpass filtering and spectrogram masking.

[BibT_eX]

[DOI]

Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

Garment Detectives: Discovering Clothes and Its Genre in Consumer Photos.

[BibT_eX]

[DOI]

Shintami Chusnul Hidayati

Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine.

[BibT_eX]

[DOI]

Jia-Ching Wang

Hsin-Min Wang

Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement.

[BibT_eX]

[DOI]

Join W. C. Sigalingging

Jia-Ching Wang

Proceedings of the 2019 International Symposium on Intelligent Signal Processing and Communication Systems, 2019

Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Noise Adaptive Speech Enhancement Using Domain Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Incorporating Symbolic Sequential Modeling for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.

[BibT_eX]

[DOI]

Wen-Chin Huang

Yi-Chiao Wu

Chen-Chou Lo

Patrick Lumban Tobing

Proceedings of the Interspeech 2019, 2019

Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech.

[BibT_eX]

[DOI]

Li-Wei Chen

Hung-yi Lee

Proceedings of the Interspeech 2019, 2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion.

[BibT_eX]

[DOI]

Wen-Chin Huang

Yi-Chiao Wu

Hsin-Te Hwang

Patrick Lumban Tobing

Proceedings of the 27th European Signal Processing Conference, 2019

Subjective Feedback-based Neural Network Pruning for Speech Enhancement.

[BibT_eX]

[DOI]

Fuqiang Ye

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

A Pruned-CELP Speech Codec Using Denoising Autoencoder with Spectral Compensation for Quality and Intelligibility Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Artificial Intelligence Circuits and Systems, 2019

2018

Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput. Intell., 2018

Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Bone-conducted speech enhancement using deep denoising autoencoder.

[BibT_eX]

[DOI]

Hung-Ping Liu

Chiou-Shann Fuh

Speech Commun., 2018

SmartHear: A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies.

[BibT_eX]

[DOI]

IEEE Syst. J., 2018

Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition.

[BibT_eX]

[DOI]

Sensors, 2018

Locally Linear Embedding Based Post-Filtering for Speech Enhancement.

[BibT_eX]

[DOI]

J. Inf. Sci. Eng., 2018

Voice Conversion Based on Locally Linear Embedding.

[BibT_eX]

[DOI]

J. Inf. Sci. Eng., 2018

Robustness against the channel effect in pathological voice detection.

[BibT_eX]

[DOI]

CoRR, 2018

Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform.

[BibT_eX]

[DOI]

CoRR, 2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller.

[BibT_eX]

[DOI]

IEEE Access, 2018

A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN).

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Workshop on Signal Processing Systems, 2018

WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

IOS-based Ear Scale application for Clinical Audiology and Otology Usage.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Hearing aids APP design based on deep learning technology.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Exemplar-Based Spectral Detail Compensation for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2018, 2018

Temporal Attentive Pooling for Acoustic Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2018, 2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2018, 2018

An Industrial IoT Analysis System Based on Machining Data of Metal Materials.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Fuzzy Theory and Its Applications, 2018

A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Enhancement and Analysis of Conversational Speech: JSALT 2017.

[BibT_eX]

[DOI]

Mahesh Krishnamoorthy

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Congruent Visual Stimulation Facilitates Auditory Frequency Change Detection: An ERP Study.

[BibT_eX]

[DOI]

Lei Wang

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Improving the performance of hearing aids in noisy environments based on deep learning technology.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2017

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2017

S1 and S2 Heart Sound Recognition Using Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2017

Personalizing Recurrent-Neural-Network-Based Language Model by Social Network.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

A Replay Spoofing Detection System Based on Discriminative Autoencoders.

[BibT_eX]

[DOI]

Int. J. Comput. Linguistics Chin. Lang. Process., 2017

Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm.

[BibT_eX]

[DOI]

Jin Li-You

Ying-Ren Chien

Int. J. Comput. Linguistics Chin. Lang. Process., 2017

Regularization of neural network model with distance metric learning for i-vector based spoken language identification.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT).

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller.

[BibT_eX]

[DOI]

CoRR, 2017

Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network.

[BibT_eX]

[DOI]

CoRR, 2017

Multi-Metrics Learning for Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2017

Experimental Study on Extreme Learning Machine Applications for Speech Enhancement.

[BibT_eX]

[DOI]

IEEE Access, 2017

A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom.

[BibT_eX]

[DOI]

IEEE Access, 2017

以軟體為基礎建構語音增強系統使用者介面 (Development of a software-based User-Interface of Speech Enhancement System) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

改進的向量空間可適性濾波器用於聲學回聲消除 (Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm) [In Chinese].

[BibT_eX]

[DOI]

Jin Li-You

Ying-Ren Chien

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

多樣訊雜比之訓練語料於降噪自動編碼器其語音強化功能之初步研究 (A Preliminary Study of Various SNR-level Training Data in the Denoising Auto-encoder (DAE) Technique for Speech Enhancement) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Object-based on-line video summarization for internet of video things.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Discriminative Autoencoders for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

Wavelet Speech Enhancement Based on Robust Principal Component Analysis.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

A locally linear embbeding based postfiltering approach for speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Discriminative autoencoders for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Track-Clustering Error Evaluation for Track-Based Multi-camera Tracking System Employing Human Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise.

[BibT_eX]

[DOI]

Hsiao-Lan Sharon Wang

Lieber Po-Hung Li

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Fast locally linear embedding algorithm for exemplar-based voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Raw waveform-based speech enhancement by fully convolutional networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Acoustic echo cancellation using deep cerebellar model articulation controller.

[BibT_eX]

[DOI]

Shih-Wei Lan

Junghsi Lee

Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016

Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

Generalized maximum a posteriori spectral amplitude estimation for speech enhancement.

[BibT_eX]

[DOI]

Speech Commun., 2016

Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus.

[BibT_eX]

[DOI]

Speech Commun., 2016

Transportation Modes Classification Using Sensors on Smartphones.

[BibT_eX]

[DOI]

Sensors, 2016

Maximum Entropy Learning with Deep Belief Networks.

[BibT_eX]

[DOI]

Entropy, 2016

Robust Beamforming Against DoA Mismatch Using Subspace-Constrained Diagonal Loading.

[BibT_eX]

[DOI]

CoRR, 2016

Image Retrieval Using Color-Aware Tag on Progressive Image Search and Recommendation System.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A pseudo-task design in multi-task learning deep neural network for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving the performance of speech perception in noisy environment based on an FAME strategy.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Incorporating local environment information with ensemble neural networks to robust automatic speech recognition.

[BibT_eX]

[DOI]

Chia-Yung Hsu

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Dictionary update for NMF-based voice conversion using an encoder-decoder network.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Locally Linear Embedding for Exemplar-Based Spectral Conversion.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.

[BibT_eX]

[DOI]

Szu-Wei Fu

Xugang Lu

Proceedings of the Interspeech 2016, 2016

Speech enhancement via ensemble modeling NMF adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

Leveraging nonnegative matrix factorization in processing the temporal modulation spectrum for speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users.

[BibT_eX]

[DOI]

Yen-Teh Liu

Ronald Y. Chang

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A study of mobile advertisement recommendation using real big data from AdLocus.

[BibT_eX]

[DOI]

Proceedings of the IEEE 5th Global Conference on Consumer Electronics, 2016

A linear regression model with dynamic pulse transit time features for noninvasive blood pressure prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2016

Temporal Modulation Spectral Restoration for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Adaptive subspace-constrained diagonal loading.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Voice conversion from non-parallel corpora using variational auto-encoder.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Audio-visual speech enhancement using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Compensating for Orientation Mismatch in Robust Wi-Fi Localization Using Histogram Equalization.

[BibT_eX]

[DOI]

Shih-Hau Fang

Chu-Hsuan Wang

IEEE Trans. Veh. Technol., 2015

Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm.

[BibT_eX]

[DOI]

Shih-Hau Fang

Yao Shiao

IEEE Signal Process. Lett., 2015

Ensemble environment modeling using affine transform group.

[BibT_eX]

[DOI]

Speech Commun., 2015

Rapid Converging M-Max Partial Update Least Mean Square Algorithms with New Variable Step-Size Methods.

[BibT_eX]

[DOI]

Jin Li-You

Ying-Ren Chien

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015

Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2015

類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese].

[BibT_eX]

[DOI]

Chia-Yung Hsu

Jia-Ching Wang

Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

Sparse representation with temporal max-smoothing for acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Speech recognition with temporal neural networks.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

A deep neural network based approach to mandarin consonant/vowel separation.

[BibT_eX]

[DOI]

Yen-Teh Liu

Ronald Y. Chang

Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

Temporal information in tone recognition.

[BibT_eX]

[DOI]

Payton Lin

Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

A discriminative post-filter for speech enhancement in hearing aids.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Multimodal arousal rating using unsupervised fusion technique.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A new frequency lowering technique for Mandarin-speaking hearing aid users.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Temporal alignment for deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

A probabilistic interpretation for artificial neural network-based voice conversion.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Variable Selection Linear Regression for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2014

Effect of adaptive envelope compression in simulated electric hearing in reverberation.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

Acoustic feature conversion using a polynomial based feature transferring algorithm.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Spectral patch based sparse coding for acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Ensemble modeling of denoising autoencoder for speech spectrum restoration.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Automatic speech recognition with primarily temporal envelope information.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Clustering-based i-vector formulation for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

An adaptive envelope compression strategy for speech processing in cochlear implants.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Ensemble of machine learning algorithms for cognitive and physical speaker load detection.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Sparse representation based on a bag of spectral exemplars for acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Speech enhancement using segmental nonnegative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Evaluation of generalized maximum a posteriori spectral amplitude (GMAPA) speech enhancement algorithm in hearing aids.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Consumer Electronics, 2013

Recurrent neural network based language model personalization by social network crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Speech enhancement based on deep denoising autoencoder.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition.

[BibT_eX]

[DOI]

Bo Li

Khe Chai Sim

Proceedings of the INTERSPEECH 2013, 2013

Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Sparse maximum entropy deep belief nets.

[BibT_eX]

[DOI]

How Jing

Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Semantic Naïve Bayes Classifier for Document Classification.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Filtering on the temporal probability sequence in histogram equalization for robust speech recognition.

[BibT_eX]

[DOI]

Jeih-Weih Hung

Proceedings of the IEEE International Conference on Acoustics, 2013

Speech enhancement using generalized maximum a posteriori spectral amplitude estimator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Robust Wi-Fi location fingerprinting against device diversity based on spatial mean normalization.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Incorporating global variance in the training phase of GMM-based voice conversion.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

A study on cepstral sub-band normalization for robust ASR.

[BibT_eX]

[DOI]

Jeih-Weih Hung

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Acoustic space partition based on broad phonetic class for ensemble acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Exploring mutual information for GMM-based spectral conversion.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A Study of Mutual Information for GMM-Based Spectral Conversion.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation.

[BibT_eX]

[DOI]

Ting-Yao Hu

Lin-Shan Lee

Proceedings of the INTERSPEECH 2012, 2012

A linear projection approach to environment modeling for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

A sampling-based environment population projection approach for rapid acoustic model adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

A particle filter feature compensation approach to robust speech recognition.

[BibT_eX]

[DOI]

Aleem Mushtaq

Proceedings of the INTERSPEECH 2010, 2010

Shrinkage model adaptation in automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Universal Communication Symposium, 2009

A study on soft margin estimation of linear regression parameters for speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2009, 2009

Ensemble speaker and speaking environment modeling approach with advanced online estimation process.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

An ensemble speaker and speaking environment modeling approach to robust speech recognition.

[BibT_eX]

[DOI]

PhD thesis, 2008

Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2008, 2008

A programmable analog radial-basis-function based classifier.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

An ensemble modeling approach to joint characterization of speaker and speaking environments.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2007, 2007

Detection-based ASR in the automatic speech attribute transcription project.

[BibT_eX]

[DOI]

Antonio Moreno-Daniel

Jeremy Morris

Yu Wang

Proceedings of the INTERSPEECH 2007, 2007

Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

A vector space approach to environment modeling for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2006, 2006

A study on detection based automatic speech recognition.

[BibT_eX]

[DOI]

Chengyuan Ma

Proceedings of the INTERSPEECH 2006, 2006

2005

Segmental eigenvoice with delicate eigenspace for improved speaker adaptation.

[BibT_eX]

[DOI]

Shang-Ming Lee

Lin-Shan Lee

IEEE Trans. Speech Audio Process., 2005

A study on separation between acoustic models and its applications.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2005, 2005

A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition.

[BibT_eX]

[DOI]