Proceedings of the 2025 28th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2025

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition.

[BibT_eX]

[DOI]

Chien-Chun Wang

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Revealing the Role of Audio Channels in ASR Performance Degradation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

DRASP: A Dual-Resolution Attentive Statistics Pooling Framework for Automatic MOS Prediction.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Leveraging Retrieval-Augmented Generation for Culturally Inclusive Hakka Chatbots: Design Insights and User Perceptions.

[BibT_eX]

[DOI]

Chen-Chi Chang

Han-Pi Chang

Hung-Shin Lee

CoRR, 2024

Effective Noise-Aware Data Simulation For Domain-Adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

VoxHakka: A Dialectally Diverse Multi-Speaker Text-to-Speech System for Taiwanese Hakka.

[BibT_eX]

[DOI]

Li-Wei Chen

Hung-Shin Lee

Chen-Chi Chang

Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024

Benchmarking Cognitive Domains for LLMS: Insights from Taiwanese Hakka Culture.

[BibT_eX]

[DOI]

Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024

2023

Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

The North System for Formosa Speech Recognition Challenge 2023.

[BibT_eX]

[DOI]

Li-Wei Chen

Kai-Chen Cheng

Hung-Shin Lee

CoRR, 2023

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

CasNet: Investigating Channel Robustness for Speech Separation.

[BibT_eX]

[DOI]

CoRR, 2022

A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference.

[BibT_eX]

[DOI]

CoRR, 2022

Filter-based Discriminative Autoencoders for Children Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-Target Filter and Detector for Speaker Diarization.

[BibT_eX]

[DOI]

CoRR, 2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Chain-based Discriminative Autoencoders for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

AlloST: Low-Resource Speech Translation Without Source Transcription.

[BibT_eX]

[DOI]

Yao-Fei Cheng

Hung-Shin Lee

Hsin-Min Wang

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Generation of Speaker Representations Using Heterogeneous Training Batch Assembly.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2020

Joint Training of Guided Learning and Mean Teacher Models for Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

The Academia Sinica Systems of Voice Conversion for VCC2020.

[BibT_eX]

[DOI]

Yu-Huai Peng

Cheng-Hung Hu

Alexander Chao-Fu Kang

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019

Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Multi-task Learning for Acoustic Modeling Using Articulatory Attributes.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

2017

A Replay Spoofing Detection System Based on Discriminative Autoencoders.

[BibT_eX]

[DOI]

Int. J. Comput. Linguistics Chin. Lang. Process., 2017

基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統 (Speaker Diarization based on I-vector PLDA Scoring and using GMM-HMM Forced Alignment) [In Chinese].

[BibT_eX]

[DOI]

Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Discriminative Autoencoders for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Discriminative autoencoders for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Incorporating proximity information in relevance language modeling for extractive speech summarization.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Clustering-based i-vector formulation for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Ensemble of machine learning algorithms for cognitive and physical speaker load detection.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker verification using kernel-based binary classifiers with binary operation derived features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

I-vector based language modeling for spoken document retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Subspace-based phonotactic language recognition using multivariate dynamic linear models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A Study of Language Modeling for Chinese Spelling Check.

[BibT_eX]

[DOI]

Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing, 2013

2012

Subspace-Based Feature Representation and Learning for Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Learning the Similarity of Audio Music in Bag-of-frames Representation from Tagged Music Data.

[BibT_eX]

[DOI]

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

2010

Exploiting semantic associative information in topic modeling.

[BibT_eX]

[DOI]

Meng-Sung Wu

Hung-Shin Lee

Hsin-Min Wang

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification.

[BibT_eX]

[DOI]

Hung-Shin Lee

Hsin-Min Wang

Berlin Chen

Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009

相似度比率式鑑別分析應用於大詞彙連續語音辨識 (Likelihood Ratio Based Discriminant Analysis for Large Vocabulary Continuous Speech Recognition) [In Chinese].

[BibT_eX]

[DOI]

Hung-Shin Lee

Berlin Chen

Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Empirical error rate minimization based linear discriminant analysis.

[BibT_eX]

[DOI]

Hung-Shin Lee

Berlin Chen

Proceedings of the IEEE International Conference on Acoustics, 2009

Generalized likelihood ratio discriminant analysis.

[BibT_eX]

[DOI]

Hung-Shin Lee

Berlin Chen

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates.

[BibT_eX]

[DOI]

Hung-Shin Lee

Berlin Chen

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Linear discriminant feature extraction using weighted classification confusion information.

[BibT_eX]

[DOI]

Hung-Shin Lee

Berlin Chen

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Training data selection for improving discriminative training of acoustic models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Hung-Shin Lee

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...