Chia-Ping Chen

Orcid: 0000-0002-7022-3061

According to our database1, Chia-Ping Chen authored at least 96 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Speaker Verification System Based on Time Delay Neural Network with Pre-activated CNN Stem and Deep Layer Aggregation.
J. Inf. Sci. Eng., March, 2024

Training Speech Recognition Model with Speech Synthesis and Text Discriminator.
J. Inf. Sci. Eng., March, 2024

Improving Speech Synthesis by Automatic Speech Recognition and Speech Discriminator.
J. Inf. Sci. Eng., January, 2024

2023
Sound Event Detection System Based on VGGSKCCT Model Architecture with Knowledge Distillation.
Appl. Artif. Intell., 2023

Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Regression-based Sound Event Detection with Semi-supervised Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

A Lightweight Speaker Verification Model For Edge Device.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Audio Time-Scale Modification with Temporal Compressing Networks.
CoRR, 2022

On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting.
CoRR, 2022

On the Efficiency of Integrating Self-Supervised Learning and Meta-Learning for User-Defined Few-Shot Keyword Spotting.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Lightweight Sound Event Detection Model with RepVGG Architecture.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Development of Mandarin-English code-switching speech synthesis system.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Mandarin-English Code-Switching Speech Recognition System for Specific Domain.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Investigation of feature processing modules and attention mechanisms in speaker verification system.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Denoising Likelihood Score Matching for Conditional Score-based Data Generation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Discussion on domain generalization in the cross-device speaker verification system.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Exploiting Low-Resource Code-Switching Data to Mandarin-English Speech Recognition Systems.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

RCRNN-based Sound Event Detection System with Specific Speech Resolution.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Improving Time Delay Neural Network Based Speaker Recognition with Convolutional Block and Feature Aggregation Methods.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-supervised Sound Event Detection Using Multiscale Channel Attention and Multiple Consistency Training.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

CLCC: Contrastive Learning for Color Constancy.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semi-Supervised Sound Event Detection Using Self-Attention and Multiple Techniques of Consistency Training.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
NSYSU+CHT Speaker Verification System for Far-Field Speaker Verification Challenge 2020.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Real-Time Single-Speaker Taiwanese-Accented Mandarin Speech Synthesis System.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Improving Embedding-based Neural-Network Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Explorable Tone Mapping Operators.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Learning Camera-Aware Noise Models.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
以三元組損失微調時延神經網路語者嵌入函數之語者辨識系統(Time Delay Neural Network-based Speaker Embedding Function Fine-tuned with Triplet Loss for Distance-based Speaker Recognition).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

即時中文語音合成系統(Real-Time Mandarin Speech Synthesis System).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

AI Deep Learning with Multiple Labels for Sentiment Classification of Tweets.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Transfer-Representation Learning for Detecting Spoofing Attacks with Converted and Synthesized Speech in Automatic Speaker Verification System.
Proceedings of the Interspeech 2019, 2019

Speaker Characterization Using TDNN-LSTM Based Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Image Haze Removal By Adaptive CycleGAN.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Combining De-noising Auto-encoder and Recurrent Neural Networks in End-to-End Automatic Speech Recognition for Noise Robustness.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

deepSA2018 at SemEval-2018 Task 1: Multi-task Learning of Different Label for Affect in Tweets.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

結合卷積神經網路與遞迴神經網路於推文極性分類 (Combining Convolutional Neural Network and Recurrent Neural Network for Tweet Polarity Classification) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Effective Attention Mechanism in Dynamic Models for Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
deepSA at SemEval-2017 Task 4: Interpolated Deep Neural Networks for Sentiment Analysis in Twitter.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Using Teacher-Student Model For Emotional Speech Recognition[In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Speech emotion recognition with ensemble learning methods.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Speech emotion recognition with skew-robust neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
標記對於類神經語音情緒辨識系統辨識效果之影響(Effects of Label in Neural Speech Emotion Recognition System)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

Support Super-Vector Machines in Automatic Speech Emotion Recognition.
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

以多層感知器辨識情緒於國台客語料庫 (Use Multilayer Perceptron To Recognize Emotion in Mandarin, Taiwanese and Hakka Database) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

Integration of orthogonal feature detectors in parameter learning of artificial neural networks to improve robustness and the evaluation on hand-written digit recognition tasks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Recurrent neural network-based language models with variation in net topology, language, and granularity.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Verifying the long-range dependency of RNN language models.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

2014
Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

台灣情緒語料庫建置與辨識 (An Emotional Speech Database in Taiwan: Collection and Recognition) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014

Speech emotion recognition with cross-lingual databases.
Proceedings of the INTERSPEECH 2014, 2014

Natural speech synthesis based on hybrid approach with candidate expansion and verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
基於Sphinx 可快速個人化行動數字語音辨識系統 (Quickly Personalizable Mobile Digit Speech Recognition System Based on Sphinx) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

基於時域上基週同步疊加法之歌聲合成系統 (Singing Voice Synthesis System Based on Time Domain-Pitch Synchronized Overlap and Add) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Query-Document Relevance Topic Models.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2013

Yet another Gaussian mixture model-based feature compensation method for robust noisy-digit recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature space dimension reduction in speech emotion recognition using support vector machine.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Intrinsic Illumination Subspace for Lighting Insensitive Face Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Integrating Recognition and Retrieval With Relevance Feedback for Spoken Term Detection.
IEEE Trans. Speech Audio Process., 2012

Speaker-Dependent Model Interpolation for Statistical Emotional Speech Synthesis.
EURASIP J. Audio Speech Music. Process., 2012

Robust dialogue act detection based on partial sentence tree, derivation rule, and spectral clustering algorithm.
EURASIP J. Audio Speech Music. Process., 2012

應用串接方法於連續變化轉速之四行程引擎聲音合成 (Concatenation-based Method for the Synthesis of Engine Noise with Continuously Varying Speed) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

Cross-lingual frame selection method for polyglot speech synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Data-driven rescaled Teager energy cepstral coefficients for noise-robust speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Noise-robust speech feature processing with empirical mode decomposition.
EURASIP J. Audio Speech Music. Process., 2011

Real-time hand tracking on depth images.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Improved spoken term detection using support vector machines based on lattice context consistency.
Proceedings of the IEEE International Conference on Acoustics, 2011

Improved spoken term detection with graph-based re-ranking in feature space.
Proceedings of the IEEE International Conference on Acoustics, 2011

Semantic Information and Derivation Rules for Robust Dialogue Act Detection in a Spoken Dialogue System.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
A hidden Markov model-based approach for emotional speech synthesis.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

A framework integrating different relevance feedback scenarios and approaches for spoken term detection.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Transformational Breathing between Present and Past: Virtual Exhibition System of the Mao-Kung Ting.
Proceedings of the Advances in Multimedia Modeling, 2010

Auditory front-ends for noise-robust automatic speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Empirical mode decomposition for noise-robust automatic speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Improved spoken term detection by discriminative training of acoustic models based on user relevance feedback.
Proceedings of the INTERSPEECH 2010, 2010

Improved spoken term detection by feature space pseudo-relevance feedback.
Proceedings of the INTERSPEECH 2010, 2010

Turning Rust into Gold: An ancient artifact as an interactive artwork.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

MOMI-Cosegmentation: Simultaneous Segmentation of Multiple Objects among Multiple Images.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Noise-Robust Speech Features Based on Cepstral Time Coefficients.
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

A Framework for Machine Translation Output Combination.
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Noise-robust feature extraction based on forward masking.
Proceedings of the INTERSPEECH 2009, 2009

Speaker diarization using divide-and-conquer.
Proceedings of the INTERSPEECH 2009, 2009

Pixel-based correspondence and shape reconstruction for moving objects.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

2007
MVA Processing of Speech Features.
IEEE Trans. Speech Audio Process., 2007

2006
An Approach to Using the Web as a Live Corpus for Spoken Transliteration Name Access.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006

Automatic Learning of Context-Free Grammar.
Proceedings of the 18th Conference on Computational Linguistics and Speech Processing, 2006

Chinese input method based on reduced Mandarin phonetic alphabet.
Proceedings of the INTERSPEECH 2006, 2006

The 4-Source Photometric Stereo Under General Unknown Lighting.
Proceedings of the Computer Vision, 2006

2005
An Approach of Using the Web as a Live Corpus for Spoken Transliteration Name Access.
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005

Focused word segmentation for ASR.
Proceedings of the INTERSPEECH 2005, 2005

Lighting Normalization with Generic Intrinsic Illumination Subspace for Face Recognition.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Speech Feature Smoothing for Robust ASR.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Image set compression through minimal-cost prediction structures.
Proceedings of the 2004 International Conference on Image Processing, 2004

2002
Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Low-resource noise-robust feature post-processing on Aurora 2.0.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002


  Loading...