Kuan-Yu Chen

Orcid: 0000-0001-9656-7551

Affiliations:
  • National Taiwan University of Science and Technology, Computer Science and Information Engineering Department, Taipei, Taiwan
  • National Taiwan University, Department of computer science and information engineering, Taipei, Taiwan (PhD 2015)


According to our database1, Kuan-Yu Chen authored at least 102 papers between 2005 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations.
CoRR, May, 2025

An Attention-Based Framework With Multistation Information for Earthquake Early Warnings.
IEEE Trans. Geosci. Remote. Sens., 2025

RATE: A Retrieval-Augmented Transformer for Regional Earthquake Early Warning.
IEEE Geosci. Remote. Sens. Lett., 2025

2023
HypR: A comprehensive study for ASR hypothesis revising with a reference corpus.
CoRR, 2023

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

2022
An Attention-Based Hypocenter Estimator for Earthquake Localization.
IEEE Trans. Geosci. Remote. Sens., 2022

Non-Autoregressive ASR Modeling Using Pre-Trained Language Models for Chinese Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Toward Zero-Shot and Zero-Resource Multilingual Question Answering.
IEEE Access, 2022

A Context-Aware Knowledge Transferring Strategy for CTC-Based ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Chinese Movie Dialogue Question Answering Dataset.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

2021
Audio-Aware Spoken Multiple-Choice Question Answering With Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021.
CoRR, 2021

Non-autoregressive Transformer-based End-to-end ASR using BERT.
CoRR, 2021

ntust-nlp-2 at ROCLING-2021 Shared Task: BERT-based semantic analyzer with word-level information.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

A Flexible and Extensible Framework for Multiple Answer Modes Question Answering.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Speech Recognition by Simply Fine-Tuning Bert.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Intelligent Real-Time Earthquake Detection by Recurrent Neural Networks.
IEEE Trans. Geosci. Remote. Sens., 2020

Enhanced Language Modeling with Proximity and Sentence Relatedness Information for Extractive Broadcast News Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

A Preliminary Study on Leveraging Meta Learning Technique for Code-switching Speech Recognition.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

An Audio-Enriched BERT-Based Framework for Spoken Multiple-Choice Question Answering.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Spoken Multiple-Choice Question Answering Using Multi-turn Audio-extracter BERT.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
GALs: 基於對抗式學習之整列式摘要法 (GALs: A GAN-based Listwise Summarizer).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
An Information Distillation Framework for Extractive Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Essence Vector-Based Query Modeling for Spoken Document Retrieval.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
A Position-Aware Language Modeling Framework for Extractive Broadcast News Speech Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017

On the Use of Neural Network Modeling Techniques for Spoken Document Retrieval.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

An Empirical Comparison of Contemporary Unsupervised Approaches for Extractive Speech Summarization.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

使用查詢意向探索與類神經網路於語音文件檢索之研究 (Exploring Query Intent and Neural Network modeling Techniques for Spoken Document Retrieval) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Discriminative Autoencoders for Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Leveraging manifold learning for extractive broadcast news summarization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A locality-preserving essence vector modeling framework for spoken document retrieval.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Neural relevance-aware query modeling for spoken document retrieval.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Exploring the use of unsupervised query modeling techniques for speech recognition and summarization.
Speech Commun., 2016

Leveraging Multi-Task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016

Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016

Extractive speech summarization leveraging convolutional neural network techniques.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

融合多任務學習類神經網路聲學模型訓練於會議語音辨識之研究(Leveraging Multi-task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

運用序列到序列生成架構於重寫式自動摘要(Exploiting Sequence-to-Sequence Generation Framework for Automatic Abstractive Summarization)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Exploring Word Mover's Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improved spoken document summarization with coverage modeling techniques.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Learning to Distill: The Essence Vector Modeling Framework.
Proceedings of the COLING 2016, 2016

Exploiting graph regularized nonnegative matrix factorization for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

A novel paragraph embedding method for spoken document summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Combining Relevance Language Modeling and Clarity Measure for Extractive Speech Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Extractive Broadcast News Summarization Leveraging Recurrent Neural Network Language Modeling Techniques.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

A Probabilistic Framework for Chinese Spelling Check.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2015

Extractive Spoken Document Summarization with Representation Learning Techniques.
Int. J. Comput. Linguistics Chin. Lang. Process., 2015

Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2015

表示法學習技術於節錄式語音文件摘要之研究(A Study on Representation Learning Techniques for Extractive Spoken Document Summarization) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

可讀性預測於中小學國語文教科書及優良課外讀物之研究(A Study of Readability Prediction on Elementary and Secondary Chinese Textbooks and Excellent Extracurricular Reading Materials) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

使用詞向量表示與概念資訊於中文大詞彙連續語音辨識之語言模型調適(Exploring Word Embedding and Concept Information for Language Model Adaptation in Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

調變頻譜分解之改良於強健性語音辨識(Several Refinements of Modulation Spectrum Factorization for Robust Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

Positional language modeling for extractive broadcast news speech summarization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Leveraging word embeddings for spoken document summarization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

I-vector based language modeling for query representation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Incorporating paragraph embeddings and density peaks clustering for spoken document summarization.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Incorporating proximity information in relevance language modeling for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
One-dimensional approximate point set pattern matching with L<sub>p</sub>-norm.
Theor. Comput. Sci., 2014

Leveraging topical and positional cues for language modeling in speech recognition.
Multim. Tools Appl., 2014

Enhancing Query Formulation for Spoken Document Retrieval.
J. Inf. Sci. Eng., 2014

A conservative scheme for solving coupled surface-bulk convection-diffusion equations with an application to interfacial flows with soluble surfactant.
J. Comput. Phys., 2014

探究新穎語句模型化技術於節錄式語音摘要 (Investigating Novel Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014

Enhanced language modeling for extractive speech summarization with sentence relatedness information.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A recurrent neural network language modeling framework for extractive speech summarization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Effective pseudo-relevance feedback for language modeling in extractive speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2014

I-vector based language modeling for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2014

Leveraging Effective Query Modeling Techniques for Speech Recognition and Summarization.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

A margin-based discriminative modeling approach for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Leveraging relevance cues for language modeling in speech recognition.
Inf. Process. Manag., 2013

A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings.
Algorithmica, 2013

改良語句模型技術於節錄式語音摘要之研究 (Improved Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Incorporating proximity information for relevance language modeling in speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Semantic Naïve Bayes Classifier for Document Classification.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Sentence modeling for extractive speech summarization.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Weighted matrix factorization for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

Effective pseudo-relevance feedback for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

Effective pseudo-relevance feedback for language modeling in speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

A Study of Language Modeling for Chinese Spelling Check.
Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing, 2013

2012
Efficient retrieval of approximate palindromes in a run-length encoded string.
Theor. Comput. Sci., 2012

Spoken Document Retrieval With Unsupervised Query Modeling Techniques.
IEEE Trans. Speech Audio Process., 2012

Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques.
IEICE Trans. Inf. Syst., 2012

Word Relevance Modeling for Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Finding all sorting tandem duplication random loss operations.
J. Discrete Algorithms, 2011

A note on pressure accuracy in immersed boundary method for Stokes flow.
J. Comput. Phys., 2011

Approximate Point Set Pattern Matching with L p -Norm.
Proceedings of the String Processing and Information Retrieval, 2011

實證探究多種鑑別式語言模型於語音辨識之研究 (Empirical Comparisons of Various Discriminative Language Models for Speech Recognition) [In Chinese].
Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing, 2011

Leveraging Relevance Cues for Improved Spoken Document Retrieval.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Relevance language modeling for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Query modeling for spoken document retrieval.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Hardness of comparing two run-length encoded strings.
J. Complex., 2010

Finding All Approximate Gapped Palindromes.
Int. J. Found. Comput. Sci., 2010

Identifying Approximate Palindromes in Run-Length Encoded Strings.
Proceedings of the Algorithms and Computation - 21st International Symposium, 2010

Latent topic modeling of word vicinity information for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
主題語言模型於大詞彙連續語音辨識之研究 (On the Use of Topic Models for Large-Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Approximate Matching for Run-Length Encoded Strings Is 3sum-Hard.
Proceedings of the Combinatorial Pattern Matching, 20th Annual Symposium, 2009

2007
On the range maximum-sum segment query problem.
Discret. Appl. Math., 2007

2006
Improved algorithms for the <i>k</i> maximum-sums problems.
Theor. Comput. Sci., 2006

2005
Optimal algorithms for locating the longest and shortest segments satisfying a sum or an average constraint.
Inf. Process. Lett., 2005


  Loading...