Kwanghee Choi

Orcid: 0000-0001-5254-1093

According to our database1, Kwanghee Choi authored at least 47 papers between 2007 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Prosodic ABX: A Language-Agnostic Method for Measuring Prosodic Contrast in Speech Representations.
CoRR, April, 2026

An Empirical Recipe for Universal Phone Recognition.
CoRR, March, 2026

Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson's Disease.
CoRR, March, 2026

Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces.
CoRR, March, 2026

[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic.
CoRR, February, 2026

The CMU-AIST submission for the ICME 2025 Audio Encoder Challenge.
CoRR, January, 2026

PRiSM: Benchmarking Phone Realization in Speech Models.
CoRR, January, 2026

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration.
CoRR, January, 2026

2025
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model.
CoRR, October, 2025

CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset.
CoRR, September, 2025

OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

On-device Streaming Discrete Speech Units.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Discrete Speech Unit Extraction via Independent Component Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2025

2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024

ESPnet-EZ: Python-Only ESPnet For Easy Fine-Tuning And Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Self-Supervised Speech Representations are More Phonetic than Semantic.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2024

Correcting Faulty Road Maps by Image Inpainting.
Proceedings of the IEEE International Conference on Acoustics, 2024

Understanding Probe Behaviors Through Variational Bounds of Mutual Information.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2024

Wav2Gloss: Generating Interlinear Glossed Text from Speech.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Unsupervised Speech Representation Pooling Using Vector Quantization.
CoRR, 2023

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset.
CoRR, 2023

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

TiDAL: Learning Training Dynamics for Active Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Automatic Severity Classification of Dysarthric Speech by Using Self-Supervised Model with Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Data-driven Approach for Automatically Correcting Faulty Road Maps.
CoRR, 2022

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning.
CoRR, 2022

Opening the Black Box of wav2vec Feature Encoder.
CoRR, 2022

TiDAL: Learning Training Dynamics for Active Learning.
CoRR, 2022

Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil.
CoRR, 2022

Combating the instability of mutual information-based losses via regularization.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Distilling a Pretrained Language Model to a Multilingual ASR Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Temporal Knowledge Distillation for on-device Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Pretraining Neural Architecture Search Controllers with Locality-based Self-Supervised Learning.
CoRR, 2021

Disentangling Label Distribution for Long-Tailed Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Regularized Mutual Information Neural Estimation.
CoRR, 2020

2019
EVCMR: A tool for the quantitative evaluation and visualization of cardiac MRI data.
Comput. Biol. Medicine, 2019

2015
A novel and scalable communication-history-based Knapsack authentication framework for IEEE 802.11 networks.
Proceedings of the 2015 IEEE Conference on Communications and Network Security, 2015

2014
Securing smart home: Technologies, security challenges, and security requirements.
Proceedings of the IEEE Conference on Communications and Network Security, 2014

2007
Object Based Contour Detection by Using Graph-cut on Stereo Image.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), 2007


  Loading...