Minchan Kim

Orcid: 0000-0002-8150-765X

According to our database1, Minchan Kim authored at least 21 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Efficient Parallel Audio Generation Using Group Masked Language Modeling.
IEEE Signal Process. Lett., 2024

Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech.
IEEE Signal Process. Lett., 2024

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models.
CoRR, 2024

Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction.
CoRR, 2024

2023
Pre-and Post-Contact Policy Decomposition for Non-Prehensile Manipulation with Zero-Shot Sim-To-Real Transfer.
IROS, 2023

EM-Network: Oracle Guided Self-distillation for Sequence Learning.
Proceedings of the International Conference on Machine Learning, 2023

Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison.
Proceedings of the IEEE International Conference on Acoustics, 2023

Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
DNN-Based Approach to Mitigate Multipath Errors of Differential GNSS Reference Stations.
IEEE Trans. Intell. Transp. Syst., 2022

A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization.
IEEE Signal Process. Lett., 2022

Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech.
CoRR, 2022

Disentangled Speaker Representation Learning via Mutual Information Minimization.
CoRR, 2022

Fully Unsupervised Training of Few-Shot Keyword Spotting.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus.
Proceedings of the Interspeech 2022, 2022

Hologram clinical system for standardization of brain computer interface.
Proceedings of the 13th International Conference on Information and Communication Technology Convergence, 2022

2021
Expressive Text-to-Speech Using Style Tag.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2017
Optimized GNSS Station Selection to Support Long-Term Monitoring of Ionospheric Anomalies for Aircraft Landing Systems.
IEEE Trans. Aerosp. Electron. Syst., 2017

Feasibility Studies for Applications of Long-Term Ionospheric Anomaly Monitor.
IEEE Trans. Aerosp. Electron. Syst., 2017

2016
GCMA: guaranteed contiguous memory allocator.
SIGBED Rev., 2016

2014
A Comprehensive Method for GNSS Data Quality Determination to Improve Ionospheric Data Analysis.
Sensors, 2014


  Loading...