Kyogu Lee

Orcid: 0000-0002-4210-0312

According to our database1, Kyogu Lee authored at least 155 papers between 2004 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning.
CoRR, September, 2025

Differentiable Acoustic Radiance Transfer.
CoRR, September, 2025

Vo-Ve: An Explainable Voice-Vector for Speaker Identity Evaluation.
CoRR, June, 2025

Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ.
CoRR, June, 2025

Few-step Adversarial Schrödinger Bridge for Generative Speech Enhancement.
CoRR, June, 2025

A Real-Time Speech Enhancement Processor for Hearing Aids in 28-nm CMOS.
IEEE J. Solid State Circuits, May, 2025

MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction.
CoRR, May, 2025

The Effects of Musical Factors on the Perception of Auditory Illusions.
Top. Cogn. Sci., January, 2025

Understanding Audio-Text Retrieval Through Singular Value Decomposition.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

SynthRL: Cross-domain Synthesizer Sound Matching via Reinforcement Learning.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Speaking Without Sound: Multi-speaker Silent Speech Voicing with Facial Inputs Only.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Synthetic Dataset Generation for String Ensemble Separation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

DOSE: Drum One-Shot Extraction from Music Mixture.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Variable Bitrate Residual Vector Quantization for Audio Coding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Uncertainty-Aware Self-Training for CTC-Based Automatic Speech Recognition.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Incorporating real-world object into virtual reality: using mobile device input with augmented virtuality.
Multim. Tools Appl., May, 2024

Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control.
CoRR, 2024

Do Captioning Metrics Reflect Music Semantic Alignment?
CoRR, 2024

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression.
CoRR, 2024

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch.
CoRR, 2024

Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings.
CoRR, 2024

Wavespace: A Highly Explorable Wavetable Generator.
CoRR, 2024

Searching For Music Mixing Graphs: A Pruning Approach.
CoRR, 2024

Multidimensional Interpolants.
CoRR, 2024

Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling.
CoRR, 2024

Inverse Nonlinearity Compensation of Hyperelastic Deformation in Dielectric Elastomer for Acoustic Actuation.
CoRR, 2024

Inverse Nonlinearity Compensation of Dielectric Elastomers for Acoustic Actuation.
IEEE Access, 2024

Music-Driven Synchronous Dance Generation Considering K-Pop Musical and Choreographical Characteristics.
IEEE Access, 2024

Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hear Your Face: Face-based voice conversion with F0 estimation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper.
Proceedings of the IEEE International Conference on Acoustics, 2024

String Sound Synthesizer On Gpu-Accelerated Finite Difference Scheme.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations.
Proceedings of the IEEE International Conference on Acoustics, 2024

Music Auto-Tagging with Robust Music Representation Learned via Domain Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2024

Emosical: An Emotion-Annotated Musical Theatre Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Combinatorial music generation model with song structure graph analysis.
CoRR, 2023

Beat-Aligned Spectrogram-to-Sequence Generation of Rhythm-Game Charts.
CoRR, 2023

Yet Another Generative Model for Room Impulse Response Estimation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

AECSQI: Referenceless Acoustic Echo Cancellation Measures Using Speech Quality and Intelligibility Improvement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Music De-Limiter Networks Via Sample-Wise Gain Inversion.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Exploiting Time-Frequency Conformers for Music Audio Enhancement.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-Refining of Pseudo Labels for Music Source Separation With Noisy Labeled Data.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Towards a New Interface for Music Listening: A User Experience Study on YouTube.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Semi-supervised Learning for Continuous Emotional Intensity Controllable Speech Synthesis with Disentangled Representations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Blind Estimation of Audio Processing Graph.
Proceedings of the IEEE International Conference on Acoustics, 2023

Global HRTF Interpolation Via Learned Affine Transformation of Hyper-Conditioned Features.
Proceedings of the IEEE International Conference on Acoustics, 2023

Neural Fourier Shift for Binaural Speech Rendering.
Proceedings of the IEEE International Conference on Acoustics, 2023

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2023

Show Me the Instruments: Musical Instrument Retrieval From Mixture Audio.
Proceedings of the IEEE International Conference on Acoustics, 2023

Medleyvox: An Evaluation Dataset for Multiple Singing Voices Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Pop2Piano : Pop Audio-Based Piano Cover Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Differentiable Artificial Reverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Continuous Emotional Intensity Controllable Speech Synthesis using Semi-supervised Learning.
CoRR, 2022

Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation.
CoRR, 2022

Expressive Singing Synthesis Using Local Style Token and Dual-path Pitch Encoder.
CoRR, 2022

Translating Melody to Chord: Structured and Flexible Harmonization of Melody With Transformer.
IEEE Access, 2022

Exploiting Negative Preference in Content-based Music Recommendation with Contrastive Learning.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

Sketching the Expression: Flexible Rendering of Expressive Piano Performance with Self-Supervised Learning.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Towards robust music source separation on loud commercial music.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-To-End Music Remastering System Using Self-Supervised And Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations.
CoRR, 2021

Real-time Denoising and Dereverberation with Tiny Recurrent U-Net.
CoRR, 2021

Cross-Domain Semi-Supervised Audio Event Classification Using Contrastive Regularization.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Room Adaptive Conditioning Method for Sound Event Classification in Reverberant Environments.
Proceedings of the IEEE International Conference on Acoustics, 2021

Reverb Conversion Of Mixed Vocal Tracks Using An End-To-End Convolutional Deep Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

Real-Time Denoising and Dereverberation wtih Tiny Recurrent U-Net.
Proceedings of the IEEE International Conference on Acoustics, 2021

Neural Audio Fingerprint for High-Specific Audio Retrieval Based on Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net.
CoRR, 2020

Do Channels Matter? Illuminating Interpersonal Influence on Music Recommendations.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Exploring Aligned Lyrics-informed Singing Voice Separation.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Exploiting Multi-Modal Features from Pre-Trained Networks for Alzheimer's Dementia Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech.
Proceedings of the 8th International Conference on Learning Representations, 2020

Disentangling Timbre and Singing Style with Multi-Singer Singing Synthesis System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Digital Watermarking For Protecting Audio Classification Datasets.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Musical Pitch Affects Brightness Judgment of a Concurrent Visual Object.
Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020

2019
Sequential Skip Prediction with Few-shot in Streamed Music Contents.
CoRR, 2019

Automatic Choreography Generation with Convolutional Encoder-decoder Network.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Audio Query-based Music Source Separation.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

VirtuosoNet: A Hierarchical RNN-based System for Modeling Expressive Piano Performance.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Adversarially Trained End-to-End Korean Singing Voice Synthesis System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Phase-Aware Speech Enhancement with Deep Complex U-Net.
Proceedings of the 7th International Conference on Learning Representations, 2019

Enhancing Music Features by Knowledge Transfer from User-item Log Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Dance motion generation by recombination of body parts from motion source.
Intell. Serv. Robotics, 2018

Listen to Dance: Music-driven choreography generation using Autoregressive Encoder-Decoder Network.
CoRR, 2018

Content-based feature exploration for transparent music recommendation using self-attentive genre classification.
CoRR, 2018

Separation of Instrument Sounds using Non-negative Matrix Factorization with Spectral Envelope Constraints.
CoRR, 2018

Music Source Separation Using Stacked Hourglass Networks.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Cover Song Identification Using Song-to-Song Cross-Similarity Matrix with Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Exploiting Continuity/Discontinuity of Basis Vectors in Spectrogram Decomposition for Harmonic-Percussive Sound Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Utilizing context-relevant keywords extracted from a large collection of user-generated documents for music discovery.
Inf. Process. Manag., 2017

Robust Singing Transcription System Using Local Homogeneity in the Harmonic Structure.
IEICE Trans. Inf. Syst., 2017

Audio Cover Song Identification using Convolutional Neural Network.
CoRR, 2017

Lyrics-to-Audio Alignment by Unsupervised Discovery of Repetitive Patterns in Vowel Acoustics.
IEEE Access, 2017

A Data-driven Approach to Identifying Music Listener Groups based on Users' Playrate Distributions of Listening Events.
Proceedings of the Adjunct Publication of the 25th Conference on User Modeling, 2017

Chord Generation from Symbolic Melody Using BLSTM Networks.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Cover Song Identification with Metric Learning Using Distance as a Feature.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Singing Voice Separation Using RPCA with Weighted l_1 -norm.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Ensemble of Convolutional Neural Networks for Weakly-supervised Sound Event Detection Using Multiple Scale Input.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Audio Event Detection Using Multiple-Input Convolutional Neural Network.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Application of precise indoor position tracking to immersive virtual reality with translational movement support.
Multim. Tools Appl., 2016

Detecting fingering of overblown flute sound using sparse feature learning.
EURASIP J. Audio Speech Music. Process., 2016

Acoustic scene classification using convolutional neural network and multiple-width frequency-delta data augmentation.
CoRR, 2016

Metrics for Electronic-Nursing-Record-Based Narratives: cross-sectional analysis.
Appl. Clin. Inform., 2016

WhichHand: automatic recognition of a smartphone's position in the hand using a smartwatch.
Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct, 2016

Learning Temporal Features Using a Deep Neural Network and its Application to Music Genre Classification.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

2015
Minimum Cost Data Aggregation for Wireless Sensor Networks Computing Functions of Sensed Data.
J. Sensors, 2015

Enhanced auditory feedback for Korean touch screen keyboards.
Int. J. Hum. Comput. Stud., 2015

Effects of Auditory Feedback on Menu Selection in Hand-Gesture Interfaces.
IEEE Multim., 2015

Escaping your comfort zone: A graph-based recommender system for finding novel recommendations among relevant items.
Expert Syst. Appl., 2015

Harmonic-Percussive Source Separation Using Harmonicity and Sparsity Constraints.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Informed source separation from monaural music with limited binary time-frequency annotation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Using Dynamically Promoted Experts for Music Recommendation.
IEEE Trans. Multim., 2014

Vocal Separation from Monaural Music Using Temporal/Spectral Continuity and Sparsity Constraints.
IEEE Signal Process. Lett., 2014

Application of non-negative spectrogram decomposition with sparsity constraints to single-channel speech enhancement.
Speech Commun., 2014

Fast Parallel Implementation for Random Network Coding on Embedded Sensor Nodes.
Int. J. Distributed Sens. Networks, 2014

Music recommendation using text analysis on song requests to radio stations.
Expert Syst. Appl., 2014

A Highly Parallelized Decoder for Random Network Coding leveraging GPGPU.
Comput. J., 2014

#nowplaying the future billboard: mining music listening behaviors of twitter users for hit song prediction.
Proceedings of the SoMeRA'14, 2014

Integration of a Precise Indoor Position Tracking Algorithm with an HMD-Based Virtual Reality System.
Proceedings of the 2nd ACM International Workshop on Immersive Media Experiences, 2014

Vocal separation using extended robust principal component analysis with Schatten p/lp-norm and scale compression.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014

Hierarchical Approach to Detect Common Mistakes of Beginner Flute Players.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Transcribing Frequency Modulated Musical Expressions from Polyphonic Music Using HMM Constrained Shift Invariant PLCA.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

A pairwise approach to simultaneous onset/offset detection for singing voice using correntropy.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Music similarity-based approach to generating dance motion sequence.
Multim. Tools Appl., 2013

Using Experts Among Users for Novel Movie Recommendations.
J. Comput. Sci. Eng., 2013

Acoustic scene classification using sparse feature learning and event-based pooling.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Sound Spray - can-shaped sound effect device.
Proceedings of the 13th International Conference on New Interfaces for Musical Expression, 2013

A Musical Performance Evaluation System for Beginner Musician based on Real-time Score Following.
Proceedings of the 13th International Conference on New Interfaces for Musical Expression, 2013

Recommending Music Based on Probabilistic Latent Semantic Analysis on Korean Radio Episodes.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Note onset detection based on harmonic cepstrum regularity.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Using Music Notation for Teaching Computer Programming.
Proceedings of the 21st International Conference on Computers in Education, 2013

2012
Voicon: An Interactive Gestural Microphone For Vocal Performance.
Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012

FutureGrab: A wearable subtractive synthesizer using hand gesture.
Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012

2011
Retrieval of the Extreme Values under Deadline Constraints in Wireless Sensor Networks.
Sensors, 2011

My head is your tail: applying link analysis on long-tailed music listening behavior for music recommendation.
Proceedings of the 2011 ACM Conference on Recommender Systems, 2011

SWAF: Towards a Web Application Framework for Composition and Documentation of Soundscape.
Proceedings of the 11th International Conference on New Interfaces for Musical Expression, 2011

Mood Classfication from Musical Audio Using User Group-Dependent Models.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

2010
A super-resolution spectrogram using coupled PLCA.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Towards a Class-Based Representation of Perceptual Tempo for Music Retrieval.
Proceedings of the International Conference on Machine Learning and Applications, 2009

2008
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio.
IEEE Trans. Speech Audio Process., 2008

Segmentation-Based Lyrics-Audio Alignment using Dynamic Programming.
Proceedings of the ISMIR 2008, 2008

2007
A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models.
Proceedings of the Adaptive Multimedial Retrieval: Retrieval, 2007

2006
Automatic Chord Recognition from Audio Using a HMM with Supervised Learning.
Proceedings of the ISMIR 2006, 2006

Automatic Chord Recognition from Audio Using Enhanced Pitch Class Profile.
Proceedings of the 2006 International Computer Music Conference, 2006

2005
Explicit onset modeling of sinusoids using time reassignment.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Implementation of a Highly Diffusing 2-D DigitalWaveguide Mesh with a Quadratic Residue Diffuser.
Proceedings of the 2004 International Computer Music Conference, 2004

Auditory Display of Hyperspectral Colon Tissue Images Using Vocal Synthesis Models.
Proceedings of the ICAD 2004: The 10th Meeting of the International Conference on Auditory Display, 2004


  Loading...