Minje Kim

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
Low-Resource Audio Codec (LRAC): 2025 Challenge Description.
CoRR, October, 2025

SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit 3D Meshes.
CoRR, September, 2025

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine.
CoRR, July, 2025

Discrete Audio Tokens: More Than a Survey!
Trans. Mach. Learn. Res., 2025

TGIF: Talker Group-Informed Familiarization of Target Speaker Extraction.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

DTA: Dual Temporal-channel-wise Attention for Spiking Neural Networks.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024
IoT Edge-Cloud: An Internet-of-Things Edge-Empowered Cloud System for Device Management in Smart Spaces.
IEEE Netw., May, 2024

Genhancer: High-Fidelity Speech Enhancement via Generative Modeling on Discrete Codec Tokens.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Generative De-Quantization for Neural Speech Codec Via Latent Diffusion.
Proceedings of the IEEE International Conference on Acoustics, 2024

Hyperbolic Distance-Based Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Scalable and Efficient Speech Enhancement Using Modified Cold Diffusion: A Residual Learning Approach.
Proceedings of the IEEE International Conference on Acoustics, 2024

Personalized Neural Speech Codec.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Comparative Analysis of Poetry Reading Audio: Singing, Narrating, or Somewhere in Between?
Proceedings of the IEEE International Conference on Acoustics, 2024

Dense Hand-Object (HO) GraspNet with Full Grasping Taxonomy and Dynamics.
Proceedings of the Computer Vision - ECCV 2024, 2024

BiTT: Bi-Directional Texture Reconstruction of Interacting Two Hands from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks.
CoRR, 2023

2022
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement.
CoRR, 2022

Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding.
CoRR, 2022

Stochastic gradient descent-based support vector machines training optimization on Big Data and HPC frameworks.
Concurr. Comput. Pract. Exp., 2022

Deep Adaptive Aec: Hybrid of Deep Learning and Adaptive Acoustic Echo Cancellation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Upmixing Via Style Transfer: A Variational Autoencoder for Disentangling Spatial Images And Musical Content.
Proceedings of the IEEE International Conference on Acoustics, 2022

Don't Separate, Learn To Remix: End-To-End Neural Remixing With Joint Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spain-Net: Spatially-Informed Stereophonic Music Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Neural Remixer: Learning to Remix Music with Interactive Control.
CoRR, 2021

Self-Supervised Learning for Personalized Speech Enhancement.
CoRR, 2021

Scalable and Efficient Neural Speech Coding.
CoRR, 2021

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding.
CoRR, 2021

Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model Selection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Harp-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Source-Aware Neural Speech Coding for Noisy Speech Compression.
Proceedings of the IEEE International Conference on Acoustics, 2021

Singular Value Decomposition for Compression of Large-Scale Radio Frequency Signals.
Proceedings of the 29th European Signal Processing Conference, 2021

An Open-Sourced Time-Frequency Domain RF Classification Framework.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

AutoQ: Automated Kernel-Wise Neural Network Quantization.
Proceedings of the 8th International Conference on Learning Representations, 2020

Efficient and Scalable Neural Residual Waveform Coding with Collaborative Quantization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Dual-Staged Context Aggregation Method towards Efficient End-to-End Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Autotuner: A Pitch Correcting Network for Singing Performances.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Efficient Context Aggregation for End-to-End Speech Enhancement Using a Densely Connected Convolutional and Recurrent Network.
CoRR, 2019

AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices.
CoRR, 2019

Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances.
CoRR, 2019

Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Cascaded Cross-Module Residual Learning Towards Lightweight End-to-End Speech Coding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Intonation: A Dataset of Quality Vocal Performances Refined by Spectral Clustering on Pitch Congruence.
Proceedings of the IEEE International Conference on Acoustics, 2019

Performance Optimization on Model Synchronization in Parallel Stochastic Gradient Descent Based SVM.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music.
CoRR, 2018

On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising.
CoRR, 2018

DeepPicar: A Low-Cost Deep Neural Network-Based Autonomous Car.
Proceedings of the 24th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2018

Bitwise Source Separation on Hashed Spectra: An Efficient Posterior Estimation Scheme Using Partial Rank Order Metrics.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Collaborative Speech Dereverberation: Regularized Tensor Factorization for Crowdsourced Multi-Channel Recordings.
Proceedings of the 26th European Signal Processing Conference, 2018

Creative leaps in musical ecosystems: early warning signals of critical transitions in professional jazz.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017
On Exploiting Structured Human Interactions to Enhance Sensing Accuracy in Cyber-physical Systems.
ACM Trans. Cyber Phys. Syst., 2017

XNOR-POP: A processing-in-memory architecture for binary Convolutional Neural Networks in Wide-IO2 DRAMs.
Proceedings of the 2017 IEEE/ACM International Symposium on Low Power Electronics and Design, 2017

Towards expressive instrument synthesis through smooth frame-by-frame reconstruction: From string to woodwind.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Collaborative Deep Learning for speech enhancement: A run-time model selection method using autoencoders.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015
Exploiting structured human interactions to enhance estimation accuracy in cyber-physical systems.
Proceedings of the ACM/IEEE Sixth International Conference on Cyber-Physical Systems, 2015

2011
Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation.
IEEE J. Sel. Top. Signal Process., 2011

2010
Nonnegative matrix partial co-factorization for drum source separation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Blind rhythmic source separation: Nonnegativity and repeatability.
Proceedings of the IEEE International Conference on Acoustics, 2010

2006
Monaural Music Source Separation: Nonnegativity, Sparseness, and Shift-Invariance.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

2005
On Spectral Basis Selection for Single Channel Polyphonic Music Separation.
Proceedings of the Artificial Neural Networks: Formal Models and Their Applications, 2005


  Loading...