Minje Kim

Tae-Kyun Kim

CoRR, September, 2025

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine.

[BibT_eX]

[DOI]

CoRR, July, 2025

Discrete Audio Tokens: More Than a Survey!

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

TGIF: Talker Group-Informed Familiarization of Target Speaker Extraction.

[BibT_eX]

[DOI]

Tsun-An Hsieh

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

DTA: Dual Temporal-channel-wise Attention for Spiking Neural Networks.

[BibT_eX]

[DOI]

Minjun Kim

Xu Yang

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024

IoT Edge-Cloud: An Internet-of-Things Edge-Empowered Cloud System for Device Management in Smart Spaces.

[BibT_eX]

[DOI]

IEEE Netw., May, 2024

Genhancer: High-Fidelity Speech Enhancement via Generative Modeling on Discrete Codec Tokens.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation.

[BibT_eX]

[DOI]

Tsun-An Hsieh

Heeyoul Choi

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Generative De-Quantization for Neural Speech Codec Via Latent Diffusion.

[BibT_eX]

[DOI]

Haici Yang

Inseon Jang

Proceedings of the IEEE International Conference on Acoustics, 2024

Hyperbolic Distance-Based Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Scalable and Efficient Speech Enhancement Using Modified Cold Diffusion: A Residual Learning Approach.

[BibT_eX]

[DOI]

Trausti T. Kristjansson

Proceedings of the IEEE International Conference on Acoustics, 2024

Personalized Neural Speech Codec.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

A Comparative Analysis of Poetry Reading Audio: Singing, Narrating, or Somewhere in Between?

[BibT_eX]

[DOI]

Kahyun Choi

Proceedings of the IEEE International Conference on Acoustics, 2024

Dense Hand-Object (HO) GraspNet with Full Grasping Taxonomy and Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

BiTT: Bi-Directional Texture Reconstruction of Interacting Two Hands from a Single Image.

[BibT_eX]

[DOI]

Tae-Kyun Kim

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Native Multi-Band Audio Coding within Hyper-Autoencoded Reconstruction Propagation Networks.

[BibT_eX]

[DOI]

Inseon Jang

CoRR, 2023

2022

The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement.

[BibT_eX]

[DOI]

Anastasia Kuznetsova

CoRR, 2022

Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding.

[BibT_eX]

[DOI]

Haici Yang

Wootaek Lim

CoRR, 2022

Stochastic gradient descent-based support vector machines training optimization on Big Data and HPC frameworks.

[BibT_eX]

[DOI]

Pulasthi Wickramasinghe

Concurr. Comput. Pract. Exp., 2022

Deep Adaptive Aec: Hybrid of Deep Learning and Adaptive Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

Trausti T. Kristjansson

Proceedings of the IEEE International Conference on Acoustics, 2022

Upmixing Via Style Transfer: A Variational Autoencoder for Disentangling Spatial Images And Musical Content.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Don't Separate, Learn To Remix: End-To-End Neural Remixing With Joint Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Spain-Net: Spatially-Informed Stereophonic Music Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Neural Remixer: Learning to Remix Music with Interactive Control.

[BibT_eX]

[DOI]

CoRR, 2021

Self-Supervised Learning for Personalized Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2021

Scalable and Efficient Neural Speech Coding.

[BibT_eX]

[DOI]

CoRR, 2021

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding.

[BibT_eX]

[DOI]

CoRR, 2021

Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Harp-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding.

[BibT_eX]

[DOI]

Seungkwon Beack

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Source-Aware Neural Speech Coding for Noisy Speech Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Singular Value Decomposition for Compression of Large-Scale Radio Frequency Signals.

[BibT_eX]

[DOI]

R. David Badger

Proceedings of the 29th European Signal Processing Conference, 2021

An Open-Sourced Time-Frequency Domain RF Classification Framework.

[BibT_eX]

[DOI]

R. David Badger

Kristopher H. Jung

Proceedings of the 29th European Signal Processing Conference, 2021

2020

Sparse Mixture of Local Experts for Efficient Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

AutoQ: Automated Kernel-Wise Neural Network Quantization.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Efficient and Scalable Neural Residual Waveform Coding with Collaborative Quantization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Dual-Staged Context Aggregation Method towards Efficient End-to-End Speech Enhancement.

[BibT_eX]

[DOI]

Kai Zhen

Mi Suk Lee

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Autotuner: A Pitch Correcting Network for Singing Performances.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Efficient Context Aggregation for End-to-End Speech Enhancement Using a Densely Connected Convolutional and Recurrent Network.

[BibT_eX]

[DOI]

Kai Zhen

Mi Suk Lee

CoRR, 2019

AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Cascaded Cross-Module Residual Learning Towards Lightweight End-to-End Speech Coding.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Intonation: A Dataset of Quality Vocal Performances Refined by Spectral Clustering on Pitch Congruence.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Performance Optimization on Model Synchronization in Parallel Stochastic Gradient Descent Based SVM.

[BibT_eX]

[DOI]

Vibhatha Lakmal Abeykoon

Geoffrey Charles Fox

Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018

A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music.

[BibT_eX]

[DOI]

CoRR, 2018

On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising.

[BibT_eX]

[DOI]

CoRR, 2018

DeepPicar: A Low-Cost Deep Neural Network-Based Autonomous Car.

[BibT_eX]

[DOI]

Michael Garrett Bechtel

Elise McEllhiney

Heechul Yun

Proceedings of the 24th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2018

Bitwise Source Separation on Hashed Spectra: An Efficient Posterior Estimation Scheme Using Partial Rank Order Metrics.

[BibT_eX]

[DOI]

Lijiang Guo

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Collaborative Speech Dereverberation: Regularized Tensor Factorization for Crowdsourced Multi-Channel Recordings.

[BibT_eX]

[DOI]

Sanna Wager

Proceedings of the 26th European Signal Processing Conference, 2018

Creative leaps in musical ecosystems: early warning signals of critical transitions in professional jazz.

[BibT_eX]

[DOI]

Matthew Setzler

Tyler Marghetis

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017

On Exploiting Structured Human Interactions to Enhance Sensing Accuracy in Cyber-physical Systems.

[BibT_eX]

[DOI]

ACM Trans. Cyber Phys. Syst., 2017

XNOR-POP: A processing-in-memory architecture for binary Convolutional Neural Networks in Wide-IO2 DRAMs.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/ACM International Symposium on Low Power Electronics and Design, 2017

Towards expressive instrument synthesis through smooth frame-by-frame reconstruction: From string to woodwind.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Collaborative Deep Learning for speech enhancement: A run-time model selection method using autoencoders.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015

Exploiting structured human interactions to enhance estimation accuracy in cyber-physical systems.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Sixth International Conference on Cyber-Physical Systems, 2015

2011

Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

2010

Nonnegative matrix partial co-factorization for drum source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Blind rhythmic source separation: Nonnegativity and repeatability.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2006

Monaural Music Source Separation: Nonnegativity, Sparseness, and Shift-Invariance.

[BibT_eX]

[DOI]

Seungjin Choi

Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

2005

On Spectral Basis Selection for Single Channel Polyphonic Music Separation.

[BibT_eX]

[DOI]