Juhan Nam

Orcid: 0000-0003-2664-2119

Affiliations:
  • KAIST, Music and Audio Computing Lab, Republic of Korea


According to our database1, Juhan Nam authored at least 88 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting.
CoRR, 2024

T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis.
CoRR, 2024

A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance.
CoRR, 2024

DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech.
CoRR, 2024

2023
Editorial for TISMIR Special Collection: Cultural Diversity in MIR Research.
Trans. Int. Soc. Music. Inf. Retr., January, 2023

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation.
CoRR, 2023

VoiceLDM: Text-to-Speech with Environmental Context.
CoRR, 2023

K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling.
CoRR, 2023

A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription.
CoRR, 2023

Music Playlist Title Generation Using Artist Information.
CoRR, 2023

All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Motion to Dance Music Generation using Latent Diffusion Model.
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

A Computational Evaluation Framework for Singable Lyric Translation.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

LP-MusicCaps: LLM-Based Pseudo Music Captioning.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Sense of Convergence: Exploring the Artistic Potential of Cross-modal Sensory Transfer in Virtual Reality.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2023

A Phoneme-Informed Neural Network Model For Note-Level Singing Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Study of Audio Mixing Methods for Piano Transcription in Violin-Piano Ensembles.
Proceedings of the IEEE International Conference on Acoustics, 2023

Toward Universal Text-To-Music Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Textless Speech-to-Music Retrieval Using Emotion Similarity.
Proceedings of the IEEE International Conference on Acoustics, 2023

PrimaDNN': A Characteristics-Aware DNN Customization for Singing Technique Detection.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Deep Learning and Knowledge Integration for Music Audio Analysis (Dagstuhl Seminar 22082).
Dagstuhl Reports, 2022

Neural Vocoder Feature Estimation for Dry Singing Voice Separation.
CoRR, 2022

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words.
CoRR, 2022

Analysis and detection of singing techniques in repertoires of J-POP solo singers.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification.
Proceedings of the Interspeech 2022, 2022

Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Melody-Unsupervision Model for Singing Voice Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

Seung-ee and Kkaebi: A VR-Mobile Cross Platform Game based on Co-Presence for a Balanced Immersive Experience.
Proceedings of the Extended Abstracts of the Annual Symposium on Computer-Human Interaction in Play, 2022

The Melody of the Mysterious Stones: A VR Mindfulness Game Using Sound Spatialization.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Classy Trash Monster: An Educational Game for Teaching Machine Learning to Non-major Students.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
Music Playlist Title Generation: A Machine-Translation Approach.
CoRR, 2021

PocketVAE: A Two-step Model for Groove Generation and Control.
CoRR, 2021

Reverse-Engineering The Transition Regions of Real-World DJ Mixes using Sub-band Analysis with Convex Optimization.
Proceedings of the 21th International Conference on New Interfaces for Musical Expression, 2021

Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Semantic Tagging of Singing Voices in Popular Music Recordings.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Novelty and influence of creative works, and quantifying patterns of advances based on probabilistic references networks.
EPJ Data Sci., 2020

Semi-supervised learning using teacher-student models for vocal melody extraction.
CoRR, 2020

Musical Word Embedding: Bridging the Gap between Listening Contexts and Music.
CoRR, 2020

Metric learning vs classification for disentangled music representation learning.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Polyphonic Piano Transcription Using Autoregressive Multi-State Note Model.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Semi-supervised learning using teacher-student models for vocal melody extraction.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Disentangled Multidimensional Metric Learning for Music Similarity.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Korean Singing Voice Synthesis Based on Auto-Regressive Boundary Equilibrium Gan.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Deep Learning for Audio-Based Music Classification and Tagging: Teaching Computers to Distinguish Rock from Bach.
IEEE Signal Process. Mag., 2019

Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing.
IEEE J. Sel. Top. Signal Process., 2019

Comparison and Analysis of SampleCNN Architectures for Audio Classification.
IEEE J. Sel. Top. Signal Process., 2019

Temporal Feedback Convolutional Recurrent Neural Networks for Keyword Spotting.
CoRR, 2019

Representation Learning of Music Using Artist, Album, and Track Information.
CoRR, 2019

Zero-shot Learning and Knowledge Transfer in Music Classification and Tagging.
CoRR, 2019

Quantifying Novelty and Influence, and the Patterns of Paradigm Shifts.
CoRR, 2019

A Cross-Scape Plot Representation for Visualizing Symbolic Melodic Similarity.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

VirtuosoNet: A Hierarchical RNN-based System for Modeling Expressive Piano Performance.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Zero-shot Learning for Audio-based Music Classification and Tagging.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Graph Neural Network for Music Score Data and Modeling Expressive Piano Performance.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
A Hybrid of Deep Audio Feature and i-vector for Artist Recognition.
CoRR, 2018

Deep Content-User Embedding Model for Music Recommendation.
CoRR, 2018

Representation Learning of Music Using Artist Labels.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Revisiting Singing Voice Detection: A quantitative review and the future outlook.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

A Timbre-based Approach to Estimate Key Velocity from Polyphonic Piano Recordings.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Singing Expression Transfer from One Voice to Another for a Given Song.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Multi-Level and Multi-Scale Feature Aggregation Using Pretrained Convolutional Neural Networks for Music Auto-Tagging.
IEEE Signal Process. Lett., 2017

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures.
CoRR, 2017

Audio-to-score alignment of piano music using RNN-based automatic music transcription.
CoRR, 2017

Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms.
CoRR, 2017

Multi-Level and Multi-Scale Feature Aggregation Using Sample-level Deep Convolutional Neural Networks for Music Classification.
CoRR, 2017

Multi-Level and Multi-Scale Feature Aggregation Using Pre-trained Convolutional Neural Networks for Music Auto-tagging.
CoRR, 2017

ForceClicks: Enabling Efficient Button Interaction with Single Finger Touch.
Proceedings of the Tenth International Conference on Tangible, 2017

Note Intensity Estimation of Piano Recordings by Score-Informed NMF.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Combining Multi-Scale Features Using Sample-Level Deep Convolutional Neural Networks for Weakly Supervised Sound Event Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Use the Force: Incorporating Touch Force Sensors into Mobile Music Interaction.
Proceedings of the Music Technology with Swing - 13th International Symposium, 2017

2016
Melody Extraction on Vocal Segments Using Multi-Column Deep Neural Networks.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

2015
Augmenting Room Acoustics and System Interaction for Intentional Control of Audio Feedback.
Proceedings of the Looking Back, 2015

Toward Certain Sonic Properties of an Audio Feedback System by Evolutionary Control of Second-Order Structures.
Proceedings of the Evolutionary and Biologically Inspired Music, Sound, Art and Design, 2015

2013
Acoustic scene classification using sparse feature learning and event-based pooling.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

2012
Optimized Polynomial Spline Basis Function Design for Quasi-Bandlimited Classical Waveform Synthesis.
IEEE Signal Process. Lett., 2012

Learning Sparse Feature Representations for Music Annotation and Retrieval.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Sound Recognition in Mixtures.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

2011
A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Multimodal Deep Learning.
Proceedings of the 28th International Conference on Machine Learning, 2011

2010
Alias-Suppressed Oscillators Based on Differentiated Polynomial Waveforms.
IEEE Trans. Speech Audio Process., 2010

Efficient Antialiasing Oscillator Algorithms Using Low-Order Fractional Delay Filters.
IEEE Trans. Speech Audio Process., 2010

A super-resolution spectrogram using coupled PLCA.
Proceedings of the INTERSPEECH 2010, 2010


  Loading...