Wei-Hsiang Liao

Affiliations:
  • Sony Research Inc., sec.2, Music Foundation Model Team, Japan
  • Pierre and Marie Curie University, Paris, France (PhD 2015)
  • National Cheng Kung University, Tainan, Taiwan (former)


According to our database1, Wei-Hsiang Liao authored at least 45 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution.
CoRR, July, 2025

Fx-Encoder++: Extracting Instrument-Wise Audio Effects Representations from Mixtures.
CoRR, July, 2025

Large-Scale Training Data Attribution for Music Generative Models via Unlearning.
CoRR, June, 2025

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors.
CoRR, June, 2025

Can Large Language Models Predict Audio Effects Parameters from Natural Language?
CoRR, May, 2025

A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
CoRR, May, 2025

Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior.
CoRR, May, 2025

DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions.
CoRR, April, 2025

SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music Editing.
CoRR, April, 2025

Cross-Modal Learning for Music-to-Music-Video Description Generation.
CoRR, March, 2025

Music Foundation Model as Generic Booster for Music Downstream Tasks.
Trans. Mach. Learn. Res., 2025

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Variable Bitrate Residual Vector Quantization for Audio Coding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
The Sound Demixing Challenge 2023 - Music Demixing Track.
Trans. Int. Soc. Music. Inf. Retr., January, 2024

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
Trans. Mach. Learn. Res., 2024

OpenMU: Your Swiss Army Knife for Music Understanding.
CoRR, 2024

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression.
CoRR, 2024

Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning.
CoRR, 2024

LOCKEY: A Novel Approach to Model Authentication and Deepfake Tracking.
CoRR, 2024

Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer.
CoRR, 2024

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation.
CoRR, 2024

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch.
CoRR, 2024

Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data.
CoRR, 2024

Searching For Music Mixing Graphs: A Pruning Approach.
CoRR, 2024

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning.
CoRR, 2024

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage.
CoRR, 2024

PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Towards Assessing Data Replication in Music Generation With Music Similarity Metrics on Raw Audio.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

SilentCipher: Deep Audio Watermarking.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Manifold Preserving Guided Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2024

Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2024

On the Language Encoder of Contrastive Cross-modal Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Automatic Piano Transcription With Hierarchical Frequency-Time Transformer.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Preventing oversmoothing in VAE via generalized variance parameterization.
Neurocomputing, 2022

Automatic music mixing with deep learning and out-of-domain data.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022

Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Preventing Posterior Collapse Induced by Oversmoothing in Gaussian VAE.
CoRR, 2021

2015
Modelling and transformation of sound textures and environmental sounds. (Modélisation et transformation de textures sonores et des sons environnementaux).
PhD thesis, 2015

2010
A SOT based digital audio coder using reference frame ordering method.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010


  Loading...