Hao-Wen Dong

Orcid: 0000-0002-5765-7594

According to our database1, Hao-Wen Dong authored at least 30 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections.
CoRR, June, 2025

REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing.
CoRR, May, 2025

Deriving Representative Structure from Music Corpora.
CoRR, February, 2025

TeaserGen: Generating Teasers for Long Documentaries.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FUTGA-MIR: Enhancing Fine-grained and Temporally-aware Music Understanding with Music Information Retrieval.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Generative AI for Music and Audio
PhD thesis, 2024

Broadband topology optimization of three-dimensional structural-acoustic interaction with reduced order isogeometric FEM/BEM.
J. Comput. Phys., 2024

Generative AI for Music and Audio.
CoRR, 2024

Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset.
CoRR, 2024

Futga: Towards Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation.
CoRR, 2024

Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

2023
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Multitrack Music Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2023

Equipping Pretrained Unconditional Music Transformers with Instrument and Genre Controls.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Multitrack Music Transformer: Learning Long-Term Dependencies in Music with Diverse Instruments.
CoRR, 2022

Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Deep Performer: Score-to-Audio Music Performance Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Bach Violin Dataset.
Dataset, October, 2021

An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

2020
Automatic Melody Harmonization with Triad Chords: A Comparative Study.
CoRR, 2020

MusPy: A Toolkit for Symbolic Music Generation.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

2019
Towards a Deeper Understanding of Adversarial Losses.
CoRR, 2019

2018
Lakh Pianoroll Dataset.
Dataset, February, 2018

Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation.
CoRR, 2018

Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
MuseGAN: Symbolic-domain Music Generation and Accompaniment with Multi-track Sequential Generative Adversarial Networks.
CoRR, 2017


  Loading...