Zeyu Xie

Orcid: 0009-0001-9546-3301

According to our database¹, Zeyu Xie authored at least 27 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

LARA-Gen: Enabling Continuous Emotion Control for Music Generation Models via Latent Affective Representation Alignment.

[BibT_eX]

[DOI]

CoRR, October, 2025

When Audio Generators Become Good Listeners: Generative Features for Understanding Tasks.

[BibT_eX]

[DOI]

CoRR, September, 2025

UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities.

[BibT_eX]

[DOI]

CoRR, September, 2025

STAR: Speech-to-Audio Generation via Representation Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

FakeSound2: A Benchmark for Explainable and Generalizable Deepfake Sound Detection.

[BibT_eX]

[DOI]

CoRR, September, 2025

PicoAudio2: Temporal Controllable Text-to-Audio Generation with Natural Language Description.

[BibT_eX]

[DOI]

CoRR, September, 2025

Overview of the Amphion Toolkit (v0.2).

[BibT_eX]

[DOI]

CoRR, January, 2025

Robust Visual Food Recognition for Enriching Nutrition Knowledge Bases.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

Dual-robust divergence-free and efficiently decoupled scheme for three-dimensional thermally coupled magnetohydrodynamic systems.

[BibT_eX]

[DOI]

Zeyu Xie

Haiyan Su

Xinlong Feng

Commun. Nonlinear Sci. Numer. Simul., 2025

AudioTime: A Temporally-aligned Audio-text Benchmark Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

PicoAudio: Enabling Precise Temporal Controllability in Text-to-Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Research on Pedestrian and Cyclist Classification Method Based on Micro-Doppler Effect.

[BibT_eX]

[DOI]

Sensors, October, 2024

Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation.

[BibT_eX]

[DOI]

CoRR, 2024

FakeSound: Deepfake General Audio Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Phonetic and Lexical Discovery of a Canine Language using HuBERT.

[BibT_eX]

[DOI]

CoRR, 2024

FakeSound: Deepfake General Audio Detection.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Audio Generation Diversity with Visual Information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Improving Audio Caption Fluency with Automatic Error Correction.

[BibT_eX]

[DOI]

CoRR, 2023

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Enhance Temporal Relations in Audio Captioning with Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Can Audio Captions Be Evaluated With Image Caption Metrics?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Intuitionistic Fuzzy C-Means Algorithm Based on Membership Information Transfer-Ring and Similarity Measurement.

[BibT_eX]

[DOI]

Sensors, 2021

Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Medical image fusion using the PCNN based on IQPSO in NSST domain.

[BibT_eX]

[DOI]

IET Image Process., 2020

Zeyu Xie

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...