Zeyu Xie

Orcid: 0009-0001-9546-3301

According to our database1, Zeyu Xie authored at least 21 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Overview of the Amphion Toolkit (v0.2).
CoRR, January, 2025

Robust Visual Food Recognition for Enriching Nutrition Knowledge Bases.
IEEE Trans. Multim., 2025

Dual-robust divergence-free and efficiently decoupled scheme for three-dimensional thermally coupled magnetohydrodynamic systems.
Commun. Nonlinear Sci. Numer. Simul., 2025

AudioTime: A Temporally-aligned Audio-text Benchmark Dataset.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

PicoAudio: Enabling Precise Temporal Controllability in Text-to-Audio Generation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Research on Pedestrian and Cyclist Classification Method Based on Micro-Doppler Effect.
Sensors, October, 2024

Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation.
CoRR, 2024

FakeSound: Deepfake General Audio Detection.
CoRR, 2024

Phonetic and Lexical Discovery of a Canine Language using HuBERT.
CoRR, 2024

FakeSound: Deepfake General Audio Detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Audio Generation Diversity with Visual Information.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Improving Audio Caption Fluency with Automatic Error Correction.
CoRR, 2023

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Enhance Temporal Relations in Audio Captioning with Sound Event Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Can Audio Captions Be Evaluated With Image Caption Metrics?
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Intuitionistic Fuzzy C-Means Algorithm Based on Membership Information Transfer-Ring and Similarity Measurement.
Sensors, 2021

Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Medical image fusion using the PCNN based on IQPSO in NSST domain.
IET Image Process., 2020


  Loading...