Xiaobin Zhuang

Orcid: 0000-0002-0285-6705

According to our database1, Xiaobin Zhuang authored at least 17 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Sounding that Object: Interactive Object-Aware Image to Audio Generation.
CoRR, June, 2025

MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation.
CoRR, June, 2025

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models.
CoRR, May, 2025

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model.
CoRR, April, 2025

ReViT: Vision Transformer Accelerator With Reconfigurable Semantic-Aware Differential Attention.
IEEE Trans. Computers, March, 2025

DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation.
CoRR, February, 2025

Sound-VECaps: Improving Audio Generation with Visually Enhanced Captions.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Improving Audio Generation with Visual Enhanced Caption.
CoRR, 2024

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models.
CoRR, 2024

2023
A Survey on Deep Learning for Chinese Medical Named Entity Recognition.
Proceedings of the 9th International Conference on Computing and Artificial Intelligence, 2023

2022
KaraTuner: Towards End-to-End Natural Pitch Correction for Singing Voice in Karaoke.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Chinese Electronic Medical Record Named Entity Recognition Based on Bi-RNN-LSTM-RNN-CRF.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

2021
KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke.
CoRR, 2021

Litesing: Towards Fast, Lightweight and Expressive Singing Voice Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2016
Real-time vehicle detection with foreground-based cascade classifier.
IET Image Process., 2016

Control of a nursing bed based on a hybrid brain-computer interface.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

2013
Algorithm for vision-based vehicle detection and classification.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013


  Loading...