Zhiqi Ai

Orcid: 0009-0005-1034-9972

According to our database¹, Zhiqi Ai authored at least 12 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Effective User-defined Keyword Spotting with Dual-stage Matching, Multi-modal Enrollment, and Continual Adaptation.

[BibT_eX]

[DOI]

CoRR, May, 2026

StereoDETR: Stereo-Based Transformer for 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2026

Enhanced Fingerprint-Based Positioning With Practical Imperfections: Deep Learning-Based Approaches.

[BibT_eX]

[DOI]

IEEE Wirel. Commun., February, 2026

2025

Dual Data Scaling for Robust Two-Stage User-Defined Keyword Spotting.

[BibT_eX]

[DOI]

CoRR, October, 2025

Stage-wise Adaptive Label Distribution for Facial Age Estimation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Towards Robust Speaker Recognition against Intrinsic Variation with Foundation Model Few-shot Tuning and Effective Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

VoxAging: Continuously Tracking Speaker Aging with a Large-Scale Longitudinal Dataset in English and Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

StableTTS: Towards Efficient Denoising Acoustic Decoder for Text to Speech Synthesis with Consistency Flow Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2025

2024

Optimizing feature fusion for improved zero-shot adaptation in text-to-speech synthesis.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., December, 2024

Enhancing Open-Set Speaker Identification Through Rapid Tuning With Speaker Reciprocal Points and Negative Sample.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting.

[BibT_eX]

[DOI]

Zhiqi Ai

Zhiyong Chen

Shugong Xu

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Zhiqi Ai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...