Zhiqi Ai

Orcid: 0009-0005-1034-9972

According to our database1, Zhiqi Ai authored at least 12 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Effective User-defined Keyword Spotting with Dual-stage Matching, Multi-modal Enrollment, and Continual Adaptation.
CoRR, May, 2026

StereoDETR: Stereo-Based Transformer for 3D Object Detection.
IEEE Trans. Circuits Syst. Video Technol., April, 2026

Enhanced Fingerprint-Based Positioning With Practical Imperfections: Deep Learning-Based Approaches.
IEEE Wirel. Commun., February, 2026

2025
Dual Data Scaling for Robust Two-Stage User-Defined Keyword Spotting.
CoRR, October, 2025

Stage-wise Adaptive Label Distribution for Facial Age Estimation.
CoRR, September, 2025

Towards Robust Speaker Recognition against Intrinsic Variation with Foundation Model Few-shot Tuning and Effective Speech Synthesis.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

VoxAging: Continuously Tracking Speaker Aging with a Large-Scale Longitudinal Dataset in English and Mandarin.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

StableTTS: Towards Efficient Denoising Acoustic Decoder for Text to Speech Synthesis with Consistency Flow Matching.
Proceedings of the IEEE International Conference on Acoustics, 2025

2024
Optimizing feature fusion for improved zero-shot adaptation in text-to-speech synthesis.
EURASIP J. Audio Speech Music. Process., December, 2024

Enhancing Open-Set Speaker Identification Through Rapid Tuning With Speaker Reciprocal Points and Negative Sample.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024


  Loading...