We stand with Ukraine

We stand with Ukraine

Tian-Hao Zhang

Orcid: 0000-0003-3456-8262

According to our database¹, Tian-Hao Zhang authored at least 17 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

PALM-Bench: A Comprehensive Benchmark for Personalized Audio-Language Models.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition.

[DOI]

,

,

,

,

,

,

,

CoRR, January, 2026

2025

Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores.

[DOI]

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition.

[DOI]

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

I<sup>2</sup>TTS: Image-Indicated Immersive Text-to-Speech Synthesis with Spatial Perception.

[DOI]

,

,

,

,

,

,

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles.

[DOI]

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Improving Multi-Type License Plate Recognition via Learning Globally and Contrastively.

[DOI]

,

,

,

,

,

IEEE Trans. Intell. Transp. Syst., September, 2024

M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.

[DOI]

,

,

,

,

,

Pattern Recognit. Lett., 2024

Transmitted and Aggregated Self-Attention for Automatic Speech Recognition.

[DOI]

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CIF-T: A Novel CIF-Based Transducer Architecture for Automatic Speech Recognition.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Self-supervised contrastive speaker verification with nearest neighbor positive instances.

[DOI]

,

,

Chuan-Fei Zhang

,

,

,

Pattern Recognit. Lett., September, 2023

Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition.

[DOI]

,

,

,

CoRR, 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Stable Speech Emotion Recognition with Head-k-Pooling Loss.

[DOI]

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Non-Autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition.

[DOI]

Chuan-Fei Zhang

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2020

Comparison of Different Missing-Imputation Methods for MAIAC (Multiangle Implementation of Atmospheric Correction) AOD in Estimating Daily PM2.5 Levels.

[DOI]

,

,

,

,

,

,

,

Remote. Sens., 2020

Loading...