Tian-Hao Zhang

Orcid: 0000-0003-3456-8262

According to our database1, Tian-Hao Zhang authored at least 17 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PALM-Bench: A Comprehensive Benchmark for Personalized Audio-Language Models.
CoRR, January, 2026

IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition.
CoRR, January, 2026

2025
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

I<sup>2</sup>TTS: Image-Indicated Immersive Text-to-Speech Synthesis with Spatial Perception.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Improving Multi-Type License Plate Recognition via Learning Globally and Contrastively.
IEEE Trans. Intell. Transp. Syst., September, 2024

M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.
Pattern Recognit. Lett., 2024

Transmitted and Aggregated Self-Attention for Automatic Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CIF-T: A Novel CIF-Based Transducer Architecture for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Self-supervised contrastive speaker verification with nearest neighbor positive instances.
Pattern Recognit. Lett., September, 2023

Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition.
CoRR, 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Stable Speech Emotion Recognition with Head-k-Pooling Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Non-Autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Comparison of Different Missing-Imputation Methods for MAIAC (Multiangle Implementation of Atmospheric Correction) AOD in Estimating Daily PM2.5 Levels.
Remote. Sens., 2020


  Loading...