Tian-Hao Zhang

Orcid: 0000-0003-3456-8262

According to our database1, Tian-Hao Zhang authored at least 14 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Improving Multi-Type License Plate Recognition via Learning Globally and Contrastively.
IEEE Trans. Intell. Transp. Syst., September, 2024

M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.
Pattern Recognit. Lett., 2024

Transmitted and Aggregated Self-Attention for Automatic Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CIF-T: A Novel CIF-Based Transducer Architecture for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Self-supervised contrastive speaker verification with nearest neighbor positive instances.
Pattern Recognit. Lett., September, 2023

Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition.
CoRR, 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Stable Speech Emotion Recognition with Head-k-Pooling Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Non-Autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Comparison of Different Missing-Imputation Methods for MAIAC (Multiangle Implementation of Atmospheric Correction) AOD in Estimating Daily PM2.5 Levels.
Remote. Sens., 2020


  Loading...