Chao Zhang

According to our database1, Chao Zhang authored at least 29 papers between 2013 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2020
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications.
IEEE J. Sel. Top. Signal Process., 2020

Introduction to the Special Issue on Deep Learning for Multi-Modal Intelligence Across Speech, Language, Vision, and Heterogeneous Signals.
IEEE J. Sel. Top. Signal Process., 2020

Neural Kalman Filtering for Speech Enhancement.
CoRR, 2020

2019
Improved Large-margin Softmax Loss for Speaker Diarisation.
CoRR, 2019

Discriminative Neural Clustering for Speaker Diarisation.
CoRR, 2019

Speaker diarisation using 2D self-attentive combination of embeddings.
CoRR, 2019

Direct-Path Signal Cross-Correlation Estimation for Sound Source Localization in Reverberation.
Proceedings of the Interspeech 2019, 2019

Multi-Span Acoustic Modelling Using Raw Waveform Signals.
Proceedings of the Interspeech 2019, 2019

PyHTK: Python Library and ASR Pipelines for HTK.
Proceedings of the IEEE International Conference on Acoustics, 2019

Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs.
CoRR, 2018

Semi-tied Units for Efficient Gating in LSTM and Highway Networks.
Proceedings of the Interspeech 2018, 2018

Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems.
Proceedings of the Interspeech 2018, 2018

High Order Recurrent Neural Networks for Acoustic Modelling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem.
PLoS Comput. Biol., 2017

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.
Proceedings of the Interspeech 2016, 2016

System combination with log-linear models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
A general artificial neural network extension for HTK.
Proceedings of the INTERSPEECH 2015, 2015

Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling.
Proceedings of the INTERSPEECH 2015, 2015

Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages.
Proceedings of the INTERSPEECH 2015, 2015

The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation.
Proceedings of the INTERSPEECH 2015, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Structured discriminative models using deep neural-network features.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Standalone training of context-dependent deep neural network acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Investigation of multilingual deep neural networks for spoken term detection.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013


  Loading...